A Survey on Scalable and Parallel High Utility Itemset Mining

Authors

  • Dalal S
  • Dahiya V Maharshi Dayanand University Rohtak, Haryana,

Keywords:

frequent pattern mining, association rules

Abstract

Utility itemset mining aims to the discovery of itemsets or patterns with stimulating interest. While frequent itemset or pattern mining finds the interesting patterns based on the occurrence frequency of a pattern, utility itemset mining (UIM) is a further development in this field. It integrates the aspect of utility in some form like weight, cost, amount, profit or any other factor of interest. Utility mining is thus an objective-oriented approach, which aims to find the patterns with a high utility such as more profit or low cost/side-effect etc. However, utility mining is a complex process than frequency itemset mining (FIM) as anti-monotonicity property does not hold for the itemsets like as in FIM. Many algorithms have been developed to mine the itemsets with utility information in recent years. But most of them are not scalable for the nature of data with which we deal nowadays, called big data. This paper focuses on reviewing the recent advances in the field of high utility pattern mining for large datasets with scalable and parallel processing algorithms. The paper is concluded with open problems and future directions for research in the arena of big data.

References

Mooney, C.H., Roddick, J.F.: Sequential pattern mining approaches and algorithms. ACM Comput. Surv. 45(2), 19:1–19:39 (2013)

Yu, G., Li, K., Shao, S.: Mining high utility itemsets in large high dimensional data. In: First International Workshop on Knowledge Discovery and Data Mining (WKDD), pp. 17–20 (2008). https://doi.org/10.1109/WKDD.2008.64

Subramanian, K., Kandhasamy, P., Subramanian, S.: A novel approach to extract high utility itemsets from distributed databases. Comput. Inform. 31(6+), 1597–1615 (2013)

Szlichta, J., Godfrey, P., Golab, L., Kargar, M., Srivastava, D .: Effective and complete discovery of order dependencies via set-based axiomatization. In: Proceedings of the VLDB Endowment, vol. 10, no. 7, pp. 721–732 (2017)

Zida, S., Fournier-Viger, P., Lin, J.C.W., Wu, C.W., Tseng, V.S.: Efim: a fast and memory efficient algorithm for high-utility itemset mining. Knowl. Inf. Syst. 51(2), 595–625 (2017). https://doi.org/10.1007/s10115-016-0986-0

Shie, B. E., Yu, P.S., Tseng, V. S.: Efficient algorithms for mining maximal high utility itemsets from data streams with different models. Expert Syst. Appl. 39, 12947–12960 (2012)

Chan, R., Yang, Q., Shen, Y.: Mining high-utility itemsets. In: Proceedings of Third IEEE International Conference on Data Mining, pp. 19–26 (2003)

Erwin, A., Gopalan, R. P., Achuthan, N. R.: Efficient mining of high utility itemsets from large datasets, pp. 554–561. Springer, Berlin (2008)

Ahmed, C.F., Tanbeer, S.K., Jeong, B.S., Lee, Y.K.: Efficient tree structures for high-utility pattern mining in incremental databases. IEEE Trans. Knowl. Data Eng. 21, 1708–1721 (2009)

Ahmed, C.F., Tanbeer, S.K., Jeong, B. S.: Interactive mining of high utility patterns over data streams. Expert Syst. Appl. 39, 11979–11991 (2012)

Dawar, S., Sharma, V., Goyal, V. : Mining top-k high-utility itemsets from a data stream under sliding window model. Appl. Intell. 47(4), 1240–1255 (2017)

Cao, L., Zhao, Y., Zhang, H., Luo, D., Zhang, C., Park, E. : Flexible frameworks for actionable knowledge discovery. IEEE Trans. Knowl. Data Eng. 22(9), 1299–1312 (2010)

Downloads

Published

2018-06-30

How to Cite

Dalal, S., & Dahiya, V. (2018). A Survey on Scalable and Parallel High Utility Itemset Mining. Universal Research Reports, 5(5), 88–93. Retrieved from https://urr.shodhsagar.com/index.php/j/article/view/794

Issue

Section

Original Research Article