Analyze the Novel Apriori Algorithm for Frequent Pattern Generation and Improving the Performance by Generating Powersets for Unique Records

Authors

DOI:

https://doi.org/10.26713/cma.v14i3.2375

Keywords:

Apriori, Frequent pattern

Abstract

Knowledge discovery is the course of absorbing facts and relations from immense amounts of data. Algorithms like Apriori let an experimenter locate the hidden pattern within a dataset. However, numerous applications do not utilize the Apriori technique because it takes a prolonged time to discover the frequent itemset. If the largest frequent itemset with length \(k\) exists, the algorithm accomplishes a \(k\) scan to deal with the time-consuming difficulty caused by the \(k\) number of scans. Rajeswari and Vaithiyanathan (A novel method for frequent pattern mining, International Journal of Engineering and Technology 5(3) (2013), 2150 - 2154) devised a Novel algorithm for frequent pattern mining that uses a single scan to discover a frequent itemset of length \(k\) by constructing a subset of transactions. In this work, the Novel algorithm's performance is embellished by generating powersets for unique records. Powerset generation is a costly operation that takes exponential time to compute. So, avoidance of unnecessary computation results in performance enhancement. To accomplish used two mechanisms to improve the performance of the Novel method viz Transaction Subset Plus Cache (TSPC) and Transaction Subset Without Duplicate Record (TSWDR). Finally, performance analysis is done between four algorithms Apriori for frequent pattern generation, Novel method for frequent pattern generation, TSPC and TSWDR.

Downloads

Download data is not yet available.

References

R. Agrawal and R. Srikant, Fast algorithms for mining association rules, in: Proceedings of the 20th International Conference on Very Large Data Bases (VLDB’94), Vol. 1215, 1994, pp. 487 – 499, URL: https://www.vldb.org/conf/1994/P487.PDF.

S. Chai, J. Yang and Y. Cheng, The research of improved Apriori algorithm for mining association rules, in: Proceedings of 2007 International Conference on Service Systems and Service Management, Chengdu, China, 2007, pp. 1 – 4, DOI: 10.1109/ICSSSM.2007.4280173.

Y. Cong, Research on data association rules mining method based on improved Apriori algorithm, in: Proceedings of 2020 International Conference on Big Data & Artificial Intelligence & Software Engineering (ICBASE), Bangkok, Thailand, 2020, pp. 373 – 376, DOI: 10.1109/ICBASE51474.2020.00085.

M. Dehghani, A. Kamandi, M. Shabankhah and A. Moeini, toward a distinguishing approach for improving the Apriori algorithm, in: Proceedings of 2019 9th International Conference on Computer and Knowledge Engineering (ICCKE), Mashhad, Iran, 2019, pp. 309 – 314, DOI: 10.1109/ICCKE48569.2019.8965206.

J. Du, X. Zhang, H. Zhang and L. Chen, Research and improvement of Apriori algorithm, in: Proceedings of 2016 Sixth International Conference on Information Science and Technology (ICIST), Dalian, China, 2016, pp. 117 – 121, DOI: 10.1109/ICIST.2016.7483396.

Y. Jia, G. Xia, H. Fan, Q. Zhang and X. Li, An improved Apriori algorithm based on association analysis, in: Proceedings of 2012 Third International Conference on Networking and Distributed Computing, Hangzhou, China, 2012, pp. 208 – 211, DOI: 10.1109/ICNDC.2012.56.

N. Karimtabar and M. J. S. Fard, An extension of the apriori algorithm for finding frequent items, in: Proceedings of 2020 6th International Conference on Web Research (ICWR), Tehran, Iran, 2020, pp. 330 – 334, DOI: 10.1109/ICWR49608.2020.9122282.

J. Liu and M. Cong, The optimization of apriori algorithm based on array and its application in the analysis of insurance clients, in: Proceedings of 2015 Forth International Conference on e-Technologies and Networks for Development (ICeND), Lodz, Poland, 2015, pp. 1 – 4, DOI: 10.1109/ICeND.2015.7328539.

S. Nayak, M. K. Gourisaria, M. Pandey and S. S. Rautaray, Prediction of heart disease by mining frequent items and classification techniques, in: Proceedings of 2019 International Conference on Intelligent Computing and Control Systems (ICCS), Madurai, India, 2019, pp. 607 – 611, DOI: 10.1109/ICCS45141.2019.9065805.

P. R. Puneeth and K. P. Rao, A comparative study on Apriori and reverse apriori in generation of frequent item set, in: Proceedings of 2019 1st International Conference on Advances in Information Technology (ICAIT), Chikmagalur, India, 2019, pp. 337 – 341, DOI: 10.1109/ICAIT47043.2019.8987430.

K. Rajeswari and V. Vaithiyanathan, A novel method for frequent pattern mining, International Journal of Engineering and Technology 5(3) (2013), 2150 – 2154, URL: https://www.enggjournals.com/ijet/docs/IJET13-05-03-031.pdf.

V. Vaithiyanathan, K. Rajeswari, R. Phalnikar and S. Tonge, Improved apriori algorithm based on selection criterion, in: Proceedings of 2012 IEEE International Conference on Computational Intelligence and Computing Research, Coimbatore, India, 2012, pp. 1 – 4, DOI: 10.1109/ICCIC.2012.6510229.

C. Yadav, S. Wang and M. Kumar, An approach to improve apriori algorithm based on association rule mining, in: Proceedings of 2013 Fourth International Conference on Computing, Communications and Networking Technologies (ICCCNT), Tiruchengode, India, 2013, pp. 1 – 9, DOI: 10.1109/ICCCNT.2013.6726678.

Q. Yang, Q. Fu, C. Wang and J. Yang, A matrix-based Apriori algorithm improvement, in: Proceedings of 2018 IEEE Third International Conference on Data Science in Cyberspace (DSC), Guangzhou, China, 2018, pp. 824 – 828, DOI: 10.1109/DSC.2018.00132.

J. Yang, Z. Li, W. Xiang and L. Xiao, An improved Apriori algorithm based on features, in: Proceedings of 2013 Ninth International Conference on Computational Intelligence and Security, Emeishan, China, 2013, pp. 125 – 128, DOI: 10.1109/CIS.2013.33.

Downloads

Published

18-10-2023
CITATION

How to Cite

Lahade, S., Ali, A., More, R., Mahajan, S., Rajeswari, K., Vispute, S., Kharat, R., & Vivekandandan, N. (2023). Analyze the Novel Apriori Algorithm for Frequent Pattern Generation and Improving the Performance by Generating Powersets for Unique Records. Communications in Mathematics and Applications, 14(3), 1143–1152. https://doi.org/10.26713/cma.v14i3.2375

Issue

Section

Research Article