Author, Subjects, Keywords

Cited Author

 

 
   » By Author or Editor
 » Browse Author by Alphabet
 » By Journal
 » By Subjects
 » Malaysian Journals
 » By Type
 » By Year
 » By Latest Additions
 
 
   » By Author
 » Top 20 Authors
 » Top 20 Article
 » Top Journal Cited
 » Top Article Cited
 » Journal Citation Statistics
 » Usage Since Sept 2007


 
 
 

Login | Create Account

Performance Testing on Two Methods of Mining Frequent Patterns from Dense Data

Norwati Mustapha, and Md Nasir Sulaiman, and Mohamed Othman, and Mohd Hasan Selamat, (2004) Performance Testing on Two Methods of Mining Frequent Patterns from Dense Data. In: Proceedings of the Joint Conference on Informatics and Research on Women in ICT (RWICT) 2004 , 28 - 30 July 2004, Putra World Trade Center, Kuala Lumpur, Malaysia .

Full text not available from this repository.

Affiliations

Universiti Putra Malaysia, Faculty of Computer Science and Information Technology
Universiti Putra Malaysia, Faculty of Computer Science and Information Technology
Universiti Putra Malaysia, Faculty of Computer Science and Information Technology
Universiti Putra Malaysia, Faculty of Computer Science and Information Technology

Abstract

Many algorithms of mining frequent patterns were developed to tackle datasets primarily from the domain of market-basket analysis, where the frequent patterns are very short There are some interests in applying these algorithms to other domains such as census data and telecommunication data that very dense. These kinds of datasets contain very long frequent patterns and very correlated data. This study is experimentally examined the two existing methods, Apriori and DIC for mining frequent patterns on dense data. The both methods are implemented and compared their performances of enumerating frequent patterns. An extensive experimental evaluation on d number of datasets shows that Apriori is outperformed DIC on the high support but DIC beat out Apriori for the lower support on the dense data. Strengths and weaknesses of these two methods are also discussed.

Item Type:Conference or Workshop Item (Paper)
Keywords:Data mining, frequent patterns, association rules
Subjects:Q Science, Computer Science
ID Code:1112

[1] Agrawal, R., Imielinski, T. and Swami, A., "Mining Association Rules between Sets of Items in Very Large Databases", Proceedings of the ACM SIGMOD Conference on Management of Data, 1993, pp. 207-216.

[2] Agrawal, R. and Srikant, R., "Fast Algorithms for Mining Association Rules", Proceedings of the 20'*' International Conference on Vety Large Databases (VLDB'94), 1994, pp. 487-499.

[3] Brin, S., Motwani, R., Ullman, J. D. and Tsur, S., "Dynamic Itemset Counting and Implication Rules for Market Basket Data", Proceedings of ACM SIGMOD. 1997, pp. 255-264.

[4] Manilla, H., Toivonen H. and Verkamo, A.L., "Efficient Algorithms for Discovering Association Rules", AAAI Workshop on Knowledge Discovery in Databases,1994,pp.181-92.

[5] Park, J.S., Chen, M.S. and Yu, P.S., "Using a Hash-based Methods with Transaction Trimming for Mining Association Rules", IEEE Transaction of Knowledge and Data Engineering 1997, Vol. 9, No. 5, pp. 813-825.

[6] Savasere, A., Omiecinski, E. and Navathe, S., "An Efficient Algorithm for Mining Association Rules in Large Databases", Proceedings of the International Conference on Very Large Databases (VLDB'95), 1995, pp. 432-443.

[7] Toivonen, H., "Sampling Large Databases for Association Rules", Proceedings of the 22"** International Conference on Very Large Databases, Bombay, India, 1996.

[8] Murphy, P.M., Repository of Machine Learning and Domain Theories. URL://http://www.ics.uci.edu/~mleam/MLRepository.html.

[9] Chan, M., Han, J. and Yu, P.S., "Data Mining: An Overview from Database Perspective", IEEE Transactions on Knowledge and Data Engineering, 1996,Vol. 8, pp. 866-833.

[10] Agrawal, R., Imielinski, T. and Swami, A., "Database Mining: A Performance Perspective". IEEE Transactions on Knowledge and Data Engineering 1993, Vol. 5, pp. 914-925.

[II] Srikant, R., Vu, Q. and Agrawal, R., "Mining Association Rules with It item. Constraints", in Proc. of the Third International Conf. On Knowledge Discovery in Databases and Data Mining, 1997, pp. 67-73.

[12] Bayardo, R.J., "Efficiently Mining Long Patterns From Databases", ACM SIGMOD Conference Proceedings, 1998, pp. 85-93.

Repository Staff Only: item control page