Author, Subjects, Keywords

Cited Author

 

 
   » By Author or Editor
 » Browse Author by Alphabet
 » By Journal
 » By Subjects
 » By Affiliations
 » By Type
 » By Year
 » By Latest Additions
 
 
   » By Author
 » Top 20 Authors
 » Top 20 Article
 » Top 20 Journal Cited
 » Top 20 Cited
 » Top 20 Author Cited
 » Usage Since Sept 2007


 
 
 

Login | Create Account

Clustering of CPU Usage Data in Grid Environment using Evoc Algorithm

Yong, Chan Huah and Ee, Kee Sim and Fazilah, Haron (2006) Clustering of CPU Usage Data in Grid Environment using Evoc Algorithm. Malaysian Journal of Computer Science, 19 (2). pp. 105-116. ISSN 0127-9084

Full text not available from this repository.

Official URL: http://mjcs.fsktm.um.edu.my/detail.asp?AID=391

Affiliations

Universiti Sains Malaysia
Universiti Sains Malaysia
Universiti Sains Malaysia

Abstract

Clustering is a process of organizing objects into groups whose members are similar in some way. It is one of the data mining techniques is an unsupervised learning. In a grid environment, the number of computing nodes and users may reach up to thousands or millions. The grid is said to be dynamic in that the behaviors and values of these resources change all the time. Hence, these data are not suitable to be processed in an off-line mode. The existing clustering techniques today however emphasize more on the data’s behaviors categorization but not the data’s stability. Furthermore, the normal clustering techniques are more suitable to be used for static data type in an off-line mode. This paper addresses these issues by presenting an Evolving Clustering (Evoc Algorithm) which is an improved version of Evolving Clustering Method (ECM). We apply both methods on CPU usage to identify computers behaviors. The algorithm has been evaluated using three main criteria; that is dynamicity, accuracy and the ability to identify the stable cluster members. Our results show the improvements of the algorithm to process the data in an on-line mode in the evaluation of the algorithm’s dynamicity and accuracy criteria compare to other existing clustering techniques. Furthermore, the stability evaluation was a success where we were able to identify the stable cluster members from the filtered stable clusters. However, the result was highly affected by three factors namely threshold value, stability value and stability hour.

Item Type:Journal
Keywords:Clustering, Grid, Data Stability, Evolving Clustering (Evoc), Evolving Clustering Method (ECM)
Subjects:Q Science
ID Code:326

I. Foster and C. Kesselman., The Grid: Blueprint for the New Computing Infrastructure, Morgan Kaufman Publishers, Inc., 1998.

M.S Chen, J.Han, and P.S. Yu., Data Mining: An Overview from database perspective, IEEE Transactions on Knowledge and Data Eng., 1996. pp. 866-883.

A.K. Jain, M.N. Murty and P.J. Flynn, Data Clustering: A Review, ACM Computing Surveys, Vol. 31, No. 3, Sept 1999. pp. 264-323.

Peter A. Dinda, “The Statistical Properties of Host Load”, Proceedings of the Fourth Workshop on Languages, Compilers, 1998.

S. Guha, R. Rastogi, and K. Shim, “CURE: An efficient clustering algorithm for large databases”, In Proceedings of ACM SIGMOD International Conference on Management of Data, New York, 1998, pp. 73-84.

KARYPIS, G., HAN, E.-H., and KUMAR, V. CHAMELEON: A hierarchical clustering algorithm using dynamic modeling, COMPUTER, 1999. pp. 32, 68-75.

L. Kaufman and P.J. Rousseeuw, Finding Groups in Data: An Introduction to Cluster Analysis, John Wiley and Sons, 1990.

A.K. Jain and R.C. Dubes, Algorithms for Clustering Data, Prentice Hall, 1988.

Raymond T. Ng and Jiawei Han, Member, IEEE Computer Society, CLARANS: A Method for Clustering Objects for Spatial Data Mining, IEEE Transactions on Knowledge and Data Engineering, Vol. 14, No. 5, September / October 2002. pp. 1003 – 1016.

Jeffrey Heer, Ed H. Chi., “Mining the Structure of User Activity using Cluster Stability”, In Proceedings of the Web Analytics Workshop, SIAM Conference on Data Mining, 2002.

Ben-Hur, A., Elisseff, A., and Guyon, I., “A Stability Based Method for Discovering Structure in Clustered Data”, in Proceedings of the Pacific Symposium on Biocomputing (PSB2002), Kaua’I, HI, January 2002.

Qun Song, Nikola Kasabov, “Dynamic Evolving Neuro-Fuzzy Inference System (DENFIS): On-line Learning and Application for Time-Series Prediction”, Proc. 6th International Conference on Soft Computing, Iizuka, Fukuoka, Japan, 2000, pp. 696-701.

Nikola Kasabov, Qun Song, DENFIS: dynamic evolving neural-fuzzy inference system and its application for time-series prediction, Fuzzy Systems, IEEE Transactions on Volume 10, Issue 2, April 2002. pp. 144–154

Ooi Boon Yaik, Chan Huah Yong, Fazilah Haron, “CPU Usage Pattern Discovery Using Suffix Tree. 2nd International Conference on Distributed Frameworks for Multimedia Applications”, Distributed Frameworks for Multimedia Application 2006, IEEE. Penang, Malaysia. May 15-17, 2006, pp. 14-21.

Rich Wolski, Spring, N. and Hayes, J., “Predicting the CPU Availability of Time-shared Unix Systems on the Computational Grid”, Proceedings of 8tj High-performance Distributed Computing Systems Conference, August, 1999.

Repository Staff Only: item control page