Jin, HuidongHuidongJinProf. LEUNG Kwong SakWong, Man-LeungMan-LeungWong2023-03-292023-03-292002Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2002, vol. 2412, pp. 569 - 575978-354044025-303029743http://hdl.handle.net/20.500.11861/7646In this paper, we propose EMACF (Expectation- Maximization Algorithm for Clustering Features) to generate clusters from data summaries rather than data items directly. Incorporating with an adaptive grid-based data summarization procedure, we establish a scalable clustering algorithm: gEMACF. The experimental results show that gEMACF can generate more accurate results than other scalable clustering algorithms. The experimental results also indicate that gEMACF can run two order of magnitude faster than the traditional expectation-maximization algorithm with little loss of accuracy. © Springer-Verlag Berlin Heidelberg 2002.enCluster AlgorithmMixture ModelData ItemGaussian Mixture ModelCluster AccuracyScaling-up model-based clustering algorithm by working on clustering featuresConference Paper10.1007/3-540-45675-9_86