Scalable model-based clustering for large databases based on data summarization

Jin, Huidong; Wong, Man-Leung; Prof. LEUNG Kwong Sak

Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.11861/7597

DC Field	Value	Language
dc.contributor.author	Jin, Huidong	en_US
dc.contributor.author	Wong, Man-Leung	en_US
dc.contributor.author	Prof. LEUNG Kwong Sak	en_US
dc.date.accessioned	2023-03-27T03:15:26Z	-
dc.date.available	2023-03-27T03:15:26Z	-
dc.date.issued	2005	-
dc.identifier.citation	IEEE Transactions on Pattern Analysis and Machine Intelligence, 2005, vol. 27 (11), pp. 1710 - 1719	en_US
dc.identifier.issn	01628828	-
dc.identifier.uri	http://hdl.handle.net/20.500.11861/7597	-
dc.description.abstract	The scalability problem in data mining involves the development of methods for handling large databases with limited computational resources such as memory and computation time. In this paper, two scalable clustering algorithms, bEMADS and gEMADS, are presented based on the Gaussian mixture model. Both summarize data into subclusters and then generate Gaussian mixtures from their data summaries. Their core algorithm, EMADS, is defined on data summaries and approximates the aggregate behavior of each subcluster of data under the Gaussian mixture model. EMADS is provably convergent. Experimental results substantiate that both algorithms can run several orders of magnitude faster than expectation-maximization with little loss of accuracy © 2005 IEEE.	en_US
dc.language.iso	en	en_US
dc.relation.ispartof	IEEE Transactions on Pattern Analysis and Machine Intelligence	en_US
dc.title	Scalable model-based clustering for large databases based on data summarization	en_US
dc.type	Peer Reviewed Journal Article	en_US
dc.identifier.doi	10.1109/TPAMI.2005.226	-
item.fulltext	No Fulltext	-
crisitem.author.dept	Department of Applied Data Science	-
Appears in Collections:	Applied Data Science - Publication

Find@HKSYU

Show simple item record

SCOPUS^TM
Citations

35

checked on Jul 6, 2025

Page view(s)

55

Last Week
0

Last month

checked on Jul 10, 2025

Google Scholar^TM

Impact Indices

SCOPUS^TM
Citations

Page view(s)

Google Scholar^TM

Altmetric

PlumX
Metrics

Publisher copyright policies & self-archiving

SCOPUSTM Citations

Page view(s)

Google ScholarTM

Altmetric

PlumX Metrics

SCOPUS^TM
Citations

Google Scholar^TM

PlumX
Metrics