Soft Clustering Criterion Functions for Partitional Document Clustering
2004-05-26
Loading...
View/Download File
Persistent link to this item
Statistics
View StatisticsJournal Title
Journal ISSN
Volume Title
Title
Soft Clustering Criterion Functions for Partitional Document Clustering
Alternative title
Authors
Published Date
2004-05-26
Publisher
Type
Report
Abstract
Recently published studies have shown that partitional clustering algorithms that optimize certain criterion functions, which measure key aspects of inter- and intra-cluster similarity, are very effective in producing hard clustering solutions for document datasets and outperform traditional partitional and agglomerative algorithms. In this paper we study the extent to which these criterion functions can be modified to include soft membership functions and whether or not the resulting soft clustering algorithms can further improve the clustering solutions. Specifically, we focus on four of these hard criterion functions, derive their soft-clustering extensions, present a comprehensive experimental evaluation involving twelve different datasets, and analyze their overall characteristics. Our results show that introducing softness into the criterion functions tends to lead to better clustering results for most datasets.
Keywords
Description
Related to
Replaces
License
Series/Report Number
Technical Report; 04-022
Funding information
Isbn identifier
Doi identifier
Previously Published Citation
Other identifiers
Suggested citation
Zhao, Ying; Karypis, George. (2004). Soft Clustering Criterion Functions for Partitional Document Clustering. Retrieved from the University Digital Conservancy, https://hdl.handle.net/11299/215616.
Content distributed via the University Digital Conservancy may be subject to additional license and use restrictions applied by the depositor. By using these files, users agree to the Terms of Use. Materials in the UDC may contain content that is disturbing and/or harmful. For more information, please see our statement on harmful content in digital repositories.