Fundamentals of Predictive Text Mining by Sholom M. Weiss Nitin Indurkhya & Tong Zhang

Fundamentals of Predictive Text Mining by Sholom M. Weiss Nitin Indurkhya & Tong Zhang

Author:Sholom M. Weiss, Nitin Indurkhya & Tong Zhang
Language: eng
Format: epub
Publisher: Springer London, London


5.6 Summary

Document collections are frequently encountered without labels. Labels may be determined by clustering the documents into disparate groups and implicitly finding common themes among the document clusters. This chapter describes methods for clustering documents. A key theme for document clustering is computing measures of similarity. We review the major clustering methods: k-means clustering, hierarchical clustering and the EM algorithm. Strategies for assigning meaning to algorithmically generated clusters and labels are considered. Performance evaluation helps determine the empirical characteristics of desirable clusters.



Download



Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.