WIT Press


Spectral Clustering And Community Detection In Document Networks

Price

Free (open access)

Volume

42

Pages

10

Page Range

41 - 50

Published

2009

Size

368 kb

Paper DOI

10.2495/DATA090051

Copyright

WIT Press

Author(s)

C. K. dos Santos, A. G. Evsukoff & B. S. L. P. de Lima

Abstract

Document clustering is one of the most active research topics in text mining. In this work two approaches issued from very different fields are explored for document clustering: spectral clustering and community detection in complex networks. Both approaches are based on a representation of the document collection as a graph, of which the nodes represent the documents and the edges represent the similarities between each pair of documents, such that the two approaches have many issues in common. The results of the application of these two types of techniques to benchmark text mining problems show that they are complementary and are useful for finding structure in large collections of documents Keywords: text mining, document clustering, spectral clustering, community detection, complex networks, modularity.

Keywords

text mining, document clustering, spectral clustering, community detection, complex networks, modularity.