K-tree: Large Scale Document Clustering

Jan-6-2010–arXiv.org Artificial Intelligence

We introduce K-tree in an information retrieval context. It is an efficient approximation of the k-means clustering algorithm. Unlike k-means it forms a hierarchy of clusters. It has been extended to address issues with sparse representations. We compare performance and quality to CLUTO using document collections. The K-tree has a low time complexity that is suitable for large document collections. This tree structure allows for efficient disk based implementations where space requirements exceed that of main memory.

artificial intelligence, k-tree, machine learning, (15 more...)

arXiv.org Artificial Intelligence

Jan-6-2010

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - Queensland (0.16)
- North America > United States
  - Massachusetts (0.15)

Genre:
- Research Report (0.40)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.96)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found