AITopics | clustergram

Collaborating Authors

clustergram

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Clustergam: visualisation of cluster analysis – Martin Fleischmann

#artificialintelligenceMay-20-2021, 23:45:15 GMT

In this post, I introduce a new Python package to generate clustergrams from clustering solutions. The library has been developed as part of the Urban Grammar research project, and it is compatible with scikit-learn and GPU-enabled libraries such as cuML or cuDF within RAPIDS.AI. When we want to do some cluster analysis to identify groups in our data, we often use algorithms like K-Means, which require the specification of a number of clusters. But the issue is that we usually don't know how many clusters there are. There are many methods on how to determine the correct number, like silhouettes or elbow plot, to name a few.

artificial intelligence, clustergram, machine learning, (16 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.95)

Add feedback

Generalized Clustergrams for Overlapping Biclusters

Badea, Liviu (National Institute for Research in Informatics)

AAAI ConferencesJun-23-2009

Many real-life datasets, such as those produced by gene expression studies, exhibit complex substructures at various levels of granularity and thus do not have unique well-defined numbers of clusters. In such cases, it is important to be able to trace the evolution of the individual clusters as the number of dimensions of the clustering is varied. While the dendrograms produced by bottom-up clustering methods such as hierarchical clustering are very useful for this purpose, the approach is known to produce unreliable clusters due to its instability w.r.t. resampling. Moreover, hierarchical clustering does not apply to overlapping (bi)clusters, such as those obtained in gene expression studies. On the other hand, the instability w.r.t. the initialization of top-down methods, such as k-means, prevents the comparison between clusters obtained at different dimensionalities. In this paper, we present a method for constructing generalized dendrograms for overlapping biclusters, which depict the evolution of the biclusters as their number is varied. An essential ingredient is a stable biclustering method based on positive tensor factorization of a number of nonnegative matrix factorization runs. We apply our approach to a large colon cancer dataset, which shows several distinct subclasses whose dimensional evolution must be carefully analyzed to enable a more meaningful biological interpretation and sub-classification.

artificial intelligence, factorization, machine learning, (17 more...)

AAAI Conferences

Twenty-First International Joint Conference on Artificial Intelligence

Industry: Health & Medicine > Therapeutic Area > Oncology > Colorectal Cancer (0.52)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Add feedback