AITopics | Clustering

Collaborating Authors

Clustering

Cluster analysis or clustering is the task of grouping a set of objects in such a way that objects in the same group (called a cluster) are more similar (in some sense) to each other than to those in other groups (clusters). It is a main task of exploratory data mining, and a common technique for statistical data analysis, used in many fields, including machine learning, pattern recognition, image analysis, information retrieval, bioinformatics, data compression, and computer graphics. (Wikipedia)

News Overviews Instructional Materials AI-Alerts Classics

Localized Data Fusion for Kernel k-Means Clustering with Application to Cancer Biology

Mehmet Gönen, Adam A. Margolin

Neural Information Processing SystemsOct-2-2025, 21:51:16 GMT

In many modern applications from, for example, bioinformatics and computer vision, samples have multiple feature representations coming from different data sources. Multiview learning algorithms try to exploit all these available information to obtain a better learner in such scenarios. In this paper, we propose a novel multiple kernel learning algorithm that extends kernel k -means clustering to the multiview setting, which combines kernels calculated on the views in a localized way to better capture sample-specific characteristics of the data. We demonstrate the better performance of our localized data fusion approach on a human colon and rectal cancer data set by clustering patients. Our method finds more relevant prognostic patient groups than global data fusion methods when we evaluate the results with respect to three commonly used clinical biomarkers.

algorithm, biomarker, kernel, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > Oregon > Multnomah County > Portland (0.04)
Europe > Denmark (0.04)

Industry: Health & Medicine > Therapeutic Area > Oncology (1.00)

Technology:

Information Technology > Data Science > Data Integration (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Add feedback

Export Reviews, Discussions, Author Feedback and Meta-Reviews

Neural Information Processing SystemsOct-2-2025, 21:51:16 GMT

First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. This paper generalizes multi-kernel k-means clustering (solved via relaxation) to the case where each clustered item (here, a person) gets an item-specific set of weights over the multiple kernels, rather than the traditional, shared, global weighting of the kernels. Using TCGA (cancer) data, with 3 modalities, they demonstrate that this generalization yields better clusterings than the traditional (global approach), when measured against 3 bronze standard clusterings arising from known clinical clusters. The writing is clear, making for an easy read. Although this is a somewhat incremental-seeming tweak, I think it was clever, with the potential to actually be used (rather than lost in the NIPS archives), and therefore of some significance. Other comments: In the introduction you mention that k-means is susceptible to local minima, and then use this to motivate the relaxation approach.

algorithm, biomarker, experiment, (12 more...)

Neural Information Processing Systems

Country: North America > Canada > Quebec > Montreal (0.04)

Genre: Summary/Review (0.35)

Industry:

Health & Medicine > Therapeutic Area > Oncology (0.92)
Health & Medicine > Pharmaceuticals & Biotechnology (0.75)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.51)

Add feedback

k-Means Clustering of Lines for Big Data

Yair Marom, Dan Feldman

Neural Information Processing SystemsOct-2-2025, 20:35:56 GMT

A very common heuristics to solve this problem is the Lloyd's algorithm [

artificial intelligence, coreset, machine learning, (16 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Israel (0.15)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.64)

Add feedback

SMYRF: Efficient Attention using Asymmetric Clustering

Neural Information Processing SystemsOct-2-2025, 20:06:30 GMT

We propose a novel type of balanced clustering algorithm to approximate attention.

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country: North America > United States > Minnesota (0.28)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

Dependent nonparametric trees for dynamic hierarchical clustering

Kumar Avinava Dubey, Qirong Ho, Sinead A. Williamson, Eric P. Xing

Neural Information Processing SystemsOct-2-2025, 17:33:44 GMT

Neural Information Processing Systems http://nips.cc/

dependent nonparametric tree

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.40)

Add feedback

Export Reviews, Discussions, Author Feedback and Meta-Reviews

Neural Information Processing SystemsOct-2-2025, 17:12:24 GMT

First provide a summary of the paper, and then address the following criteria: Quality, clarity, originality and significance. In this paper the authors propose a novel bi-clustering approach based on a message passing algorithm. The motivation is that most current bi-clustering techniques overcome the computational difficulty of the problem by performing greedy local optimisations. In this paper, the authors propose to overcome this by defining a global likelihood function (eq 1) and then maximising an approximation to this function via message passing. Quality: this is a high quality paper.

algorithm, application, iteration, (11 more...)

Neural Information Processing Systems

Country: North America > Canada > Quebec > Montreal (0.05)

Genre: Summary/Review (0.30)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.49)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.50)

Add feedback

Looking Beyond Single Images for Contrastive Semantic Segmentation Learning - Supplementary Material - 1 Additional results 1.1 Controlled experiment on auxiliary label generation

Neural Information Processing SystemsOct-2-2025, 16:06:48 GMT

Table 1 reports the results of a controlled experiment evaluating different components in our framework for auxiliary label generation. Positive correspondences are generated by matching pixels across different augmentations of the same image. With respect to the clustering algorithm, K-means performs better than DBSCAN (#4 vs. #5), which is We show qualitative results, comparing different feature extractors in Figure 1. DBSCAN is limited by the memory and computational complexity. Corresponding qualitative results are shown in Figure 3. Tables 3-5 show We observe the best performance when 5% outliers are removed.

auxiliary label, experiment, feature extractor, (13 more...)

Neural Information Processing Systems

Genre:

Research Report > Strength High (1.00)
Research Report > Experimental Study (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.52)

Add feedback

Strongly local p-norm-cut algorithms for semi-supervised learning and local graph clustering

Neural Information Processing SystemsOct-2-2025, 15:58:25 GMT

For this problem, we propose a novel generalization of random walk, diffusion, or smooth function methods in the literature to a convex p-norm cut function.

artificial intelligence, inductive learning, machine learning, (14 more...)

Neural Information Processing Systems

Country: North America > United States (0.69)

Genre: Research Report > New Finding (0.46)

Industry: Information Technology (0.46)

Technology: