Active clustering for labeling training data

Apr-25-2026, 16:55:48 GMT–Neural Information Processing Systems

We also algorithm family, propose as a conjecture that they reach the minimum average items and analyze their complexity. In the second model, we analyze a specific the algorithms that minimize the average number of queries required to cluster the independently following a fixed distribution. In the first model, we characterize they form is drawn uniformly, the other one where each item chooses its class items, we consider two random models for the classes: one where the set partition classes (which can be labeled cheaply at the very end of the process). Given the cheap task of answering pairwise queries, and the computer groups the items into for training data gathering where the human experts perform the comparatively to see whether they belong to the same class. Thus motivated, we propose a setting determining the correct labels is much more expensive than comparing two items most practical cases rely on humans-in-the-loop to label the data. The process of has a high impact on the performance of the learned function.

algorithm, artificial intelligence, machine learning, (19 more...)

Neural Information Processing Systems

Apr-25-2026, 16:55:48 GMT

Conferences PDF

Add feedback

Country:
- Europe > United Kingdom > England (0.28)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.46)

Duplicate Docs Excel Report

Title
47841cc9e552bd5c40164db7073b817b-Paper.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found