AITopics | Clustering

Collaborating Authors

Clustering

Cluster analysis or clustering is the task of grouping a set of objects in such a way that objects in the same group (called a cluster) are more similar (in some sense) to each other than to those in other groups (clusters). It is a main task of exploratory data mining, and a common technique for statistical data analysis, used in many fields, including machine learning, pattern recognition, image analysis, information retrieval, bioinformatics, data compression, and computer graphics. (Wikipedia)

News Overviews Instructional Materials AI-Alerts Classics

Customer Segmentation using K-Means Clustering

#artificialintelligenceJan-17-2022, 09:42:48 GMT

Originally published on Towards AI the World's Leading AI and Technology News and Media Company. If you are building an AI-related product or service, we invite you to consider becoming an AI sponsor. At Towards AI, we help scale AI and technology startups. Let us help you unleash your technology to the masses. "The aim of marketing is to know and understand the customer so well the product or service fits him and sells itself".

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.40)

Add feedback

Tk-merge: Computationally Efficient Robust Clustering Under General Assumptions

Insolia, Luca, Perrotta, Domenico

arXiv.org Machine LearningJan-17-2022

We address general-shaped clustering problems under very weak parametric assumptions with a two-step hybrid robust clustering algorithm based on trimmed k-means and hierarchical agglomeration. The algorithm has low computational complexity and effectively identifies the clusters also in presence of data contamination. We also present natural generalizations of the approach as well as an adaptive procedure to estimate the amount of contamination in a data-driven fashion. Our proposal outperforms state-of-the-art robust, model-based methods in our numerical simulations and real-world applications related to color quantization for image analysis, human mobility patterns based on GPS data, biomedical images of diabetic retinopathy, and functional data across weather stations.

contamination, partition, tclust, (16 more...)

arXiv.org Machine Learning

2201.06391

Country:

Europe > Italy (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)

Genre: Research Report (0.82)

Industry: Health & Medicine > Therapeutic Area > Ophthalmology/Optometry (0.48)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Add feedback

Doing More with Less: Overcoming Data Scarcity for POI Recommendation via Cross-Region Transfer

Gupta, Vinayak, Bedathur, Srikanta

arXiv.org Artificial IntelligenceJan-16-2022

Variability in social app usage across regions results in a high skew of the quantity and the quality of check-in data collected, which in turn is a challenge for effective location recommender systems. In this paper, we present Axolotl (Automated cross Location-network Transfer Learning), a novel method aimed at transferring location preference models learned in a data-rich region to significantly boost the quality of recommendations in a data-scarce region. Axolotl predominantly deploys two channels for information transfer, (1) a meta-learning based procedure learned using location recommendation as well as social predictions, and (2) a lightweight unsupervised cluster-based transfer across users and locations with similar preferences. Both of these work together synergistically to achieve improved accuracy of recommendations in data-scarce regions without any prerequisite of overlapping users and with minimal fine-tuning. We build Axolotl on top of a twin graph-attention neural network model used for capturing the user- and location-conditioned influences in a user-mobility graph for each region. We conduct extensive experiments on 12 user mobility datasets across the U.S., Japan, and Germany, using 3 as source regions and 9 of them (that have much sparsely recorded mobility data) as target regions. Empirically, we show that Axolotl achieves up to 18% better recommendation performance than the existing state-of-the-art methods across all metrics.

axolotl, recommendation, target region, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3511711

2201.06095

Country:

North America > United States > California (0.04)
Asia > India > NCT > Delhi (0.04)
Europe > Germany > North Rhine-Westphalia (0.04)
(8 more...)

Genre: Research Report > Promising Solution (1.00)

Industry: Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Effective and Efficient Graph Learning for Multi-view Clustering

Gao, Quanxue, Xia, Wei, Gao, Xinbo, Zhang, Xiangdong, Li, Qin, Tao, Dacheng

arXiv.org Artificial IntelligenceJan-14-2022

Despite the impressive clustering performance and efficiency in characterizing both the relationship between data and cluster structure, existing graph-based multi-view clustering methods still have the following drawbacks. They suffer from the expensive time burden due to both the construction of graphs and eigen-decomposition of Laplacian matrix, and fail to explore the cluster structure of large-scale data. Moreover, they require a post-processing to get the final clustering, resulting in suboptimal performance. Furthermore, rank of the learned view-consensus graph cannot approximate the target rank. In this paper, drawing the inspiration from the bipartite graph, we propose an effective and efficient graph learning model for multi-view clustering. Specifically, our method exploits the view-similar between graphs of different views by the minimization of tensor Schatten p-norm, which well characterizes both the spatial structure and complementary information embedded in graphs of different views. We learn view-consensus graph with adaptively weighted strategy and connectivity constraint such that the connected components indicates clusters directly. Our proposed algorithm is time-economical and obtains the stable results and scales well with the data size. Extensive experimental results indicate that our method is superior to state-of-the-art methods.

artificial intelligence, graph, machine learning, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/TPAMI.2022.3187976

2108.06734

Country:

North America > United States (0.14)
Asia > China > Chongqing Province > Chongqing (0.04)
Asia > China > Shaanxi Province > Xi'an (0.04)
(5 more...)

Genre: Research Report (0.70)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.66)

Add feedback

A Survey of Opponent Modeling in Adversarial Domains

Nashed, Samer | Zilberstein, Shlomo (UMass Amherst)

Journal of Artificial Intelligence ResearchJan-14-2022

Opponent modeling is the ability to use prior knowledge and observations in order to predict the behavior of an opponent. This survey presents a comprehensive overview of existing opponent modeling techniques for adversarial domains, many of which must address stochastic, continuous, or concurrent actions, and sparse, partially observable payoff structures. We discuss all the components of opponent modeling systems, including feature extraction, learning algorithms, and strategy abstractions. These discussions lead us to propose a new form of analysis for describing and predicting the evolution of game states over time. We then introduce a new framework that facilitates method comparison, analyze a representative selection of techniques using the proposed framework, and highlight common trends among recently proposed methods. Finally, we list several open problems and discuss future research directions inspired by AI research on opponent modeling and related research in other disciplines.

artificial intelligence, opponent, proceedings, (13 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.1.12889

AI Access Foundation

12889

Journal of Artificial Intelligence Research

Country:

North America > United States > Massachusetts > Hampshire County > Amherst (0.14)
Asia > Japan > Honshū > Tōhoku > Fukushima Prefecture > Fukushima (0.04)
South America > Uruguay > Maldonado > Maldonado (0.04)
(9 more...)

Genre: Overview (1.00)

Industry: Leisure & Entertainment > Games > Computer Games (1.00)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Robots > Soccer Robots (1.00)
(12 more...)

Add feedback

The Mathematics of Comparing Objects

Weber, Marcus, Fackeldey, Konstantin

arXiv.org Artificial IntelligenceJan-14-2022

After reading two different crime stories, an artificial intelligence concludes that in both stories the police has found the murderer just by random.

algorithm, characteristic, mathematics, (14 more...)

arXiv.org Artificial Intelligence

2201.07032

Country:

Europe > Germany > Berlin (0.04)
Europe > Germany > Lower Saxony > Gottingen (0.04)

Genre: Research Report (0.82)

Industry:

Media > Film (0.93)
Leisure & Entertainment (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.67)

Add feedback

Probabilistic spatial clustering based on the Self Discipline Learning (SDL) model of autonomous learning

Gu, Zecang, Sun, Xiaoqi, Sun, Yuan, Zhang, Fuquan

arXiv.org Artificial IntelligenceJan-14-2022

Unsupervised clustering algorithm can effectively reduce the dimension of high-dimensional unlabeled data, thus reducing the time and space complexity of data processing. However, the traditional clustering algorithm needs to set the upper bound of the number of categories in advance, and the deep learning clustering algorithm will fall into the problem of local optimum. In order to solve these problems, a probabilistic spatial clustering algorithm based on the Self Discipline Learning(SDL) model is proposed. The algorithm is based on the Gaussian probability distribution of the probability space distance between vectors, and uses the probability scale and maximum probability value of the probability space distance as the distance measurement judgment, and then determines the category of each sample according to the distribution characteristics of the data set itself. The algorithm is tested in Laboratory for Intelligent and Safe Automobiles(LISA) traffic light data set, the accuracy rate is 99.03%, the recall rate is 91%, and the effect is achieved.

algorithm, feature vector, probability space, (9 more...)

arXiv.org Artificial Intelligence

2201.03449

Country:

Asia > Middle East > Jordan (0.04)
Asia > Japan (0.04)
Asia > China > Sichuan Province (0.04)

Genre: Research Report (0.50)

Industry:

Transportation > Ground > Road (0.77)
Transportation > Infrastructure & Services (0.57)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Add feedback

Clustering : The Craft of Segmentation

#artificialintelligenceJan-13-2022, 22:50:13 GMT

Clustering is the unsupervised learning process of segmenting observations, or dataset into number of groups such that data points have more similarity within the groups than those to the data points in other groups. It is one of the most frequently used analytical process. It helps tremendously in finding of homogeneous groups across the dataset. Clustering algorithms uses different measures to check similarity in order to cluster the observations. The type of similarity measure plays an important role in the final cluster formation.

centroid, clustering, hierarchical clustering, (14 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Add feedback

Manifold learning via quantum dynamics

Kumar, Akshat, Sarovar, Mohan

arXiv.org Machine LearningJan-12-2022

We introduce an algorithm for computing geodesics on sampled manifolds that relies on simulation of quantum dynamics on a graph embedding of the sampled data. Our approach exploits classic results in semiclassical analysis and the quantum-classical correspondence, and forms a basis for techniques to learn the manifold from which a dataset is sampled, and subsequently for nonlinear dimensionality reduction of high-dimensional datasets. We illustrate the new algorithm with data sampled from model manifolds and also by a clustering demonstration based on COVID-19 mobility data. Finally, our method reveals interesting connections between the discretization provided by data sampling and quantization.

coherent state, manifold, operator, (17 more...)

arXiv.org Machine Learning

2112.11161

Country: