AITopics | claran

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > Switzerland > Vaud > Lausanne (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
Europe > Finland > North Karelia > Joensuu (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Neural Information Processing SystemsNov-21-2025, 15:56:28 GMT

K-Medoids For K-Means Seeding

We show experimentally that the algorithm CLARANS of Ng and Han (1994) finds better K-medoids solutions than the Voronoi iteration algorithm of Hastie et al. (2001). This finding, along with the similarity between the Voronoi iteration algorithm and Lloyd's K-means algorithm, motivates us to use CLARANS as a K-means initializer. We show that CLARANS outperforms other algorithms on 23/23 datasets with a mean decrease over k-means++ of 30% for initialization mean squared error (MSE) and 3% for final MSE. We introduce algorithmic improvements to CLARANS which improve its complexity and runtime, making it a viable initialization scheme for large datasets.

k-means seeding, k-medoid, name change, (4 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

James Newling, François Fleuret

K-Medoids For K-Means Seeding

Neural Information Processing SystemsOct-4-2024, 01:38:06 GMT

We show experimentally that the algorithm clarans of Ng and Han (1994) finds better K-medoids solutions than the Voronoi iteration algorithm of Hastie et al. (2001). This finding, along with the similarity between the Voronoi iteration algorithm and Lloyd's K-means algorithm, motivates us to use clarans as a K-means initializer. We show that clarans outperforms other algorithms on 23/23 datasets with a mean decrease over k-means-++ (Arthur and Vassilvitskii, 2007) of 30% for initialization mean squared error (MSE) and 3% for final MSE. We introduce algorithmic improvements to clarans which improve its complexity and runtime, making it a viable initialization scheme for large datasets.

algorithm, claran, initialization, (16 more...)

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York > New York County > New York City (0.04)
Europe > Switzerland > Vaud > Lausanne (0.04)
(4 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

#artificialintelligenceJan-29-2021, 20:31:16 GMT

Understanding Core Data Science Algorithms: K-Means and K-Medoids Clustering - DZone Big Data

Clustering is one of the major techniques used for statistical data analysis. As the term suggests, "clustering" is defined as the process of gathering similar objects into different groups or distribution of datasets into subsets with a defined distance measure. K-means clustering is touted as a foundational algorithm every data scientist ought to have in their toolbox. K-means and k-medoids are methods used in partitional clustering algorithms whose functionality works based on specifying an initial number of groups or, more precisely, iteratively by reallocation of objects among groups. The algorithm works by first segregating all the points into an already selected number of clusters.

algorithm, clustering, euclidean distance, (11 more...)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Schubert, Erich, Rousseeuw, Peter J.

Fast and Eager k-Medoids Clustering: O(k) Runtime Improvement of the PAM, CLARA, and CLARANS Algorithms

arXiv.org Artificial IntelligenceAug-12-2020

Clustering non-Euclidean data is difficult, and one of the most used algorithms besides hierarchical clustering is the popular algorithm Partitioning Around Medoids (PAM), also simply referred to as k-medoids clustering. In Euclidean geometry the mean-as used in k-means-is a good estimator for the cluster center, but this does not exist for arbitrary dissimilarities. PAM uses the medoid instead, the object with the smallest dissimilarity to all others in the cluster. This notion of centrality can be used with any (dis-)similarity, and thus is of high relevance to many domains and applications. A key issue with PAM is its high run time cost. We propose modifications to the PAM algorithm that achieve an O(k)-fold speedup in the second ("SWAP") phase of the algorithm, but will still find the same results as the original PAM algorithm. If we relax the choice of swaps performed (while retaining comparable quality), we can further accelerate the algorithm by eagerly performing additional swaps in each iteration. With the substantially faster SWAP, we can now explore faster initialization strategies, because (i) the classic ("BUILD") initialization now becomes the bottleneck, and (ii) our swap is fast enough to compensate for worse starting conditions. We also show how the CLARA and CLARANS algorithms benefit from the proposed modifications. While we do not study the parallelization of our approach in this work, it can easily be combined with earlier approaches to use PAM and CLARA on big data (some of which use PAM as a subroutine, hence can immediately benefit from these improvements), where the performance with high k becomes increasingly important. In experiments on real data with k=100,200, we observed a 458x respectively 1191x speedup compared to the original PAM SWAP algorithm, making PAM applicable to larger data sets, and in particular to higher k.

artificial intelligence, data mining, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2008.05171

Country:

Europe > Belgium > Flanders > Flemish Brabant > Leuven (0.04)
North America > United States (0.04)
Europe > Germany > North Rhine-Westphalia > Arnsberg Region > Dortmund (0.04)
Asia (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Newling, James, Fleuret, François

K-Medoids For K-Means Seeding

Neural Information Processing SystemsFeb-14-2020, 16:57:51 GMT

We show experimentally that the algorithm CLARANS of Ng and Han (1994) finds better K-medoids solutions than the Voronoi iteration algorithm of Hastie et al. (2001). This finding, along with the similarity between the Voronoi iteration algorithm and Lloyd's K-means algorithm, motivates us to use CLARANS as a K-means initializer. We show that CLARANS outperforms other algorithms on 23/23 datasets with a mean decrease over k-means of 30% for initialization mean squared error (MSE) and 3% for final MSE. We introduce algorithmic improvements to CLARANS which improve its complexity and runtime, making it a viable initialization scheme for large datasets. Papers published at the Neural Information Processing Systems Conference.

k-means seeding, k-medoid, voronoi iteration algorithm, (2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

#artificialintelligenceNov-2-2018, 03:18:29 GMT

Artificial intelligence program trained to recognise galaxies

This artificial intelligence program, named ClaRAN, has the ability to scan images taken by radio telescopes. With the responsibility to identify radio galaxies, galaxies that emit powerful radio jets from supermassive black holes at their centres, ClaRAN is the brainchild of big data specialist Dr Chen Wu and astronomer Dr Ivy Wong, both from The University of Western Australia in partnership with the International Centre for Radio Astronomy Research (ICRAR). Wong explains: "These supermassive black holes occasionally burp out jets that can be seen with a radio telescope." "Over time, the jets can stretch a long way from their host galaxies, making it difficult for traditional computer programs to figure out where the galaxy is." "That's what we're trying to teach ClaRAN to do." Describing the origin of the artificial intelligence program, Dr Wu discusses how ClaRAN grew out of an open source version of Microsoft and Facebook's object detection software. The program was completely overhauled and trained to recognise galaxies instead of people.

artificial intelligence, galaxy, social media, (8 more...)

Country: Oceania > Australia > Western Australia (0.27)

Technology:

Information Technology > Artificial Intelligence (1.00)
Information Technology > Communications > Social Media (0.82)

#artificialintelligenceNov-1-2018, 11:17:30 GMT

Artificial intelligence bot trained to recognize galaxies

Researchers have taught an artificial intelligence program used to recognise faces on Facebook to identify galaxies in deep space. The result is an AI bot named ClaRAN that scans images taken by radio telescopes. Its job is to spot radio galaxies - galaxies that emit powerful radio jets from supermassive black holes at their centres. ClaRAN is the brainchild of big data specialist Dr Chen Wu and astronomer Dr Ivy Wong, both from The University of Western Australia node of the International Centre for Radio Astronomy Research (ICRAR). Dr Wong said black holes are found at the centre of most, if not all, galaxies.

artificial intelligence, galaxy, social media, (12 more...)

Country: Oceania > Australia > Western Australia (0.26)

Industry: Information Technology > Services (0.37)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (1.00)

#artificialintelligenceNov-1-2018, 11:16:20 GMT

AI bot "ClaRAN" can spot radio galaxy too. – TechGraph

An artificial intelligence (AI) programme used to recognize faces on Facebook can also identify galaxies in deep space, scientists said Wednesday. The AI bot named ClaRAN scans images taken by radio telescopes, said researchers from the International Centre for Radio Astronomy Research (ICRAR) in Australia. Its job is to spot radio galaxies -- galaxies that emit powerful radio jets from supermassive black holes at their centers, according to the research published in the journal Monthly Notices of the Royal Astronomical Society. Black holes are found at the center of most, if not all, galaxies. "These supermassive black holes occasionally burp out jets that can be seen with a radio telescope," said Ivy Wong from The University of Western Australia node of the International Centre for Radio Astronomy Research (ICRAR).

artificial intelligence, galaxy, social media, (12 more...)

Country: Oceania > Australia > Western Australia (0.27)

Genre: Research Report (0.95)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (1.00)

#artificialintelligenceNov-1-2018, 11:15:58 GMT

Artificial intelligence bot trained to recognize galaxies

Researchers have taught an artificial intelligence program used to recognise faces on Facebook to identify galaxies in deep space. The result is an AI bot named ClaRAN that scans images taken by radio telescopes. Its job is to spot radio galaxies--galaxies that emit powerful radio jets from supermassive black holes at their centres. ClaRAN is the brainchild of big data specialist Dr. Chen Wu and astronomer Dr. Ivy Wong, both from The University of Western Australia node of the International Centre for Radio Astronomy Research (ICRAR). Dr. Wong said black holes are found at the centre of most, if not all, galaxies.

artificial intelligence, galaxy, social media, (13 more...)

Country: Oceania > Australia > Western Australia (0.26)

Industry: Information Technology > Services (0.37)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (1.00)