Accelerating Barnes-Hut t-SNE Algorithm by Efficient Parallelization on Multi-Core CPUs

Chaudhary, Narendra, Pivovar, Alexander, Yakovlev, Pavel, Gorshkov, Andrey, Misra, Sanchit

Dec-22-2022–arXiv.org Artificial Intelligence

t-SNE remains one of the most popular embedding techniques for visualizing high-dimensional data. Most standard packages of t-SNE, such as scikit-learn, use the Barnes-Hut t-SNE (BH t-SNE) algorithm for large datasets. However, existing CPU implementations of this algorithm are inefficient. In this work, we accelerate the BH t-SNE on CPUs via cache optimizations, SIMD, parallelizing sequential steps, and improving parallelization of multithreaded steps. Our implementation (Acc-t-SNE) is up to 261x and 4x faster than scikit-learn and the state-of-the-art BH t-SNE implementation from daal4py, respectively, on a 32-core Intel(R) Icelake cloud instance.

artificial intelligence, implementation, machine learning, (14 more...)

arXiv.org Artificial Intelligence

Dec-22-2022

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Texas > Travis County
    - Austin (0.04)
  - New York > New York County
    - New York City (0.04)
- Europe > Netherlands
  - South Holland > Delft (0.04)

Genre:
- Research Report (0.64)
- Workflow (0.48)

Industry:
- Health & Medicine (0.50)

Technology:
- Information Technology
  - Data Science (0.68)
  - Artificial Intelligence
    - Machine Learning > Statistical Learning (0.69)
    - Representation & Reasoning (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found