Accelerating Barnes-Hut t-SNE Algorithm by Efficient Parallelization on Multi-Core CPUs
Chaudhary, Narendra, Pivovar, Alexander, Yakovlev, Pavel, Gorshkov, Andrey, Misra, Sanchit
–arXiv.org Artificial Intelligence
t-SNE remains one of the most popular embedding techniques for visualizing high-dimensional data. Most standard packages of t-SNE, such as scikit-learn, use the Barnes-Hut t-SNE (BH t-SNE) algorithm for large datasets. However, existing CPU implementations of this algorithm are inefficient. In this work, we accelerate the BH t-SNE on CPUs via cache optimizations, SIMD, parallelizing sequential steps, and improving parallelization of multithreaded steps. Our implementation (Acc-t-SNE) is up to 261x and 4x faster than scikit-learn and the state-of-the-art BH t-SNE implementation from daal4py, respectively, on a 32-core Intel(R) Icelake cloud instance.
arXiv.org Artificial Intelligence
Dec-22-2022
- Country:
- North America > United States
- Texas > Travis County
- Austin (0.04)
- New York > New York County
- New York City (0.04)
- Texas > Travis County
- Europe > Netherlands
- South Holland > Delft (0.04)
- North America > United States
- Genre:
- Research Report (0.64)
- Workflow (0.48)
- Industry:
- Health & Medicine (0.50)
- Technology: