How to Choose the Best Nearest Neighbors Algorithm
In my previous post [KNN is Dead!], I have compared an ANN algorithm called HNSW with sklearn's KNN and proved that HNSW has vastly superior performance with a 380X speed up while delivering 99.3% of the same results. As a data scientist, I am a huge proponent of making data-driven decisions, as I mentioned in How to Choose the Best Keras Pre-Trained Model. So, in this post, I'll demonstrate a data-driven way to decide which ANN algorithm is the best choice for your custom dataset by using the excellent ann-benchmarks GitHub repository. The ann-benchmarks code compares multiple ANN algorithms by plotting each algorithm's Recall vs Queries per second.
Dec-5-2020, 05:55:04 GMT