Julien Jerphanion 12/17/2021

Performance and scikit-learn (2/4)

Read Original

This technical article examines the performance limitations of the exact k-nearest neighbors (k-NN) search in scikit-learn. It details how the current implementation's high-level parallelization with joblib leads to inefficient CPU cache usage and poor hardware scalability. The post promises a follow-up discussing the design of a new, more scalable implementation to address these issues.

Performance and scikit-learn (2/4)

Comments

No comments yet

Be the first to share your thoughts!

Browser Extension

Get instant access to AllDevBlogs from your browser

Top of the Week

1
The Beautiful Web
Jens Oliver Meiert 2 votes
3
LLM Use in the Python Source Code
Miguel Grinberg 1 votes
4
Wagon’s algorithm in Python
John D. Cook 1 votes