Top 10 interesting facts about Scikit-Learn

Dhiraj K
2 min readMar 18, 2019

--

Scikit-learn is open source machine learning library used with Python programming language.

Photo by Chris Ried on Unsplash
  1. Scikit-learn was initially developed by David Cournapeau as a Google summer of code project in 2007.
  2. Scikit-learn is mainly written in Python but some core algorithms are written in Cython to achieve performance. Cython is a programming language that is super set of the Python language, that gives C-like performance.
  3. Scikit-learn is built upon the SciPy (Scientific Python) which must be installed before you can use scikit-learn.
  4. Scikit-learn can be used to develop various regression, classification and clustering algorithms.
  5. Scikit-learn implements a range of machine learning, preprocessing, cross-validation and visualization algorithms.
  6. Scikit-learn needs input data to be numeric and stored as numpy arrays or scipy matrices to be accepted to processing.
  7. Scikit-learn provides three Regression Metrics namely Mean Absolute Error, Mean Squared Error, R² Score.
  8. Scikit-learn provides three Classification Metrics namely Accuracy Score, Classification Report, Confusion Matrix.
  9. Scikit-learn provides three Clustering Metrics namely Adjusted rand Index, Homogeneity, V-measure.
  10. Scikit-learn provides the functions to tune the model using grid search and randomized parameter optimization.
Photo by Markus Spiske on Unsplash

--

--

Dhiraj K
Dhiraj K

Written by Dhiraj K

Data Scientist & Machine Learning Evangelist. I love transforming data into impactful solutions and sharing my knowledge through teaching. dhiraj10099@gmail.com

Responses (1)