Machine Learning articles

10/5/2014 • EN

Naive Bayes and Text Classification

Explores Naive Bayes classifiers for text classification, covering theory and applications like spam filtering and song lyric analysis.

Machine Learning Naive Bayes Spam Filtering Supervised Learning Text Classification

Sebastian Raschka

9/14/2014 • EN

Kernel tricks and nonlinear dimensionality reduction via RBF kernel PCA

A guide to performing nonlinear dimensionality reduction using RBF Kernel PCA, including theory, implementation, and examples.

dimensionality reduction Kernel Pca Kernel Trick Machine Learning Radial Basis Function

Sebastian Raschka

8/25/2014 • EN

Predictive modeling, supervised machine learning, and pattern classification

An overview of predictive modeling, supervised machine learning, and the core workflow for pattern classification tasks.

Machine Learning Pattern Classification Predictive Modeling Supervised Learning

Sebastian Raschka

8/25/2014 • EN

Predictive modeling, supervised machine learning, and pattern classification

An overview of predictive modeling, supervised machine learning, and pattern classification concepts, workflows, and applications.

classification Machine Learning Pattern Classification Predictive Modeling Supervised Learning

Sebastian Raschka

8/3/2014 • EN

Linear Discriminant Analysis

A technical guide to Linear Discriminant Analysis (LDA) for dimensionality reduction and classification in machine learning, with comparisons to PCA.

classification dimensionality reduction Linear Discriminant Analysis Machine Learning Scikit Learn

Sebastian Raschka

8/3/2014 • EN

Linear Discriminant Analysis

A technical guide to Linear Discriminant Analysis (LDA) for dimensionality reduction and classification in machine learning, including a Python implementation.

classification dimensionality reduction Feature Extraction Machine Learning Scikit Learn

Sebastian Raschka

7/15/2014 • EN

Scikit-learn 0.15 release: highlights

Highlights of the scikit-learn 0.15 release, including performance improvements, new features, and deprecations.

Machine Learning performance optimization Python Release Notes Scikit Learn

Gael Varoquaux

7/11/2014 • EN

About Feature Scaling and Normalization

Explains feature scaling and normalization in machine learning, comparing standardization and Min-Max scaling, with examples using scikit-learn.

Feature Scaling Machine Learning Normalization pca Scikit Learn

Sebastian Raschka

7/11/2014 • EN

About Feature Scaling and Normalization

A guide to feature scaling and normalization in machine learning, covering standardization, Min-Max scaling, and their implementation in scikit-learn.

Feature Scaling Machine Learning Normalization pca Scikit Learn

Sebastian Raschka

6/27/2014 • EN

Entry Point Data

A Python tutorial covering essential tools and techniques for machine learning, including data visualization, PCA, LDA, and classification.

Anaconda data analysis Machine Learning Python Scikit Learn

Sebastian Raschka

6/27/2014 • EN

Entry Point Data

A tutorial on using Python tools for machine learning, covering data loading, visualization, preprocessing, and classification with scikit-learn.

Anaconda data analysis Machine Learning Python Scikit Learn

Sebastian Raschka

6/10/2014 • EN

Getting to iHub: The Letter

A blog post sharing the author's cover letter for an internship at iHub Research, focusing on their interest in automating hate speech detection using AI and NLP.

artificial intelligence Cover Letter Internship Machine Learning Natural Language Processing

Young Kenyan

5/8/2014 • EN

Personas, data science, k-means

Explores how personas, data science, and k-means clustering can be used together to analyze user data and gain actionable business insights.

Clustering Data Science K Means Machine Learning Personas

Craig Kerstiens

4/23/2014 • EN

Google summer of code projects for scikit-learn

Announcing the four students accepted for Google Summer of Code 2024 to work on scikit-learn projects, including neural networks and performance improvements.

Google Summer Of Code Machine Learning Neural Networks open source Scikit Learn

Gael Varoquaux

4/13/2014 • EN

Implementing a Principal Component Analysis (PCA)

A technical guide to implementing Principal Component Analysis (PCA) for dimensionality reduction, comparing it with MDA and providing code examples.

dimensionality reduction Machine Learning Matplotlib pca Scikit Learn

Sebastian Raschka

12/13/2013 • EN

PCA is not a panacea

An author critiques the overuse of PCA in data science, arguing it's not a universal solution for classification problems.

classification dimensionality reduction Machine Learning matrix factorization pca

Dan Luu

11/14/2013 • EN

Stochastic Outlier Selection

Introduces Stochastic Outlier Selection (SOS), an unsupervised machine learning algorithm for detecting outliers based on affinity between data points.

Machine Learning Outlier Detection Python Stochastic Outlier Selection Unsupervised Learning

Jeroen Janssens