Machine Learning articles

3/30/2023 • EN

Autoregressive Models, OOD Prompts and the Interpolation Regime

Explores autoregressive models, their relationship to joint distributions, and how they handle out-of-distribution prompts, with insights relevant to LLMs.

Autoregressive Models Generative Modeling Inductive Biases llm Machine Learning

Ferenc Huszár

3/27/2023 • EN

Replacing an A/B Test with GPT

Explores using GPT-3 text embeddings and a simple classifier to predict the winner of a headline A/B test, potentially replacing traditional testing.

ab testing Gpt 3 llm Machine Learning NLP

Will Kurt

3/27/2023 • EN

Class imbalance: bug or feature?

Explores the concept of class imbalance in machine learning, drawing parallels to medical training and questioning if it's a problem or an inherent feature.

Class Imbalance Machine Learning Medical AI Training Data

Thomas Lumley

3/23/2023 • EN

Keeping Up With AI Research And News

A guide on managing the overwhelming volume of AI/ML research, sharing strategies and tools for prioritizing and staying updated effectively.

artificial intelligence Arxiv Machine Learning productivity Research

Sebastian Raschka

3/23/2023 • EN

Keeping Up With AI Research And News

A guide on managing the flood of AI and machine learning research, covering tools and strategies for prioritizing papers and news.

AI Research Arxiv Machine Learning productivity Research Workflow

Sebastian Raschka

3/22/2023 • EN

We May be Surprised Again: Why I take LLMs seriously.

A reflection on past skepticism of deep learning and why similar dismissal of Large Language Models (LLMs) might be a mistake.

artificial intelligence Deep Learning llm Machine Learning Statistical Learning Theory

Ferenc Huszár

3/20/2023 • EN

Deploy FLAN-UL2 20B on Amazon SageMaker

A technical guide on deploying Google's FLAN-UL2 20B large language model for real-time inference using Amazon SageMaker and Hugging Face.

Amazon Sagemaker Hugging Face Inference Machine Learning Model Deployment

Philipp Schmid

3/12/2023 • EN

How to Write Data Labeling/Annotation Guidelines

A guide on creating effective data labeling guidelines for machine learning, covering principles like Why, What, and How, with examples from Google and Bing.

Data Annotation Data Labeling Guidelines Machine Learning Search Relevance

Eugene Yan

3/4/2023 • EN

Linear Regression, the essential theory

Explains the core theory behind linear regression models, a fundamental machine learning algorithm for predicting continuous numerical values.

Linear Regression Machine Learning Model Interpretability statistics

Stern Semasuka

2/26/2023 • EN

Content Moderation & Fraud Detection - Patterns in Industry

Explores five industry patterns for building robust content moderation and fraud detection systems using ML, including human-in-the-loop and data augmentation.

Anomaly Detection content moderation Fraud Detection Machine Learning Supervised Learning

Eugene Yan

2/8/2023 • EN

Diffusion models; or Yet another way to sample from an arbitrary distribution

A non-expert's humorous exploration of diffusion models as a method for sampling from arbitrary probability distributions, touching on measure transport.

Diffusion Models Machine Learning Measure Transport Probability Distributions sampling

Dan Simpson

2/7/2023 • EN

Understanding Large Language Models -- A Transformative Reading List

A curated reading list of key academic papers for understanding the development and architecture of large language models and transformers.

Attention Mechanism large language models Machine Learning Natural Language Processing Transformers

Sebastian Raschka

2/7/2023 • EN

Understanding Large Language Models -- A Transformative Reading List

A curated reading list of key academic papers for understanding the development and architecture of large language models and transformers.

Attention Mechanism large language models Machine Learning Natural Language Processing Transformers

Sebastian Raschka

1/31/2023 • EN

2022, a new scientific adventure: machine learning for health and social sciences

A retrospective on forming a research team in 2022 to apply machine learning to challenges in health and social sciences, including data management and validation.

Data Science Health Data Machine Learning Scikit Learn Social Sciences

Gael Varoquaux

1/22/2023 • EN

Mechanisms for Effective Machine Learning Projects

Explores practical mechanisms like pilot/copilot roles and literature reviews to improve the success rate of machine learning projects.

code review Data Validation Machine Learning project management Team Collaboration

Eugene Yan

1/15/2023 • EN

Training an XGBoost Classifier Using Cloud GPUs Without Worrying About Infrastructure

A guide to training XGBoost models on cloud GPUs using the Lightning AI framework, bypassing complex infrastructure setup.

cloud computing Gpu Lightning AI Machine Learning Xgboost

Sebastian Raschka

1/15/2023 • EN

Training an XGBoost Classifier Using Cloud GPUs Without Worrying About Infrastructure

Learn how to train an XGBoost classifier using cloud GPUs without managing infrastructure via the Lightning AI framework.

Cloud Gpu Infrastructure Lightning AI Machine Learning Xgboost

Sebastian Raschka

1/15/2023 • EN

AI adoption: is it obvious yet?

Analyzes common pitfalls in AI adoption, arguing that technical and product maturity models can hinder practical implementation.

ai adoption artificial intelligence Chatgpt Machine Learning Mlop

Neal Lathia

1/5/2023 • EN

Open Source Highlights 2022 for Machine Learning & AI

A curated list of the top 10 open-source machine learning and AI projects released or updated in 2022, including PyTorch 2.0 and scikit-learn 1.2.

Deep Learning Machine Learning Neural Networks open source Pytorch

Sebastian Raschka

1/5/2023 • EN

Open Source Highlights 2022 for Machine Learning & AI

A curated list of the top 10 open-source releases in Machine Learning & AI for 2022, including PyTorch 2.0 and scikit-learn 1.2.

ai Deep Learning Machine Learning open source Pytorch

Sebastian Raschka

Machine Learning Articles

Autoregressive Models, OOD Prompts and the Interpolation Regime

Replacing an A/B Test with GPT

Class imbalance: bug or feature?

Keeping Up With AI Research And News

Keeping Up With AI Research And News

We May be Surprised Again: Why I take LLMs seriously.

Deploy FLAN-UL2 20B on Amazon SageMaker

How to Write Data Labeling/Annotation Guidelines

Linear Regression, the essential theory

Content Moderation & Fraud Detection - Patterns in Industry

Diffusion models; or Yet another way to sample from an arbitrary distribution

Understanding Large Language Models -- A Transformative Reading List

Understanding Large Language Models -- A Transformative Reading List

2022, a new scientific adventure: machine learning for health and social sciences

Mechanisms for Effective Machine Learning Projects

Training an XGBoost Classifier Using Cloud GPUs Without Worrying About Infrastructure

Training an XGBoost Classifier Using Cloud GPUs Without Worrying About Infrastructure

AI adoption: is it obvious yet?

Open Source Highlights 2022 for Machine Learning & AI

Open Source Highlights 2022 for Machine Learning & AI

Select Language

We use cookies