Machine Learning articles

4/22/2024 • EN

Running Python on a serverless GPU instance for machine learning inference

A guide to running Python code on serverless GPU instances using Modal.com for faster machine learning inference, demonstrated with a speech-to-text example.

Gpu Inference Machine Learning Modal serverless

Saeed Esmaili

4/20/2024 • EN

Using and Finetuning Pretrained Transformers

Explores methods for using and finetuning pretrained large language models, including feature-based approaches and parameter updates.

ai Finetuning large language models Machine Learning Transformers

Sebastian Raschka

4/12/2024 • EN

Diffusion Models for Video Generation

Explores the application of diffusion models to video generation, covering technical challenges, parameterization, and sampling methods.

Deep Learning Diffusion Models generative ai Machine Learning Video Generation

Lilian Weng

4/10/2024 • EN

Should you discretize continuous features for Machine Learning? 🤖

Explores the pros and cons of discretizing continuous features in machine learning, with a practical guide using scikit-learn's KBinsDiscretizer.

Data Preprocessing Discretization Feature Engineering Machine Learning Scikit Learn

Kevin Markham i

4/8/2024 • EN

Building, Serving, and Operating a Recommendation System Using AWS SageMaker

A developer's journey building a TV show recommendation engine using AWS SageMaker, from data collection to model deployment.

ai AWS Sagemaker cloud computing Machine Learning Recommendation System

Mavrick Laakso

4/3/2024 • EN

Exam AI-900: Microsoft Azure AI Fundamentals Study Guide 2024

A study guide for the Microsoft AI-900 Azure AI Fundamentals exam, covering AI workloads, machine learning, and generative AI.

AI 900 artificial intelligence Azure AI generative ai Machine Learning

Hugo Barona

3/28/2024 • EN

6 New books added to Big Book of R

Announces the addition of 6 new R programming books to the Big Book of R collection, covering statistics, machine learning, and data science.

Data Science Feature Engineering Machine Learning R Programming statistics

Oscar Baruffa

3/14/2024 • EN

What I learned from looking at 900 most popular open source AI tools

An analysis of 900 popular open-source AI tools, categorizing them into infrastructure, model development, and application layers.

ai tools Foundation Models github Machine Learning open source

Chip Huyen

3/6/2024 • EN

My inputs in February

A monthly tech digest covering Meta's DotSlash tool, AI-powered code reviews, AWS Lambda scaling, observability trends, and Cloudflare's logging pipeline.

AWS Lambda cloudflare code review Machine Learning observability

Gaspare Vitta

2/25/2024 • EN

Don't Mock Machine Learning Models In Unit Tests

Explains why mocking ML models in unit tests is problematic and offers guidelines for effectively testing machine learning code.

Machine Learning Model Testing Python software engineering unit testing

Eugene Yan

2/20/2024 • EN

There Is a Huge Gap in Generative Ai

Explores the gap between generative AI's perceived quality in open-ended play and its practical effectiveness for specific, goal-oriented tasks.

ai development generative ai llm Machine Learning software engineering

Saeed Esmaili

2/5/2024 • EN

Thinking about High-Quality Human Data

Explores the importance of high-quality human-annotated data for training AI models, covering task design, rater selection, and the wisdom of the crowd.

Data Quality Human Annotation LLM Alignment Machine Learning Rlhf

Lilian Weng

1/16/2024 • EN

Generation configurations: temperature, top-k, top-p, and test time compute

Explains key AI model generation parameters like temperature, top-k, and top-p, and how they control output creativity and consistency.

Machine Learning sampling Temperature Top K Top P

Chip Huyen

12/24/2023 • EN

Push Notifications: What to Push, What Not to Push, and How Often

Analyzes push notifications as a recommender system, discussing intent, personalization, timeliness, and user engagement challenges.

Machine Learning Personalization Push Notifications Recommender Systems user engagement

Eugene Yan

12/1/2023 • EN

Revelations and Innovations: Unveiling the Spectacular Journey of re:Invent 2023 - Part 2

A recap of key announcements from the second half of AWS re:Invent 2023, focusing on new AI/ML services and management tools.

Amazon Bedrock aws cloud computing generative ai Machine Learning

Konstantinos Bessas

12/1/2023 • EN

Ideas for AI-Driven CI/CD

Explores how AI and LLMs can enhance CI/CD pipelines by predicting test failures, generating tests, enabling intelligent rollbacks, and detecting anomalies.

ai ci/cd DevOps Machine Learning test automation

Tigran Hakobyan

11/27/2023 • EN

People underestimate how impactful Scikit-learn continues to be

Scikit-learn remains a dominant and impactful machine learning library, especially for classic ML and tabular data, despite the hype around deep learning.

Gradient Boosting Machine Learning Scikit Learn tabular data Tree Based Models

Gael Varoquaux