Deep Learning articles

9/14/2018 • EN

Variational Autoencoders Explained

A technical explanation of Variational Autoencoders (VAEs), covering their theory, latent space, and how they generate new data.

Deep Learning Generative Models Machine Learning Probability Density Function Variational Autoencoder

Yoel Zeldes

8/12/2018 • EN

From Autoencoder to Beta-VAE

Explores the evolution from basic Autoencoders to Beta-VAE, covering their architecture, mathematical notation, and applications in dimensionality reduction.

Beta Vae Deep Learning Generative Models Neural Networks Variational Autoencoder

Lilian Weng

8/6/2018 • EN

Neural Networks from a Bayesian Perspective

Explores Bayesian methods for quantifying uncertainty in deep neural networks, moving beyond single-point weight estimates.

Bayesian Statistics Deep Learning Maximum A Posteriori Estimation Neural Networks Uncertainty Estimation

Yoel Zeldes

7/30/2018 • EN

OMSCS CS7642 (Reinforcement Learning) Review and Tips

A review and tips for Georgia Tech's OMSCS CS7642 Reinforcement Learning course, covering workload, projects, and key learnings.

Deep Learning Machine Learning Omsc Python Reinforcement Learning

Eugene Yan

6/24/2018 • EN

Attention? Attention!

Explains the attention mechanism in deep learning, its motivation from human perception, and its role in improving seq2seq models like Transformers.

Attention Mechanism Deep Learning Machine Learning Neural Networks Transformer

Lilian Weng

6/18/2018 • EN

Deep Learning: Theory & Practice

Highlights from a deep learning conference covering optimization algorithms' impact on generalization and human-in-the-loop efficiency.

Deep Learning Generalization Machine Learning Neural Networks optimization

Yoel Zeldes

6/14/2018 • EN

The Hitchhiker's Guide to Hyperparameter Tuning

A practical guide to implementing a hyperparameter tuning script for machine learning models, based on real-world experience from Taboola's engineering team.

Deep Learning Hyperparameter Tuning Machine Learning Neural Networks Scikit Learn

Yoel Zeldes

4/8/2018 • EN

Policy Gradient Algorithms

A comprehensive overview of policy gradient algorithms in reinforcement learning, covering key concepts, notations, and various methods.

algorithms Deep Learning Machine Learning Policy Gradient Reinforcement Learning

Lilian Weng

3/22/2018 • EN

Gated Multimodal Units for Information Fusion

Explains the Gated Multimodal Unit (GMU), a deep learning architecture for intelligently fusing data from different sources like images and text.

Attention Mechanism Deep Learning Multimodal Fusion Neural Networks Tensorflow

Yoel Zeldes

2/19/2018 • EN

A (Long) Peek into Reinforcement Learning

An introductory guide to Reinforcement Learning (RL), covering key concepts, algorithms like SARSA and Q-learning, and its role in AI breakthroughs.

artificial intelligence Deep Learning Machine Learning Q Learning Reinforcement Learning

Lilian Weng

12/31/2017 • EN

Object Detection for Dummies Part 3: R-CNN Family

Explores the R-CNN family of models for object detection, covering R-CNN, Fast R-CNN, Faster R-CNN, and Mask R-CNN with technical details.

Cnn computer vision Deep Learning Object Detection R Cnn

Lilian Weng

12/17/2017 • EN

Training Sequence Models with Attention

Practical tips for training sequence-to-sequence models with attention, focusing on debugging and ensuring the model learns to condition on input.

Attention Mechanism Deep Learning Language Model Neural Networks Sequence To Sequence

Awni Hannun

12/15/2017 • EN

Object Detection for Dummies Part 2: CNN, DPM and Overfeat

Explores classic CNN architectures for image classification, including AlexNet, VGG, and ResNet, as foundational models for object detection.

Cnn computer vision Convolutional Neural Networks Deep Learning Object Detection

Lilian Weng

12/4/2017 • EN

The Last 5 Years In Deep Learning

A retrospective on the transformative impact of deep learning over the past five years, covering its rise, key applications, and future potential.

ai computer vision Deep Learning Machine Learning Neural Networks

Adit Deshpande

11/15/2017 • EN

After PyData Warsaw 2017

A recap of PyData Warsaw 2017, covering key talks, new package announcements, and analytics on the conference's international attendees.

Data Science Deep Learning Machine Learning Pydata Python

Piotr Migdał

10/11/2017 • EN

Speech Recognition Is Not Solved

Argues that speech recognition hasn't reached human-level performance, highlighting persistent challenges with accents, noise, and semantic errors.

Accent Recognition Asr Deep Learning speech recognition Word Error Rate

Awni Hannun

9/28/2017 • EN

Anatomize Deep Learning with Information Theory

Explores applying information theory, specifically the Information Bottleneck method, to analyze training phases and learning bounds in deep neural networks.

Deep Learning Information Bottleneck Information Theory Neural Networks Training Dynamics

Lilian Weng

8/20/2017 • EN

From GAN to WGAN

Explains the math behind GANs, their training challenges, and introduces WGAN as a solution for improved stability.

Deep Learning Gan Generative Adversarial Networks Machine Learning Wgan

Lilian Weng

8/17/2017 • EN

PyTorch or TensorFlow?

A comparison of PyTorch and TensorFlow deep learning frameworks, focusing on programmability, flexibility, and ease of use for different project scales.

Deep Learning Machine Learning Frameworks Neural Networks Pytorch Tensorflow

Awni Hannun

8/1/2017 • EN

How to Explain the Prediction of a Machine Learning Model?

Explores the importance of interpreting ML model predictions, especially in regulated fields, and reviews methods like linear regression and interpretable models.

Deep Learning Ethics Explainable AI Machine Learning Model Interpretability

Lilian Weng