Computer vision articles

4/3/2021 • EN

How To Solve Sudoku Using Azure Form Recognizer

A tutorial on building a Sudoku solver application using Azure Form Recognizer AI, .NET backend, and Angular frontend.

.net angular Azure Form Recognizer computer vision Sudoku Solver

3/24/2021 • EN

GeoGuessing with Deep Learning

Exploring how deep learning and a pre-trained geolocation model can be used to automate and improve performance in the GeoGuessr geographic discovery game.

computer vision Deep Learning Geolocation Machine Learning Selenium

Andrew Healey

2/26/2021 • EN

Why your phone’s portrait mode fakes the blur

Explains the physics and optics behind why smartphone portrait mode uses artificial blur instead of true optical depth of field.

artificial intelligence computer vision image processing Photography Portrait Mode

Surma

2/11/2021 • EN

Datasets for Machine Learning and Deep Learning

A curated list of public dataset repositories for machine learning and deep learning projects, including sources for computer vision, NLP, and more.

computer vision Data Repositories Datasets Deep Learning Machine Learning

Sebastian Raschka

2/11/2021 • EN

Datasets for Machine Learning and Deep Learning

A curated list of public dataset repositories for machine learning and deep learning projects, including computer vision and NLP datasets.

computer vision Data Repositories Datasets Deep Learning Machine Learning

Sebastian Raschka

4/23/2020 • EN

Rotoscoping: Hollywood’s video data segmentation?

Explores video segmentation techniques like rotoscoping and green screens used in Hollywood VFX, comparing them to modern AI models like Deeplab v3+.

computer vision Data Annotation Deep Learning Vfx Video Segmentation

Igor Susmelj

3/9/2020 • EN

Optical Character Reader Using Angular And Azure Computer Vision

A tutorial on building an OCR application using Angular for the frontend and Azure Computer Vision API for text extraction from images.

angular Aspnet Core Azure computer vision ocr

Ankit Sharma

2/28/2020 • EN

List of Data Annotation Companies

A curated list of top data annotation companies worldwide, grouped by annotation type, focusing on services for computer vision, NLP, and audio data.

AI Training computer vision Data Annotation Data Labeling Machine Learning

Igor Susmelj

2/28/2020 • EN

Getting started with CNNs by calculating LeNet-Layer manually

A technical tutorial explaining the fundamentals of Convolutional Neural Networks (CNNs) by manually calculating layers from the classic LeNet-5 architecture.

Cnn computer vision Convolutional Neural Networks Deep Learning Lenet 5

Philipp Schmid

2/17/2020 • EN

Humans Powering the Machines

Explores the human effort behind AI training data, covering challenges of data annotation and techniques like transfer learning to reduce labeling workload.

computer vision Data Labeling Deep Learning Machine Learning Transfer Learning

Igor Susmelj

2/14/2020 • EN

Tools and Frameworks

A curated list of open-source and free tools for data annotation across computer vision, NLP, audio, and other domains, including image and video labeling.

computer vision Data Annotation Machine Learning NLP open source

Igor Susmelj

5/29/2019 • EN

Representation Theory for Robotics

Explores efficient state representations for robots to accelerate Reinforcement Learning training, comparing pixel-based and model-based approaches.

computer vision Deep Learning Reinforcement Learning Robotics State Representation

Mark Saroufim

3/8/2019 • EN

Enriching SharePoint Image Libraries with Azure Cognitive Services

Using Azure Cognitive Services and Logic Apps to automatically tag and enrich images stored in SharePoint with metadata.

Azure Cognitive Services computer vision image processing Logic Apps Sharepoint

Simon Waight

2/15/2019 • EN

Learning Latent Dynamics for Planning from Pixels

Introduces PlaNet, a model-based AI agent that learns environment dynamics from pixels and plans actions in latent space for efficient control tasks.

computer vision Deep Learning Latent Dynamics Model Based Planning Reinforcement Learning

David Ha

12/27/2018 • EN

Object Detection Part 4: Fast Detection Models

Explores fast, one-stage object detection models like YOLO, SSD, and RetinaNet, comparing them to slower two-stage R-CNN models.

computer vision Deep Learning Object Detection ssd Yolo

Lilian Weng

10/20/2018 • EN

Playing Mortal Kombat with TensorFlow.js. Transfer learning and data augmentation

Building a Mortal Kombat controller using TensorFlow.js, CNNs, and transfer learning for posture classification from a webcam feed.

computer vision Convolutional Neural Networks Data Augmentation Tensorflowjs Transfer Learning

Minko Gechev

10/10/2018 • EN

Zooming Past the Competition

A team wins a tech hackathon by creating an AR app that uses AI and computer vision to recommend web content based on what a phone camera sees.

ai Augmented Reality computer vision Microservices Web Application

Yoel Zeldes

12/31/2017 • EN

Object Detection for Dummies Part 3: R-CNN Family

Explores the R-CNN family of models for object detection, covering R-CNN, Fast R-CNN, Faster R-CNN, and Mask R-CNN with technical details.

Cnn computer vision Deep Learning Object Detection R Cnn

Lilian Weng

12/15/2017 • EN

Object Detection for Dummies Part 2: CNN, DPM and Overfeat

Explores classic CNN architectures for image classification, including AlexNet, VGG, and ResNet, as foundational models for object detection.

Cnn computer vision Convolutional Neural Networks Deep Learning Object Detection

Lilian Weng

12/4/2017 • EN

The Last 5 Years In Deep Learning

A retrospective on the transformative impact of deep learning over the past five years, covering its rise, key applications, and future potential.

ai computer vision Deep Learning Machine Learning Neural Networks

Adit Deshpande

Computer vision Articles

How To Solve Sudoku Using Azure Form Recognizer

GeoGuessing with Deep Learning

Why your phone’s portrait mode fakes the blur

Datasets for Machine Learning and Deep Learning

Datasets for Machine Learning and Deep Learning

Rotoscoping: Hollywood’s video data segmentation?

Optical Character Reader Using Angular And Azure Computer Vision

List of Data Annotation Companies

Getting started with CNNs by calculating LeNet-Layer manually

Humans Powering the Machines

Tools and Frameworks

Representation Theory for Robotics

Enriching SharePoint Image Libraries with Azure Cognitive Services

Learning Latent Dynamics for Planning from Pixels

Object Detection Part 4: Fast Detection Models

Playing Mortal Kombat with TensorFlow.js. Transfer learning and data augmentation

Zooming Past the Competition

Object Detection for Dummies Part 3: R-CNN Family

Object Detection for Dummies Part 2: CNN, DPM and Overfeat

The Last 5 Years In Deep Learning

Select Language

We use cookies