Computer vision articles

12/21/2023 • EN

Test Run - Using Multimodal Vision AI In Test Automation

Explores using multimodal vision AI models like LLaVA for advanced UI/UX test automation, moving beyond traditional methods.

artificial intelligence computer vision Multimodal AI test automation ui testing

Unmesh Gundecha

12/2/2023 • EN

Rough Experiments with Llamafile and LLaVA 1.5

A developer experiments with Llamafile and LLaVA 1.5 to extract structured data from comedy show posters, testing its accuracy and JSON output capabilities.

computer vision Image Recognition Llamafile Llava Multimodal AI

Michael Lynch

11/30/2023 • EN

Finding images with text and image queries with the help of GPT-4 Vision

Building an image search system using GPT-4 Vision and Azure AI to find images via text queries or similar pictures.

Azure AI Search computer vision Embeddings Gpt 4 Vision Multimodal Search

Geert Baeke

10/11/2023 • EN

Segmenting Satellite Images

A technical guide on using Meta AI's Segment Anything model to perform object segmentation on satellite imagery from Maxar.

artificial intelligence computer vision Image Segmentation Satellite Imagery Segment Anything

Mark Litwintschik

9/10/2023 • EN

TWIL: September 10, 2023

A weekly tech learning digest covering Microsoft Fabric, AI topics, computer vision, Azure AI Document Intelligence, embeddings, and vector search.

Azure AI computer vision Data Engineering Etl Microsoft Fabric

André Vala

5/30/2023 • EN

Datacast Episode 117: Vector Databases, The Embeddings Revolution, and Working in China with Frank Liu

Interview with Frank Liu on vector databases, embeddings, his career in ML/hardware, and work culture differences between China and the US.

computer vision Embeddings Hardware Engineering Machine Learning Vector Databases

James Le

4/30/2023 • EN

supercells: universal superpixels algorithm for applications to geospatial data

Explains the Supercells algorithm for generating superpixels to improve segmentation of geospatial and satellite imagery.

computer vision Geospatial Data Image Segmentation Regionalization Superpixels

Jakub Nowosad

3/24/2023 • EN

AI-assisted computer interfaces of the future

Explores a future AI-assisted computer interface model inspired by sci-fi, where AI highlights data anomalies for human specialist review.

ai computer vision data analysis Human AI Interaction user interface

Hugo

1/26/2023 • EN

Hugging Face Transformers Examples

A guide to using Hugging Face Transformers library with examples for fine-tuning models like BERT and BART for NLP and computer vision tasks.

Bart Bert computer vision Hugging Face Transformers Natural Language Processing

Philipp Schmid

1/3/2023 • EN

Influential Machine Learning Papers Of 2022

A review of the top 10 most influential machine learning papers from 2022, including ConvNeXt and MaxViT, with technical analysis.

computer vision Convnext Convolutional Neural Networks Machine Learning Research Papers

Sebastian Raschka

11/29/2022 • EN

Using Computer Vision to make €6,147,455 Overnight in In-Game Currency

A developer uses Python, OpenCV, and computer vision to automate collecting in-game currency in City Island 5, earning millions overnight.

automation computer vision Game Scripting Opencv Python

Paul Onteri

10/4/2022 • EN

Document AI: Fine-tuning LayoutLM for document-understanding using Hugging Face Transformers

A tutorial on fine-tuning Microsoft's LayoutLM model for document understanding and information extraction using the Hugging Face Transformers library.

computer vision Document AI Hugging Face Transformers Information Extraction Layoutlm

Philipp Schmid

5/3/2022 • EN

Semantic Segmantion with Hugging Face's Transformers and Amazon SageMaker

A technical guide on using Hugging Face's SegFormer model with Amazon SageMaker for semantic image segmentation tasks.

Amazon Sagemaker computer vision Image Segmentation Semantic Segmentation Transformers

Philipp Schmid

3/4/2022 • EN

The 2030 Self-Driving Car Bet

A $10,000 charity bet on whether fully autonomous (Level 5) self-driving cars will be commercially available in major US cities by 2030.

artificial intelligence autonomous vehicles computer vision Machine Learning Self Driving Cars

Jeff Atwood

2/12/2022 • EN

Why Write

A developer revives their blog to improve thinking through public writing, planning posts on research, career, and technical topics.

blogging computer vision Research Thinking Writing

Avi Singh

8/1/2021 • EN

Making an interactive digital frame with head-tracking using Three.js and TensorFlow.js

A tutorial on creating an interactive digital frame with head-tracking perspective effects using Three.js and TensorFlow.js.

3d Graphics computer vision Head Tracking Tensorflowjs three.js

Charlie Gerard

7/9/2021 • EN

Introduction to Deep Learning

A comprehensive deep learning course covering fundamentals, neural networks, computer vision, and generative models using PyTorch.

computer vision Deep Learning Machine Learning Neural Networks Pytorch

Sebastian Raschka

7/9/2021 • EN

Introduction to Deep Learning

A comprehensive deep learning course overview with PyTorch tutorials, covering fundamentals, neural networks, and advanced topics like CNNs and GANs.

computer vision Deep Learning Machine Learning Neural Networks Pytorch

Sebastian Raschka

6/11/2021 • EN

Where Are Pixels? -- a Deep Learning Perspective

Explores how images are discretized into pixels, the impact of sampling grids on deep learning models, and inconsistencies in image processing libraries.

Cnn computer vision Deep Learning image processing Sampling Theory

Yuxin Wu