Llm articles

12/11/2023 • EN

Retrieval-Augmented Generation (RAG) simply explained

A simple explanation of Retrieval-Augmented Generation (RAG), covering its core components: LLMs, context, and vector databases.

large language models llm Rag Retrieval Augmented Generation Vector Databases

Luc van Donkersgoed

11/30/2023 • EN

LLM-Supported Development

A developer's experience using Sweep, an LLM-powered tool that generates pull requests to write unit tests and fix code in a GitHub workflow.

ai coding github llm software development unit testing

James Smith

11/14/2023 • EN

Exploring ChatGPT’s Knowledge Cutoff

An analysis of ChatGPT's knowledge cutoff date, testing its accuracy on celebrity death dates to understand the limits of its training data.

api Chatgpt Gpt 4 Knowledge Cutoff llm

Matt Mazur

11/5/2023 • EN

Out-of-Domain Finetuning to Bootstrap Hallucination Detection

Explores using out-of-domain data to improve LLM finetuning for detecting factual inconsistencies (hallucinations) in text summaries.

Finetuning Hallucination Detection llm Machine Learning Natural Language Inference

Eugene Yan

10/25/2023 • EN

Adversarial Attacks on LLMs

Explores adversarial attacks and jailbreak prompts that can make large language models produce unsafe or undesired outputs, bypassing safety measures.

Adversarial Attacks Jailbreak Prompts large language models llm security

Lilian Weng

10/22/2023 • EN

Ollama - Building a Custom Model

A guide on using Ollama's Modelfile to create and deploy a custom large language model (LLM) for specific tasks, like an API security assistant.

API Security Custom Model llm Modelfile Ollama

Unmesh Gundecha

10/15/2023 • EN

Reflections on AI Engineer Summit 2023

Key takeaways from the AI Engineer Summit 2023, focusing on challenges in LLM deployment like evaluation methods and serving costs.

AI Engineering deployment Eval llm Serving Costs

Eugene Yan

10/14/2023 • EN

Ollama - running large language models on your machine

A guide to using Ollama, an open-source CLI tool for running and customizing large language models like Llama 2 locally on your own machine.

command line llm Local AI Ollama Transformer

Unmesh Gundecha

10/10/2023 • EN

Multimodality and Large Multimodal Models (LMMs)

An in-depth exploration of Large Multimodal Models (LMMs), covering their fundamentals, key architectures like CLIP and Flamingo, and current research directions.

Clip Flamingo Large Multimodal Models llm Multimodal AI

Chip Huyen

10/9/2023 • EN

AI Engineer 2023 Keynote - Building Blocks for LLM Systems

A summary of a keynote talk on essential building blocks for production LLM systems, covering evaluations, RAG, and guardrails.

AI Engineering Evaluations llm production Retrieval Augmented Generation

Eugene Yan

9/24/2023 • EN

TWIL: September 24, 2023

A developer's weekly learning log covering Azure Machine Learning, Prompt Flow, Microsoft Fabric, Copilot, and an LLM hallucination paper.

Azure Machine Learning llm Microsoft Copilot Microsoft Fabric Prompt Flow

André Vala

9/20/2023 • EN

LLMs Demand Observability-Driven Development

Explains why traditional debugging fails for LLMs and advocates for observability-driven development to manage their non-deterministic nature in production.

debugging llm observability Production Systems software development

Charity Majors

9/15/2023 • EN

Optimizing LLMs From a Dataset Perspective

Strategies for improving LLM performance through dataset-centric fine-tuning, focusing on instruction datasets rather than model architecture changes.

Dataset Finetuning Instruction Tuning llm Neural Networks

Sebastian Raschka

9/15/2023 • EN

Optimizing LLMs From a Dataset Perspective

Explores dataset-centric strategies for fine-tuning LLMs, focusing on instruction datasets to improve model performance without altering architecture.

Dataset Finetuning Instruction Tuning llm Neural Networks

Sebastian Raschka

9/8/2023 • EN

Asking a Large Language Model How YouTube Works

A technical guide on using an LLM (Platypus2) with LangChain and pgvector to analyze YouTube's Procella database paper.

Langchain Llamacpp llm Pgvector postgresql

Mark Litwintschik

8/31/2023 • EN

Optimize open LLMs using GPTQ and Hugging Face Optimum

A guide to using GPTQ quantization with Hugging Face Optimum to compress open-source LLMs for efficient deployment on smaller hardware.

Gptq Hugging Face llm Optimum Quantization

Philipp Schmid

8/29/2023 • EN

AI crap

A critical analysis of the machine learning bubble, arguing its lasting impact will be a proliferation of low-quality, automated content and services, not true AGI.

Agi AI Bubble automation llm Machine Learning

Drew DeVault

8/27/2023 • EN

TWIL: August 27, 2023

A developer's weekly learning log covering Power BI data refresh, LLM architectures, Azure OpenAI costs, AI news, Python in Excel, and Azure SQL updates.

Azure Azure Openai Data Refresh llm Power Bi

André Vala