Llm articles

9/1/2024 • EN

Building LLMs from the Ground Up: A 3-hour Coding Workshop

A 3-hour coding workshop video covering the implementation, training, and use of Large Language Models (LLMs) from scratch.

Coding Workshop Deep Learning llm Machine Learning Transformer Architecture

Sebastian Raschka

9/1/2024 • EN

Building LLMs from the Ground Up: A 3-hour Coding Workshop

A 3-hour coding workshop teaching how to implement, train, and use Large Language Models (LLMs) from scratch with practical examples.

Gpt 2 Implementation llm Tokenizer Training

Sebastian Raschka

8/27/2024 • EN

World Model + Next Token Prediction = Answer Prediction

A philosophical and technical exploration of how Large Language Models (LLMs) transform 'next token prediction' into meaningful answer generation.

AI Reasoning Language Models llm Next Token Prediction World Models

Daniel Miessler

8/25/2024 • EN

Trying K8sGPT – AI For Kubernetes

A hands-on review of K8sGPT, an AI-powered CLI tool for analyzing and troubleshooting Kubernetes clusters, including setup with local LLMs.

ai Cncf DevOps Kubernetes llm

Jonathan

8/17/2024 • EN

New LLM Pre-training and Post-training Paradigms

A technical review of the latest pre-training and post-training methodologies used in state-of-the-art large language models (LLMs) like Qwen 2 and Llama 3.1.

ai large language models llm Post Training Pre Training

Sebastian Raschka

8/17/2024 • EN

New LLM Pre-training and Post-training Paradigms

Analyzes the latest pre-training and post-training methodologies used in state-of-the-art LLMs like Qwen 2, Apple's models, Gemma 2, and Llama 3.1.

Fine Tuning Language Models llm Post Training Pre Training

Sebastian Raschka

8/16/2024 • EN

Re: q What do I title this article?

A developer creates a Bash script called 'qq' to query Kagi's FastGPT API from the terminal, improving on a command-line LLM tool concept.

API Integration Bash Scripting Command Line Tools Fastgpt llm

Paul's Weblog

8/13/2024 • EN

Building a ChatGPT Clone with Ruby on Rails and Claude 3.5 Sonnet 🚀

A tutorial on building a ChatGPT-like chat application using Ruby on Rails and the Claude 3.5 Sonnet AI model, covering setup, models, and integration.

Chat Application Claude 35 Sonnet llm Openai API Ruby On Rails

Landon Gray

8/11/2024 • EN

How good can you be at Codenames without knowing any words?

Analyzing if a Codenames bot can win using only card layout patterns, without understanding word meanings.

game ai llm Machine Learning programming simulation

Dan Luu

8/6/2024 • EN

Connecting the Dots with AI

Explores how AI can revolutionize communication by bridging context gaps between people, using tools like RAG and AI assistants as proxies.

ai communication context llm Rag

Julien Danjou

8/3/2024 • EN

Re: GitHub Roaster

A satirical web app uses LLMs to roast GitHub profiles, highlighting a Svelte-based implementation and straightforward API.

api github llm Satire svelte

Paul's Weblog

7/20/2024 • EN

Instruction Pretraining LLMs

Explores recent research on instruction finetuning for LLMs, including cost-effective data generation methods and an overview of new models like Gemma 2.

Alignment Data Gemma 2 Instruction Finetuning llm Pretraining

Sebastian Raschka

7/15/2024 • EN

How to run a local LLM for inference with an offline-first approach

A guide on running Large Language Models (LLMs) locally for inference, covering tools like Ollama and Open WebUI for privacy and cost control.

Inference llm Local Machine Learning offline

Liran Tal

7/11/2024 • EN

Token consumption in Microsoft’s Graph RAG

Analyzes token consumption in Microsoft's Graph RAG for local and global queries, including setup with LiteLLM and Langfuse for monitoring.

Langfuse Litellm llm Microsoft Graph Rag Token Consumption

Geert Baeke

7/7/2024 • EN

Trying out Microsoft’s Graph RAG

Explores Microsoft's Graph RAG, an advanced RAG technique using knowledge graphs to answer global questions about datasets, with a hands-on setup guide.

Azure AI Search Graph Rag Knowledge Graphs llm Retrieval Augmented Generation

Geert Baeke

7/7/2024 • EN

Extrinsic Hallucinations in LLMs

Explores the causes and types of hallucinations in large language models, focusing on extrinsic hallucinations and how training data affects factual accuracy.

Factuality Fine Tuning Hallucination llm Pre Training

Lilian Weng

7/5/2024 • EN

Anthropic Claude 3 vs. Claude 3.5: A Comprehensive Comparison

A detailed comparison of Anthropic's Claude 3 and the newer Claude 3.5 Sonnet AI models, covering performance, capabilities, and benchmarks.

Anthropic artificial intelligence Claude llm Model Comparison

Varun Kumar

6/27/2024 • EN

AI Engineer 2024 Keynote - What We Learned from a Year of LLMs

Reflections on delivering the closing keynote at the AI Engineer World's Fair 2024, sharing lessons from a year of building with LLMs.

AI Engineering Keynote llm production software development

Eugene Yan

6/17/2024 • EN

The limitations of LLMs, or why are we doing RAG?

Explains the limitations of Large Language Models (LLMs) and introduces Retrieval Augmented Generation (RAG) as a solution for incorporating proprietary data.

Chatgpt Gpt 4 llm Rag Retrieval Augmented Generation

Phil Eaton

6/12/2024 • EN

GenAI Predictions and The Future of LLMs as local-first offline Small Language Models (SLMs)

The article argues for a shift from subscription-based online LLMs to offline-first Small Language Models (SLMs) due to privacy, security, and cost concerns.

llm Local First Offline Inference privacy Slm

Liran Tal