Llm articles

1/17/2025 • EN

Implementing A Byte Pair Encoding (BPE) Tokenizer From Scratch

A step-by-step educational guide to building a Byte Pair Encoding (BPE) tokenizer from scratch, as used in models like GPT and Llama.

algorithm Bpe llm NLP Tokenization

Sebastian Raschka

1/17/2025 • EN

Implementing A Byte Pair Encoding (BPE) Tokenizer From Scratch

A step-by-step guide to implementing the Byte Pair Encoding (BPE) tokenizer from scratch, used in models like GPT and Llama.

algorithm Byte Pair Encoding llm NLP Tokenizer

Sebastian Raschka

1/17/2025 • EN

How to use Anthropic MCP Server with open LLMs, OpenAI or Google Gemini

A guide on using Anthropic's Model Context Protocol (MCP) to connect AI agents with tools and data sources using various LLMs like OpenAI or Gemini.

AI Agents llm mcp Model Context Protocol Openai

Philipp Schmid

1/16/2025 • EN

Common pitfalls when building generative AI applications

A guide to common mistakes developers make when building applications with generative AI, including overuse and poor UX integration.

AI Applications Common Pitfalls generative ai llm software development

Chip Huyen

1/14/2025 • EN

Don't use cosine similarity carelessly

A guide on the pitfalls of blindly using cosine similarity with text embeddings and how to apply it more intentionally for better results.

Cosine Similarity llm Sentence Embeddings Vector Similarity Word Embeddings

Piotr Migdał

1/13/2025 • EN

Assembling the Prompt: Notes on ‘Prompt Engineering for LLMs’ ch 6

A summary of Chapter 6 from 'Prompt Engineering for LLMs', covering prompt structure, document templates, and strategies for effective context inclusion.

ai development Context Management llm prompt engineering Structured Documents

Alex Strick van Linschoten

1/12/2025 • EN

Building AI Reading Club: Features & Behind the Scenes

A developer builds an AI-powered reading companion called Dewey, detailing its features, design, and technical implementation.

AI Reading Companion Context Retrieval Learning Tool llm prototyping

Eugene Yan

1/8/2025 • EN

Making my startup come back to life

Developer revives his old AI startup's brainstorming tool by building a GitHub Copilot Extension, using VS Code's speech features and LLMs.

ai tools Github Copilot llm Voice Interface Vscode Extension

Cassidy Williams

12/29/2024 • EN

LLM Research Papers: The 2024 List

A curated list of notable LLM and AI research papers published in 2024, providing a resource for those interested in the latest developments.

AI Research Arxiv llm Machine Learning Research Papers

Sebastian Raschka

12/25/2024 • EN

Getting Starting with Intelligent Java Applications using Spring AI

A tutorial on building a simple AI-powered chat client in Java using the Spring AI framework, covering setup, configuration, and provider abstraction.

API Abstraction Java llm Spring AI spring boot

Loiane Groner

12/19/2024 • EN

Pydantic Logfire for LLM and API Observability

Introducing Logfire, Pydantic's new observability tool for Python, with easy integration for OpenAI LLM calls, FastAPI, and logging.

api llm Logfire observability Pydantic

Saeed Esmaili

12/11/2024 • EN

Using the Azure AI Inference Service

Explores using Azure AI Inference Service to simplify LLM integration, focusing on Python SDK and GitHub Marketplace for experimentation.

AI Inference Azure AI llm Python SDK Serverless Endpoints

Geert Baeke

12/4/2024 • EN

Build a search engine, not a vector DB

Argues that building a good search engine is more critical for effective RAG than just using a vector database, as poor retrieval misleads AI.

llm Rag Retrieval Search Engine Vector Database

Saeed Esmaili

11/26/2024 • EN

Building LLMs is probably not going be a brilliant business

Analyzes why building Large Language Models (LLMs) may be a poor business, comparing the AI industry's structure to historically unprofitable sectors like airlines.

AI Business Industry Structure llm Technology Economics venture capital

Cal Paterson

11/11/2024 • EN

Using the Smartest AI to Rate Other AI

Explores a method using a 'Judging AI' (like o1-preview) to evaluate the performance of other AI models on tasks, relative to human capability.

AI Benchmarking AI Evaluation Fabric Pattern llm prompt engineering

Daniel Miessler

10/15/2024 • EN

How To T̶r̶a̶i̶n̶ Synthesize Your D̶r̶a̶g̶o̶n̶ Data

Explores the use of LLMs to generate synthetic data for training AI models, discussing challenges, an experiment with coding data, and a new library.

Data Generation Fastdata llm Synthetic Data Tinystories

Jeremy Howard

10/11/2024 • EN

Overcoming writer's block — lessons from AI

The article explores how the writing process of AI models can inspire humans to overcome writer's block by adopting a less perfectionist approach.

ai llm software development Transformers Writing

Piotr Migdał

9/30/2024 • EN

Transformers Create Shapes of the Universe

Explores the philosophical argument that AI, particularly LLMs, possess a form of understanding and model reality, challenging the notion they are mere token predictors.

artificial intelligence llm Philosophy Of AI Transformer Models Understanding

Daniel Miessler