Llm articles

10/29/2025 • EN

DGX Spark and Mac Mini for Local PyTorch Development

A technical comparison of the DGX Spark and Mac Mini M4 Pro for local PyTorch development and LLM inference, including benchmarks.

benchmark Inference llm local development Pytorch

Sebastian Raschka

10/27/2025 • EN

Stop Getting Average Code from Your LLM

A guide to improving LLM-generated code quality by using contextual rules and agents to enforce production-ready patterns and architecture.

code generation developer tools llm Model Context Protocol Swift

Krzysztof Zabłocki

10/23/2025 • EN

Generative AI Track record

A timeline and analysis of major generative AI model releases and a security framework for AI agents from late 2025.

AI Agents AI Models generative ai llm software development

Richard Groß

10/20/2025 • EN

Generative Development

Explores how GenAI and agentic tools are shifting developer workflows towards rapid prototyping and focusing on output over implementation details.

Agentic Tools developer workflow generative ai llm prototyping

Minko Gechev

10/19/2025 • EN

API Keys Are a Bad Idea for Enterprise LLM, Agent, and MCP Access

Argues against using API keys for securing enterprise AI tools like LLMs and agents, highlighting security flaws and recommending better alternatives.

AI Agents API Security authentication Enterprise Architecture llm

Christian Posta

10/15/2025 • EN

Impatienter and Dumberer

A developer reflects on how over-reliance on LLMs like Claude for coding tasks is making them impatient and hindering deep learning.

ai Documentation learning llm programming

Tyler Sloane

10/12/2025 • EN

Agents 2.0: From Shallow Loops to Deep Agents

Explores the evolution from simple, stateless AI agents (Agent 1.0) to advanced, deep agents (Agent 2.0) capable of complex, multi-step tasks.

Agentic Patterns AI Agents llm memory management software architecture

Philipp Schmid

10/6/2025 • EN

Stumbling into AI: Part 5—Agents

Explores the concept of AI Agents, defining them and examining their role in the AI ecosystem, with references to LangChain and Anthropic.

AI Agents artificial intelligence llm Machine Learning software agents

Robin Moffatt

10/1/2025 • EN

Animals vs Ghosts

A discussion of AI researcher Rich Sutton's critique of LLMs and his vision for AI inspired by animal learning, contrasting with current approaches.

artificial intelligence Bitter Lesson llm Pretraining Scaling Laws

Andrej Karpathy

10/1/2025 • EN

Cachy: How we made our notebooks 60x faster.

Introducing Cachy, an open-source Python package that caches LLM API calls to speed up development, testing, and clean up notebook diffs.

caching Httpx llm Python testing

Jeremy Howard

9/30/2025 • EN

Beyond Algorithm Eats: How LLMs Accelerate Human Cognitive Evolution

Explores how conversational LLMs actively reshape human thought patterns through neural mirroring, unlike passive social media algorithms.

Cognitive Evolution Conversational AI llm Mirror Neurons Neural Adaptation

Kenneth Reitz

9/29/2025 • EN

Do Humans Really Have World Models?

A technical AI researcher questions if human 'world models' are as emergent and training-dependent as those in large language models (LLMs).

ai Cognition llm Neural Networks World Models

Daniel Miessler

9/18/2025 • EN

LLMs Set the Floor—Not the Ceiling—in Developer Tooling

Argues that LLMs serve as a baseline for developer tools, not replacements, due to their general but non-specialized capabilities.

ai coding automation Developer Tooling llm software development

Fernando Castor

9/16/2025 • EN

Stumbling into AI: Part 4—Terminology Tidy-up (and a little rant)

A blog post exploring the differences between AI and ML, clarifying terminology and common misconceptions in the field.

ai generative ai llm Machine Learning Terminology

Robin Moffatt

9/14/2025 • EN

Training an LLM-RecSys Hybrid for Steerable Recs with Semantic IDs

Explores training a hybrid LLM-recommender system using Semantic IDs for steerable, explainable recommendations.

Hybrid Models Language Models llm Recommender Systems Semantic Ids

Eugene Yan

9/12/2025 • EN

Stumbling into AI: Part 3—RAG

Explains Retrieval-Augmented Generation (RAG), a pattern for improving LLM accuracy by augmenting prompts with retrieved context.

ai Apache Kafka llm Rag Retrieval Augmented Generation

Robin Moffatt

9/11/2025 • EN

LLMs, Token Limits, and Handling Concurrent Requests

Explains LLM API token limits (TPM) and strategies for managing concurrent requests to avoid rate limiting in production applications.

api concurrency llm Rate Limiting tokens

Rajesh P

9/8/2025 • EN

The Making of Circuits Royale, a Communal Word Game for the Web

The development story of Circuits Royale, a fast-paced, communal web-based word game powered by LLMs for real-time validation.

llm multiplayer prototyping Web Development Word Game

Orta Therox

9/8/2025 • EN

Stumbling into AI: Part 2—Models

Explores the role of Large Language Models (LLMs) in AI, covering major model families, providers, and concepts like hallucinations.

AI Agents AI Ecosystem Apache Flink large language models llm

Robin Moffatt

9/6/2025 • EN

Understanding and Implementing Qwen3 From Scratch

A hands-on guide to understanding and implementing the Qwen3 large language model architecture from scratch using pure PyTorch.

llm Mixture Of Experts Pytorch Qwen3 Transformer

Sebastian Raschka

Llm Articles

DGX Spark and Mac Mini for Local PyTorch Development

Stop Getting Average Code from Your LLM

Generative AI Track record

Generative Development

API Keys Are a Bad Idea for Enterprise LLM, Agent, and MCP Access

Impatienter and Dumberer

Agents 2.0: From Shallow Loops to Deep Agents

Stumbling into AI: Part 5—Agents

Animals vs Ghosts

Cachy: How we made our notebooks 60x faster.

Beyond Algorithm Eats: How LLMs Accelerate Human Cognitive Evolution

Do Humans Really Have World Models?

LLMs Set the Floor—Not the Ceiling—in Developer Tooling

Stumbling into AI: Part 4—Terminology Tidy-up (and a little rant)

Training an LLM-RecSys Hybrid for Steerable Recs with Semantic IDs

Stumbling into AI: Part 3—RAG

LLMs, Token Limits, and Handling Concurrent Requests

The Making of Circuits Royale, a Communal Word Game for the Web

Stumbling into AI: Part 2—Models

Understanding and Implementing Qwen3 From Scratch

Select Language

We use cookies