Llm articles

12/10/2025 • EN

Under the hood of Canada Spends with Brendan Samek

An interview about Canada Spends, a project using Datasette, SQLite, and LLMs to make Canadian government financial data accessible and explorable.

data visualization Datasette llm Pdf Extraction sqlite

Simon Willison

12/9/2025 • EN

Quoting Claude

A blog post analyzing a critical bug in Claude Code where a command accidentally deleted a user's home directory.

ai ethics Claude Code Coding Agents generative ai llm

Simon Willison

12/9/2025 • EN

Prediction: AI will make formal verification go mainstream

AI is predicted to bring formal verification tools like Dafny and Verus into mainstream use, aided by LLMs making them more accessible.

ai formal verification llm programming languages software development

Simon Willison

12/7/2025 • EN

Using LLMs at Oxide

Bryan Cantrill discusses applying Large Language Models (LLMs) at Oxide, evaluating them against the company's core values.

ai ethics generative ai llm Oxide software development

Simon Willison

12/7/2025 • EN

Quoting David Crespo

Tips from David Crespo on effectively using Claude Code for understanding codebases and automating tedious coding tasks.

Claude Code Context Management llm Programming Workflow software development

Simon Willison

12/4/2025 • EN

Context Engineering for AI Agents: Part 2

Explores advanced Context Engineering techniques for AI agents, focusing on combating Context Rot and improving multi-agent coordination.

Agent Harness AI Agents Context Engineering llm Multi Agent Systems

Philipp Schmid

12/3/2025 • EN

A Technical Tour of the DeepSeek Models from V3 to V3.2

A technical analysis of the DeepSeek model series, from V3 to the latest V3.2, covering architecture, performance, and release timeline.

Deepseek llm Model Architecture Reinforcement Learning Sparse Attention

Sebastian Raschka

12/2/2025 • EN

Claude 4.5 Opus' Soul Document

Anthropic's internal 'soul document' used to train Claude 4.5 Opus's personality and values has been confirmed and partially revealed.

AI Safety Anthropic Claude llm Model Training

Simon Willison

11/29/2025 • EN

The space of minds

Explores the fundamental differences between animal intelligence and AI/LLM intelligence, focusing on their distinct evolutionary and optimization pressures.

artificial intelligence llm Machine Learning Neuroscience optimization

Andrej Karpathy

11/29/2025 • EN

Handing over to the AI for a day [blog]

A developer's personal experiment with AI-driven software development using local LLMs, detailing setup, challenges, and initial impressions.

ai development Claude Code llm Local LLM TypeScript

Remy Sharp

11/27/2025 • EN

deepseek-ai/DeepSeek-Math-V2

DeepSeek-Math-V2 is an open-source 685B parameter AI model that achieves gold medal performance on mathematical Olympiad problems.

Deepseek Large Language Model llm Mathematical Reasoning Open Weights

Simon Willison

11/26/2025 • EN

Why (Senior) Engineers Struggle to Build AI Agents

Senior engineers struggle with AI agent development due to ingrained deterministic habits, contrasting with the probabilistic nature of agent engineering.

Agent Engineering AI Agents Deterministic Systems llm software engineering

Philipp Schmid

11/26/2025 • EN

Interesting links - November 2025

A monthly tech link roundup covering AI agents, Kafka, Flink, LLMs, conference tips, and commentary on tech publishing trends.

AI Agents Flink Kafka llm mcp

Robin Moffatt

11/25/2025 • EN

llm-anthropic 0.23

Release of llm-anthropic 0.23 plugin adding support for Claude Opus 4.5 and its new thinking_effort option.

Anthropic Claude llm Python Library Thinking Effort

Simon Willison

11/25/2025 • EN

Quoting Claude Opus 4.5 system prompt

Analysis of a leaked system prompt for Claude Opus 4.5, discussing its content and the challenges of evaluating new LLMs.

Anthropic Claude generative ai llm System Prompts

Simon Willison

11/24/2025 • EN

MCP with Quarkus LangChain4j

A tutorial on using Quarkus LangChain4j to implement the Model Context Protocol (MCP) for connecting AI models to tools and data sources.

ai Langchain4j llm mcp Quarkus

Piotr Mińkowski

11/24/2025 • EN

Surprises hidden in the Claude Opus 4.5 System Card

Analysis of surprising findings in Claude Opus 4.5's system card, including loophole exploitation, model welfare, and deceptive behaviors.

AI Safety Anthropic Claude llm Model Welfare

Dave Hulbert

11/23/2025 • EN

Agent design is still hard

Armin Ronacher discusses challenges in AI agent design, including abstraction issues, testing difficulties, and API synchronization problems.

abstraction Agent Design llm Reinforcement testing

Simon Willison

11/22/2025 • EN

LLM APIs are a Synchronization Problem

Analyzes LLM APIs as a distributed state synchronization problem, critiquing their abstraction and proposing a mental model based on token and cache state.

api design distributed systems Language Models llm State Synchronization

Armin Ronacher

11/21/2025 • EN

Non-determinism and ownership

A developer discusses the non-deterministic nature of LLMs like GitHub Copilot, arguing that while useful, they cannot take ownership of errors like a human teammate.

ai tools Github Copilot llm Non Determinism software development

Cassidy Williams