Llm articles

4/7/2025 • EN

Java with Generative AI and LLMs

Explores the integration of Java with Generative AI and Large Language Models (LLMs) for building innovative applications like AI chatbots.

AI Integration generative ai Java llm software development

Mark Heckler

4/7/2025 • EN

A Journey from AI to LLMs and MCP - 3 - Boosting LLM Performance — Fine-Tuning, Prompt Engineering, and RAG

Explores three key methods to enhance LLM performance: fine-tuning, prompt engineering, and RAG, detailing their use cases and trade-offs.

ai Fine Tuning llm prompt engineering Retrieval Augmented Generation

Alex Merced

4/6/2025 • EN

A Journey from AI to LLMs and MCP - 2 - How LLMs Work — Embeddings, Vectors, and Context Windows

Explains how LLMs work by converting words to numerical embeddings, using vector spaces for semantic understanding, and managing context windows.

Context Windows Embeddings llm Transformers Vectors

Alex Merced

4/5/2025 • EN

A Journey from AI to LLMs and MCP - 1 - What Is AI and How It Evolved Into LLMs

Explores the evolution of AI from symbolic systems to modern Large Language Models (LLMs), detailing their capabilities and limitations.

artificial intelligence Deep Learning llm Machine Learning Model Context Protocol

Alex Merced

4/4/2025 • EN

I don't know what MCP is and at this point I'm too afraid to ask

An explanation of the Model Context Protocol (MCP), an open standard for connecting LLMs to data and tools, and why it's important for AI development.

AI Agents Language Server Protocol llm mcp Model Context Protocol

Cassidy Williams

4/3/2025 • EN

Model Context Protocol (MCP) an overview

An overview of the Model Context Protocol (MCP), an open standard for connecting AI applications to external tools and data sources.

AI Integration api client-server llm mcp

Philipp Schmid

4/2/2025 • EN

Building a Production-Ready, Pluggable A2A Agent with IBM watsonx.ai, MatrixHub, and MCP Gateway

A guide to building a production-ready, vendor-neutral AI agent using IBM watsonx.ai, MatrixHub, and MCP Gateway, focusing on decoupled architecture.

Agent To Agent Ibm Watsonxai llm Matrixhub Production Ready

Ruslan Magana Vsevolodovna

3/31/2025 • EN

ReAct agent from scratch with Gemini 2.5 and LangGraph

A tutorial on building a ReAct AI agent from scratch using Google's Gemini 2.5 Pro/Flash and the LangGraph framework for complex reasoning and tool use.

AI Agents Gemini Langgraph llm React

Philipp Schmid

3/31/2025 • EN

Poisoning Well

Explores the ethics of LLM training data and proposes a technical method to poison AI crawlers using nofollow links.

ai ethics Data Poisoning llm robots.txt web scraping

Heydon Pickering

3/30/2025 • EN

Bootstrapping ranking models with an LLM judge

Using an LLM to label Hacker News titles and train a Ridge regression model for personalized article ranking based on user preferences.

llm Machine Learning Ranking Models Ridge Regression Sentence Transformers

Emir U

3/29/2025 • EN

First Look at Reasoning From Scratch: Chapter 1

An introduction to reasoning in Large Language Models, covering key concepts like chain-of-thought and methods to improve LLM reasoning abilities.

artificial intelligence Deep Learning llm Machine Learning Reasoning

Sebastian Raschka

3/18/2025 • EN

Enhancing Text-to-SQL With Synthetic Summaries

Explains a technique using AI-generated summaries of SQL queries to improve the accuracy of text-to-SQL systems with LLMs.

llm Retrieval Augmented Generation SQL Generation Synthetic Data Text To SQL

Saeed Esmaili

3/18/2025 • EN

NVIDIA GTC 2025 - Building LLM-Powered Applications

Summary of a panel discussion at NVIDIA GTC 2025 on insights and lessons learned from building real-world LLM-powered applications.

Engineering generative ai llm Nvidia Gtc production

Eugene Yan

3/16/2025 • EN

Improving Recommendation Systems & Search in the Age of LLMs

Explores how large language models (LLMs) are transforming industrial recommendation systems and search, covering hybrid architectures, data generation, and unified frameworks.

llm Model Architecture recommendation systems Search Semantic Ids

Eugene Yan

3/14/2025 • EN

Google Gemma 3 Function Calling Example

A tutorial on implementing function calling with Google's Gemma 3 27B LLM, showing how to connect it to external tools and APIs.

api Function Calling Google Gemma llm Structured Output

Philipp Schmid

3/6/2025 • EN

Understanding Attention in LLMs

A clear explanation of the attention mechanism in Large Language Models, focusing on how words derive meaning from context using vector embeddings.

Attention Mechanism llm Machine Learning Natural Language Processing Transformers

Bartosz Milewski

3/5/2025 • EN

Headroom for AI development

Argues that AI can improve beyond current transformer models by examining biological examples of superior sample efficiency and planning.

artificial intelligence Deep Learning llm Machine Learning Transformers

John Langford

3/5/2025 • EN

Generality

Explores the concept of 'generality' in AI models, using examples of ML failures and LLM inconsistencies to question how we assess their capabilities.

ai Generality llm Machine Learning programming

Alex Gaynor

3/5/2025 • EN

Function Calling Guide: Google DeepMind Gemini 2.0 Flash

A practical guide to implementing function calling with Google's Gemini 2.0 Flash model, enabling LLMs to interact with external tools and APIs.

API Integration Function Calling Google Gemini llm Structured Output

Philipp Schmid

3/3/2025 • EN

Add Logprobs to Openai Structured Output

Explains how to extract logprobs from OpenAI's structured JSON outputs using the structured-logprobs Python library for better LLM confidence insights.

llm Logprobs Openai API Python Library Structured Output

Saeed Esmaili