Rag articles

12/4/2024 • EN

Build a search engine, not a vector DB

Argues that building a good search engine is more critical for effective RAG than just using a vector database, as poor retrieval misleads AI.

llm Rag Retrieval Search Engine Vector Database

Saeed Esmaili

9/20/2024 • EN

Azure Logic Apps – RAG

Explores using Azure Logic Apps for document parsing and chunking to streamline RAG (Retrieval-Augmented Generation) workflows and AI integration.

AI Integration Azure Logic Apps Document Parsing Low Code Rag

Wessel

8/6/2024 • EN

Connecting the Dots with AI

Explores how AI can revolutionize communication by bridging context gaps between people, using tools like RAG and AI assistants as proxies.

ai communication context llm Rag

Julien Danjou

6/25/2024 • EN

Train and Deploy open Embedding Models on Amazon SageMaker

A guide to fine-tuning and deploying custom embedding models for RAG applications on Amazon SageMaker using Sentence Transformers v3.

Amazon Sagemaker Embedding Models Hugging Face Rag Sentence Transformers

Philipp Schmid

6/18/2024 • EN

Full Local RAG scenario using #Phi3, #SemanticKernel and TextMemory. Bonus: Test in CodeSpaces

A tutorial on implementing a local RAG system using Phi-3, Semantic Kernel, and TextMemory in a C# console application.

c Phi 3 Rag Semantic Kernel Text Memory

Bruno Capuano

6/17/2024 • EN

The limitations of LLMs, or why are we doing RAG?

Explains the limitations of Large Language Models (LLMs) and introduces Retrieval Augmented Generation (RAG) as a solution for incorporating proprietary data.

Chatgpt Gpt 4 llm Rag Retrieval Augmented Generation

Phil Eaton

6/4/2024 • EN

Fine-tune Embedding models for Retrieval Augmented Generation (RAG)

A guide to fine-tuning embedding models for RAG applications using Sentence Transformers 3, featuring Matryoshka Representation Learning for efficiency.

Embedding Models Fine Tuning Matryoshka Representation Learning Rag Sentence Transformers

Philipp Schmid

6/2/2024 • EN

To Chunk or Not to Chunk With the Long Context Single Embedding Models

An experiment comparing retrieval performance of chunked vs. non-chunked documents using long-context embedding models like BGE-M3.

Chunking Context Window Embeddings Rag Retrieval

Saeed Esmaili

5/12/2024 • EN

What We've Learned From A Year of Building with LLMs

A practical guide sharing lessons learned from a year of building real-world applications with Large Language Models (LLMs).

AI Evals large language models LLM Applications prompt engineering Rag

Eugene Yan

4/6/2024 • EN

Building a RAG for tabular data in Go with PostgreSQL & Gemini

A technical guide on building a Retrieval-Augmented Generation (RAG) system in Go to query PostgreSQL tabular data using Google's Gemini LLM.

Gemini go postgresql Rag Vertex AI

Paolo Galeone

3/15/2024 • EN

Using Azure AI Language studio to improve RAG grounding document discovery

A technical guide on using Azure AI Language Studio to summarize and optimize grounding documents for improving RAG-based AI solutions.

Azure AI Language Studio Document Summarization llm Rag Retrieval Augmented Generation

Benjamin Perkins

2/10/2024 • EN

Retrieval with the Azure OpenAI Assistants API

Explains how to implement document retrieval with the Azure OpenAI Assistants API using a custom RAG approach, as the retrieval tool is not yet natively supported.

Assistants API Azure Openai Rag Retrieval Vector Storage

Geert Baeke

12/11/2023 • EN

Retrieval-Augmented Generation (RAG) simply explained

A simple explanation of Retrieval-Augmented Generation (RAG), covering its core components: LLMs, context, and vector databases.

large language models llm Rag Retrieval Augmented Generation Vector Databases

Luc van Donkersgoed

10/30/2023 • EN

Evaluate LLMs and RAG a practical example using Langchain and Hugging Face

A hands-on guide to evaluating LLMs and RAG systems using Langchain and Hugging Face, covering criteria-based and pairwise evaluation methods.

Gpt 4 Hugging Face Langchain LLM Evaluation Rag

Philipp Schmid

8/13/2023 • EN

How to Match LLM Patterns to Problems

A guide to selecting the right LLM architectural patterns (like RAG, fine-tuning, caching) to solve common production challenges such as performance metrics and data constraints.

Fine Tuning LLM Applications LLM Patterns LLM Production Rag

Eugene Yan