Large language models articles

7/11/2024 • EN

LLM Evaluation doesn't need to be complicated

A guide to simplifying LLM evaluation workflows using clear metrics, chain-of-thought, and few-shot prompts, inspired by real-world examples.

AI Applications Chatbot generative ai large language models LLM Evaluation

Philipp Schmid

6/28/2024 • EN

Evaluating Open LLMs with MixEval: The Closest Benchmark to LMSYS Chatbot Arena

Introduces MixEval, a cost-effective LLM benchmark with high correlation to Chatbot Arena, for evaluating open-source language models.

benchmark Chatbot Arena large language models LLM Evaluation open source

Philipp Schmid

5/29/2024 • EN

HeavyIQ: Understanding 220M Flights with AI

HeavyIQ is an AI-powered English-to-SQL interface from HEAVY.AI, using a fine-tuned LLM to query and visualize massive datasets like flight records.

ai data visualization Gpu Database large language models sql

Mark Litwintschik

5/26/2024 • EN

Prompting Fundamentals and How to Apply them Effectively

Explains core prompting fundamentals for effective LLM use, including mental models, role assignment, and practical workflow with examples.

Chain Of Thought Claude API Evaluation large language models prompt engineering

Eugene Yan

5/12/2024 • EN

What We've Learned From A Year of Building with LLMs

A practical guide sharing lessons learned from a year of building real-world applications with Large Language Models (LLMs).

AI Evals large language models LLM Applications prompt engineering Rag

Eugene Yan

5/2/2024 • EN

Deploy open LLMs with vLLM on Hugging Face Inference Endpoints

A tutorial on deploying open-source large language models (LLMs) like Llama 3 using the vLLM framework on Hugging Face Inference Endpoints.

Hugging Face Inference Endpoints large language models LLM Deployment Vllm

Philipp Schmid

4/22/2024 • EN

Efficiently fine-tune Llama 3 with PyTorch FSDP and Q-Lora

A technical guide on fine-tuning the Llama 3 70B model using PyTorch FSDP and Q-Lora for efficient training on limited GPU hardware.

Fine Tuning large language models Llama 3 Pytorch Fsdp Q Lora

Philipp Schmid

4/20/2024 • EN

Using and Finetuning Pretrained Transformers

Explores methods for using and finetuning pretrained large language models, including feature-based approaches and parameter updates.

ai Finetuning large language models Machine Learning Transformers

Sebastian Raschka

4/18/2024 • EN

Deploy Llama 3 on Amazon SageMaker

A technical guide on deploying Meta's Llama 3 70B model on Amazon SageMaker using the Hugging Face LLM DLC and Text Generation Inference.

Amazon Sagemaker Hugging Face large language models Llama 3 Model Deployment

Philipp Schmid

3/20/2024 • EN

PALE Large Language Models, instead of ``Open Source.''

Argues that the term 'Open Source' is misleading for LLMs and proposes the new term 'PALE LLMs' (Publicly Available, Locally Executable).

ai ethics Free Software large language models Licensing open source

Fernando Castor

3/9/2024 • EN

Using AI tools for coding: good or bad?

Explores the balanced use of AI coding tools like GitHub Copilot, discussing benefits, risks of hallucinations, and best practices for developers.

AI Coding Tools Chatgpt code generation Github Copilot large language models

Andrea Grandi

3/4/2024 • EN

Measuring and Mitigating Hallucinations in Large Language Models: A Multifaceted Approach

A technical paper exploring the causes, measurement, and mitigation strategies for hallucinations in Large Language Models (LLMs).

AI Safety Hallucination Mitigation large language models LLM Evaluation Model Alignment

Xavier Amatriain

12/11/2023 • EN

Retrieval-Augmented Generation (RAG) simply explained

A simple explanation of Retrieval-Augmented Generation (RAG), covering its core components: LLMs, context, and vector databases.

large language models llm Rag Retrieval Augmented Generation Vector Databases

Luc van Donkersgoed

10/25/2023 • EN

Adversarial Attacks on LLMs

Explores adversarial attacks and jailbreak prompts that can make large language models produce unsafe or undesired outputs, bypassing safety measures.

Adversarial Attacks Jailbreak Prompts large language models llm security

Lilian Weng

10/18/2023 • EN

Building Intelligent Enterprise-Grade applications with Azure OpenAI and Microsoft Data Platform

Explores building enterprise applications using Azure OpenAI and Microsoft's data platform for secure, integrated AI solutions.

Azure Openai Enterprise Applications generative ai large language models Microsoft Data Platform

Hugo Barona

10/12/2023 • EN

Deploy Idefics 9B and 80B on Amazon SageMaker

A technical guide on deploying Hugging Face's IDEFICS visual language models (9B & 80B parameters) to Amazon SageMaker using the LLM DLC.

Amazon Sagemaker Idefics large language models Model Deployment Multimodal AI

Philipp Schmid

9/26/2023 • EN

Llama 2 on Amazon SageMaker a Benchmark

A benchmark analysis of deploying Meta's Llama 2 models on Amazon SageMaker using Hugging Face's LLM Inference Container, evaluating cost, latency, and throughput.

Amazon Sagemaker benchmark large language models Llama 2 Model Deployment

Philipp Schmid

9/20/2023 • EN

Fine-tune Falcon 180B with DeepSpeed ZeRO, LoRA and Flash Attention

A technical guide on fine-tuning the massive Falcon 180B language model using DeepSpeed ZeRO, LoRA, and Flash Attention for efficient training.

Deepspeed Falcon 180b Flash Attention large language models Lora

Philipp Schmid

6/1/2023 • EN

Semantic Kernel Planner 101

An introduction to Semantic Kernel's Planner, a tool for automatically generating and executing complex AI tasks using plugins and natural language goals.

AI Integration large language models Planner plugins Semantic Kernel

Geert Baeke