Llm articles

6/2/2024 • EN

LLM Research Insights: Instruction Masking and New LoRA Finetuning Experiments?

Explores new research on instruction masking and LoRA finetuning techniques for improving large language models (LLMs).

Finetuning Instruction Tuning llm Lora Research

Sebastian Raschka

5/31/2024 • EN

Impact of LLMs on Interviewing in 2024

Analyzes how LLMs and AI are making technical interviews harder, leading to more complex coding questions and increased cheating, and proposes work sample tests as a better alternative.

ai Leetcode llm software development technical interviews

Kris Kula

5/31/2024 • EN

Netflix PRS 2024 - Applying LLMs to Recommendation Experiences

A summary of a talk on applying Large Language Models (LLMs) to build and deploy recommendation systems at scale, presented at Netflix's PRS workshop.

llm Machine Learning Personalization recommendation systems Recsys

Eugene Yan

5/28/2024 • EN

Are LLMs going to replace us?

Analyzing if AI can replace humans using computational theory, comparing countable vs. uncountable problems and AI's inherent limitations.

ai limitations Computability llm Theory Of Computation Turing Machine

Minko Gechev

5/27/2024 • EN

Lessons After a Half Billion Gpt Tokens

Practical lessons from integrating LLMs into a product, focusing on prompt design pitfalls like over-specification and handling null responses.

API Integration Claude 3 Gpt 4 llm prompt engineering

Saeed Esmaili

5/12/2024 • EN

How Good Are the Latest Open LLMs? And Is DPO Better Than PPO?

A technical review of April 2024's major open LLM releases (Mixtral, Llama 3, Phi-3, OpenELM) and a comparison of DPO vs PPO for LLM alignment.

Dpo llm Ppo Reinforcement Learning Transformer

Sebastian Raschka

5/12/2024 • EN

How Good Are the Latest Open LLMs? And Is DPO Better Than PPO?

A review and comparison of the latest open LLMs (Mixtral, Llama 3, Phi-3, OpenELM) and a study on DPO vs. PPO for LLM alignment.

llm Mixture Of Experts Ppo Reinforcement Learning Transformer

Sebastian Raschka

3/18/2024 • EN

LLMs Shouldn't Write SQL

Argues against using LLMs to generate SQL queries for novel business questions, highlighting the importance of human analysts for precision.

data analysis Data Modeling llm Query Generation sql

Saeed Esmaili

3/15/2024 • EN

Using Azure AI Language studio to improve RAG grounding document discovery

A technical guide on using Azure AI Language Studio to summarize and optimize grounding documents for improving RAG-based AI solutions.

Azure AI Language Studio Document Summarization llm Rag Retrieval Augmented Generation

Benjamin Perkins

3/12/2024 • EN

Optimizing Technical Docs for LLMs

Practical tips for writing technical documentation that is optimized for LLM question-answering tools, improving developer experience.

API Documentation Code Snippets developer experience llm Technical Documentation

Saeed Esmaili

3/3/2024 • EN

Research Papers in February 2024

A summary of key AI research papers from February 2024, focusing on new open-source LLMs, small fine-tuned models, and efficient fine-tuning techniques.

AI Research Finetuning Gemma llm open source

Sebastian Raschka

3/3/2024 • EN

Research Papers in February 2024

A summary of February 2024 AI research, covering new open-source LLMs like OLMo and Gemma, and a study on small, fine-tuned models for text summarization.

AI Research Finetuning llm open source Summarization

Sebastian Raschka

2/20/2024 • EN

There Is a Huge Gap in Generative Ai

Explores the gap between generative AI's perceived quality in open-ended play and its practical effectiveness for specific, goal-oriented tasks.

ai development generative ai llm Machine Learning software engineering

Saeed Esmaili

2/13/2024 • EN

I worry our Copilot is leaving some passengers behind

A developer's critical reflection on GitHub Copilot's impact, questioning if its AI assistance is creating accessibility and quality divides in software development.

AI Coding Assistant developer tools Github Copilot llm software development

Josh Collinsworth

2/11/2024 • EN

How to Generate and Use Synthetic Data for Finetuning

Explores methods for generating synthetic data (distillation & self-improvement) to fine-tune LLMs for pretraining, instruction-tuning, and preference-tuning.

Finetuning Instruction Tuning llm Preference Tuning Synthetic Data

Eugene Yan

1/25/2024 • EN

Running a local LLM with Ollama

A guide on running a Large Language Model (LLM) locally using Ollama for privacy and offline use, covering setup and performance tips.

llm Local AI Model Deployment Ollama privacy

Jan Ouwens

1/23/2024 • EN

RLHF in 2024 with DPO and Hugging Face

A technical guide on using Direct Preference Optimization (DPO) with Hugging Face's TRL library to align and improve open-source large language models in 2024.

Dpo Hugging Face llm Rlhf Trl

Philipp Schmid

1/7/2024 • EN

Language Modeling Reading List (to Start Your Paper Club)

A curated reading list of fundamental language modeling papers with summaries, designed to help start a weekly paper club for learning and discussion.

Language Modeling llm Paper Club Research Transformer

Eugene Yan