Large language models articles

5/11/2023 • EN

Accelerating Large Language Models with Mixed-Precision Techniques

Exploring mixed-precision techniques to speed up large language model training and inference by up to 3x without losing accuracy.

Deep Learning Floating Point Precision Gpu Optimization large language models Mixed Precision Training

Sebastian Raschka

5/11/2023 • EN

Accelerating Large Language Models with Mixed-Precision Techniques

Explores how mixed-precision training techniques can speed up large language model training and inference by up to 3x, reducing memory use.

Deep Learning Floating Point Precision Gpu Optimization large language models Mixed Precision Training

Sebastian Raschka

5/7/2023 • EN

Open-LLMs - A list of LLMs for Commercial Use

A curated list of open-source Large Language Models (LLMs) available for commercial use, including community-contributed updates and details.

Commercial License Finetuning large language models Machine Learning open source

Eugene Yan

5/2/2023 • EN

How to scale LLM workloads to 20B+ with Amazon SageMaker using Hugging Face and PyTorch FSDP

A technical tutorial on fine-tuning a 20B+ parameter LLM using PyTorch FSDP and Hugging Face on Amazon SageMaker's multi-GPU infrastructure.

Amazon Sagemaker Hugging Face large language models Model Fine Tuning Pytorch Fsdp

Philipp Schmid

4/12/2023 • EN

Understanding Parameter-Efficient Finetuning of Large Language Models: From Prefix Tuning to LLaMA-Adapters

Explains parameter-efficient finetuning methods for large language models, covering techniques like prefix tuning and LLaMA-Adapters.

Adapters large language models Llama Adapter Parameter Efficient Finetuning Prefix Tuning

Sebastian Raschka

4/12/2023 • EN

Understanding Parameter-Efficient Finetuning of Large Language Models: From Prefix Tuning to LLaMA-Adapters

A guide to parameter-efficient finetuning methods for large language models, covering techniques like prefix tuning and LLaMA-Adapters.

Adapters large language models Llama Adapter Parameter Efficient Finetuning Prefix Tuning

Sebastian Raschka

4/4/2023 • EN

Introducing IGEL an instruction-tuned German large Language Model

Introduces IGEL, an instruction-tuned German large language model based on BLOOM, for NLP tasks like translation and QA.

Bloom German NLP Hugging Face Instruction Tuning large language models

Philipp Schmid

3/28/2023 • EN

Finetuning Large Language Models On A Single GPU Using Gradient Accumulation

A guide to finetuning large language models like BLOOM on a single GPU using gradient accumulation to overcome memory limits.

Bloom Finetuning Gpu Memory Gradient Accumulation large language models

Sebastian Raschka

3/28/2023 • EN

Finetuning Large Language Models On A Single GPU Using Gradient Accumulation

Guide to finetuning large language models on a single GPU using gradient accumulation to overcome memory limitations.

Finetuning Gpu Memory Gradient Accumulation large language models Transformers

Sebastian Raschka

3/23/2023 • EN

Efficient Large Language Model training with LoRA and Hugging Face

A technical guide on fine-tuning the large FLAN-T5 XXL model efficiently using LoRA and Hugging Face libraries on a single GPU.

Flan T5 Hugging Face large language models Lora Parameter Efficient Fine Tuning

Philipp Schmid

2/22/2023 • EN

Combine Amazon SageMaker and DeepSpeed to fine-tune FLAN-T5 XXL

Guide to fine-tuning the large FLAN-T5 XXL model using Amazon SageMaker managed training and DeepSpeed for optimization.

Amazon Sagemaker Deepspeed Fine Tuning Flan T5 large language models

Philipp Schmid

2/19/2023 • EN

ChatGPT Is Not a Blurry JPEG of the Web

Argues against the 'lossy compression' analogy for LLMs like ChatGPT, proposing instead that they are simulators creating temporary simulacra.

artificial intelligence Chatgpt large language models Neural Networks simulation

Domenic Denicola

2/7/2023 • EN

Understanding Large Language Models -- A Transformative Reading List

A curated reading list of key academic papers for understanding the development and architecture of large language models and transformers.

Attention Mechanism large language models Machine Learning Natural Language Processing Transformers

Sebastian Raschka

2/7/2023 • EN

Understanding Large Language Models -- A Transformative Reading List

A curated reading list of key academic papers for understanding the development and architecture of large language models and transformers.

Attention Mechanism large language models Machine Learning Natural Language Processing Transformers

Sebastian Raschka

1/16/2023 • EN

Curated Resources and Trustworthy Experts: The Key Ingredients for Finding Accurate Answers to Technical Questions in the Future

Analyzes the limitations of AI chatbots like ChatGPT in providing accurate technical answers and discusses the need for curated data and human experts.

Chatgpt large language models LLM Training Perplexity AI Technical Misinformation

Sebastian Raschka

1/16/2023 • EN

Curated Resources and Trustworthy Experts: The Key Ingredients for Finding Accurate Answers to Technical Questions in the Future

Discusses the limitations of AI chatbots like ChatGPT in providing accurate technical answers and proposes curated resources and expert knowledge as future solutions.

AI Training Chatgpt large language models Perplexity AI Technical Misinformation

Sebastian Raschka

9/13/2022 • EN

Accelerate GPT-J inference with DeepSpeed-Inference on GPUs

Learn to optimize GPT-J inference using DeepSpeed-Inference and Hugging Face Transformers for faster GPU performance.

Deepspeed Inference Gpt J Gpu Optimization large language models Transformer Models

Philipp Schmid

3/7/2022 • EN

Applying massive language models in the real world with Cohere

An engineer shares insights and tutorials on applying Cohere's large language models for real-world tasks like prompt engineering and semantic search.

api large language models prompt engineering Text Summarization Transformers

Jay Alammar

3/3/2022 • EN

Implicit Bayesian Inference in Large Language Models

Explores how Large Language Models perform implicit Bayesian inference through in-context learning, connecting exchangeable sequence models to prompt-based learning.

Bayesian Inference Deep Learning Exchangeability In Context Learning large language models

Ferenc Huszár

1/3/2022 • EN

The Illustrated Retrieval Transformer

Explains how retrieval-augmented language models like RETRO achieve GPT-3 performance with far fewer parameters by querying external knowledge.

Deepmind large language models Retrieval Augmented Generation Retro Transformer Architecture

Jay Alammar

Large language models Articles

Accelerating Large Language Models with Mixed-Precision Techniques

Accelerating Large Language Models with Mixed-Precision Techniques

Open-LLMs - A list of LLMs for Commercial Use

How to scale LLM workloads to 20B+ with Amazon SageMaker using Hugging Face and PyTorch FSDP

Understanding Parameter-Efficient Finetuning of Large Language Models: From Prefix Tuning to LLaMA-Adapters

Understanding Parameter-Efficient Finetuning of Large Language Models: From Prefix Tuning to LLaMA-Adapters

Introducing IGEL an instruction-tuned German large Language Model

Finetuning Large Language Models On A Single GPU Using Gradient Accumulation

Finetuning Large Language Models On A Single GPU Using Gradient Accumulation

Efficient Large Language Model training with LoRA and Hugging Face

Combine Amazon SageMaker and DeepSpeed to fine-tune FLAN-T5 XXL

ChatGPT Is Not a Blurry JPEG of the Web

Understanding Large Language Models -- A Transformative Reading List

Understanding Large Language Models -- A Transformative Reading List

Curated Resources and Trustworthy Experts: The Key Ingredients for Finding Accurate Answers to Technical Questions in the Future

Curated Resources and Trustworthy Experts: The Key Ingredients for Finding Accurate Answers to Technical Questions in the Future

Accelerate GPT-J inference with DeepSpeed-Inference on GPUs

Applying massive language models in the real world with Cohere

Implicit Bayesian Inference in Large Language Models

The Illustrated Retrieval Transformer

Select Language

We use cookies