Philipp Schmid

Philipp Schmid is a Staff Engineer at Google DeepMind, building AI Developer Experience and DevRel initiatives. He specializes in LLMs, RLHF, and making advanced AI accessible to developers worldwide.

https://www.philschmid.de

RSS Feed

1/22/2026

AI LLMs developer experience Google DeepMind RLHF

Articles from this Blog

189 articles from this blog

9/20/2023 • EN

Fine-tune Falcon 180B with DeepSpeed ZeRO, LoRA and Flash Attention

A technical guide on fine-tuning the massive Falcon 180B language model using DeepSpeed ZeRO, LoRA, and Flash Attention for efficient training.

large language models Lora Deepspeed

9/12/2023 • EN

Fine-tune Falcon 180B with QLoRA and Flash Attention on Amazon SageMaker

A technical guide on fine-tuning the massive Falcon 180B language model using QLoRA and Flash Attention on Amazon SageMaker.

Amazon Sagemaker Qlora LLM Fine Tuning

9/7/2023 • EN

Deploy Falcon 180B on Amazon SageMaker

A technical guide on deploying the Falcon 180B open-source large language model to Amazon SageMaker using the Hugging Face LLM DLC.

Hugging Face Amazon Sagemaker Text Generation Inference

8/31/2023 • EN

Optimize open LLMs using GPTQ and Hugging Face Optimum

A guide to using GPTQ quantization with Hugging Face Optimum to compress open-source LLMs for efficient deployment on smaller hardware.

llm Hugging Face Quantization

8/15/2023 • EN

LLMOps: Deploy Open LLMs using Infrastructure as Code with AWS CDK

A technical guide on deploying open-source LLMs like Llama 2 using Infrastructure as Code with AWS CDK and the Hugging Face LLM construct.

Hugging Face Infrastructure As Code AWS Cdk

8/7/2023 • EN

Deploy Llama 2 7B/13B/70B on Amazon SageMaker

A technical guide on deploying Meta's Llama 2 large language models (7B, 13B, 70B) on Amazon SageMaker using the Hugging Face LLM DLC.

Hugging Face Amazon Sagemaker Text Generation Inference

8/3/2023 • EN

Introducing EasyLLM - streamline open LLMs

Introduces EasyLLM, an open-source Python package for streamlining work with open large language models via OpenAI-compatible clients.

Python open source llm

7/26/2023 • EN

Extended Guide: Instruction-tune Llama 2

A technical guide on instruction-tuning Meta's Llama 2 model to generate instructions from inputs, enabling personalized LLM applications.

aws ec2 Instruction Tuning Llama 2

7/21/2023 • EN

LLaMA 2 - Every Resource you need

A comprehensive guide to Meta's LLaMA 2 open-source language model, covering resources, playgrounds, benchmarks, and technical details.

open source ai development Large Language Model

7/18/2023 • EN

Fine-tune LLaMA 2 (7-70B) on Amazon SageMaker

A technical guide on fine-tuning LLaMA 2 models (7B to 70B) using QLoRA and PEFT on Amazon SageMaker for efficient large language model adaptation.

Amazon Sagemaker Peft Model Fine Tuning

7/13/2023 • EN

Train LLMs using QLoRA on Amazon SageMaker

A technical guide on using QLoRA to efficiently fine-tune the Falcon 40B large language model on Amazon SageMaker.

Hugging Face Amazon Sagemaker Parameter Efficient Fine Tuning

7/4/2023 • EN

Deploy LLMs with Hugging Face Inference Endpoints

A guide to deploying open-source Large Language Models (LLMs) like Falcon using Hugging Face's managed Inference Endpoints service.

Machine Learning api Hugging Face

6/28/2023 • EN

Optimize and Deploy BERT on AWS inferentia2

A tutorial on optimizing and deploying a BERT model for low-latency inference using AWS Inferentia2 accelerators and Amazon SageMaker.

Amazon Sagemaker Model Optimization Bert

6/20/2023 • EN

Securely deploy LLMs inside VPCs with Hugging Face and Amazon SageMaker

A technical guide on deploying open-source Large Language Models (LLMs) from Amazon S3 to Amazon SageMaker using Hugging Face's LLM Inference Container within a VPC.

Hugging Face Amazon Sagemaker AWS Vpc

6/7/2023 • EN

Deploy Falcon 7B and 40B on Amazon SageMaker

A technical guide on deploying the open-source Falcon 7B and 40B large language models to Amazon SageMaker using the Hugging Face LLM Inference Container.

Hugging Face LLM Inference Model Deployment