Philipp Schmid

Philipp Schmid is a Staff Engineer at Google DeepMind, building AI Developer Experience and DevRel initiatives. He specializes in LLMs, RLHF, and making advanced AI accessible to developers worldwide.

https://www.philschmid.de

RSS Feed

1/22/2026

AI LLMs developer experience Google DeepMind RLHF

Articles from this Blog

189 articles from this blog

1/17/2025 • EN

How to use Anthropic MCP Server with open LLMs, OpenAI or Google Gemini

A guide on using Anthropic's Model Context Protocol (MCP) to connect AI agents with tools and data sources using various LLMs like OpenAI or Gemini.

llm mcp Openai

1/17/2025 • EN

Bite: How Deepseek R1 was trained

Explains the training of DeepSeek-R1, focusing on the Group Relative Policy Optimization (GRPO) reinforcement learning method.

Reinforcement Learning Deepseek LLM Training

12/25/2024 • EN

Fine-tune classifier with ModernBERT in 2025

A tutorial on fine-tuning the ModernBERT model for classification tasks to build an efficient LLM router, covering setup, training, and evaluation.

classification Fine Tuning Bert

12/20/2024 • EN

How to fine-tune open LLMs in 2025 with Hugging Face

A technical guide on optimizing and scaling the fine-tuning of open-source large language models using Hugging Face tools in 2025.

Hugging Face Peft Distributed Training

12/3/2024 • EN

Deploy QwQ-32B-Preview the best open Reasoning Model on AWS with Hugging Face

A technical guide on deploying the QwQ-32B-Preview open-source reasoning model on AWS SageMaker using Hugging Face's tools.

aws Hugging Face Amazon Sagemaker

10/17/2024 • EN

Deploy Llama 3.2 Vision on Amazon SageMaker

A technical guide on deploying Meta's Llama 3.2 Vision model on Amazon SageMaker using the Hugging Face LLM DLC.

large language models Hugging Face Llama 32

9/30/2024 • EN

How to Fine-Tune Multimodal Models or VLMs with Hugging Face TRL

A technical guide on fine-tuning Vision-Language Models (VLMs) using Hugging Face's TRL library for custom applications like image-to-text generation.

Hugging Face Fine Tuning Multimodal Models

9/24/2024 • EN

Evaluate open LLMs with Vertex AI and Gemini

A technical guide on using Google's Vertex AI Gen AI Evaluation Service with Gemini to evaluate open LLM models like Llama 3.1.

Gemini LLM Evaluation Model Deployment

9/19/2024 • EN

Evaluate LLMs using Evaluation Harness and Hugging Face TGI/vLLM

A guide to evaluating Large Language Models (LLMs) using the Evaluation Harness framework and optimized serving tools like Hugging Face TGI and vLLM.

benchmarking LLM Evaluation Evaluation Harness

8/5/2024 • EN

Deploy open LLMs with Terraform and Amazon SageMaker

A guide to deploying open-source LLMs like Llama 3 to Amazon SageMaker using Terraform for Infrastructure as Code.

Machine Learning Infrastructure As Code Terraform

7/11/2024 • EN

LLM Evaluation doesn't need to be complicated

A guide to simplifying LLM evaluation workflows using clear metrics, chain-of-thought, and few-shot prompts, inspired by real-world examples.

generative ai large language models Chatbot

6/28/2024 • EN

Evaluating Open LLMs with MixEval: The Closest Benchmark to LMSYS Chatbot Arena

Introduces MixEval, a cost-effective LLM benchmark with high correlation to Chatbot Arena, for evaluating open-source language models.

open source large language models benchmark

6/25/2024 • EN

Train and Deploy open Embedding Models on Amazon SageMaker

A guide to fine-tuning and deploying custom embedding models for RAG applications on Amazon SageMaker using Sentence Transformers v3.

Rag Hugging Face Amazon Sagemaker

6/18/2024 • EN

Deploy Mixtral 8x7B on AWS Inferentia2 with Hugging Face Optimum

A technical guide on deploying the Mixtral 8x7B LLM on AWS Inferentia2 using Hugging Face Optimum and Amazon SageMaker.

Amazon Sagemaker Hugging Face Optimum LLM Deployment

6/11/2024 • EN

Fine-tune Llama 3 with PyTorch FSDP and Q-Lora on Amazon SageMaker

A technical guide on fine-tuning the Llama 3 LLM using PyTorch FSDP and Q-Lora on Amazon SageMaker for efficient training.

Llama 3 Fine Tuning Amazon Sagemaker

6/4/2024 • EN

Fine-tune Embedding models for Retrieval Augmented Generation (RAG)

A guide to fine-tuning embedding models for RAG applications using Sentence Transformers 3, featuring Matryoshka Representation Learning for efficiency.

Rag Fine Tuning Sentence Transformers