Inference Endpoints articles

5/2/2024 • EN

Deploy open LLMs with vLLM on Hugging Face Inference Endpoints

A tutorial on deploying open-source large language models (LLMs) like Llama 3 using the vLLM framework on Hugging Face Inference Endpoints.

Hugging Face Inference Endpoints large language models LLM Deployment Vllm

Philipp Schmid

12/20/2023 • EN

Programmatically manage 🤗 Inference Endpoints

Learn to programmatically manage Hugging Face Inference Endpoints using the huggingface_hub Python library for automated model deployment.

generative ai Huggingface_hub Inference Endpoints Infrastructure As Code Python

Philipp Schmid

7/4/2023 • EN

Deploy LLMs with Hugging Face Inference Endpoints

A guide to deploying open-source Large Language Models (LLMs) like Falcon using Hugging Face's managed Inference Endpoints service.

api Hugging Face Inference Endpoints LLM Deployment Machine Learning

Philipp Schmid

3/3/2023 • EN

Controlled text-to-image generation with ControlNet on Inference Endpoints

Learn how to deploy and use ControlNet for controlled text-to-image generation via Hugging Face Inference Endpoints as a scalable API.

Controlnet Custom Handler Diffusion Models Inference Endpoints Text To Image Generation

Philipp Schmid

12/20/2022 • EN

Managed Transcription with OpenAI Whisper and Hugging Face Inference Endpoints

A tutorial on deploying OpenAI's Whisper speech recognition model using Hugging Face Inference Endpoints for scalable transcription APIs.

Automatic Speech Recognition Hugging Face Inference Endpoints openai whisper Transformer

Philipp Schmid

12/15/2022 • EN

Stable Diffusion Inpainting example with Hugging Face inference Endpoints

A tutorial on using Hugging Face Inference Endpoints to deploy and run Stable Diffusion 2 for AI image inpainting via a custom API.

generative ai Hugging Face Inference Endpoints Inpainting stable diffusion

Philipp Schmid

11/28/2022 • EN

Stable Diffusion with Hugging Face Inference Endpoints

A tutorial on deploying Stable Diffusion 2.0 for image generation using Hugging Face Inference Endpoints and integrating it via an API.

Diffusers Hugging Face Inference Endpoints stable diffusion Text To Image

Philipp Schmid

11/17/2022 • EN

Multi-Model GPU Inference with Hugging Face Inference Endpoints

Learn how to deploy multiple ML models on a single GPU using Hugging Face Inference Endpoints for scalable, cost-effective inference.

Gpu Inference Hugging Face Inference Endpoints Machine Learning Multi Model Inference

Philipp Schmid

10/25/2022 • EN

Deploy T5 11B for inference for less than $500

A tutorial on deploying the T5 11B language model for inference using Hugging Face Inference Endpoints on a budget.

Hugging Face Inference Endpoints Model Deployment T5 Model Transformer

Philipp Schmid

10/6/2022 • EN

Deploy LayoutLM with Hugging Face Inference Endpoints

A tutorial on deploying the LayoutLM document understanding model using Hugging Face Inference Endpoints for production API integration.

Document Understanding Hugging Face Inference Endpoints Layoutlm Transformer Model

Philipp Schmid

9/29/2022 • EN

Custom Inference with Hugging Face Inference Endpoints

A tutorial on creating custom inference handlers for Hugging Face Inference Endpoints to add business logic and dependencies.

Custom Handler Hugging Face Inference Endpoints Machine Learning Transformers

Philipp Schmid

Inference Endpoints Articles

Deploy open LLMs with vLLM on Hugging Face Inference Endpoints

Programmatically manage 🤗 Inference Endpoints

Deploy LLMs with Hugging Face Inference Endpoints

Controlled text-to-image generation with ControlNet on Inference Endpoints

Managed Transcription with OpenAI Whisper and Hugging Face Inference Endpoints

Stable Diffusion Inpainting example with Hugging Face inference Endpoints

Stable Diffusion with Hugging Face Inference Endpoints

Multi-Model GPU Inference with Hugging Face Inference Endpoints

Deploy T5 11B for inference for less than $500

Deploy LayoutLM with Hugging Face Inference Endpoints

Custom Inference with Hugging Face Inference Endpoints

Select Language

We use cookies