Deepspeed Inference Articles

Page 1 of 1 (3 articles)

11/8/2022 • EN

Learn to optimize Stable Diffusion for faster GPU inference using DeepSpeed-Inference and Hugging Face Diffusers.

aws ec2 Deepspeed Inference Gpu Optimization Hugging Face Diffusers stable diffusion

9/13/2022 • EN

Learn to optimize GPT-J inference using DeepSpeed-Inference and Hugging Face Transformers for faster GPU performance.

Deepspeed Inference Gpt J Gpu Optimization large language models Transformer Models

8/16/2022 • EN

Learn to optimize BERT and RoBERTa models for faster GPU inference using DeepSpeed-Inference, reducing latency from 30ms to 10ms.

Bert Deepspeed Inference Gpu Inference Optimization Transformers

Select Language