InfoCapability

Accelerating Stable Diffusion Inference on Sapphire Rapids using OpenVINO and Intel AI Optimizations

AI Impact Summary

Intel Sapphire Rapids enables substantial acceleration for Stable Diffusion when paired with OpenVINO and Intel-specific PyTorch optimizations. The article demonstrates an end-to-end path using Diffusers with OVStableDiffusionPipeline, exporting to OpenVINO, and applying system-level tuning (jemalloc, libiomp, numactl) plus IPEX bf16 to leverage AMX/VNNI. Reported latencies drop from about 32 seconds to 16.7 seconds with OpenVINO, and to 4.7 seconds with fixed input shapes, highlighting a pathway to 2–10x throughput gains over Ice Lake depending on configuration. Cloud provisioning notes (EC2 R7iz/Sapphire Rapids preview) imply a migration window and environment changes for production deployments.

Affected Systems

Stable Diffusion (runwayml/stable-diffusion-v1-5)Diffusers

Date: Date not specified
Change type: capability
Severity: info

Accelerating Stable Diffusion Inference on Sapphire Rapids using OpenVINO and Intel AI Optimizations

More from Hugging Face

Get alerts for Hugging Face