InfoCapability

Accelerating Stable Diffusion XL Inference with JAX on Cloud TPU v5e

AI Impact Summary

Hugging Face has enabled accelerated Stable Diffusion XL (SDXL) inference on Google Cloud TPUs v5e by integrating JAX and Diffusers. This leverages JAX's just-in-time (jit) compilation and XLA compiler-driven parallelism to achieve high throughput and cost-efficiency, particularly for image generation workloads. The demo showcases a 4-chip TPU v5e instance generating four 1024x1024 images in approximately 2.3 seconds, highlighting the potential for significant speed improvements compared to traditional inference methods.

Affected Systems

Stable Diffusion XLJAX

Date: Date not specified
Change type: capability
Severity: info

Accelerating Stable Diffusion XL Inference with JAX on Cloud TPU v5e

More from Hugging Face

Get alerts for Hugging Face