Scaleway Generative APIs becomes Hugging Face Inference Provider with gpt-oss, Qwen3, DeepSeek R1, Gemma 3
AI Impact Summary
Scaleway Generative APIs is now a supported Inference Provider on the Hugging Face Hub, enabling serverless, managed inference directly from model pages and via the HF Python/JS SDKs. Models supported include gpt-oss, Qwen3, DeepSeek R1, and Gemma 3, with two call modes: Custom key direct to the provider or Routed by HF; billing follows the provider for direct calls and HF rates for routed calls. European data residency is highlighted (Paris data centers) with sub-200ms first-token latency and pricing starting at €0.20 per million tokens, plus capabilities like structured outputs, function calling, and multimodal support.
Affected Systems
- Date
- Date not specified
- Change type
- capability
- Severity
- info