Together AI: Serverless Model Bring Ups & Prepaid Billing
AI Impact Summary
Together AI has initiated a significant shift in its serverless model offerings, adding several new models including deepseek-ai/DeepSeek-V4-Pro, google/gemma-4-31B-it, and Wan-AI/wan2.7-i2v. This expansion is coupled with a complete transition to a prepaid billing model and the deprecation of dynamic rate limits, impacting existing users and requiring careful monitoring of costs. The move to Dedicated Container Inference (DCI) further expands deployment options, but necessitates a migration from the legacy Python SDK v1.
Affected Systems
- Date
- Date not specified
- Change type
- capability
- Severity
- info