InfoCapability

Together AI Batch API: 50% Cost Savings for LLM Processing

AI Impact Summary

Together AI has launched a Batch API designed for processing large volumes of LLM requests, offering a 50% reduction in cost compared to real-time inference. This API caters to asynchronous workloads like data transformations and synthetic data generation, which don't require immediate responses. The Batch API supports up to 50,000 requests per batch and utilizes models like DeepSeek-V3 and Llama 3, providing a scalable solution for businesses needing to process significant amounts of data efficiently.

Affected Systems

Together AI Batch APIDeepSeek-V3

Date: Date not specified
Change type: capability
Severity: info

Together AI Batch API: 50% Cost Savings for LLM Processing

More from Together AI

Get alerts for Together AI