OpenAI introduces Whisper speech-to-text capability in the OpenAI API
AI Impact Summary
Whisper adds a dedicated speech-to-text capability to the OpenAI API, enabling transcription of audio files across multiple languages. This expands input modalities for voice-enabled apps, captions, and media workflows, allowing automated transcription without external ASR services. Teams should anticipate larger payloads for audio, potential latency differences, and data governance implications for processing voice data, including storage and compliance considerations.
Affected Systems
- Date
- Date not specified
- Change type
- capability
- Severity
- medium