MediumCapability

OpenAI introduces Whisper speech-to-text capability in the OpenAI API

AI Impact Summary

Whisper adds a dedicated speech-to-text capability to the OpenAI API, enabling transcription of audio files across multiple languages. This expands input modalities for voice-enabled apps, captions, and media workflows, allowing automated transcription without external ASR services. Teams should anticipate larger payloads for audio, potential latency differences, and data governance implications for processing voice data, including storage and compliance considerations.

Affected Systems

WhisperOpenAI API

Date: Date not specified
Change type: capability
Severity: medium

OpenAI introduces Whisper speech-to-text capability in the OpenAI API

More from OpenAI

Get alerts for OpenAI