OpenAI GPT-4o Audio Models
Build Powerful Voice Agents
2025-03-21

New OpenAI audio models for developers: gpt-4o powered speech-to-text (more accurate than Whisper) and steerable text-to-speech. Build voice agents, transcriptions, and more.
OpenAI GPT-4o Audio Models empower developers with advanced speech-to-text and customizable text-to-speech capabilities, surpassing Whisper in accuracy. Ideal for creating voice agents, transcription tools, and other voice-driven applications, these models enable seamless, high-quality audio interactions.
Artificial Intelligence
Audio
Development