The three are GPT-Realtime-2, a successor to the company’s existing realtime voice model with what OpenAI describes as GPT-5-class reasoning; GPT-Realtime-Translate, a live translation model with more ...
Xiaomi has open-sourced OmniVoice, a multilingual AI voice cloning model supporting hundreds of languages with fast speech ...
OpenAI introduced three audio models for its developer platform on Thursday, aiming ​to make voice-based software agents more ‌conversational and capable of completing tasks in real time.