OpenAI releases voice Agent
On March 21 at 01:00 (UTC+8), OpenAI conducted a technical live broadcast and released three new voice models specifically for developing AI Voice Agents. Two are voice-to-text models, GPT-40 Transcribe and GPT-4 Mini Transcribe; one is a text-to-speech model, GPT-40 Mini TTS. It's worth mentioning that developers can control the vocal emotion and style of the GPT-40 Mini TTS model. OpenAI has added a powerful streaming mode to its voice-to-text API, allowing developers to input continuous audio streams into the model in real time, and the model can also return continuous text and responses in real time. This feature of real-time interaction is very helpful for applications that require immediate feedback, such as real-time voice dialogue systems, transcription of voice meetings etc. (AIGC Open Community)
Disclaimer: The content of this article solely reflects the author's opinion and does not represent the platform in any capacity. This article is not intended to serve as a reference for making investment decisions.
You may also like
BTC Surpasses 110,000 USD
US Treasury Secretary: Trump Has Brought Substantial Improvement to US Inflation
All Three Major U.S. Stock Indexes Turn Negative
US President Trump: The Federal Reserve Should Cut Interest Rates by 1%
Trending news
MoreCrypto prices
More








