OpenAI said Thursday that its API will now include a number of new voice intelligence features designed to help developers create apps that can talk, transcribe, and translate conversations with users. The company's new GPT'Realtime'2 is another voice model, built to create a realistic vocal simulation that can converse with users. However, unlike its predecessor (GPT-Realtime-1.5) this one is built with GPT'5'class reasoning that OpenAI says was created to deal with more complicated requests from users. The company is also launching GPT'Realtime'Translate, which, just as it sounds, is designed to provide real-time translation services that 'keep pace' with the user, conversationally. The feature includes more than 70 input languages (that is, the languages that it can comprehend) and 13 output languages (the languages it relays to the speaker). 'Together, the models we are launching move real-time audio from simple call-and-response toward voice interfaces that can actually do...
learn more