Technologist Mag
  • Home
  • Tech News
  • AI
  • Apps
  • Gadgets
  • Gaming
  • Guides
  • Laptops
  • Mobiles
  • Wearables
  • More
    • Web Stories
    • Trending
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On
Meet Rassvet, Russia’s Answer to Starlink

Meet Rassvet, Russia’s Answer to Starlink

8 May 2026
Nintendo is raising Switch 2 price in the US, but there’s still time left to snag one for less

Nintendo is raising Switch 2 price in the US, but there’s still time left to snag one for less

8 May 2026
OpenAI’s new voice AI can listen, think, and talk back in 70+ languages

OpenAI’s new voice AI can listen, think, and talk back in 70+ languages

8 May 2026
The Canvas Hack Is a New Kind of Ransomware Debacle

The Canvas Hack Is a New Kind of Ransomware Debacle

8 May 2026
Even brief AI use could hurt your ability to think, a new study finds

Even brief AI use could hurt your ability to think, a new study finds

8 May 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Technologist Mag
SUBSCRIBE
  • Home
  • Tech News
  • AI
  • Apps
  • Gadgets
  • Gaming
  • Guides
  • Laptops
  • Mobiles
  • Wearables
  • More
    • Web Stories
    • Trending
    • Press Release
Technologist Mag
Home » OpenAI’s new voice AI can listen, think, and talk back in 70+ languages
Tech News

OpenAI’s new voice AI can listen, think, and talk back in 70+ languages

By technologistmag.com8 May 20262 Mins Read
OpenAI’s new voice AI can listen, think, and talk back in 70+ languages
Share
Facebook Twitter Reddit Telegram Pinterest Email

OpenAI has launched three new audio models in its Realtime API, and they are a big deal for anyone building voice-powered apps. The three models are GPT-Realtime-2, GPT-Realtime-Translate, and GPT-Realtime-Whisper. 

Together, they move voice AI beyond simple back-and-forth responses toward something that can understand you, take action, and keep up with a real conversation.

If their demo is anything to go by, we have just seen the next evolution in how voice AI models work. 

So what can these models actually do?

GPT-Realtime-2 is the headline act. It brings GPT-5-class reasoning to live voice interactions, meaning it can handle harder requests without dropping the thread of the conversation. 

It can call multiple tools simultaneously and even narrate what it’s doing with phrases like “checking your calendar” or “let me look into that.” It also has a larger context window of 128K tokens, which means longer, more coherent sessions. Developers can even adjust the reasoning effort based on the complexity of the request.

GPT-Realtime-Translate is probably my favorite. It’s the closest we have come to having Star Trek’s Universal Translator in real life. It supports live speech translation across 70+ input languages and 13 output languages. 

The best part of the demo was that even when a new person joined and spoke a different language, GPT-Realtime-Translate had no issues in translating both speakers into English in real time. 

Finally, there’s the GPT-Realtime-Whisper. Most speech-to-text models wait for the speaker to finish before providing the full translation. This one is a streaming transcription model that converts speech to text as the speaker talks. It is useful for live captions, meeting notes, and any voice-powered workflow where waiting for a transcription is not an option.

Can anyone use these new voice AI models?

Currently, OpenAI has released these models for developers. But the apps they build will affect everyone. For example, a developer can build a real-time translator app, allowing users to converse with people in different languages. 

Many companies are already testing these new models. Zillow is building a voice assistant that can search homes and schedule tours from a single spoken request. Priceline can check your flights and hotels, cancel them, and book new ones. Vimeo is using it for real-time transcription, and so on. 

new voice ai model working inside Priceline

Pricing starts at $0.017 per minute for Whisper, $0.034 per minute for Translate, and $32 per 1M audio input tokens for GPT-Realtime-2.

Share. Facebook Twitter Pinterest LinkedIn Telegram Reddit Email
Previous ArticleThe Canvas Hack Is a New Kind of Ransomware Debacle
Next Article Nintendo is raising Switch 2 price in the US, but there’s still time left to snag one for less

Related Articles

Meet Rassvet, Russia’s Answer to Starlink

Meet Rassvet, Russia’s Answer to Starlink

8 May 2026
Nintendo is raising Switch 2 price in the US, but there’s still time left to snag one for less

Nintendo is raising Switch 2 price in the US, but there’s still time left to snag one for less

8 May 2026
The Canvas Hack Is a New Kind of Ransomware Debacle

The Canvas Hack Is a New Kind of Ransomware Debacle

8 May 2026
Even brief AI use could hurt your ability to think, a new study finds

Even brief AI use could hurt your ability to think, a new study finds

8 May 2026
Google responds to Chrome’s silent Gemini Nano install, stops short of addressing consent

Google responds to Chrome’s silent Gemini Nano install, stops short of addressing consent

8 May 2026
Google pulls the plug on Project Mariner, the AI agent that browsed the web like a human

Google pulls the plug on Project Mariner, the AI agent that browsed the web like a human

7 May 2026
Stay In Touch
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Don't Miss
Nintendo is raising Switch 2 price in the US, but there’s still time left to snag one for less

Nintendo is raising Switch 2 price in the US, but there’s still time left to snag one for less

By technologistmag.com8 May 2026

Nintendo is the latest company to bend its knee in the face of a pricing…

OpenAI’s new voice AI can listen, think, and talk back in 70+ languages

OpenAI’s new voice AI can listen, think, and talk back in 70+ languages

8 May 2026
The Canvas Hack Is a New Kind of Ransomware Debacle

The Canvas Hack Is a New Kind of Ransomware Debacle

8 May 2026
Even brief AI use could hurt your ability to think, a new study finds

Even brief AI use could hurt your ability to think, a new study finds

8 May 2026
Google responds to Chrome’s silent Gemini Nano install, stops short of addressing consent

Google responds to Chrome’s silent Gemini Nano install, stops short of addressing consent

8 May 2026
Technologist Mag
Facebook X (Twitter) Instagram Pinterest
  • Privacy
  • Terms
  • Advertise
  • Contact
© 2026 Technologist Mag. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.