Technologist Mag
  • Home
  • Tech News
  • AI
  • Apps
  • Gadgets
  • Gaming
  • Guides
  • Laptops
  • Mobiles
  • Wearables
  • More
    • Web Stories
    • Trending
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On

Alcatel V3 Pro 5G, V3 Classic 5G Teased Ahead of May 27 India Launch

21 May 2025

Which Microsoft Surface Is Best for You?

21 May 2025

Lies of P: Overture makes a great Soulslike more approachable than ever

21 May 2025

iQOO Watch 5 With 1.43-Inch AMOLED Display and TWS Air 3 With Up to 45 Hours of Total Battery Life Launched

21 May 2025

CyberPowerPC India Announces ‘Play Guarantee’ for a Transparent Buying Experience

21 May 2025
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Technologist Mag
SUBSCRIBE
  • Home
  • Tech News
  • AI
  • Apps
  • Gadgets
  • Gaming
  • Guides
  • Laptops
  • Mobiles
  • Wearables
  • More
    • Web Stories
    • Trending
    • Press Release
Technologist Mag
Home » Google I/O 2025: Gemini 2.5 AI Models Upgraded With Deep Think Mode, Native Audio Output
Apps

Google I/O 2025: Gemini 2.5 AI Models Upgraded With Deep Think Mode, Native Audio Output

By technologistmag.com20 May 20254 Mins Read
Share
Facebook Twitter Reddit Telegram Pinterest Email

Google showcased several new features for the Gemini 2.5 family of artificial intelligence (AI) models at the Google I/O 2025 on Tuesday. The Mountain View-based tech giant introduced an enhanced reasoning mode dubbed Deep Think, which is powered by the Gemini 2.5 Pro model. It also unveiled a new, natural and human-like speech called Native Audio Output, which will be available via the Live application programming interface (API). Additionally, the company is also bringing thought summaries and thinking budgets with the latest Gemini models for developers.

Gemini 2.5 Pro Ranks on top of the LMArena Leaderboard

In a blog post, the tech giant detailed all the new capabilities and features that it will be shipping to the Gemini 2.5 AI model series throughout the next few months. Earlier this month, Google released an updated version of the Gemini 2.5 Pro with improved coding capabilities. The updated model also ranked in the top position on the WebDev Arena and LMArena leaderboards.

Now, Google is improving the AI model further with the Deep Think mode. The new reasoning mode allows Gemini 2.5 Pro to consider multiple hypotheses before responding. The company says it uses a different research technique compared to the Thinking versions of the older models.

Based on internal testing, the tech giant shared the reasoning mode’s benchmark scores across different parameters. Notably, the Gemini 2.5 Pro Deep Think is claimed to score 49.4 percent on the 2025 UAMO, one of the toughest mathematics benchmark tests. It also scores competitively on LiveCodeBench v6 and MMMU.

Deep Think is currently under testing, and Google says it is conducting safety evaluations and getting input from safety experts. Currently, the reasoning mode is only available to trusted testers via the Gemini API. There is no word on its release date.

Google also announced adding new capabilities to the Gemini 2.5 Flash model, which was released just a month ago. The company said the AI model’s key benchmarks for reasoning, multimodality, code and long context have been improved. Additionally, it is also more efficient and uses 20-30 percent fewer tokens, the company claimed.

This new version of Gemini 2.5 Flash is currently available in preview to developers via Google AI Studio. Enterprises can access it via the Vertex AI platform, and individuals can find it in the Gemini app. Notably, the model will be widely available for production in June.

Developers accessing the Live API will now get a new feature with the Gemini 2.5 series of AI models. The company is introducing a preview version of Native Audio Output, which can generate speech in a more expressive and human-like manner. Google said the feature allows users to control the tone, accent, and style of speech generated.

The early version of the capability comes with three features. First is Affective Dialogue, where the AI model can detect emotions in the user’s voice and respond accordingly. The second is Proactive Audio, which enables the model to ignore background conversations and only respond when it is spoken to. And finally, Thinking, which lets the speech generation leverage Gemini’s thinking capabilities to verbally answer complex queries.

Apart from this, the 2.5 Pro and Flash models in the Gemini API and in Vertex AI will also show thought summaries. These are essentially the model’s raw thought process, which were previously only visible in Gemini’s reasoning models. Now, Google will show a detailed summary including headers, key details and information about model actions with every response.

In the coming weeks, developers will also be able to use thinking budgets with the Gemini 2.5 Pro. This will allow them to decide how many tokens a model consumes before responding. Finally, Project Mariner’s Computer Use agentic function will also be added to the API and in Vertex AI soon.

Share. Facebook Twitter Pinterest LinkedIn Telegram Reddit Email
Previous ArticleApple WWDC 2025 to Be Held From June 9 to June 13: All You Need to Know
Next Article Fortnite is finally back on Apple’s App Store … sort of

Related Articles

WhatsApp Had No Plans to Compete With Facebook, Co-Founder Says

21 May 2025

Asus ExpertBook P3 (P3406) Price (21 May 2025) Specification & Reviews । Asus Laptops

21 May 2025

Amazon’s Drones Can Now Deliver New Categories of Devices Like iPhone, AirPods and More

21 May 2025

Google Unveils New Workspace Features at I/O for Meet, Docs, Vids; Gmail Gets Personalised Smart Replies

21 May 2025

Google Introduces Beam, an AI-Driven Communication Platform That Turns 2D Video Into 3D Experiences

20 May 2025

Garmin Forerunner 570 Online at Lowest Price in India

20 May 2025
Stay In Touch
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Don't Miss

Which Microsoft Surface Is Best for You?

By technologistmag.com21 May 2025

The Microsoft Surface is the flagship PC brand from the brains behind the Windows operating…

Lies of P: Overture makes a great Soulslike more approachable than ever

21 May 2025

iQOO Watch 5 With 1.43-Inch AMOLED Display and TWS Air 3 With Up to 45 Hours of Total Battery Life Launched

21 May 2025

CyberPowerPC India Announces ‘Play Guarantee’ for a Transparent Buying Experience

21 May 2025

Xiaomi to Equip Premium Smartphones With Snapdragon 8-Series Chips as Part of Multi-Year Agreement

21 May 2025
Technologist Mag
Facebook X (Twitter) Instagram Pinterest
  • Privacy
  • Terms
  • Advertise
  • Contact
© 2025 Technologist Mag. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.