Technologist Mag
  • Home
  • Tech News
  • AI
  • Apps
  • Gadgets
  • Gaming
  • Guides
  • Laptops
  • Mobiles
  • Wearables
  • More
    • Web Stories
    • Trending
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On
You can now ask Gemini questions about your NotebookLM notebooks

You can now ask Gemini questions about your NotebookLM notebooks

15 December 2025
Forget the Galaxy S26; some of Samsung’s coolest tech of 2026 could be revealed at CES

Forget the Galaxy S26; some of Samsung’s coolest tech of 2026 could be revealed at CES

15 December 2025
NASA’s ‘Moonbound’ builds the hype for its epic Artemis II mission

NASA’s ‘Moonbound’ builds the hype for its epic Artemis II mission

15 December 2025
OpenAI’s Chief Communications Officer Is Leaving the Company

OpenAI’s Chief Communications Officer Is Leaving the Company

15 December 2025
This 85″ TCL mini LED TV just dropped to 9.99 after a ,000 price cut

This 85″ TCL mini LED TV just dropped to $999.99 after a $1,000 price cut

15 December 2025
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Technologist Mag
SUBSCRIBE
  • Home
  • Tech News
  • AI
  • Apps
  • Gadgets
  • Gaming
  • Guides
  • Laptops
  • Mobiles
  • Wearables
  • More
    • Web Stories
    • Trending
    • Press Release
Technologist Mag
Home » Google finds AI chatbots are only 69% accurate… at best
Tech News

Google finds AI chatbots are only 69% accurate… at best

By technologistmag.com15 December 20252 Mins Read
Google finds AI chatbots are only 69% accurate… at best
Share
Facebook Twitter Reddit Telegram Pinterest Email
Google finds AI chatbots are only 69% accurate… at best

Google has published a blunt assessment of how reliable today’s AI chatbots really are, and the numbers are not flattering. Using its newly introduced FACTS Benchmark Suite, the company found that even the best AI models struggle to break past a 70% factual accuracy rate. The top performer, Gemini 3 Pro, reached 69% overall accuracy, while other leading systems from OpenAI, Anthropic, and xAI scored even lower. The takeaway is simple and uncomfortable. These chatbots still get roughly one out of every three answers wrong, even when they sound confident doing it.

The benchmark matters because most existing AI tests focus on whether a model can complete a task, not whether the information it produces is actually true. For industries like finance, healthcare, and law, that gap can be costly. A fluent response that sounds confident but contains errors can do real damage, especially when users assume the chatbot knows what it is talking about.

What Google’s accuracy test reveals

The FACTS Benchmark Suite was built by Google’s FACTS team with Kaggle to directly test factual accuracy across four real-world use. One test measures parametric knowledge, which checks whether a model can answer fact-based questions using only what it learned during training. Another evaluates search performance, testing how well models use web tools to retrieve accurate information. A third focuses on grounding, meaning whether the model sticks to a provided document without adding false details. The fourth examines multimodal understanding, such as reading charts, diagrams, and images correctly.

ai-accuracy-rankings-by-facts-google

The results show sharp differences between models. Gemini 3 Pro led the leaderboard with a 69% FACTS score, followed by Gemini 2.5 Pro and OpenAI’s ChatGPT-5 nearly at 62% percent. Claude 4.5 Opus landed at ~51% percent, while Grok 4 scored ~54%. Multimodal tasks were the weakest area across the board, with accuracy often below 50%. This matters because these tasks involve reading charts, diagrams, or images, where a chatbot could confidently misread a sales graph or pull the wrong number from a document, leading to mistakes that are easy to miss but hard to undo.

The takeaway isn’t that chatbots are useless, but blind trust is risky. Google’s own data suggests AI is improving, yet it still needs verification, guardrails, and human oversight before it can be treated as a reliable source of truth.

Share. Facebook Twitter Pinterest LinkedIn Telegram Reddit Email
Previous ArticleHow sleep technology is rewriting your night
Next Article Ford Kills the All-Electric F-150 as It Rethinks Its EV Ambitions

Related Articles

You can now ask Gemini questions about your NotebookLM notebooks

You can now ask Gemini questions about your NotebookLM notebooks

15 December 2025
Forget the Galaxy S26; some of Samsung’s coolest tech of 2026 could be revealed at CES

Forget the Galaxy S26; some of Samsung’s coolest tech of 2026 could be revealed at CES

15 December 2025
NASA’s ‘Moonbound’ builds the hype for its epic Artemis II mission

NASA’s ‘Moonbound’ builds the hype for its epic Artemis II mission

15 December 2025
OpenAI’s Chief Communications Officer Is Leaving the Company

OpenAI’s Chief Communications Officer Is Leaving the Company

15 December 2025
This 85″ TCL mini LED TV just dropped to 9.99 after a ,000 price cut

This 85″ TCL mini LED TV just dropped to $999.99 after a $1,000 price cut

15 December 2025
Microsoft Copilot quietly shows up on LG TVs, and you can’t remove it

Microsoft Copilot quietly shows up on LG TVs, and you can’t remove it

15 December 2025
Stay In Touch
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Don't Miss
Forget the Galaxy S26; some of Samsung’s coolest tech of 2026 could be revealed at CES

Forget the Galaxy S26; some of Samsung’s coolest tech of 2026 could be revealed at CES

By technologistmag.com15 December 2025

One of the biggest companies to watch over CES 2026, an annual tech event which…

NASA’s ‘Moonbound’ builds the hype for its epic Artemis II mission

NASA’s ‘Moonbound’ builds the hype for its epic Artemis II mission

15 December 2025
OpenAI’s Chief Communications Officer Is Leaving the Company

OpenAI’s Chief Communications Officer Is Leaving the Company

15 December 2025
This 85″ TCL mini LED TV just dropped to 9.99 after a ,000 price cut

This 85″ TCL mini LED TV just dropped to $999.99 after a $1,000 price cut

15 December 2025
Microsoft Copilot quietly shows up on LG TVs, and you can’t remove it

Microsoft Copilot quietly shows up on LG TVs, and you can’t remove it

15 December 2025
Technologist Mag
Facebook X (Twitter) Instagram Pinterest
  • Privacy
  • Terms
  • Advertise
  • Contact
© 2025 Technologist Mag. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.