Technologist Mag
  • Home
  • Tech News
  • AI
  • Apps
  • Gadgets
  • Gaming
  • Guides
  • Laptops
  • Mobiles
  • Wearables
  • More
    • Web Stories
    • Trending
    • Press Release

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

What's On
Crimson Desert Specs For PC, Console, And Other Platforms Revealed

Crimson Desert Specs For PC, Console, And Other Platforms Revealed

10 March 2026
Our Favorite Earbuds for Samsung Owners Are on Sale

Our Favorite Earbuds for Samsung Owners Are on Sale

10 March 2026
Sonos is wading into budget speaker waters with the new Era 100 SL speaker

Sonos is wading into budget speaker waters with the new Era 100 SL speaker

10 March 2026
Interstellar Comet 3I/Atlas Has Another Surprise: It’s Full of Alcohol

Interstellar Comet 3I/Atlas Has Another Surprise: It’s Full of Alcohol

10 March 2026
Nvidia is leveling up game visuals with the new DLSS 4.5 update

Nvidia is leveling up game visuals with the new DLSS 4.5 update

10 March 2026
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Technologist Mag
SUBSCRIBE
  • Home
  • Tech News
  • AI
  • Apps
  • Gadgets
  • Gaming
  • Guides
  • Laptops
  • Mobiles
  • Wearables
  • More
    • Web Stories
    • Trending
    • Press Release
Technologist Mag
Home » New study shows AI isn’t ready for office work
Tech News

New study shows AI isn’t ready for office work

By technologistmag.com24 January 20262 Mins Read
New study shows AI isn’t ready for office work
Share
Facebook Twitter Reddit Telegram Pinterest Email

It has been nearly two years since Microsoft CEO Satya Nadella predicted that generative AI would take over knowledge work, but if you look around a typical law firm or investment bank today, the human workforce is still very much in charge. Despite all the hype about “reasoning” and “planning,” a new study from training-data company Mercor explains exactly why the robot revolution is stalled: AI just can’t handle the messiness of real work.

A reality check for the “replacement” theory

Mercor released a new benchmark called APEX-Agents, and it is brutal. unlike the usual tests that ask AI to write a poem or solve a math problem, this one uses actual queries from lawyers, consultants, and bankers. It asks the models to do complete, multi-step tasks that require jumping between different types of information.

The results? Even the absolute best models on the market—we are talking about Gemini 3 Flash and GPT-5.2—couldn’t crack a 25% accuracy rate. Gemini led the pack at 24%, with GPT-5.2 right behind it at 23%. Most others were stuck in the teens.

Why AI is failing the “office test”

Mercor CEO Brendan Foody points out that the issue isn’t raw intelligence; it’s context. In the real world, answers aren’t served up on a silver platter. A lawyer has to check a Slack thread, read a PDF policy, look at a spreadsheet, and then synthesize all that to answer a question about GDPR compliance.

Humans do this context-switching naturally. AI, it turns out, is terrible at it. When you force these models to hunt for information across “scattered” sources, they either get confused, give the wrong answer, or just give up entirely.

The “Unreliable Intern”

For anyone worried about their job security, this is a bit of a relief. The study suggests that right now, AI functions less like a seasoned professional and more like an unreliable intern who gets things right about a quarter of the time.

That said, the progress is terrifyingly fast. Foody noted that just a year ago, these models were scoring between 5% and 10%. Now they are hitting 24%. So, while they aren’t ready to take the wheel yet, they are learning to drive much faster than we expected. For now, though, the “knowledge work” revolution is on hold until the bots learn how to multitask p

Share. Facebook Twitter Pinterest LinkedIn Telegram Reddit Email
Previous ArticleICE Asks Companies About ‘Ad Tech and Big Data’ Tools It Could Use in Investigations
Next Article The Instant Smear Campaign Against Border Patrol Shooting Victim Alex Pretti

Related Articles

Our Favorite Earbuds for Samsung Owners Are on Sale

Our Favorite Earbuds for Samsung Owners Are on Sale

10 March 2026
Sonos is wading into budget speaker waters with the new Era 100 SL speaker

Sonos is wading into budget speaker waters with the new Era 100 SL speaker

10 March 2026
Interstellar Comet 3I/Atlas Has Another Surprise: It’s Full of Alcohol

Interstellar Comet 3I/Atlas Has Another Surprise: It’s Full of Alcohol

10 March 2026
Nvidia is leveling up game visuals with the new DLSS 4.5 update

Nvidia is leveling up game visuals with the new DLSS 4.5 update

10 March 2026
Pete Hegseth Is Pushing Defense Employees to Volunteer With DHS

Pete Hegseth Is Pushing Defense Employees to Volunteer With DHS

10 March 2026
Sonos ends long product draught with the 9 Play speaker

Sonos ends long product draught with the $299 Play speaker

10 March 2026
Stay In Touch
  • Facebook
  • Twitter
  • Pinterest
  • Instagram
  • YouTube
  • Vimeo

Subscribe to Updates

Get the latest tech news and updates directly to your inbox.

Don't Miss
Our Favorite Earbuds for Samsung Owners Are on Sale

Our Favorite Earbuds for Samsung Owners Are on Sale

By technologistmag.com10 March 2026

While Apple and Pixel owners have their own earbuds with special features, Samsung phone owners…

Sonos is wading into budget speaker waters with the new Era 100 SL speaker

Sonos is wading into budget speaker waters with the new Era 100 SL speaker

10 March 2026
Interstellar Comet 3I/Atlas Has Another Surprise: It’s Full of Alcohol

Interstellar Comet 3I/Atlas Has Another Surprise: It’s Full of Alcohol

10 March 2026
Nvidia is leveling up game visuals with the new DLSS 4.5 update

Nvidia is leveling up game visuals with the new DLSS 4.5 update

10 March 2026
Mega Man Voice Actor Ben Diskin Won’t Return For Dual Override, Claims Capcom Won’t Offer Union Contract

Mega Man Voice Actor Ben Diskin Won’t Return For Dual Override, Claims Capcom Won’t Offer Union Contract

10 March 2026
Technologist Mag
Facebook X (Twitter) Instagram Pinterest
  • Privacy
  • Terms
  • Advertise
  • Contact
© 2026 Technologist Mag. All Rights Reserved.

Type above and press Enter to search. Press Esc to cancel.