AI's Freelance Fail: Even Top Agents Struggle to Earn a Living

Artificial Intelligence AI Freelance Work Scale AI Center for AI Safety Automation Job Displacement

October 29, 2025

Source: Wired AI

Reality Check

Media Hype 6/10

Real Impact 8/10

What is the Viqus Verdict?

We evaluate each news story based on its real impact versus its media hype to offer a clear and objective perspective.

AI Analysis:

While AI hype surrounding job displacement remains, this benchmark provides a critical dose of reality, significantly reducing the short-term impact score due to the demonstrable limitations of current models.

Article Summary

Researchers at Scale AI and the Center for AI Safety have developed the Remote Labor Index, a novel benchmark designed to assess the economic viability of frontier AI agents. Their experiment tested leading AI agents—including Manus, Grok, Claude, ChatGPT, and Gemini—across a range of simulated freelance tasks sourced from verified Upwork workers. The results were sobering: the best agents only managed to complete less than 3% of the work, generating a paltry $1,810 from a potential $143,991. The agents’ struggles stem from their inability to handle complex, multi-step tasks, lack of long-term memory, and failure to continually learn like human freelancers. This challenges previous optimistic predictions about widespread AI job displacement, particularly following OpenAI’s GDPval benchmark, which had suggested AI models were nearing human-level performance. The Remote Labor Index highlights a critical gap – AI’s current capabilities are far from ready to replace a significant portion of the freelance workforce, and raises questions about the speed of AI’s evolution.

Key Points

Even the most advanced AI agents perform poorly in economically valuable freelance tasks, achieving less than 3% completion.
AI’s primary limitation is its inability to handle complex, multi-step tasks and lacks the ‘long-term memory’ and continual learning capabilities of human freelancers.
The Remote Labor Index offers a crucial counterpoint to previous optimistic predictions about AI’s immediate impact on the workforce, particularly in light of the GDPval benchmark.

Why It Matters

This news is significant for professionals across numerous sectors. It provides a more grounded and realistic assessment of AI’s current capabilities, directly challenging prevailing narratives of imminent job displacement. It’s crucial for investors, policymakers, and workers to understand the limitations of current AI technology and avoid inflated expectations. The findings complicate the ongoing debate about AI’s societal impact and refocus attention on the practical barriers to AI adoption in the freelance economy.

AI's Freelance Fail: Even Top Agents Struggle to Earn a Living

What is the Viqus Verdict?

Article Summary

Key Points

Why It Matters

You might also be interested in

Robotics Funding Surge: A Decade of Maturation Signals New Opportunity

Levie Shares Cloud Survival Secrets at TechCrunch Disrupt 2025

AI Browser Wars Heat Up: Atlas Launches with a Security Flaw