Viqus Logo Viqus Logo
Home
Categories
Language Models Generative Imagery Hardware & Chips Business & Funding Ethics & Society Science & Robotics
Resources
AI Glossary Academy CLI Tool Labs
About Contact

AI's Freelance Fail: Even Top Agents Struggle to Earn a Living

Artificial Intelligence AI Freelance Work Scale AI Center for AI Safety Automation Job Displacement
October 29, 2025
Source: Wired AI
Viqus Verdict Logo Viqus Verdict Logo 8
Reality Check
Media Hype 6/10
Real Impact 8/10

Article Summary

Researchers at Scale AI and the Center for AI Safety have developed the Remote Labor Index, a novel benchmark designed to assess the economic viability of frontier AI agents. Their experiment tested leading AI agents—including Manus, Grok, Claude, ChatGPT, and Gemini—across a range of simulated freelance tasks sourced from verified Upwork workers. The results were sobering: the best agents only managed to complete less than 3% of the work, generating a paltry $1,810 from a potential $143,991. The agents’ struggles stem from their inability to handle complex, multi-step tasks, lack of long-term memory, and failure to continually learn like human freelancers. This challenges previous optimistic predictions about widespread AI job displacement, particularly following OpenAI’s GDPval benchmark, which had suggested AI models were nearing human-level performance. The Remote Labor Index highlights a critical gap – AI’s current capabilities are far from ready to replace a significant portion of the freelance workforce, and raises questions about the speed of AI’s evolution.

Key Points

  • Even the most advanced AI agents perform poorly in economically valuable freelance tasks, achieving less than 3% completion.
  • AI’s primary limitation is its inability to handle complex, multi-step tasks and lacks the ‘long-term memory’ and continual learning capabilities of human freelancers.
  • The Remote Labor Index offers a crucial counterpoint to previous optimistic predictions about AI’s immediate impact on the workforce, particularly in light of the GDPval benchmark.

Why It Matters

This news is significant for professionals across numerous sectors. It provides a more grounded and realistic assessment of AI’s current capabilities, directly challenging prevailing narratives of imminent job displacement. It’s crucial for investors, policymakers, and workers to understand the limitations of current AI technology and avoid inflated expectations. The findings complicate the ongoing debate about AI’s societal impact and refocus attention on the practical barriers to AI adoption in the freelance economy.

You might also be interested in