Viqus Logo Viqus Logo
Home
Categories
Language Models Generative Imagery Hardware & Chips Business & Funding Ethics & Society Science & Robotics
Resources
AI Glossary Academy CLI Tool Labs
About Contact

AI Agents Make Legal Leap

AI Artificial Intelligence Legal Tech Benchmarks Anthropic Opus 4.6 TechCrunch
February 06, 2026
Viqus Verdict Logo Viqus Verdict Logo 8
Momentum, Not Mastery
Media Hype 7/10
Real Impact 8/10

Article Summary

Anthropic’s Opus 4.6 has dramatically altered the landscape of AI agent benchmarks. Initial reports from Mercor last month showed AI agents struggling in professional domains, with scores below 25%. However, the release of Opus 4.6 demonstrated a substantial improvement, achieving a score of nearly 30% in one-shot trials and an average of 45% after multiple attempts. Crucially, the release included new ‘agent swarms’ designed to facilitate multi-step problem-solving. Despite remaining far from 100%, this represents a considerable jump and raises questions about the timeline for AI’s potential in areas previously considered exclusively human, such as legal analysis. Mercor CEO Brendan Foody hailed the advancement as ‘insane,’ highlighting the speed of progress. The benchmark results demonstrate ongoing development in foundational models, indicating that AI's capabilities are evolving more quickly than initially anticipated.

Key Points

  • Anthropic’s Opus 4.6 scored nearly 30% in one-shot AI agent trials.
  • The release included ‘agent swarms’ to aid in complex problem-solving.
  • This represents a significant jump from previous AI agent benchmarks.

Why It Matters

This news is critical for professionals, particularly in legal and business fields. The rapid improvement in AI agent performance suggests a potential shift in the competitive landscape, demanding that individuals and organizations proactively assess and adapt to the evolving capabilities of AI. This could have long-term implications for job roles, business processes, and the overall use of technology.

You might also be interested in