Total 1 articles
The Apex-Agents AI benchmark 2026 shows that even GPT-5.2 and Gemini 3 Flash fail to exceed 25% accuracy in real-world professional work tasks.