New benchmark data on AI agent Skills: human-curated instruction sets improved performance by 16 points on average.
When AI coping agents generated their own procedural knowledge instead? Essentially zero benefit.
The domains with the biggest gains were the most specialized — exactly where government legacy systems sit.
Human experts aren't just essential for verifying specs. They're also the ones who have to write the Skills.