Post · bonfire.cafe

Post

Log in

New benchmark data on AI agent Skills: human-curated instruction sets improved performance by 16 points on average.

When AI coping agents generated their own procedural knowledge instead? Essentially zero benefit.

The domains with the biggest gains were the most specialized — exactly where government legacy systems sit.

Human experts aren't just essential for verifying specs. They're also the ones who have to write the Skills.

What the Research Is Starting to Tell Us About Agent Skills

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

Automatic federation enabled

Log in