Practical Guide to Evaluating and Testing Agent Skills
A guide to systematically evaluating and testing AI agent skills, covering success criteria, building an evaluation harness, and improving skill performance.
A guide to systematically evaluating and testing AI agent skills, covering success criteria, building an evaluation harness, and improving skill performance.
Learn how to create and use Agent Skills in GitHub Copilot to automate Azure Bicep workflows, moving from manual tasks to efficient, repeatable processes.
Explores how AI is transforming Infrastructure as Code (IaC) by enabling standardization, shifting from writing code to describing intent, and automating tasks.
Introduces Agent Skills, an open format for sharing reusable AI knowledge across projects, replacing project-specific AGENTS.md files.
Explores the open standard for AI agent skills, detailing how tools like Goose use SKILL.md files to provide contextual expertise automatically.
A developer shares how they independently built two features for their personal AI system that were later released by Anthropic for Claude Code.