Add agentic-eval skill for agent evaluation patterns #591

hashwnath · 2026-01-19T03:40:23Z

Summary

This PR adds a new agentic-eval skill to the skills collection, focused on patterns for evaluating and improving AI agent outputs.

Skill Contents

Reflection Pattern: Self-critique and iterative improvement loops
Evaluator-Optimizer Pattern: Separate generation/evaluation components
Code-Specific Reflection: Test-driven refinement workflows
Evaluation Strategies: Outcome-based, LLM-as-Judge, Rubric-based
Best Practices: Clear criteria, iteration limits, convergence checks

Use Cases

Implementing self-critique and reflection loops
Building evaluator-optimizer pipelines for quality-critical generation
Creating test-driven code refinement workflows
Designing rubric-based or LLM-as-judge evaluation systems
Measuring and improving agent response quality

This skill is domain-agnostic and can be applied to any AI agent system requiring output quality improvement.

aaronpowell

please ensure you run the update script so that the readme is updated with the changes

Ran the update script as requested by reviewer to regenerate the skills table.

Hashwanth Sutharapu and others added 2 commits January 18, 2026 19:39

Add agentic-eval skill for agent evaluation patterns

3d00ec4

Merge branch 'main' into add-agentic-eval-skill

1dd4a8e

aaronpowell requested changes Jan 19, 2026

View reviewed changes

chore: update README.skills.md via npm start

105c0f5

Ran the update script as requested by reviewer to regenerate the skills table.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add agentic-eval skill for agent evaluation patterns #591

Add agentic-eval skill for agent evaluation patterns #591

hashwnath commented Jan 19, 2026

Uh oh!

aaronpowell left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add agentic-eval skill for agent evaluation patterns #591

Are you sure you want to change the base?

Add agentic-eval skill for agent evaluation patterns #591

Conversation

hashwnath commented Jan 19, 2026

Summary

Skill Contents

Use Cases

Uh oh!

aaronpowell left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants