Content Preview
# Multi-Agent System Evaluation Methodology ## Overview This document provides a comprehensive methodology for evaluating multi-agent systems across multiple dimensions including performance, reliability, cost-effectiveness, and user satisfaction. The methodology is designed to provide actionable insights for system optimization. ## Evaluation Framework ### Evaluation Dimensions #### 1. Task Performance - **Success Rate:** Percentage of tasks completed successfully - **Completion Time:** Ti
How to Use
Recommended: Install to project (local)
mkdir -p .claude/skills
curl -o .claude/skills/evaluation_methodology.md \
https://raw.githubusercontent.com/alirezarezvani/claude-skills/main/engineering/agent-designer/references/evaluation_methodology.mdSkill is scoped to this project only. Add .claude/skills/ to your .gitignoreif you don't want to commit it.
Alternative: Clone full repo
git clone https://github.com/alirezarezvani/claude-skillsThen reference at engineering/agent-designer/references/evaluation_methodology.md
Related Skills
Conducting Chaos Engineering
This skill enables Claude to design and execute chaos engineering experiments to test system resilience. It is used when the user requests help with failure injection, latency simulation, resource exhaustion testing, or resilience validation. The skill is triggered by discussions of chaos experiment
skill-adapterconducting chaos engineering
by jeremylongshore · plugins-plus-skills
Engineering Features for Machine Learning
This skill empowers Claude to perform feature engineering tasks for machine learning. It creates, selects, and transforms features to improve model performance. Use this skill when the user requests feature creation, feature selection, feature transformation, or any request that involves improving t
skill-adapterengineering features for machine learning
by jeremylongshore · plugins-plus-skills
engineering_metrics
Engineering Metrics & KPIs Guide
c-level-advisorengineeringmetrics
by alirezarezvani · alirezarezvani-claude-skills
cs-engineering-lead
Engineering Team Lead agent for coordinating QA, security, data engineering, ML, and frontend/backend teams. Orchestrates engineering-team skills for team-level technical decisions. Spawn when users need team coordination, tech stack evaluation, incident response, or cross-functional engineering work.
agentsengineeringleadagent
by alirezarezvani · alirezarezvani-claude-skills