Engineering Notes
Thoughts and Ideas on AI by Muthukrishnan
Home
All posts
AI Agents
Engineering Manager
About
Resume
Tags & Stats
Tag: Evaluation
19
Feb 2026
Agent Evaluation and Benchmarking for Measuring What Matters
Learn how to systematically evaluate AI agent performance using benchmarks, metrics, and evaluation frameworks that go beyond simple accuracy
29
Dec 2025
A Practical Guide to Evaluating Your AI Agents with DeepEval
Learn how to systematically test and evaluate AI agents using DeepEval's metrics for relevancy, faithfulness, and task completion.