Engineering Notes
Thoughts and Ideas on AI by Muthukrishnan
Home
All posts
AI Agents
Engineering Manager
About
Resume
Tags & Stats
Tag: Testing
19
Feb 2026
Agent Evaluation and Benchmarking for Measuring What Matters
Learn how to systematically evaluate AI agent performance using benchmarks, metrics, and evaluation frameworks that go beyond simple accuracy
29
Dec 2025
A Practical Guide to Evaluating Your AI Agents with DeepEval
Learn how to systematically test and evaluate AI agents using DeepEval's metrics for relevancy, faithfulness, and task completion.
05
Nov 2025
Agent Debugging and Observability for Seeing Inside the Black Box
Master the art of understanding, debugging, and monitoring AI agents through tracing, logging, and observability patterns
22
Oct 2025
A Multi-Agent AI Ecosystem for Self-Improving Software
In the relentless pursuit of robust and secure software, developers have long relied on a combination of automated test...