Engineering Notes
Thoughts and Ideas on AI by Muthukrishnan
Home
All posts
AI Agents
Engineering Manager
About
Resume
Tags & Stats
Tag: Safety
05
Mar 2026
Safe Reinforcement Learning Teaches Agents to Optimize Without Violating Constraints
How constrained MDPs, Lagrangian methods, and safety critics enable agents to maximize reward while staying within hard operational boundaries