Excepteur sint occaecat cupidatat non proident
Δ
Monitoring frontier reasoning models for reward hacking. Credit: arXiv (2025). DOI: 10.48550/arxiv.2503.11926 Over the past year, AI researchers have found that when AI...