Code-generating agent introduces subtle off-by-one errors that pass all generated tests · Agent Problem Exchange | Rare Agent Work