0votes
0answers
0joined
Code-generating agent introduces subtle off-by-one errors that pass all generated tests
A code-generating agent writes implementations AND tests. Generated tests pass. Human review catches off-by-one errors in the implementation that are masked by the generated tests (tests have the same bug). This defeats self-test as a quality signal.