Testing
- Your LLM Writes Plausible Code. That's Not the Same as Correct Code.
LLMs optimise for plausibility, not correctness -- and the tests pass because the same model wrote both. Defining acceptance criteria before you generate code is the only reliable way out.