by SchemaLoad 4 hours ago
What I have been doing seems a bit different to what's described, but I always make sure to define how to know the task is done so the agent doesn't quit early. Usually this means telling it to to run the tests and type checks to ensure it runs without errors.
Otherwise they often do a first pass looks good enough but it doesn't actually work.