I’m assuming the test cases that are generated here are in automated script format, for me it would miss the point if it generated cases for manual execution and it for any reason it was the latter I’d assume those executing them would validate them as they go.
I suspect its a good question as we may be fasttracking down a path that most automation is generated as the product code is built, i.e nobody writing automated scripts any more.
Let me expand on that a bit. We already have playwright and llms where a couple of prompts update all your product code with test ID’s, create POM models for each views, generates, runs and fixes test cases and very quickly you can have a basic UI health check.
That with some learning may advance to no prompting required and apply at all layers in the stack that all automated tests are covered with minimal intervention.
Have you got a lot of slop though, are the tests useful, do they find the things they are intended to find. Will it scale is another decent question.
At this point probably a lot of slop but if its taught better that should hopefully reduce.
I’m of the view that having a critical thinking, skeptical human at the helm of most AI use is not going to go away and I doubt is going to be an admin activity so likely having someone taking specific ownership of each layer makes sense and developer or quality engineer can likely do that role.
I’m not sure if this will be similar to current validation process though those skills will apply as I suspect it will need more than current, its not just a validation activity at least for now. It’s a throw away, refine and expand activity so decent automation and testing skills.
I’m also not sure about usefulness outside the automated coverage, what uses did you have in mind? Testing as a learning, discovery and investigation activity may become even more important as the machines take over the scripted coverage.
I remain skeptical whether this path will occur but I’m not ruling it out as things are moving fast even if the learning of the tools remains slow or at times non existent.