After publishing my recent article on using AI as a partner in Quality Engineering, I realised many teams are moving beyond “one-off prompts” and starting to build more structured GPT agents for testing tasks.
I’m curious :
– Have you built or experimented with GPT agents for testing or QA work?
– What problems are they actually helping you solve (analysis, exploration, automation support, triage, etc.)?
– What didn’t work as expected?
I’d love to hear real examples, especially from teams using agents in day-to-day testing, not just experiments.
I tried playwright test agent for generating test plans , script generation and self healing, based on my experience it was okay. I tried it for website with mandatory login for accessing the feature and initially the agent was bit stuck on login.
After multiple attempts it was able to access the website.
I didn’t tried script generation for any complex features, as I was just exploring so i tried with normal and it worked.
Self healing was good and it worked as expected.
However like other gpt , the test agents results needs manual review before using the results formally.