Hi, Andrew, you raised excellent questions and concerns.
First, as a Quality Manager — not just a tester or test automation engineer — I need to think strategically. My key objective is: how do I minimize critical incidents in production? To answer that, I have to consider the resources I have, the budget, and the complexity of the system under test.
Our application has 1,000+ URLs. Even though I have people doing manual testing, test automation, and domain experts on the team, the sheer complexity makes thorough regression and exploratory testing a serious challenge. We have E2E test automation, unit testing, integration testing — yes, all of it. But you know as well as I do that these tests follow the same workflows every time. They verify known paths. They don’t surprise you.
AutoExplore gives us something different: a daily quality pulse of the system by exploring the app autonomously — paths we didn’t script, data we didn’t prepare, combinations we didn’t think of. Is it shallow compared to a skilled human exploratory tester? Sometimes, yes. But it runs 24/7 across environments without consuming my team’s capacity. That’s the trade-off I’m making consciously.
Your point about context awareness is valid and I see it as an opportunity for the product to develop further and generate more value. Right now it catches generic issues — crashes, errors, accessibility gaps, broken flows. Adding context-aware exploration (requirements, business rules, heuristics) would take it to another level.
I’ve also experimented with Playwright agents that dynamically navigate the browser. My take: the potential is real, but there’s a significant gap between a promising POC and a robust solution that works day after day without requiring maintenance from my QA team. For a small team with limited resources, a production-ready tool like AutoExplore is the practical choice today.
To your bigger question — what do we lose by having machines simulate human investigative testing? I don’t see it as replacement. I see it as coverage expansion. My human testers focus on complex scenarios, business logic validation, and the “does this actually make sense?” questions that require domain knowledge. AutoExplore covers the surface area that no human team could realistically touch every day on a 1,000+ URL application.