How are you actually using AI in testing right now?

Matt_Calder · 7 April 2026 05:11

On our side, it started small. Generating draft test cases from requirements. Summarising long bug reports. Suggesting edge cases we might not think of immediately. Nothing magical, but it shaved time off the repetitive parts.

More recently, we’ve been experimenting with using AI to review requirement changes and flag which existing tests might be impacted. That’s been surprisingly useful, especially in larger suites where things quietly drift. Some tools have started baking this in directly, such as test management platforms that can analyse gaps or redundant cases and suggest updates instead of forcing manual audits.

For teams leaning into AI in testing:
Are you using it for test generation, maintenance, flakiness detection, coverage analysis, something else entirely?
And has it genuinely reduced effort, or just shifted it around?

andrewkelly2555 · 7 April 2026 06:35

One of the things I’ve found is that a lot of people are using it for problems I do not have, this risks a knock on effect that if I had a manager who was seeing all these posts and seeing people get a lot of savings or 10x multipliers they could be asking why I am not.

The answer to that is that we likely have already been for the last decade 10x to what some teams are doing.

I have not written test cases in well over a decade, I am not doing mundane boring tasks that machines could do quicker and I do not have flakey automation suites that need self healing. Okay so by not doing things saying I am already 10x could be a bit of a push but it’s definitely a discussion.

So where do we then use it.

General Automation - same as a developer would use copilot - I’d say fairly significant gains, it also gives me a bit of coding edge, it’s better than me at coding.

Web UI new automation. - Playwright agent usage here - often where it was deemed too costly to have this for the ROI - now this can give a basic health check with model and in the CI in a few hours. So not so much a saving but extra cover for low cost.

Vibe coding building small tools or general copilot - data generation, building a mutated code base for testing how good the automation tests actually are, adding an english translation to ease my testing. I’d add to this root cause analysis - tell it the issue you are seeing and it can often find the piece of code and offer suggestions. Code access is key to gains on this.

Research and ideas - the questioning going deeper aspect really matches testing. Whilst this speeds up the activity it tends to give me more ideas to go deeper so increased coverage rather than savings. This is likely my biggest usage.

Dev buddy copilot. I build a lot of apps locally, lots of installations, dependencies, setup and configs, windows versus mac, IOS, Android. When I hit blockers before I would take developers time, not AI is helping me do this much quicker with less interruption for developers.

Testing Agents - Spending a lot of time experimenting here. Getting some okay results on some specific things like accessibility where it does a lot of scans and some exploration via mcp. This for now takes time but is interesting. I am still not so clear on capability for the basic test cycle though.

“Risk hypotheses, investigate and experiment, interpret findings and review, revise hypothesis and loop”

If anyone has a tool that does this, please do a write up - my buddy is experimenting with the security tool Shannon and he’s been impressed to a level with its explore ability but its still not full loop and needs him hands on.

Some will not have the challenges I have.

If you do not have code access just getting code access will offer benefits before you add in AI and then it in my view will offer more.

Note I often test four products a day, a 10x so going to 40 products and my brain will pop so it’s not a goal, fairly fast already.

aiden_mystic · 8 April 2026 04:57

Yeah, I’ve been seeing a similar shift. I’ve been using AI-powered automated pentesting tools like ZeroThreat AI, and it’s genuinely changed how I approach testing. It takes care of a lot of the repetitive discovery and triage work, and I’m getting much higher-signal results compared to traditional scans.

It hasn’t removed effort, but it’s definitely moved it to where it matters more… for business logic testing and validation instead of surface-level checks. To put it simply, I feel like we’re finally moving from just finding issues to actually understanding risk faster.

juanalvarezarquillos · 8 April 2026 09:10

Nothing special, just limiting it to help automate little tasks

When you say “reduced effort” or “x10” time savings. How do you prove this assertion to your managers? And, how do you answer to the expected question: What did you do in the time you gain?

When all the industry is measuring AI with this kind of metrics, something is really really wrong.

probe_runner · 5 May 2026 09:36

I’m coming at this from the developer side, building QA tools as testing shifts earlier into the development workflow.

The split I keep coming back to is: AI for test creation and analysis feels useful; AI inside the execution path makes me more cautious.

If AI suggests scenarios, drafts checks, or helps triage failures, a human or deterministic runner can still review the result. But if AI is deciding what to click during every CI run, the test result becomes harder to trust.

I’m curious whether teams here make that distinction in practice: AI for authoring vs AI for execution. Or do most teams just evaluate the tool by whether it reduces effort?

ujjwal.singh · 6 May 2026 07:17

We are using Subscription based claude code for various purposes in our testing process, some of them are like :

Generate test cases
Validate PR
Compare automation script with PR to ensure script covers the PRD
Verify test results
Also verify test results against test cases

juanalvarezarquillos · 6 May 2026 11:03

Thanks for the insights! I found it scary to use the AI to verify the test results. AI is a tool with a probabilistic output, how can you trust it ti verify the tests if every time it can output different results?

probe_runner · 6 May 2026 13:07

That’s exactly the concern that shaped our approach. In the workflow I described, AI is only used during the annotation phase — identifying and labeling UI elements. A human reviews and corrects those annotations before anything runs.

After that, execution is purely deterministic: fixed coordinates, standard Python, no model inference at runtime. Same input, same behavior, every time. The probabilistic nature of AI never touches your test results.

chrisbryant · 10 June 2026 21:09

Hello!

We’re just diving into AI-driven testing with Keysight’s Eggplant DAI (Digital Automation Intelligence). We’ve been using Eggplant Functional for years in concert with Eggplant Manager, and the opportunity to include exploratory testing (hands-off, unscripted, investigative) is exciting.

I’m still in the model building phase (architecture) but have already started coding out snippets for various functions (not scripted workflows) for a web platform exploratory testing model meant to emulate a human user within a web portal doing just about whatever they want, testing the boundaries of performance and functional integrity.

I remember trying to script out something similar back in the early 2000s using SilkTest and I have to admit having AI working behind the scenes instead of trying to randomize “user actions” through actual code is a time-saver (and keeps my blood pressure down)!

Although this is my first exposure to AI-driven testing, I believe this is the area of strength AI should be focused on in testing: Taking human scripted functional steps that align to how users do things within an application, and then having AI-driven testing be that user and using the application as unexpectedly as a human might, opening up more possibility for the discovery of defects than perhaps a seasoned tester would do.

Enjoying it so far, I have to say!

Cheers!

nsegal · 17 June 2026 06:54

Have you regularly compared the test cases (not test automation) produced by Claude VS some of your best testers ? Did Claude generate same or even better tests than the humans ? Did Claude test cases reveal more bugs ?

jiaohe · 28 June 2026 19:33

For me the useful part has not been “generate a test suite and trust it.” It has been using AI as a structured first-draft and review assistant.

The workflow that feels safest is:

Generate candidate cases from the story and acceptance criteria
Ask it to map each case back to a requirement
Ask it to find missing assumptions, negative paths, duplicate coverage, and vague expected results
Have a human decide what actually belongs in the release, smoke, or regression set

The time saving is real on repetitive drafting, but I would not call it a replacement for QA judgment. It mostly shifts effort from blank-page writing to review and correction.

QA_Tidy_Boy · 3 July 2026 11:26

For me, I find it useful for:

1.Test case generation

User stories
Acceptance criteria
Requirements
Existing code
API specifications

2.Automated test creation

AI can produce usable code really quick.

3.Bug analysis

Instead of manually reading a stack trace, AI can analyse:

logs
screenshots
console output
network traffic
exception traces

and suggest likely root causes.

dstekanov · 3 July 2026 20:34

At the beginning, there were some experiments using it for test generation, maintenance, flakiness detection, and coverage analysis, etc… but somehow it didn’t work out. Now I use it more where I need to collect data or give me insights (for example, as for a lead, I want to see at a high level what all the teams have planned for their sprints, so as not to drown in the details of dozens of tasks, but to see the essence of the sprint, and I use AI to help me generate abstractions, through which it is easier for me to keep the whole picture in my head, for example, try asking AI to build an analogy for you of the Attention Mechanism in LLM using the example of searching for books in the library). It helps me see the big picture, the dependencies between teams and where the hardest part is.

amichael · 7 July 2026 06:40

We have it differently in discipline. Most development teams switched to UT creation with AI. Quality Engineers and SDETs are increasingly moving toward AI-driven test generation. But, so far, we have no AI-based testing, only test generation.

Topic		Replies	Views
Where do you see the most need for AI Support? Discussions automation , ai	11	222	26 January 2026
In testing, what is AI useful for and what is it useless for? Discussions ai , testing	4	116	19 June 2026
Where would you trust an AI to write your tests — and where absolutely not? Discussions tools , automation , process , api-testing , ai	3	164	7 May 2026
🤖 Day 5: Identify a case study on AI in testing and share your findings 30 Days of Testing ai , 30-days-of-ai-in-testing , 30	63	3922	13 September 2024
AI and Testing wiki Wikis machine-learning , ai , wiki	3	4282	12 November 2023

How are you actually using AI in testing right now?

Related topics