How is everyone testing LLM based applications?

jaswanth.m · 1 September 2025 12:21

Heya everyone Been a while! But great to be back to the club.

AI based apps are everywhere, and companies have incorporated it into their existing offerings in one way or the other, in small and big ways.

Ignoring all the hyped apps, looking into just usecases that are truly a value add, even in the smallest of ways.. testing them have been such a precarious thing.

The aspect that an input drastically affects the output that comes out is now actually a feature and not a bug!

How is everyone looking at testing them ? The obvious ones that everyone have started adopting are Evals.

The non-determinism is just such contrasting that its close to impossible to cover even 80% of the cases.

So my question is… how have you changed your mindset in today’s world to test AI based features, that are non-deterministic, inherently biased and easily manipulated through prompt injections, etc.. ? how has your thought process changed ?

arcigo · 1 September 2025 12:46

One of my takes is that now more than ever, we testers will be increasingly required to evaluate these types of applications.I’ve seen some comments on Internet about people complaining on what AI is generating (talking about apps). So, in theory, testers should be needed more than ever and not replaced by AI.

One of the things that comes to my mind is to do a static analysis of the prompt that’s going to be executed. Does it meet what the stakeholder looks for? But, then we have to do another static analysis for the code generated to avoid security/leak breaches.

Topic		Replies	Views
Looking Some Best Practices for Testing AI-Powered Applications? Discussions tools , learning	2	224	26 February 2025
Is there any automation tool to test AI applications Archive automation , machine-learning	2	852	5 July 2019
How do you test AI systems to handle unexpected inputs? Discussions test-strategies , ai , biases	0	45	14 January 2025
Have you tested an AI tool/app? Discussions tools , ai , polls	14	456	17 July 2025
What do you think about AI features for software testers? Discussions tools , ai	19	1141	11 December 2023

How is everyone testing LLM based applications?

Related topics