Do you test in production?

cassandrahl · 20 February 2025 13:56

Hey all, the idea of testing in production isn’t a new one. However, it seems that there are contexts where it’s a good idea, and others where it creates its own risks.

So I’m just interested to get a feel for things. Are you testing in production, and why (not)?

Are you testing in production? Why

Yes, we are currently testing in production
Yes, but we plan to stop
No, but we plan to start
No, we do not test in production

0 voters

jmosley5 · 20 February 2025 14:20

I feel like I need to explain. Haha! I rarely test in production, but when I do it’s because we have a designated production account that does not touch client accounts. So, I know what boundaries I’m working with when testing.

Otherwise, if I had access to client data, I would delightfully decline. I know my propensity to break things and don’t want to be able to do something I would regret later.

komalgc · 20 February 2025 14:45

In Production , only sanity testing is performed and deep testing needs live user account with orders, even if we create one but it would be limited capabilities and access

ipstefan · 20 February 2025 15:16

Most testing for most product features is done in local/test/pre-prod environments. Some features need extra or specific testing in production.
We check in production things we’ve just deployed, sometimes briefly, other times more in depth due to data availability or environment specific setup/configurations. One example: to get a similar server setup in the test environment for one sub-system as in prod a company didn’t want to pay 200k/year/server, so instead a server with a test setup was used which varied slightly.
We check in production features when we have ab testing or blue-green releases or canary releases. Or when we did demos or alpha/beta testing.
We test in production features fully hidden from the users(with some technical enabling).
We investigate/explore data/logs in production to pinpoint bugs that clients have that are ‘silent’ to them and bugs not reported by anyone.
We test in production release product versions where there’s multiple departments involved in configuration, administration, data, content management, besides product feature deployment; where there’s no possibility to duplicate the work in test environments.
We ‘test’(monitor) in production with automated checks, scripts, dashboards, and other tools the stability, availability, reliability, robustness, etc… of the application over time.
We test/check in production 3rd party system updates, or our systems features linked to 3rd party systems integrations based on a risk based strategy. We had several such surprises where ‘hot-fixes’ updates have caused problems to our system.

testingrequired · 20 February 2025 16:06

Most companies I’ve been at we needed some production testing or dumping prod data to lower environments. It’s not always the cause but real use data like what’s in prod is a major source of bugs and missing test cases. Staging environments are useful as they should mirror production but this almost always means infrastructure wise. It’s rare I’ve seen high quality real use data in staging unless it’s been dumped from prod.

Ideally the system design wouldn’t let data get in to bad states. I’ll be honest and say it’s rare for developers (I say as a senior dev) to even think about preventing bad states let alone actually pulling it off.

*Test locally, test in dev, in staging, in production. Run it before you push up changes. Pull down PRs and run them.

*There isn’t a one size fits all answer unfortunately. Every code base has different testing needs. Every change has different risks and testing needs. Being better requires missing things, feeling the pain, and developing your gut.

ujjwal.singh · 21 February 2025 02:51

So when it comes to testing on production we do it in two ways :

We do sanity testing after a new build is deployed, we have just one user for our project and once the build is deployed we just go to the application, log in with credentials and just load all tabs/screens, and then logout to ensure that the prod build is not getting crashed after the deployment
In this case we don’t do testing we use the prod environment only when there is any issue with a specific scenario that we are not able to replicate on the internal environment. So we access the prod environment only to check the logs to find the Root cause. However, this is done mostly by EMs or QA leads/managers.

There are specific protocols setup for those who access prod environment and they have to ensure that every protocol is being followed while the prod environment is accessed by our team.

And we don’t do any other testing on prod because any change by our activity may impact the data of real-time users, and it might be possible that compliance issue may also occur.

ghawkes · 21 February 2025 09:32

Tricky. So I’d say no we don’t test in production, we do everything we can to gain confidence before deploying to production. Yes our Ops teams will do sanity checking to ensure there are no red flags post deployment, but that to me is common sense - as ops will get the phone calls if it isn’t!
But…we use production at times as part of our testing. When you have realtime data products for multiple clients with multiple formats of data inputs to each of them, one tweak to improve one customer could have an effect on another. So as well as the objective testing of “is it working as we expected?” there is a subjective check against production asking “could the customer see a negative impact when its deployed?”.
So we compare our stagings to prod environments to ensure that the impact of the changes is not going to be seen to have a negative effect on the current production. We could have made the product more accurate with our changes, but if the customer perceives a negative impact we need to be ready to explain why its a positive.

ramanan49 · 24 February 2025 02:05

Hello @cassandrahl ,

Yes, I have experience testing in production. Only sanity testing is performed in production and full regression testing on a live user account linked to our account is only possible when we got confirmation from the project manager.

Additionally, in production, issues reported by end users can be tested. I have experience testing in production and have identified critical bugs during the release phase.

Happy Testing!
Ramanan

andrewkelly2555 · 24 February 2025 08:39

Normally as an extension of testing and not as an alternative to other testing we do.

Health checking the small variations between our test and production environment and making sure we have deployed to production okay is the most common one.

We also do A/B testing which needs to be production, alongside analytics, testing users real usage with heatmaps, devices, OS’s , and monitoring crash reports, security, traffic volumes etc, maybe even some testing for marketing funnelling. All of this testing contributes to ideas for improvement in addition to catching the odd bug we may have missed.

If there is a good reason and value in production testing then I do not see why not, but I’d be wary if it was seen as a replacement for pre-production testing activities which offer different value.

cassandrahl · 24 February 2025 11:53

I can certainly relate to that fear of messing up something. Maybe this is a sign that the recovery processes for something like that aren’t good enough yet?

cassandrahl · 24 February 2025 11:56

That’s kind of a challenge I’m having on my project, in that all records would be “real” records, and they would affect multiple other dependent teams’ systems too. This means there would be a lot of hoops to jump through to change the environment in such a way that testing in prod would be possible… But then the environment no longer has the prod configuration, so what would be the benefit? That benefit vs risk question has been really important for us.

cassandrahl · 24 February 2025 12:02

I love that question!

cassandrahl · 24 February 2025 12:06

Oh, I was definitely thinking of testing on prod as an addition, not a replacement, but it is a good point that some places have used it as a replacement. Reading through the comments so far, that doesn’t seem to be the case for folks here. I imagine you’d need a specific set up in order to enable and warrant testing only on prod.

jmosley5 · 24 February 2025 14:11

No, we could fix things if we needed to. I’m just really good at doing unfortunate things

kristof · 25 February 2025 07:33

Shift Right

But yea mostly sanity checks APM tests etc.
We also have automated tests to check if configuration is set properly.

It doesn’t take days of course, just a few quick tests.

rosie · 25 February 2025 10:31

I posted the poll on LinkedIn too, it might help get some responses/insights.

milos · 25 February 2025 11:59

Yes, to some extent. Most of the testing is done on dev and staging env, but I sometimes check certain features on production too if I consider them risky or see some value in testing there too.

It’s done on my own prod testing accounts and doesn’t affect end users in any way. Cleanup is also performed if needed.

Topic		Replies	Views
Load testing on Production.....is it the right thing to do? 🙋 Questions strategy , performance-testing	9	1776	2 May 2025
Testing / QA in Production 🗄️ Archive process	12	3069	29 August 2022
Test Environments: Do You Have Them? 🗄️ Archive	4	459	25 June 2020
Resources to Learn More About Testing in Production 🗄️ Archive learning , resources	2	498	26 February 2021
Which Environments Should We Be Testing In? CI/CD 🗄️ Archive devops , process	7	3767	25 November 2021

Do you test in production?

Related topics