S.O.S Refactors and lack of communication between different teams (that break each other's features)

antonella · 11 April 2024 11:01

Hi there! I’m not very active here but right now I feel like the community can help me come up with ideas because I very much need them blush.

Our product is a C2C and B2C solution (kinda like Vinted ) and we’re organized by tribes split by user experience, i.e. Seller, Buyer, Marketplace, etc. I was recently moved to a different tribe in the company and one particular team is eager for me to help them avoid lots of bugs (it’s happened) whenever there’s a refactor. I’m lost and I feel like a complete impostor. I am a user of the app but I don’t know “everything” (can anyone even?) and I’m struggling to help them find solutions to plan things better and avoid breaking a lot of things from other teams whenever they refactor something.

I’m currently working on a doc I call “Trigger questions” in which I speak with different people and come up with a list of things we should take into account (and encourage communication with the other teams!) when we plan a refactor. I’ve also suggested modeling before refactoring (like a mindmap) so we can have an idea of different areas that could be affected by our changes. We’ll try these things and see how they work.

My question is, have you been in a similar situation? If yes, what helped you help the team?

PS: I tried to provide enough context, I hope it is

Thank you so much for reading!

ipstefan · 11 April 2024 12:02

Hi,

The first thing that comes to mind when I see 'to help them avoid lots of bugs ’ is to find out what your role is and their concrete expectations:

Product manager - help them define better specifications and offer technical support if possible;
Development lead - define, design and lead development practices;
Quality Assurance Engineer - analyze development issues, fix them, review code, fix bugs, improve the code, write low-level checks, build scripts/tools to help developers test(Google was using QAE with tasks like these)
Software Tester - identify quickly the problems or potential issues impacting the quality of the product and inform the relevant people that matter;
Release manager - lead the process of release and not go through if there are pending bugs that would require fixing.
Other?! maybe an Automation engineer - which codes hundreds of checks in the logic/flows of the application as a net for the obvious issues.

shad0wpuppet · 11 April 2024 12:06

Keeping backward compatibility is key - by doing integration regression testing, you’re making sure everything works as it did before. You don’t need to test anything new, just that the integration still works after refactoring. Refactoring is tricky - there’s no magic solution to avoid breaking stuff and having bugs Devs from different teams gotta talk and figure out how to handle everything - integrations, APIs, schemas, etc - without breaking the whole system. In some situations, you might run almost 2 systems/interfaces for a bit to switch from the old to the new code.

Planning is important - break down the refactor into smaller iterations you can do it step by step. It’s gonna take more time and resources, but it’s way easier to manage and less likely to cause critical issues. Make sure there’s a code review process across all teams, so everyone understands and accepts the changes, and plans any extra refactoring that needs to be done on their sides.

The real issue probably is in having a solid dev process that all affected teams get and strong engineering management to maintain. needed formalities in processes. As QA, finding every bug or cutting down on them significantly might not be possible but keeping them manageable is possible. Refactoring is basically a feature If you’re implementing a big feature or several across different services, you’re in the same situation.

Try to use automation - think e2e API tests, unit testing, contract testing, etc. Use feature flags to ship refactoring so you can switch between old and new code as and when and if needed. Keep tech docs and diagrams up to date, have cross-team meetings, and make sure everyone who needs to know, does.

shad0wpuppet · 11 April 2024 12:47

I was motivated to write a more structured but also simplified post about this topic on LI so maybe it’ll be a bit easier to understand some of my points Konstantin Sakhchinskiy on LinkedIn: #qa #qualityassurance #softwaretesting #softwaredevelopment #refactoring…

han_toan_lim · 11 April 2024 13:06

This might be useful.

In the past I worked for a company with two products sharing some code and dealing with different users. The devops engineers had the following things implemented:
= for each feature or change, a developer had to make automated tests to assure that the code worked as required. The bulk of the tests were unit tests, which could be executed within seconds. Test Driven Development was their way to develop the code.

In order to manage the code, they used git, a version control system. There was only one single branch. If the code would break, then it would become obvious within minutes.
A developer could only integrate code in this single branch after passing the auotmated tests of all parts of the system.

There were other house rules for development, but the described ones had a major impact on the quality of the products.

conrad.braam · 12 April 2024 08:09

Welcome back @antonella . Yeah, I’m curious as to why the company keeps on refactoring, I mean that’s not actually a customer value piece of work, but maintenance work for engineering sake. Which is a velocity enabler or a velocity killer.

I am not so sure you are so much preventing “bugs” as preventing “regressions”, and as such I find the friction that having a very strong full automated regression testing suite imposes, means that every refactor has to work the same as the one before. And in reality, that’s not true all of the time. During a refactoring, you may find many reasons to update or to fix the UI (or for that matter to fix any api interfaces and integrations. And I think it has been unhelpful to create the myth that regression tests must always pass, so I think for me, that would be my first “trigger” or myth to bust in any conversations. Refactoring is a chance to make security as well as UI and workflow changes.

As for ways you might actively help, it might be in championing the socialising of internal test environments where the teams integrate all their components into a environment usefull for not just automated-testing, but also for those B2C demos as a way of turning the integration testing you already do into something far more “social” as well as better understood by all teams. Things become less ‘chuck it over the fence’ when there is a test sandbox that is always live and up to date. Hope that’s an idea that is highly technical but only requires communications and people-networking effort to do.

Having a place where everyone can see their refactoring and show it off being that step towards the 3 teams/divisions in the company being able to share by seeing new features early, and being allowed to play because nothing can really break is powerfull.

ansha_batra · 12 April 2024 08:21

hello @antonella

Thanks for reaching out an sharing your thoughts.
It sounds like you’re navigating some new territory within your company, but you’re not alone in feeling that way.
Remember you were chosen to be part of this team for a reason!

Your approach creating a Trigger questions doc & suggesting modelling before refactoring shows great initiative and problem-solving skills, it’s all about fostering open communication & collaboration b/w teams which is a key to successful refactoring.

In similar situations, I’ve found that transparency and teamwork are crucial. Building relationships with others and fostering a culture of knowledge sharing can help immensely. Don’t be afraid to lean on your colleagues for support and bounce ideas off each other.

Keep up the great work, you’ve got this!!!
NO MATTER HOW SMALL, EVERY STEP FORWARD IS PROGRESS.

msh · 12 April 2024 13:17

Thats a rough road.

I think I have an understanding that:
You dont have the Subject Matter Expertise that longer term folks have
Your application is subject to changes that teams external to yours are making, causing regressions that your team didnt create.

First, find the SMEs that do exist. Often they are in Product Support (they know all the ugly warts in the system) Product Management (they know all the business rules) and engineering management (they know how often their teams have to fix the same things) Learn from them. Always create test documentation. Even if its brief. Either a planned set of tests OR a record of tests that have been done. This is so that all of those mentioned disciplines have something to go over and examine for gaps in testing. I find it also helps me articulate my understanding of the feature or the change being made

If there are tools in the build process that articulate upstream changes, get to know them. If there arent, encourage the implementation of them. Work with engineering leadership about how you all can be aware of dependencies changing in the codebase. Dont just accept a build. Look at the pull requests involved. Over time you will learn to recognize areas of risk. Or at least ask. In my last role this would happen often. It had taken a long time but I knew which changes were risky. An external developer (single) had created a particular email service. He like to just push changes. Now We couldnt detect those changes (and he was a jerk about bothering to announce them) But when we had a feature change that would touch any aspect of email, I would alert on that and loudly communicate that the development and testing needs to be broad and deep around any email dependency

And do continue to create a culture of that communication. We continually struggled with it. Our Product managers and engineering leads were instrumental in those comms. Where a PM would call out that this feature chage or business rule change they were making would impact other team features. Engineering leads would call out when a PR they were reviewing was depended on by other teams.

It will never be perfect and it takes relentless leaning on others to get that culture to shift slowly. Keep at it!

joyz · 13 April 2024 01:12

If this problem is an “issue” for you, you may try to dig into finding the cause first before trying to plan on “how to avoid”.

The first thing I would do is understand the dev team ability. e.g.

Will the team combination be having too many junior dev?
Does this problem only happen in this team/ specific person’s work?
Is the team scale too large to manage?
Will the pull requests be stacking for a long time before review and merge?

This is not to blame anyone, but to check if we need to talk to managers on people arrangement.

Another check point is on the timeline, are the dev finishing the refractor in a rush for a deadline? Mistakes may occur easier if they don’t have enough time to review and check their work.

You could also suggest a retrospective with the team to know more about the reason behind. Communication is important to solve problems.

Of course also check the testing process, but this is mostly to avoid the situation going worse, but not tackling the root cause.

How early/ frequently are the tests run? As in projects with many bugs, it may require early testing to shorten the feedback loop with developers.
Is the defect list well managed? As if bugs keep growing it will be hard to see the priority of issues.
Are the high-value issues being prioritised to report and arrange for fix? If impossible to have nice builds, at least we ensure the main features are not breaking.

antonella · 16 April 2024 07:37

Hi @ipstefan! I’m sorry I forgot to mention my rol, it’s called “QA Engineer” but what it’s expected from us it to coach the teams when needed so they can come up with solutions that work for them. Also, we aren’t embedded on a team, we split our time working with multiple teams.

antonella · 16 April 2024 07:43

Hi @shad0wpuppet I’m super happy it inspired you to write a post!
I’m happy to say that the list of potential things to take into account that we’ve been working on has all the things you mentioned, so it seems like we’re on the right path and I find that a relief, to be honest.
Thank you so much for taking the time to answer my question so thoroughly!

antonella · 16 April 2024 07:57

Hello @han_toan_lim thank you for your reply. I believe our devs are already doing this with every piece of code they merge, but I’ll ask to learn if they’re doing it with new features mostly or also with refactors.
Thank you so much!

antonella · 16 April 2024 08:16

Hi @conrad.braam , thank you for taking the time to answer to my question!

Currently, the problem these refactors cause isn’t really affecting the regression or the automated tests we have (or not in a big way, because no one has complained about it yet haha).

We do have a beta environment and we’ve been suggesting for years that perhaps we should have one more environment, since beta is “everyone’s sandbox” and sometimes things that should work don’t because someone’s deployed something there…but it hasn’t been a problem for at least the past two years, since I think communication improved and we’re alerted when someone’s going to merge something and cause some temporary disruptions.

I think they’re refactoring this important parts because they’re legacy, and they have two goals: 1) implement the latest design system and, 2) update the code so that they can carry out their next ideas on the roadmap…however, reading this question of yours made me think…so I’ll be surely asking the team WHY we’re doing it because I sense there might be more than I think.

Once again, thank you for your reply! Reading all the different replies is making me think that perhaps the issue here (I’m new in this tribe, so I don’t have the full context) might be linked to speed and pressure to deliver…so I think I’ll push some buttons there to see what answers I get

antonella · 16 April 2024 08:24

Hi @ansha_batra thank you thank you thank you for this. Perhaps that’s the solution I was really looking for, someone to remind me that I’m already trying and that I’m trying to find a collaborative solution because, truth is, I think that’s the key to solve this problem: collaborate more, often and better.

Thank you! <3 I hope you have a brilliant day!

antonella · 16 April 2024 08:33

Hi @msh , thank you for taking the time to answer to my question.

I think we’re already doing many of the things you mentioned BUT you’re telling me something I’ve been feeling for the little while I’ve been working with this team (3 weeks-ish) which is “if we don’t know many things about the area of the product we’re refactoring, then let’s find someone who does!”. I’ll follow that path to see what I find out! Thank you!!!

antonella · 16 April 2024 08:39

Hi @joyz thank you for taking the time to answer my question.

Your initial questions are super interesting and combined with some suggestions given by other people here, I think they’ll allow me to get a better picture of what could be the actual problem here, so I marked it as a solution.

Fortunately, we’re monitoring bugs and defects pretty well and also prioritizing them well enough, so I think that the actual team + speed they commit themselves to + lack of breaking refactors into smaller deliverable chunks might be causing some pain here…I’ll keep investigating and asking questions to make people reflect.

Thank you so much! Have a lovely day <3

Topic		Replies	Views
What do you think of these new discussion categories and chat channels? – Changes to The Club coming soon w/b 30th Jan 2023 🗒️ Site Updates & Feedback	7	652	1 February 2023
Follow-up from MoT workshop on writing 🗄️ Archive	2	251	8 September 2021
What updates would you make to the Feature Chat Sheet? 🙋 Questions test-planning , cheat-sheet	1	532	18 March 2023
How do you help your team to have structured conversations? 🗄️ Archive communication , collaboration	4	452	18 March 2021
Glasgow Meetup Notes, 24th September (To be archived) Meetups	2	768	26 September 2018

S.O.S Refactors and lack of communication between different teams (that break each other's features)

Related Topics