SRPM – Site Reliability Product Management
We all need to deal with ad-hoc changes, tech debt and balance it with product feature development. Working in the site reliability space, this gets pushed to the next level, since most interruptions are critical and cannot be discussed or prioritized. How can a team have a meaningful velocity when it comes to the development of product features while it is also responsible for the availability, reliability, scaling and performance of the platform? I will give insights, learnings and mistakes on our journey of running our platform and how SRE topics become normal product features. I will show how this helped the organization to learn faster while dealing with a massive growth at the same time.