DevOps Vision Blog

DevOps Vision Blog

About the Blog

Various posts about software delivery in the 21th century. Managed by Anders Lundsgard, @anderslundsgard. Psots are my own but inspired by many.

Autonomous teams Supporting Microservices 24/7

DevOpsPosted by Anders Lundsgård Tue, April 24, 2018 06:47PM

For agile software teams, moving to fully take responsibility not only for Development and Test, but also for Operations is a challenge. On top of that, supporting their services 24/7 may seem more than one can demand.

This post hopefully gives you some useful thoughts to consider when deciding how to effectively deal with support of your software applications/services 24/7.

First. Many have seen the value of a clear ownership of the code that builds your application. With code means: application code, tests, database scripts, configurations. Not only that but also the delivery pipeline definition and infrastructure code. It is important that one application or service has one (and only one) team owner.

At the very start, the team that chooses to create a repository (e.g. Git-repo in GitLab) and store some code in it is the owner of the lifecycle of the code. From the first day code hits production, the team that owns the code must then also take care of the 24/7 operational support of the code. If the team do not, the team is not (at least by my definition) autonomous. A team is usually between 3-8 persons.

Consider below aspects before taking a decision if you want to involve an external team/partner to take care of supporting operations of your code in production:

- What team do you think handle support better than your own team? Your team knows their code best. Agree?

- Do you believe you can write solution templates for an external person to follow when things start to fail at night? And if you do write “disaster instructions”, will the external person who follow those do more good than harm? For all types of things that can go wrong?

- What about offload the heavyweight lifting of for example a database server to a cloud provider. Using for example AWS RDS service your team don’t need to patch servers or care about high availability. The cloud provider manages that for you. Of course, you need to invest in understanding what the cloud provider does and does not do for you. With Serverless services your team does probably not need any capacity planning at all.

- Zero Downtime deployments: When your services can be updated without end user impact your team members will do it on office hours. QA mindset will significant increase when the engineers who writes the code also the same day push the change to production. And when the change is pushed, they are awake ready to deal with good or bad end-user impact.

- If your services are very business critical, consider introducing a pager system. That is an on-call schedule for all team members. If the system goes down at night, the engineer(s) who have the pager will be notified. But for that to work you really need to have a reliable monitoring of your apps. False alarms during night is not that very popular. This is unusual in Europe but more common in US.

Finally. If you seriously consider external support, are you doing that to increase customer satisfaction? Or just to have someone else to blame when your services are not available?





  • Comments(0)//blog.devops.vision/#post4

2016 State of DevOps Report

DevOpsPosted by Anders Lundsgård Mon, June 27, 2016 10:36PM
"We hope the findings, analysis and guidance in this report help you better understand the potential impact of DevOps on your organization."

As always (5th year in a row) exciting when the State of DevOps Report from Puppet comes out. Above quote is the last words from the report and I can't say anything else than the guys behind the report accomplished yet another success. I hope people from my company will read and adopt the practices and findings from this report.

Below are my favorite quotes and key takeaways from the report:

#1
"Our analysis shows that high performers are deploying 200 times more frequently that low performers, with 2555 times faster lead times. They also have the fastest recovery time and the lowest change failure rate."

#2
"The integration of security objectives is just as important as the integration of other business objectives, and security must be integrated into the daily work of delivery teams."

#3
"Always try to minimize the amount to test data you require in order to rum your automated tests, and avoid large database dumps wherever possible."

#4
"The idea that developers should work in small batches off master or trunk rather than on long-lived feature branches is still one of the most controversial ideas in the Agile canon, despite the fact that it is the norm in high-performing organizations such as Google."

#5
"Teams that don't have code freeze periods also achieve higher performance."

#6
"Agile has more or less won the methodology wars, but in larger companies it's still common to see months spent on budgeting, analysis and requirements-gathering before work stars."

#7
"Leaders can change culture. In today's fast-moving and competitive world, the best thing you can do for your products, your company and your people is institute a culture of experimentation and learning, and invest in the technical and management capabilities that enable it".

#8
"DevOps is no longer a mere fad or buzzword, but an understood set of practices and cultural patterns."

- Yes, I made the survey for the report and will do so next year. Thanks to Puppet, Inc!







  • Comments(0)//blog.devops.vision/#post3

My very first blog post

DevOpsPosted by Anders Lundsgård Sun, June 12, 2016 10:14PM
Finally. Could not wait no longer than until today to create my very first blog post in my brand new DevOps blog.

  • Comments(0)//blog.devops.vision/#post0