Architecture Weekly Issue #67. Articles, books, and playlists on architecture and related topics. Split by sections, highlighted with complexity: ๐ŸคŸ means hardcore, ๐Ÿ‘ทโ€โ™‚๏ธ is technically applicable right away, ย ๐Ÿผ - is an introduction to the topic or an overview. Now in telegram as well.

WARNING ๐Ÿ‡บ๐Ÿ‡ฆ

It's already been a year since Russia's crazy, brutal and unjustified war against Ukraine. We condemn this war and want it to stop ASAP. We continue this newsletter so you can advance your skill and help the millions of Ukrainian people in any way possible. If you want to help directly, visit this fund.

This week we handled a discussion on Disaster Recovery with Misha Druzhining. And you won't believe what happened in the middle of the broadcast.

Big thanks to Nikita, Anatoly, Oleksandr, Dima, Pavel B, Pavel, Robert, Roman, Iyri, Andrey, Lidia, Vladimir, August, Roman, Egor, Roman, Evgeniy and Nadia for supporting the newsletter. They receive early access to the articles, influence the content and participate in the closed group where we discuss the architecture problems. They also see my daily updates on all the things I am working on. Join them at Patreon or Boosty! ย 

Highlights

Database Sharding Explained ๐ŸคŸ

Sharding is an important concept to ensure the reliability and performance of the overall system. You can do that in a variety of ways, which of them can cause it's own problems. Architecture Notes blog has a free post explaining in deep details what the sharding is in a nutshell. ย 

Database Sharding Explained
More features, active users, and data are collected daily. Your database is slowing your application. Many people donโ€™t understand database sharding, which could solve their difficulties. This article explains database sharding, its benefits, including how to use it and when not to.

#db #sharding

How Tinder built its own API Gateway ๐Ÿ‘ทโ€โ™‚๏ธ

Tinder tried multiple solutions for API Gateway, including AWS API Gateway, Apigee, Kong and others. But in the end, they decided they really needed a bespoke solution to match their requirements of scalable, reusable and configuration-based demands. So they took Spring Cloud Gateway and built their solution on top. Find what they managed to achieve in the article in Tinder Tech Blog.

How we built the Tinder API Gateway
Authored by: Vijayvangapandu Vijaya Vangapandu Distinguished Software Engineer

Migrating Critical Traffic at Scale with No Downtime - Part 1 ๐Ÿ‘ทโ€โ™‚๏ธ

Bringing new infrastructure to the production load is always a little risky. For Netflix which wants to ensure an uninterrupted watching experience this is a critical technical capability. In the latest blog post, they explain that real traffic replay plays a crucial role in testing new services and they built a special solution including a replay server. Follow the article for the details!

Migrating Critical Traffic At Scale with No Downtimeโ€Šโ€”โ€ŠPart 1
Shyam Gala, Javier Fernandez-Ivern, Anup Rokkam Pratap, Devang Shah

#sre #casestudy

Follow-Up

Software Architecture Canvas ๐Ÿผ

I am a big proponent of Solution Architecture Documents, RFCs and ADRs. But it's always good to take a fresh look. Patrick Roos shared a new format to allow the collaborative effort to architecture: the Canvas. I especially like the strong demand for the business case(top of green) and the risks and challenges(in blue). Give it a try!

Software Architecture Canvas: A Collaborative Way to Your Software Architecture
The Software Architecture Canvas is a collaborative technique for elaborating the software architecture playground of a software initiative. With this canvas, you can work efficiently, iteratively, and in a time-saving manner on the software architecture of your software products as a team sport.

#documentation

The Inner Workings of Distributed Databases ๐ŸคŸ

Alex Pelagenko begins an article with a nice analogy: he gets to the office by the bike, but if it failes - should there be a replacement? Same happens with the databases: if the first node fails, the should be a standby. But should the replication by sync or async? Should it be a master-master replication? Alex considers several databases and demonstrates the sequence diagrams how they handle disconnection issues. ย 

The Inner Workings of Distributed Databases | QuestDB
Comparison of replication options of time-series databases.

#db #timeseries

Building a large scale unsupervised model anomaly detection system ๐ŸคŸ

Lyft leverages tons of ML models to define a wide range of parameters from ETAs to pricing. But they also need to understand if those model perform well. The problem is that different model different number of features and outputs. So they need to unify and process them efficiently. Find how they do it in the blog post!

Building a large scale unsupervised model anomaly detection systemโ€Šโ€”โ€ŠPart 1
Distributed Profiling of Model Inference Logs

#ml

2023 State of Platform Engineering Report

The word DevOps is mentioned less frequently while people speak more and more about Platform Engineering. Perforce is publishing it's report on Platform Engineering, and among many valuable insights, you will find the statement about companies underinvesting in the product managers for the platforms - because it's still a product, even for your internal developers. Find the report download below, and while you're going through it, turn the discussion about developer relations with Baruch Sadogursky here. ย 

2023 State of Platform Engineering Report | Puppet by Perforce
The State of Platform Engineering Report is the 2023 edition of our State of DevOps Report. Download it today & learn platform engineeringโ€™s impact on DevOps.

#devops #platformengineering

Passwords are no more

Passwords have a long history of problems like being easy to brute force, phished and prone to social engineering attacks. With the zero trust world coming, the passwordless approach has finally become publicly available with support from Google and Apple. Read the news post! ย  ย 

So long passwords, thanks for all the phish
By: Arnar Birgisson and Diana K Smetters, Identity Ecosystems and Google Account Security and Safety teams Starting today , you can create a...

#security