How we monitor Monzo

bea · 27 July 2018 09:47

Earlier this week we shared our Reliability Report, which shows that we’ve made Monzo more reliable in the last 12 months and explains how we’ve done it.

Monitoring helps us make sure everything’s working as expected, and alerts us when things do go wrong.

If you’ve ever wondered how we do it, Platform Team lead Chris digs into the details here

anon72173902 · 27 July 2018 09:52

“hundreds of servers”

Pics or it’s lies

glasgow · 27 July 2018 09:54

Ah and hundreds of physical servers or a few physical servers running hundreds of virtualised instances?

anon72173902 · 27 July 2018 09:58

I want it to be physical but it’s obvs gonna be virtual

theshillito · 27 July 2018 10:29

Thanos Store

My balance doesn’t feel so good…

anon4562461 · 27 July 2018 10:49

don’t feel so good…

༼ つ ◕_ ◕ ༽つ

༼ つ ◕_ ◕ ::;:.::…:. . . . . . . . . . . .

༼ つ ◕_ :;:.::…:. . . . . . . . . . . . . .

༼ つ :;:.::…:::…:.:… . . . . . . . . .

༼ ;::,’:;:.::…:::…:.:… . . . . . . . . .

.

tjvr · 27 July 2018 11:44

While we do have some hardware in physical data centers to interconnect with payment schemes, most of our servers are virtual servers running on EC2, Amazon’s cloud.

Most of these servers are Kubernetes “worker” nodes. Each of these workers runs many different microservices, each in its own container. So we have containers on top of virtual servers on top of physical servers in a few Amazon data centers somewhere…

anon14294927 · 27 July 2018 12:33

Why do you “want” it to be physical?!

glasgow · 27 July 2018 12:42

Because a bank of physical servers with lots of cables and lights is nerd heaven

anon72173902 · 27 July 2018 12:48

^^^ This, very much this

Peter_R · 27 July 2018 14:16

Only if the cable management is good otherwise it’s hell

daniel · 27 July 2018 15:06

Hundreds of virtualised servers running thousands of containers.

anon5660699 · 27 July 2018 15:20

Wow that blog post brings loads of questions to mind. Can we nominate Chris for the next Q&A? @simonb

anon61228674 · 27 July 2018 15:44

On holiday for a week but happy to answer anything when I get back

anon72030606 · 30 July 2018 23:18

This is great, just rolling out Prometheus and Thanos across my cloud.

Any chance you can share some further details of your custom template for slack notifications along with details on rules fetcher?

Sounds like a missing piece of the puzzle for my implementation

Pete

Topic		Replies	Views
Technical conversation Monzo Chat	14	2262	14 July 2016
Mondo infrastructure Monzo Chat	4	1979	6 June 2016
Outages Monzo Chat	19	5027	21 September 2016
Monzo & the Recent AWS Outage Monzo Chat	0	1581	4 March 2017
We had issues with Monzo on 29th July. Here's what happened, and what we did to fix it News & Updates	69	6238	23 December 2019

How we monitor Monzo

Related topics