How Prometheus Halved Its Memory Usage - Bryan Boreham, Grafana Labs

  Рет қаралды 3,938

CNCF [Cloud Native Computing Foundation]

CNCF [Cloud Native Computing Foundation]

8 ай бұрын

How Prometheus Halved Its Memory Usage - Bryan Boreham, Grafana Labs
Prometheus is the most popular CNCF project for monitoring, but if you have millions of metrics it needs gigabytes of memory.
And, Prometheus runs as a single process: it’s sometimes a challenge to find a machine large enough to run it.
The good news is that, over the past year, work has been done to reduce Prometheus’ memory usage, in some cases by 50%.
In this talk we’ll look at:
* What kinds of data Prometheus is holding in memory.
* How profiling lets you analyze where the biggest savings could be found.
* How transaction isolation was causing a big blow-up for the largest Prometheus.
* Why a whole new data structure was needed for the labels that identify time series.
* The impacts of these changes on related projects such as Cortex and Thanos.

Пікірлер: 8
@Worldwidegam3r
@Worldwidegam3r 7 ай бұрын
Great to see the progress from Bryan's PromCon talk in 2022.
@pawandeepb5967
@pawandeepb5967 8 ай бұрын
this is awesome !!! thank you for a great talk.
@lichir
@lichir 8 ай бұрын
This is a masterclass. Thank you very much Sir!
@AbhishekKothari638
@AbhishekKothari638 7 ай бұрын
Fan ! This was just the most awesome storytelling
@BryanBoreham
@BryanBoreham 5 ай бұрын
😊
@everythingtechrelated
@everythingtechrelated 3 ай бұрын
How can I reduce memory usage when I have ~3000 pods running with istio proxy sidecar. I have also disabled few non required metrics as well, still usage is 40GB of prometheus. We are using prometheus operator with Thanos. I tried the new version and GOGC 60% but not much helpful. Any help here is appreciated.
@scrabblesolver8168
@scrabblesolver8168 6 ай бұрын
I don't understand how many of these symbol tables are generated to not cause a bottleneck and according to that? This is one of the most important thing for this design choice but it was not mentioned.
@BryanBoreham
@BryanBoreham 5 ай бұрын
If you can read Go, check out the code in the Prometheus repo, PR number 12304. The main idea is there are multiple symbol-tables: one for each scraping target, one for reading each block from disk, etc. Also the table is locked for writing (i.e. when new series are added) but lock-free for reading because all the data used by an existing series is immutable.
PromCon EU 2022: Why Is It so Big? Analysing the Memory Consumption of Prometheus
35:26
Cortex - Infinitely Scalable Prometheus - Bryan Boreham, Weaveworks
36:19
CNCF [Cloud Native Computing Foundation]
Рет қаралды 3 М.
THE POLICE TAKES ME! feat @PANDAGIRLOFFICIAL #shorts
00:31
PANDA BOI
Рет қаралды 25 МЛН
Red❤️+Green💚=
00:38
ISSEI / いっせい
Рет қаралды 40 МЛН
I've been using Redis wrong this whole time...
20:53
Dreams of Code
Рет қаралды 336 М.
Obscure Go Optimisations - Bryan Boreham
51:07
GopherCon UK
Рет қаралды 10 М.
Generative AI in a Nutshell - how to survive and thrive in the age of AI
17:57
How Much Overhead How to Evaluate Observability Agent Performance   Braydon Kains, Google Cloud
24:02
CNCF [Cloud Native Computing Foundation]
Рет қаралды 441
Design Microservice Architectures the Right Way
48:30
InfoQ
Рет қаралды 708 М.
SQLite: How it works, by Richard Hipp
1:39:27
Prof. Dr. Jens Dittrich, Big Data Analytics
Рет қаралды 2,5 М.
What is Redis and What Does It Do?
6:47
CBT Nuggets
Рет қаралды 248 М.
How Prometheus Monitoring works | Prometheus Architecture explained
21:31
TechWorld with Nana
Рет қаралды 1 МЛН
Cosmology in Crisis? Confronting the Hubble Tension
36:26
World Science Festival
Рет қаралды 103 М.
Игровой Комп с Авито за 4500р
1:00
ЖЕЛЕЗНЫЙ КОРОЛЬ
Рет қаралды 2,2 МЛН
Я УКРАЛ ТЕЛЕФОН В МИЛАНЕ
9:18
Игорь Линк
Рет қаралды 111 М.
НЕ ПОКУПАЙ СМАРТФОН, ПОКА НЕ УЗНАЕШЬ ЭТО! Не ошибись с выбором…
15:23
ОБСЛУЖИЛИ САМЫЙ ГРЯЗНЫЙ ПК
1:00
VA-PC
Рет қаралды 1,9 МЛН
Todos os modelos de smartphone
0:20
Spider Slack
Рет қаралды 14 МЛН