We are using CASSANDRA for metric time series data store. Looking for dashboarding on the top of it. Looked at Superset. BUT superset likes to speak SQL. That research led me to Apache Pinot. How do you compare cassandra vs Pinot for time series data. Numbers. - 1000 metrics, 500k metrics/Minute, 200K dimensions.
@kishoreg19803 жыл бұрын
If you have only metrics and values, Cassandra is good enough but if you have multiple dimensions for each metric, then something like Pinot is a better option.
@hemanthaugust72172 жыл бұрын
@@kishoreg1980 Do you see any downsides in using a TimeseriesDB such as Prometheus (ignore its alerting & other capabilities, if you don't need them)for this usecase; I agree it's not distributed system. If you have a lot of data, you could explore Grafana Mimir & Grafana UI for dashboarding. Let me know if you see any problems with this solution. It's just 1K metrics and 500k metrics/min datapoints is not a lot. Yes, there are many dimensions to it, and Mimir can shard these and solve it at scale.
@hemanthaugust72172 жыл бұрын
@The Leaf Please explore Grafana Mimir too.
@ashypeshy3 жыл бұрын
SUPERB
@nipuntalukdar4 жыл бұрын
Great talk.
@mudunurisrujitha20844 жыл бұрын
Is pinot is having any graphql integration point as such?
@kishoreg19804 жыл бұрын
No.
@mudunurisrujitha20844 жыл бұрын
and what is the idea behind choosing the samza as stream processing?
@kishoreg19804 жыл бұрын
Samza was built at LinkedIn. One can use any system for stream processing - Flink, Spark Streaming, etc