Apache Cassandra - Tutorial 6 - Partitioning, Rings and Tokens

  Рет қаралды 74,388

jumpstartCS

jumpstartCS

Күн бұрын

Пікірлер: 41
@Thundernixx
@Thundernixx Жыл бұрын
This was one the most explainable videos that I've saw about this topic, thank you so much for sharing it.
@jumpstartCS
@jumpstartCS Жыл бұрын
Really glad you found it useful! Thanks for the support!
@ahmetcihan8025
@ahmetcihan8025 5 жыл бұрын
Keep it up man... Thats one for the books.
@asefnak4764
@asefnak4764 7 ай бұрын
Thanks man day 3 into Cassandra and I've completed 6th today
@jumpstartCS
@jumpstartCS 6 ай бұрын
Great to hear, thanks for watching!
@donrounds7791
@donrounds7791 Жыл бұрын
minor observation - should be referring to tokens in a range that includes either the lower or upper bound. If the number is between 10 and 20 then that means 10 and 20 wont be mapped to a node. So either >=10 n 10 n
@kbsunil7464
@kbsunil7464 5 жыл бұрын
since Cassandra is highly available and fault tolerant .ie if any node fails the data should be available from other nodes. but in you example all partitions with bmw resides in node one. what happens when node 1 fails .....from where did we get data for the partition bmw
@santu4bth
@santu4bth 4 жыл бұрын
sharding is distribution or partition of data across multiple different machines whereas partitioning is distribution of data on the same machine
@valerylukin8446
@valerylukin8446 4 жыл бұрын
Other nodes will have copy of the data from other nodes. This is controlled by replication factor that is set when you create a Keyspace. For example: You have 5 nodes and created a Keyspace with replication factor = 2. Then you have emergency with one of the node. Then Reads that this nodes suppose to be serving will be served from another node, since replication factor was set to 2 and one node contains duplicate data from failed node. I am not in anyway advanced with Cassandra, but this is the way I understand it will behave, based on my knowledge.
@guents
@guents 2 жыл бұрын
So will a query across multiple partitions work but be inefficient, or it won't work at all?
@rohitmania1
@rohitmania1 4 жыл бұрын
Question - What happens when a node runs out of capacity? Is this practically possible? Say a bunch of tokens for any given node represent too much data and needs to be split across more than one nodes. How do we handle that?
@heh2k
@heh2k 3 жыл бұрын
You watch df and add servers (with more drives) when the nodes fill up. You need to add enough nodes to take a slice of keyspace (the token ranges) from each existing node.
@HarshaVardhan-jf9sd
@HarshaVardhan-jf9sd 5 жыл бұрын
If i have a partition key that is being used heavily, then the data that is stored in the node against that partition key would be more than other nodes, this introduces non uniform distribution of data with some nodes being heavily loaded and other not...how do we deal with such kind of issues?
@jumpstartCS
@jumpstartCS 5 жыл бұрын
I would recommend rethinking your partitioning strategy. It is important to partition on an appropriate value and think long term to make sure this will not cause issues down the line.
@muhammedsalama8323
@muhammedsalama8323 Жыл бұрын
Thanks for the great videos, please do you have monitoring shell scripts for cassendra for crontab jobs
@jumpstartCS
@jumpstartCS Жыл бұрын
Hello, thanks for the support! I don't have any scripts handy but I would recommend checking out the book Cassandra: The Definitive Guide by O'Reilly as it might have some content that will help you!
@madhusudandad6028
@madhusudandad6028 4 жыл бұрын
What if a node has exceeds it's storge limit based on token rane and new entry comes? Now I have setup cassandra on my machine...so it has only one node...so by default everything will go in the node only right?
@jackcasey1502
@jackcasey1502 3 жыл бұрын
Great video! Anyone else think the ring diagram at 6:04 looks like a turtle?
@amanali9501
@amanali9501 Жыл бұрын
If i have a 100 different partition key ,so do i need 100 different machines?
@jumpstartCS
@jumpstartCS Жыл бұрын
No you do not. A single node (machine) can handle one or more partitions. This is very common.
@summary017
@summary017 Жыл бұрын
How does 63 value decided ? 2 to the power of 63 .... Where is 63 configured and what does it refer to ?
@amanali9501
@amanali9501 Жыл бұрын
It is 64 bit number range , do Google it
@abhinee
@abhinee 5 жыл бұрын
do we need to worry about token assignment while creating tables in cassandra?
@AwaraGhumakkad
@AwaraGhumakkad 4 жыл бұрын
how do we decide partition key ? what is a best way to decide? just guessing - is it the common column between 2 tables?
@smitalpatel
@smitalpatel 3 жыл бұрын
not the common column but the column on which you will be searching with where clause. Reason is that particular key should be on same partition so that the retrieval is faster.
@aswathis2938
@aswathis2938 4 жыл бұрын
why cant i query my table in cassandra using primary key? It will work right then why its not according to your video?
@wulymammoth
@wulymammoth 4 жыл бұрын
Your question is unclear -- are you talking about query by PK on the table whose partition key is "car make" or the second table where the partition key is the "car ID" which is the same as our idea of a PK in a relational DB? If you're querying by PK, you want to build a table whose partition key is your PK...
@aswathis2938
@aswathis2938 4 жыл бұрын
@@wulymammoth I am a beginner in cassandra, My question is if suppose say i define primary key like PRIMARY KEY((car_id), car_make) then if i use select statement like SELECT * FROM CAR WHERE car_id = "some" and car_make = "some" is it required to mention both car_id & car_make in "where" clause in cassandra query or just "car_id"?
@progfan234
@progfan234 Жыл бұрын
Thanks
@jumpstartCS
@jumpstartCS Жыл бұрын
Thanks so much for your support!
@progfan234
@progfan234 Жыл бұрын
@jumpstartCS you're welcome. Thanks once again!
@heh2k
@heh2k 3 жыл бұрын
But how are the tokens assigned? This seems to be totally undocumented, which is bizarre because it's critical to data safety.
@abirpaul9027
@abirpaul9027 Жыл бұрын
Thanks for sharing the knowledge
@jumpstartCS
@jumpstartCS Жыл бұрын
No problem thanks for watching!
@donl7737
@donl7737 4 жыл бұрын
Why does it sound somewhat similar to consistent having technique?
@melodiessim2570
@melodiessim2570 4 жыл бұрын
it is the consistent hashing (according to Cassandra's website)
@cantwaittowatch
@cantwaittowatch 5 жыл бұрын
Excellent!
@Andrzej3935
@Andrzej3935 Жыл бұрын
Thanks!
@shivendrakadam7596
@shivendrakadam7596 5 жыл бұрын
Awesome
@198273sam
@198273sam Жыл бұрын
That's awesome
@jumpstartCS
@jumpstartCS Жыл бұрын
Thanks for the support Samir!
So Cute 🥰 who is better?
00:15
dednahype
Рет қаралды 19 МЛН
黑天使被操控了#short #angel #clown
00:40
Super Beauty team
Рет қаралды 61 МЛН
Why Ghostty is written in Zig (not Rust or Go)
3:42
Changelog
Рет қаралды 15 М.
Partitioning Key vs Clustering Keys in Cassandra | Cassandra Fundamentals
24:02
Database Sharding and Partitioning
23:53
Arpit Bhayani
Рет қаралды 109 М.
Local GraphRAG with LLaMa 3.1 - LangChain, Ollama & Neo4j
15:01
Coding Crash Courses
Рет қаралды 37 М.
Cassandra Database Crash Course
19:48
Code with Irtiza
Рет қаралды 35 М.
Solving one of PostgreSQL's biggest weaknesses.
17:12
Dreams of Code
Рет қаралды 223 М.
03 | Intro to Cassandra - Tables, Partitions, and Examples
14:13
DataStax Developers
Рет қаралды 31 М.
Apache Cassandra - Tutorial 5 - Query First Approach
7:10
jumpstartCS
Рет қаралды 78 М.