Understanding HDFS using Legos

  Рет қаралды 148,642

InfoQ

InfoQ

9 жыл бұрын

You've been hearing about Hadoop and HDFS. How does it work?
In this video, we use an innovative method to show how HDFS works with Legos. Jesse Anderson shows how HDFS handles files and replicates the data, then covers the read and write paths for the data. Finally, he talks about how HDFS handles failure scenarios and the importance of data locality.
** Hadoop & Apache Spark training from NewCircle: newcircle.com/category/big-data

Пікірлер: 110
@stackinglittlesats
@stackinglittlesats 5 жыл бұрын
Nice explanation man, if I could, I would buy you an air conditioner. You deserve it.
@SIRabhinav
@SIRabhinav 5 жыл бұрын
may the force be with you
@miguelchris6374
@miguelchris6374 2 жыл бұрын
sorry to be offtopic but does anyone know a trick to log back into an Instagram account? I was stupid forgot my account password. I would love any tricks you can give me.
@miguelchris6374
@miguelchris6374 2 жыл бұрын
@Billy Dariel it worked and I actually got access to my account again. Im so happy! Thank you so much you really help me out!
@billydariel9140
@billydariel9140 2 жыл бұрын
@Miguel Chris Happy to help :D
@ZeTamboh
@ZeTamboh 2 жыл бұрын
@@billydariel9140 nice ad
@davidvella7141
@davidvella7141 3 жыл бұрын
This video was uploaded 5 years ago and it still one of the best explanations I've ever seen.
@ThorstenStaerk
@ThorstenStaerk 2 жыл бұрын
the first explanation at all that could tell me what Hadoop has to do with MapReduce
@taxatlanticinc6611
@taxatlanticinc6611 6 жыл бұрын
This is a best explanation I have seen yet! It's a lot more engaging and informative than the traditional PowerPoint! Thank You!!
@Coffingdw
@Coffingdw 8 жыл бұрын
Nice job Jesse. Very informative and creative. Thank you. TeraTom
@AdrianRodriguezWebDevelopment
@AdrianRodriguezWebDevelopment 8 жыл бұрын
This video just made my day! Thank you New Circle Training! And thank you Doug Cutting for sharing this video on Twitter.
@wisdomandpeace4897
@wisdomandpeace4897 9 жыл бұрын
Excellent video. I actually understand Hadoop somewhat after watching this video.
@TheMrsStinsfire
@TheMrsStinsfire 7 жыл бұрын
11:06 R.I.P. Data Node 3
@supermonkey965
@supermonkey965 5 жыл бұрын
He was a good node, admired by his node friends.
@ElCuchu
@ElCuchu 4 жыл бұрын
I'm still crying, can't get over it, such a good node dude rippp
@parthnagdev
@parthnagdev Ай бұрын
He is happy in the Node Heaven and is saving all the replicated data it ever wanted to save.
@AhlamLamo
@AhlamLamo 3 жыл бұрын
Amazing explanation !! one of the best videos I've seen about HDFS
@ravianantharamaiah7567
@ravianantharamaiah7567 3 жыл бұрын
Excellent teaching. Conceptually things are very clear now. Thank you.
@gustavogbfBR
@gustavogbfBR 9 жыл бұрын
Nice work. Really help me to understand how HDFS works.
@amitprakashpandeysonu
@amitprakashpandeysonu 2 жыл бұрын
Really nice and innovative way to teach hdfs concept. loved it and understood it very clearly. Thank you.
@klausdupont6335
@klausdupont6335 5 жыл бұрын
Incredible illustration! Would love to see more on this topic in this form :-)
@maryoleary8660
@maryoleary8660 3 жыл бұрын
I love learning with legos, even watching it at 1.5x, I was able to follow along easily. Well Done.
@MuzamilKhan-rl2sh
@MuzamilKhan-rl2sh 4 жыл бұрын
Wow man, you explain it in a creative way.
@Happymoon789
@Happymoon789 5 жыл бұрын
Thanks for your efforts! Smart display!
@prohouse6088
@prohouse6088 8 жыл бұрын
very nice teaching methodology jesse, thanks for sharing
@dingman081130
@dingman081130 6 жыл бұрын
gorgeous presentation, thanks
@ThomasEhardt
@ThomasEhardt 7 жыл бұрын
Great introduction to HDFS!
@thandekilenzungu7240
@thandekilenzungu7240 2 жыл бұрын
The explanation is so clear I understood everything
@i_e_she
@i_e_she 5 жыл бұрын
This was great, thank you! Should have more views.
@MSDlublin
@MSDlublin 7 жыл бұрын
Very good work for begginers - THANKS A LOT!
@joecordingley7071
@joecordingley7071 8 жыл бұрын
This was great, thanks.
@abhijeet_r
@abhijeet_r 6 жыл бұрын
Very innovative presentation thanks a lot!
@CarlosMercadoINIGTDY
@CarlosMercadoINIGTDY 5 жыл бұрын
Excellent video, thanks!
@olesyagorbacheva6991
@olesyagorbacheva6991 6 ай бұрын
Thank you for such a good explanation!
@thndesmondsaid
@thndesmondsaid 11 ай бұрын
Jesse! Great explanation as always.
@markhellel3371
@markhellel3371 6 жыл бұрын
Great Job Jesse! Nicely done! :-)
@1234abcd2139
@1234abcd2139 7 жыл бұрын
nice illustrative way of teaching HDFS. Would have been wonderful if some more information was given about fallback mechanism of Name node or coordinator
@nirupamaj6140
@nirupamaj6140 7 жыл бұрын
very informative, thank you
@kausaralam2605
@kausaralam2605 8 жыл бұрын
Great explanation!
@yicai7
@yicai7 4 жыл бұрын
Great explanation! Voted!
@haydo8373
@haydo8373 5 жыл бұрын
Superb, I had it running at 1.5x and it was still easy to follow! Thanks :) Can you explain every CS concept with Lego? - that would be amazing
@juiliussmith5702
@juiliussmith5702 2 жыл бұрын
thanks dude lol
@CB-fz3li
@CB-fz3li 4 жыл бұрын
Nice clear explanation
@mrdhksan
@mrdhksan 5 жыл бұрын
Excellent, thank you. A serious question: What would happen if two of the four nodes crash?
@manojprabhakar5522
@manojprabhakar5522 4 жыл бұрын
Awesome, Thank you for the explanation, Could you please make videos of Spark with Yarn and how the communication is handled?
@underlecht
@underlecht 3 жыл бұрын
looks like we have a perfect explaination!
@arisweedler4703
@arisweedler4703 6 жыл бұрын
Great explanation! I assume that another benefit of the HDFS is that reading large files will be quicker, because you would be able to effectively "BitTorrent" from your cluster. Does HDFS do that?
@BabtaOfEinGedi
@BabtaOfEinGedi 6 жыл бұрын
Perfect! Thanks so much
@jesusoliveros9950
@jesusoliveros9950 5 жыл бұрын
Amazing !!! great Job
@rimchatti3807
@rimchatti3807 5 жыл бұрын
Nice job, it is helping getting familiar with HDFS. I'm new to Data Engineering and so on.. Could you please explain to me what is a cluster. Thanks;
@Guitarman01
@Guitarman01 7 жыл бұрын
Good Presentation, however I do have question. Since the file is split to other nodes, doesn't replication also take places so that if a node does go down, then you can retrieve. Node 3 went down, but I would have figured I could get it from another node. Does master save a copy of all files as well? I didn't see how that works on the video.
@alaayari6391
@alaayari6391 3 жыл бұрын
thanks for the explanation
@bugs181
@bugs181 8 жыл бұрын
I'm just now learning about the methods used in distributed file systems. I'm an application developer and it's a bit difficult to wrap my head around the lower level storage systems like HDFS. This video explained replication in an easy to understand way. Now only if I could have one other BIG question answered. What kind of file system would we use if we want an application to use a virtualized file system stored over many nodes? For example, we want each node to add additional storage capacity. To the application layer, this would look like a single big storage drive but to the lower level facilities this would use network coordination to serve the files to the application.
@kishorpatil6260
@kishorpatil6260 8 жыл бұрын
ĺm ĺ
@buzz-uk
@buzz-uk 5 жыл бұрын
Hi, While setting up pseudo or full cluster, do we need to format data node with HDFS file system or we only have to format namenode. I am asking this because, I have read this on many blogs that, HDFS stores the data in sequential order on the hard disk and it is an abstract layer which stores data on big blocks rather than default block size storage provided host file system. If we are not formatting datanode than the powerful feature of HDFS will come to toss.
@CosmeJunior
@CosmeJunior 6 жыл бұрын
Nice Job. Brazil thanks you!
@TzGiwrgos15
@TzGiwrgos15 7 жыл бұрын
Brilliant!
@mahdiamrollahi8456
@mahdiamrollahi8456 3 жыл бұрын
Hello, nice job, I have a question, as a file system how hadoop can manage a database file(like mssql or mysql) file? how can it devide an .MDF file to other separeted files to store them on different machines? Because such files, have meta data and overhead and they are not like a basic txt file. Does hadoop have special system to treat each file type differently? Regards.
@abhimanyukarkara4218
@abhimanyukarkara4218 2 ай бұрын
Question: when we have to read from let's say the red file. Would all three nodes be processing simultaneously different data (chunks) and give us an combined output or would only one node process the complete the data processing alone?
@shyland20
@shyland20 6 жыл бұрын
why s3 service streaming with embedded link is slow (get stock every 2-3 second) when embedded on wp site? after understanding what you saying how can i improve the speed? i read something about the hdfs but i don't understand how it's related to s3 if at all. thanks in advance
@ulrikkallblad6698
@ulrikkallblad6698 5 жыл бұрын
Very nice video! Only one question: If node 3 is down, how can the data from node 3 be moved to the other nodes?
@forbin80
@forbin80 4 жыл бұрын
@@brianbitchballs3902 thanks for the great explanation BrianBitchBalls
@marflem12
@marflem12 5 жыл бұрын
Thank You
@JackyA123
@JackyA123 5 жыл бұрын
yoou haveabsolutely no need to be nervous! Doing a great job here
@ahmedaj2000
@ahmedaj2000 3 жыл бұрын
Thanks 😊
@taharhalloub8721
@taharhalloub8721 3 жыл бұрын
Thanks a lot
@joseenrique6723
@joseenrique6723 2 жыл бұрын
For the red file, are EACH of the replicas still 64 mb in size?
@Gorlung
@Gorlung 3 жыл бұрын
what does happen when you add a new and empty node?
@myeverymusic
@myeverymusic 5 жыл бұрын
What will happen once Data Node 3 is alive again? will the Name Node asks other nodes to copy some data to Data Node 3?
@AbhinavSingh-oq7dk
@AbhinavSingh-oq7dk 3 жыл бұрын
if a data node malfunctions, then name node instructs remaining data nodes to create replicas of files that malfunctioned data node held. why create another replica when there are two others already? I mean they are there for the backup, right? Do correct if I am missing something. Thanks.
6 жыл бұрын
The fact that hbase write on a node as you say is the reason why it corrupts the HDFS filesystem so easily?
@elwyndude
@elwyndude Жыл бұрын
If a node goes down, why does it need 3 replications to pull the data, could it not just read from the existing two?
@stivstivsti
@stivstivsti 5 жыл бұрын
thanx!
@wow376
@wow376 4 жыл бұрын
feel like buying Legos already!
@Irresponsibleful
@Irresponsibleful 4 жыл бұрын
did you get a AC by now ?
@nocontentnoname5922
@nocontentnoname5922 4 жыл бұрын
Did we find who broke node 3 yet?
@satwindersetia4367
@satwindersetia4367 7 жыл бұрын
Very creative, indeed.
@malesamuel7736
@malesamuel7736 5 жыл бұрын
Cool
@amni5tianone263
@amni5tianone263 2 жыл бұрын
tnx
2 жыл бұрын
Interesting, it's very similar to how Elasticsearch works
@KenedyYinkfuChuye
@KenedyYinkfuChuye 7 жыл бұрын
Now I like Legos :)
@barefeg
@barefeg 3 жыл бұрын
What if hulk smashes the naming node?
@GiacomoMilazzo
@GiacomoMilazzo 6 жыл бұрын
I don't understand. If each set of blocks is "one" file (red, yellow, blue) why he says that blocks are replicated? He should say "distributed", not replicated! Replicating involve data resiliency, erasure code and so on. Is it? Then he put the case that one of the cluster's node crash. So in this case replications come on play. And, of course, he should not call the set of blocks "one" file. But it should say there's one file composed of "n" chunks that are replicated among nodes of cluster.
@draganglumac
@draganglumac 6 жыл бұрын
The way I understood it, = . I suppose (if my understanding is correct of course) the confusion then comes from the fact that at the beginning of the video he said that a = . He really should have started with just one row of lego bricks for each file, and just explained that a data node sends a copy of the block it just wrote to one its data peers as directed by the control node.
@user-oi3ce5nj3m
@user-oi3ce5nj3m 4 жыл бұрын
狡兔三窟说的就是这个意思
@user-ml2ci7wl1f
@user-ml2ci7wl1f 5 жыл бұрын
英语不太好,但是觉得很棒
@danielleu.877
@danielleu.877 4 жыл бұрын
SUPER informative, but also i hear "Hadoop" and just think "Hadooken" just me? yeah okay hahaha
@samiulsaeef2076
@samiulsaeef2076 3 жыл бұрын
play in 1.25x
@ravatmehul3906
@ravatmehul3906 7 жыл бұрын
Nican
@guille.p
@guille.p 6 жыл бұрын
It started off pretty well but then it got very confusing. He didn't seem so sure of what he was saying. It didn't work for me. Thank you, anyway.
@marcelscherzer8385
@marcelscherzer8385 4 жыл бұрын
its lego, not legos... but nice vid.
@JM-fp3gf
@JM-fp3gf 9 жыл бұрын
Why is he so sweaty?
@jessetanderson
@jessetanderson 9 жыл бұрын
Yeah, it was the lighting. We tried moving the lights around, but I didn't have any makeup on which mitigates the lights.
@musasall5740
@musasall5740 7 жыл бұрын
You should not answer this moron. u doing a good job for free
@vishusingh008
@vishusingh008 7 жыл бұрын
Looks like you are a moron!
@vishusingh008
@vishusingh008 7 жыл бұрын
He replied so kindly and genuinely, you are calling him moron.
@stonemysterioserusss
@stonemysterioserusss 7 жыл бұрын
Pretty sure he was referring to the initial commenter, not Jesse. Rude remark nevertheless.
@viewerone
@viewerone 5 жыл бұрын
It's been quite a challenge to hear this video. Headphones are in but it doesn't seem to help.
@FredroStarr12
@FredroStarr12 5 жыл бұрын
audio volume is fine to me, must be an issue on your machine
@viewerone
@viewerone 5 жыл бұрын
Freddy yes, that’s what it was. Guess my Mac needed a reboot. Worked fine afterwards.
@cafecapes
@cafecapes 9 жыл бұрын
Why do Americans call Lego bricks Legos? Lego is a company name and small building bricks is what they make, they don't make Legos! You can't implicitly type Lego bricks as Legos it sounds silly.
@lucaborzani56
@lucaborzani56 9 жыл бұрын
we do the same in Europe. Where are you from?
@cafecapes
@cafecapes 9 жыл бұрын
I've been thinking deeply about this and decided I'm the worst person to be dictating English. Briton mate.
@bugs181
@bugs181 8 жыл бұрын
cafecapes Every nation has it's own way of speaking. There's a very elaborate section on the Stack Exchange website that goes into great depth on the difference in languages, pronunciation, and word usage. If this is a serious inquiry, I'd suggest you go there. It's a very informative place to learn anything you want - and if the question hasn't already been asked, you can ask it yourself. One example is that there is a topic on how Americans pronounce the word solder as "sodder" while other countries pronounce it as "sold-er" and where this distinction came from. You might be surprised to know that the language variations have a lot to do with heritage dating way way back. Every language and dialect, regardless of what it is has become bastardized - and it's just a part of how languages evolve. For what it's worth, I used to pronounce it as "sold-er" until I got tired of being corrected - and I have no indication of where I learned this from. I now colloquially pronounce it as "sodder" just because of tradition and geographical linguistics. Also, pronunciations and accent vary widely in the United States from coast to coast also. Apologize for the long comment.
Learn MapReduce with Playing Cards
9:44
Jesse Anderson
Рет қаралды 343 М.
Basic Introduction to Apache Hadoop
14:00
Hortonworks
Рет қаралды 235 М.
The Noodle Picture Secret 😱 #shorts
00:35
Mr DegrEE
Рет қаралды 30 МЛН
MEU IRMÃO FICOU FAMOSO
00:52
Matheus Kriwat
Рет қаралды 18 МЛН
3 wheeler new bike fitting
00:19
Ruhul Shorts
Рет қаралды 45 МЛН
Understanding Kafka with Legos
11:48
Jesse Anderson
Рет қаралды 92 М.
What is MapReduce?
5:37
internet-class
Рет қаралды 242 М.
Radio Hacking: Cars, Hardware, and more! - Samy Kamkar - AppSec California 2016
51:12
MapReduce - Computerphile
6:41
Computerphile
Рет қаралды 250 М.
HDFS Architecture
1:06:33
Durga Software Solutions
Рет қаралды 718 М.
The Tragedy of systemd
47:18
linux.conf.au
Рет қаралды 1,1 МЛН
Busy Beaver Turing Machines - Computerphile
17:56
Computerphile
Рет қаралды 404 М.
Hadoop Distributed File System (HDFS) Introduction
8:28
Hortonworks
Рет қаралды 71 М.
Gizli Apple Watch Özelliği😱
0:14
Safak Novruz
Рет қаралды 2,6 МЛН
WWDC 2024 - June 10 | Apple
1:43:37
Apple
Рет қаралды 10 МЛН
После ввода кода - протирайте панель
0:18