Apache Hadoop YARN: How YARN changed Hadoop from v1 to v2

  Рет қаралды 34,555

Hortonworks

Hortonworks

Күн бұрын

Learn about the impact of Apache Hadoop YARN on Hadoop, and how it transforms Hadoop 2 into a Data Operating System.

Пікірлер: 9
@roguelitedev
@roguelitedev 9 жыл бұрын
I literally have goosebumps I'm so excited!! :D
@jameskpl
@jameskpl 11 жыл бұрын
Horton works, thank you so much for the video. A quick question - is there a way to manage the data that's going into hdfs like to check for duplicates. For an eg: we upload data (several GB's and all structured) for the day. And we are asked to upload data after couple of weeks. Is there a way to check/compare the data that's being uploaded now to the data that was uploaded before. So we don't end up having 6 copies of the same data (limit to 3 with replication). Would really appreciate any feedback. Thank you, James.
@dukegaming2231
@dukegaming2231 7 жыл бұрын
jameskpl in hadoop 2 if over replication is done among datanodes, it will thow overReplicatedBlock exception therefor Replication balancers should be run ie define threshold or specify datanodes
@charleygrossman8368
@charleygrossman8368 9 жыл бұрын
One cluster to store them all.
@homoudalshammari9139
@homoudalshammari9139 11 жыл бұрын
Hi I like the question that jameskpl posted. I would add a simple point which is since the data source file has the same and needed to be uploaded into the same NameNode? Is that can be considered as a duplication or overwritten ? Thank you... Hamoud
@MAZEN_TAEMIN
@MAZEN_TAEMIN 8 ай бұрын
here cuz i'm studying hadoop and it's version at the moment in 2024
@vivek2319
@vivek2319 7 жыл бұрын
Arun looks pissed :D What's the matter Arun? Somebody give him Hadoop to play with ;) #IYKWIM :D
@sn20
@sn20 11 жыл бұрын
the life of me... I still cannot understand why in the hell they call YARN as MR2? To me it sounds a like a layer of abstraction for resource management. & Now you have to go through YARN if you need something done on hdfs. (May be another secondary name node in the making...) in other words - Dismantle existing MR and reorg it. More importantly open up the processing unit underlying HDFS to other applications. Let them all fight for cpu time via YARN
@NitishUpreti
@NitishUpreti 10 жыл бұрын
Accidently Hilarious :D
Why YARN is now the Apache Hadoop Operating System
11:40
Hortonworks
Рет қаралды 5 М.
YARN: Hadoop Beyond MapReduce
47:08
InfoQ
Рет қаралды 62 М.
UFC 310 : Рахмонов VS Мачадо Гэрри
05:00
Setanta Sports UFC
Рет қаралды 1,2 МЛН
IL'HAN - Qalqam | Official Music Video
03:17
Ilhan Ihsanov
Рет қаралды 700 М.
Delta Live Tables A to Z: Best Practices for Modern Data Pipelines
1:27:52
Hadoop - Just the Basics for Big Data Rookies
1:25:32
SpringDeveloper
Рет қаралды 336 М.
[Webinar] How to Build a Modern Agentic System
1:00:55
Arthur
Рет қаралды 15 М.
Understanding Hadoop 2 0 Architechture
1:01:38
TechGig
Рет қаралды 34 М.
GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem
19:15
Understanding HDFS using Legos
15:03
InfoQ
Рет қаралды 150 М.
Beginner's Crash Course to Elastic Stack -  Part 1: Intro to Elasticsearch and Kibana
56:42
System Design Interview - Step By Step Guide
1:23:31
System Design Interview
Рет қаралды 829 М.
LISA11 - Fork Yeah! The Rise and Development of illumos
1:04:04