Hadoop

  Рет қаралды 30,480

Altamira TC

Altamira TC

Күн бұрын

Пікірлер: 25
@ravibharathiii
@ravibharathiii 12 жыл бұрын
One of the best Hadoop presentation.Thanks a lot !
@misterbruno
@misterbruno 12 жыл бұрын
Good information. I liked the overlay of slides over the video. I wouldn't think that would work but it does. Sound is excellent except for questions from some members of the audience and when the speaker turns his back.
@srinivassr1985
@srinivassr1985 12 жыл бұрын
Thx.. Best map Reduce Tutorial I have ever watched..
@piyushmishra1289
@piyushmishra1289 12 жыл бұрын
Ultimate video for Hadoop Overview .. must watch.
@mailvkjain
@mailvkjain 12 жыл бұрын
Awesome Video loved the presentation and ease with which its presented
@scottleber
@scottleber 11 жыл бұрын
Karthik, in Hadoop the replication is for data redundancy. It also provides the map/reduce framework with multiple places to schedule mappers, right? i.e. with default replication of 3, a mapper for a given data block can be scheduled on any of the 3 different machines where that block is located. As for how Hadoop does block splits, it basically splits at the block size, regardless of natural record boundaries. The record readers in the map phase know how to retrieve records that were split.
@MohammadAdnanRaza
@MohammadAdnanRaza 11 жыл бұрын
what a presentation. very nice. Thanks for sharing.
@scottleber
@scottleber 11 жыл бұрын
Karthik, in general the fact that the data is replicated 3 times doesn't affect performance, since map/reduce processes each block only once in the map phase. But generally yes, the more data you have, and thus the more data which must be scanned by the mappers, the longer your map/reduce job will take to run. However, performance depends on many factors such as the size of your cluster, how busy the cluster is at the moment, etc.
@psjrajarajan
@psjrajarajan 12 жыл бұрын
thank you so much, great overview to hadoop
@saikarthik16
@saikarthik16 11 жыл бұрын
Thanks for such informative demo.!! I have couple of questions like. 1. The Data itself is very big...(For ex: Google processes 20 PB of data per data). In hadoop we are replicating the data 3 times. Here it will become 60 PB of data.. Won't it affect the processing performance. I'm new to this., If my perception is wrong please correct me.!! 2. Can you please give me an example, how unstructured data split into blocks & stored.And how queried..?? Thanks
@scottleber
@scottleber 12 жыл бұрын
The description now includes a link to the code samples on GitHub
@stholy32
@stholy32 12 жыл бұрын
super good vid !!! many thx !!!
@nebzero1990
@nebzero1990 12 жыл бұрын
is the code available?
@scottleber
@scottleber 12 жыл бұрын
For some reason I am having a hard time pasting the actual URL and getting it to work properly (it keeps expanding into a bunch of hex characters). If you go to github.com / sleberknight then choose the project called basic-hadoop-examples that should get you there
@scottleber
@scottleber 12 жыл бұрын
The sample code is available on GitHub at github.com/sleberknight/basic-hadoop-examples
@mjshaheed
@mjshaheed 10 жыл бұрын
It's been more than 3 years since this video was uploaded but in the mapreduce wordcount program, line 31 is unnecessary. 'word' is nowhere used. The code would work just fine without that line!
@paderborner5213
@paderborner5213 9 жыл бұрын
mjshaheed You're right. Some guy in the audience noticed it as well @30:30 :)
@lambdafunc
@lambdafunc 12 жыл бұрын
Awesome :)
@ALAAMURAD
@ALAAMURAD 12 жыл бұрын
Would have been nice, if you can batch some of the poor voice spots. But nice presentation !
@backlit01
@backlit01 13 жыл бұрын
thank you for this video. it is very informative.
@piyushmishra1289
@piyushmishra1289 12 жыл бұрын
page is showing 404.
@nebzero1990
@nebzero1990 12 жыл бұрын
thanks!
@ningzhao569
@ningzhao569 11 жыл бұрын
thanks yo
@Sk99012
@Sk99012 12 жыл бұрын
super like
@thomasbenny4202
@thomasbenny4202 10 жыл бұрын
You can simply configure single node hadoop using the below blog hadoopcorner.blogspot.in/
Apache Hadoop - Petabytes and Terawatts
1:11:15
LinkedInTechTalks
Рет қаралды 68 М.
Karthik Ranganathan Hadoop Summit 2011 Facebook Messages Infrastructure with Q&A
36:18
Family Love #funny #sigma
00:16
CRAZY GREAPA
Рет қаралды 52 МЛН
How To Choose Mac N Cheese Date Night.. 🧀
00:58
Jojo Sim
Рет қаралды 45 МЛН
ZooKeeper
38:15
Altamira TC
Рет қаралды 37 М.
Big Ideas: Demystifying Hadoop
22:28
Chad Sakac
Рет қаралды 56 М.
Introducing Apache Hadoop: The Modern Data Operating System
1:16:44
The Art of Searching
28:30
Altamira TC
Рет қаралды 33 М.
Big Data in Real Time
1:14:32
LinkedInTechTalks
Рет қаралды 33 М.
Cluster Computing and MapReduce Lecture 5
32:32
Google for Developers
Рет қаралды 45 М.
Snowflake
1:17:52
Jignesh Patel
Рет қаралды 589
Explaining Big Data
8:33
ExplainingComputers
Рет қаралды 802 М.
The Tragedy of systemd
47:18
linux.conf.au
Рет қаралды 1,1 МЛН
Family Love #funny #sigma
00:16
CRAZY GREAPA
Рет қаралды 52 МЛН