Apache Spark Core Concepts 02 (Rdd /data frame/transformations/actions)

Рет қаралды 12,435

Күн бұрын

Пікірлер

@vikaskasaraneni6111 8 ай бұрын

map()- Narrow mappartition() - Narrow groupbyKey()- WideSpread reduceByKey() - WideSpread Join()- Narrow distinct() - WideSpread intersect()- WideSpread flatMap() - Narrow filter() - Narrow Union() - Narrow Please correct me if I am wrong.

@codjawan 6 ай бұрын

Join is a Very Big Wide transformation in Spark, how come you mentioned it under Narrow

@vishnuk-g1b Жыл бұрын

Great work ! your explanation is clear and excellent . I feel you like your content is a hidden gem.

@mmp9371 11 ай бұрын

very nice explanation, mam. Thank you.

@raghavendrareddy4765 2 жыл бұрын

Nice content but bit confusion is there @Bhawna while explanation

@kaushaldangi900 2 жыл бұрын

Hi Bhawna, very nice explanation, could you please share the notebook used during this exercise.

@vishalnasre1251 Жыл бұрын

Is this playlist focused on mainly on Scala ?

@gurramvarunchowdary5735 2 жыл бұрын

I like your content and very informative. Thank you. Could you please share those ppt's if possible?

@anithaanitha-g8b 10 ай бұрын

It is very understanding and great sessions , can you please provide the notebook for future reference purpose.

@TamizharasanL-sx9yn 6 ай бұрын

Maam I have a question to you . When you say action has stages and tasks etc then, What happens really happening behind the transformation ? Is it just computing and storing it as a dataframe ?

@jdisunil 2 жыл бұрын

Great content and Great delivery: Question: if RDDs are immutable, and next RDD is created on basis of previous. What happens to the previous RDDs, how many such rdds are kept to until its freed? I know I should bother about the latest one. but still.

@venkatakrishnaprasadk1214 2 жыл бұрын

The previous RDDs are by default deleted after successful generation of new RDD- unless we use persist method, in which case the RDD we want will be persisted in cache

@codjawan 6 ай бұрын

Yaa that's true if it fail at any step it can go back to previous step to recalculate the step again after successful it will delete the previous Rdd's