Apache Spark Core Concepts 02 (Rdd /data frame/transformations/actions)

  Рет қаралды 12,435

CloudFitness

CloudFitness

Күн бұрын

Пікірлер
@vikaskasaraneni6111
@vikaskasaraneni6111 8 ай бұрын
map()- Narrow mappartition() - Narrow groupbyKey()- WideSpread reduceByKey() - WideSpread Join()- Narrow distinct() - WideSpread intersect()- WideSpread flatMap() - Narrow filter() - Narrow Union() - Narrow Please correct me if I am wrong.
@codjawan
@codjawan 6 ай бұрын
Join is a Very Big Wide transformation in Spark, how come you mentioned it under Narrow
@vishnuk-g1b
@vishnuk-g1b Жыл бұрын
Great work ! your explanation is clear and excellent . I feel you like your content is a hidden gem.
@mmp9371
@mmp9371 11 ай бұрын
very nice explanation, mam. Thank you.
@raghavendrareddy4765
@raghavendrareddy4765 2 жыл бұрын
Nice content but bit confusion is there @Bhawna while explanation
@kaushaldangi900
@kaushaldangi900 2 жыл бұрын
Hi Bhawna, very nice explanation, could you please share the notebook used during this exercise.
@vishalnasre1251
@vishalnasre1251 Жыл бұрын
Is this playlist focused on mainly on Scala ?
@gurramvarunchowdary5735
@gurramvarunchowdary5735 2 жыл бұрын
I like your content and very informative. Thank you. Could you please share those ppt's if possible?
@anithaanitha-g8b
@anithaanitha-g8b 10 ай бұрын
It is very understanding and great sessions , can you please provide the notebook for future reference purpose.
@TamizharasanL-sx9yn
@TamizharasanL-sx9yn 6 ай бұрын
Maam I have a question to you . When you say action has stages and tasks etc then, What happens really happening behind the transformation ? Is it just computing and storing it as a dataframe ?
@jdisunil
@jdisunil 2 жыл бұрын
Great content and Great delivery: Question: if RDDs are immutable, and next RDD is created on basis of previous. What happens to the previous RDDs, how many such rdds are kept to until its freed? I know I should bother about the latest one. but still.
@venkatakrishnaprasadk1214
@venkatakrishnaprasadk1214 2 жыл бұрын
The previous RDDs are by default deleted after successful generation of new RDD- unless we use persist method, in which case the RDD we want will be persisted in cache
@codjawan
@codjawan 6 ай бұрын
Yaa that's true if it fail at any step it can go back to previous step to recalculate the step again after successful it will delete the previous Rdd's
@cusukanya
@cusukanya 2 жыл бұрын
Ma'am do you provide the ppts for reference??
@sunitabedi1230
@sunitabedi1230 2 жыл бұрын
👍👌
@caiyu538
@caiyu538 Жыл бұрын
👍
Apache Spark Core Concepts 01
24:03
CloudFitness
Рет қаралды 20 М.
Azure Databricks Workspace for DE-DS/SQL/ML
20:09
CloudFitness
Рет қаралды 23 М.
Trapped by the Machine, Saved by Kind Strangers! #shorts
00:21
Fabiosa Best Lifehacks
Рет қаралды 40 МЛН
Amazing remote control#devil  #lilith #funny #shorts
00:30
Devil Lilith
Рет қаралды 16 МЛН
What is dbfs? Databricks Filesystem
18:54
CloudFitness
Рет қаралды 13 М.
Databricks Cluster Creation and Configuration?
21:12
CloudFitness
Рет қаралды 27 М.
RDD, DataFrames and DataSets #spark #dataengineering #databricks
18:23
20.  Runtime Architecture of Spark In Databricks
19:41
CloudFitness
Рет қаралды 12 М.
Apache Spark Memory Management
23:09
Afaque Ahmad
Рет қаралды 13 М.
Read/Write Data from Sql Database using JDBC Connector
12:08
CloudFitness
Рет қаралды 25 М.
Read/Write Data from Snowflake in Databricks
10:00
CloudFitness
Рет қаралды 15 М.