[100% Interview Question] Cache and Persist in Spark

  Рет қаралды 5,569

Learnomate Technologies

Learnomate Technologies

Күн бұрын

Пікірлер: 9
@balajichandramohan9707
@balajichandramohan9707 6 ай бұрын
Hi sir, this is topic related to Oracle.
@DS-bo5wu
@DS-bo5wu 4 жыл бұрын
Hi Ankush, Thanks for the video, I have one query. suppose if I am using Persist(StorageLevel.DISK_ONLY), then how will it improve Spark application performance because if this application will need this data again then it will have to read from DISK only, so there will be more I/O operations with the disks and as we all know spark doesn't do unnecessary I/O operations with the disks and it is the main reason why Spark is better than MapReduce.
@learnomate
@learnomate 4 жыл бұрын
Simple example - you may have one relatively great RDD rdd1 and one smalled RDD rdd2. You want to store both of them. If you apply persist MEMORY_AND_DISK on both, then both of them will be spilled to disk resulting in slower reaed. But you may take a different approach - you may store rdd1 with DISK_ONLY. It may just so happen that thanks to this move you can store rdd2 right in the memory with cache() option and you will be able to read it faster.
@DS-bo5wu
@DS-bo5wu 4 жыл бұрын
@@learnomate Thanks for the clarification
@pardeep657
@pardeep657 4 жыл бұрын
Hi Ankush, how long the cached data will survive in memory, does it automatically gets removed when the session ends?
@Ady_Sr
@Ady_Sr Жыл бұрын
yes it does if you dont un cache it manually
@mani.kandan4020
@mani.kandan4020 4 жыл бұрын
Nice video bro ....... I'm from tamil nadu
@mani.kandan4020
@mani.kandan4020 4 жыл бұрын
Make hbase video bro
@rohinidhorje8269
@rohinidhorje8269 Жыл бұрын
Aws step function
[100% Interview Question] Broadcast Join Spark | Increase  Spark Join Performance
6:59
23. Databricks | Spark | Cache vs Persist | Interview Question | Performance Tuning
18:56
У вас там какие таланты ?😂
00:19
Карина Хафизова
Рет қаралды 21 МЛН
Real Man relocate to Remote Controlled Car 👨🏻➡️🚙🕹️ #builderc
00:24
Random Emoji Beatbox Challenge #beatbox #tiktok
00:47
BeatboxJCOP
Рет қаралды 55 МЛН
Elza love to eat chiken🍗⚡ #dog #pets
00:17
ElzaDog
Рет қаралды 22 МЛН
Oracle Dataguard Health Check Tricks and Tips
12:54
Learnomate Technologies
Рет қаралды 11 М.
Spark Performance Tuning | Handling DATA Skewness | Interview Question
16:08
Spark  - Repartition Or  Coalesce
10:02
Data Engineering
Рет қаралды 19 М.
Spark Performance Tuning | EXECUTOR Tuning | Interview Question
18:19
TechWithViresh
Рет қаралды 32 М.
У вас там какие таланты ?😂
00:19
Карина Хафизова
Рет қаралды 21 МЛН