Data Validation Using Pyspark || ColumnPositionComparision ||

  Рет қаралды 439

DataSpark

DataSpark

Күн бұрын

How we can develop a function or script using Pyspark to compare ColumnPosition while dumping data into raw layer from stage or source layer.
#pyspark
#databricks
#dataanalytics
#spark
#interviewquestions
#pythonprogramming
#dataengineering
linkedin :
/ lokeswar-reddy-valluru...

Пікірлер: 11
@MuzicForSoul
@MuzicForSoul 2 ай бұрын
why we have to do ColumnPositionComparision? shouldn't the column name comparison you did earlier catch this?
@vamshimerugu6184
@vamshimerugu6184 3 ай бұрын
Sir Can you make a video on how to connect adls to DataBricks using Service principle
@DataSpark45
@DataSpark45 3 ай бұрын
Thanks for asking, will do that one for sure .
@DuyTran-tx5jq
@DuyTran-tx5jq 6 ай бұрын
Can you do end to end portfolio project please
@DataSpark45
@DataSpark45 6 ай бұрын
We don't have the azure account bro as of Now. Once after creating the account we will do it for sure. Thank you
@DuyTran-tx5jq
@DuyTran-tx5jq 6 ай бұрын
@@DataSpark45 sure! Btw love your content so much
@vinothkannaramsingh8224
@vinothkannaramsingh8224 6 ай бұрын
Sort the both ref/df column name based on alphabetical order and compare column names ? will it be sufficient ?
@DataSpark45
@DataSpark45 6 ай бұрын
Certainly, whatever the order will mention at reference_df is the correct order as we expect.If we sort dfs column names in alphabetical order then their would be chances of failure. Thank you
@MuzicForSoul
@MuzicForSoul 2 ай бұрын
sir, can you please also show us the run failing, you are only showing passing case, when I tested by swaping the columns in dataframe it is still not failing because the set still have them in same order.
@DataSpark45
@DataSpark45 2 ай бұрын
Set values will come from reference df .so it always a constant one
OpenAI's New SearchGPT Shakes Up the Industry, Google Stock CRASHES!
10:10
HAPPY BIRTHDAY @mozabrick 🎉 #cat #funny
00:36
SOFIADELMONSTRO
Рет қаралды 18 МЛН
ЧУТЬ НЕ УТОНУЛ #shorts
00:27
Паша Осадчий
Рет қаралды 9 МЛН
What it feels like cleaning up after a toddler.
00:40
Daniel LaBelle
Рет қаралды 77 МЛН
Finger Heart - Fancy Refill (Inside Out Animation)
00:30
FASH
Рет қаралды 12 МЛН
“We Have Been LIED TO...” The Dr Banned For Speaking Out | Dr Aseem Malhotra
21:41
Data Validation with Pyspark || Real Time Scenario
37:34
DataSpark
Рет қаралды 4,2 М.
Master Reading Spark Query Plans
39:19
Afaque Ahmad
Рет қаралды 25 М.
Database Indexing for Dumb Developers
15:59
Laith Academy
Рет қаралды 48 М.
HAPPY BIRTHDAY @mozabrick 🎉 #cat #funny
00:36
SOFIADELMONSTRO
Рет қаралды 18 МЛН