Handle Deduplication While Loading CSV File Into Snowflake | Ch-07 | Snowflake Data Loading Approach

  Рет қаралды 5,901

Data Engineering Simplified

Data Engineering Simplified

Күн бұрын

Пікірлер: 22
@ARUNKUMAR-qy2jt
@ARUNKUMAR-qy2jt 5 ай бұрын
how to achieve this in s3 bucket? we won't load duplicates from s3 bucket to staging area
@mayanknema3007
@mayanknema3007 2 жыл бұрын
Nice Video. Covered the in depth of duplication. Grear work..!!
@DataEngineering
@DataEngineering 2 жыл бұрын
Glad you liked it!
@arindammitra3975
@arindammitra3975 Жыл бұрын
Very nice presentation. Helped me a lot. Thank you.
@DataEngineering
@DataEngineering Жыл бұрын
Glad you linked the content... if you liked this.. you will also like this new series..end to end ETL using Snowpark kzbin.info/www/bejne/Z5umamuOhtx1kNk
@siddhipatodia473
@siddhipatodia473 11 ай бұрын
This approach works if we set up snowpipe as well?
@vvenakatesh
@vvenakatesh Жыл бұрын
i am loading same data and using different file name and same records, it will load and showing duplicate records in user table, how to avoid this kind of issue
@sanjeev2012delhi
@sanjeev2012delhi 2 жыл бұрын
Great learning plateform, really enjoyed😊😊
@DataEngineering
@DataEngineering 2 жыл бұрын
Glad you liked it
@pragyakhare1762
@pragyakhare1762 2 жыл бұрын
Great video! Any thoughts how can we check if data being loaded is not already in the table? and load only new data?
@DataEngineering
@DataEngineering 2 жыл бұрын
Would you like to check while loading new data set or you just want to check if data is loaded previously or not, for that you can check using copy history information schema. [Watch information schema chapter in my master playlist kzbin.info/www/bejne/oaK9g4ule8h-pZI]
@mohammedvahid5099
@mohammedvahid5099 10 ай бұрын
Excellent 👌 sir 🎉thnk u lot❤
@DataEngineering
@DataEngineering 10 ай бұрын
Most welcome
@hemaammu2337
@hemaammu2337 Жыл бұрын
Hi sir, this helped me alot..thank you so much...one question- suppose if there are any server issues while loading the file and the file is loaded partially...if we try to reload the same file again , will it take the remaining unloaded data in file..?? So how can we handle ..? please help me with this
@DataEngineering
@DataEngineering Жыл бұрын
if the data is loaded from a file, partially or fully.. unless you change the file. (the MD5 hash, that is being tracked by snowflake).. snowflake will not pick it up... and if you fix the problem.. the duplicate records must be handled manually... there is no automated way..
@Vishnugondi
@Vishnugondi 2 жыл бұрын
Is there any dump’s available for snow pro certification exam sir?
@DataEngineering
@DataEngineering 2 жыл бұрын
I am not sure about dump.. but you can go through this playlist to test your knowledge. Many folks cleared the exam by following this playlist along side snowflake documentation..hope this will also help. kzbin.info/aero/PLba2xJ7yxHB5X2CMe7qZZu-V4LxNE1HbF
@Vishnugondi
@Vishnugondi 2 жыл бұрын
@@DataEngineering Thanks for your kind support sir..❤️
@gujjarisravan1746
@gujjarisravan1746 2 жыл бұрын
Hi Sir this video is helped me a lot thank you for your effort could u upload unit testing for snowflake tables like we need to do unit test between before transformation table to after transformation table.
@DataEngineering
@DataEngineering 2 жыл бұрын
We will try
@mayanknema3007
@mayanknema3007 2 жыл бұрын
Just one quick question - Under select distinct command - @~/ch07/csv/01_user_data.csvWhat is 't' here? You can refer this @ 5.10 Min.
@DataEngineering
@DataEngineering 2 жыл бұрын
t is an alias.. you can have any name like tbl..
Validate Data Before Loading Into Snowflake | Ch-08 | Snowflake Data Loading Approach
22:46
Null Handling In Snowflake | Snowflake Data Loading Consideration | Ch-09
21:44
Data Engineering Simplified
Рет қаралды 6 М.
Quilt Challenge, No Skills, Just Luck#Funnyfamily #Partygames #Funny
00:32
Family Games Media
Рет қаралды 55 МЛН
So Cute 🥰 who is better?
00:15
dednahype
Рет қаралды 19 МЛН
To Brawl AND BEYOND!
00:51
Brawl Stars
Рет қаралды 17 МЛН
Snowflake Snowpipe - Email Alert Mechanism
22:54
Knowledge Amplifier
Рет қаралды 6 М.
Snowflake Cost Optimization Strategies | 7 Tips To Reduce Your Snowflake Costs
26:09
Data Engineering Simplified
Рет қаралды 5 М.
Load CSV data from Azure data lake to Snowflake tables
10:46
The Education Machine
Рет қаралды 11 М.
Snowflake Cache Concepts | Sample Questions | SnowPro Certification
27:01
Data Engineering Simplified
Рет қаралды 18 М.
Best Practices For Loading Data Into Snowflake | Snowflake Tutorial | Ch-11
22:19
Data Engineering Simplified
Рет қаралды 5 М.
Discover How Primary and Unique Key Constraints Work in Snowflake?
20:45
Data Engineering Simplified
Рет қаралды 3,8 М.
ETL Workflow In Snowflake | Chapter-19 | Snowflake Hands-on Tutorial
1:01:33
Data Engineering Simplified
Рет қаралды 127 М.
Snowflake Data Loading | Csv Files | PSV Files in Snowflake
37:39
Praveen Kumar Bommisetty
Рет қаралды 316
Quilt Challenge, No Skills, Just Luck#Funnyfamily #Partygames #Funny
00:32
Family Games Media
Рет қаралды 55 МЛН