Databricks, Delta Lake and You

  Рет қаралды 19,206

SQLBits

SQLBits

3 жыл бұрын

Databricks, Lakes & Parquet are a match made in heaven, but explode with extra power when using Delta Lake. This session will dive into the details of how Databricks Delta works and how to make the most of it.
Speaker: Simon Whiteley SQLbits.com/speakers/Simon_Wh...
SQLbits.com/Sessions/Databrick...
Tags: Optimising,Developing,Managing,Cloud,Databricks,Python,Spark,Data Lake,Big data analytics,Modern Analytics,delta lake

Пікірлер: 29
@ericbegg8727
@ericbegg8727 Жыл бұрын
Simon - you couldn't be any better at explaining all these concepts. Thanks again
@nithints302
@nithints302 Жыл бұрын
Accidentally hopped on to your channel I can listen for the whole day
@Markttt5
@Markttt5 2 жыл бұрын
Hey Simon, if I was still based in the UK, I’d be knocking on your door and handing you a beer. Fantastic video (again) - this is going to help me help my organisation so much. You are by far, the best speaker, most passionate dude about data that I watch on KZbin. Many thanks.
@mdzakariabarbhuiya1608
@mdzakariabarbhuiya1608 3 жыл бұрын
This is one of the best explanation on Delta Lake!!
@andycarter9845
@andycarter9845 2 жыл бұрын
Deserves many more thousands of views. Fantastically clear.
@chandraxg1
@chandraxg1 Жыл бұрын
Simon... thank you so much for an excellent video...
@sunnysoni88
@sunnysoni88 3 жыл бұрын
I have never seen such a clear video for Delta Lake, This is just great stuff. My understanding of Delta Lake is so well now, Thanks for sharing your knowledge
@adityajakka9856
@adityajakka9856 2 жыл бұрын
Great job explaining the Delta Lake, Simon. I thought you did a fantastic job with your slides and working examples. That's exactly how I look to learn new data concepts. More power to you, mate :)
@RodrigoBocanegraCruz
@RodrigoBocanegraCruz 2 жыл бұрын
Great video. Thanks Simon!
@nimesharya909
@nimesharya909 2 жыл бұрын
awesome video, precise , clear and with easy to understand examples
@tj_lee
@tj_lee 2 жыл бұрын
Great content, help clarifies a lot on delta tables!
@Boompiee
@Boompiee 2 жыл бұрын
Great video as usual Simon, thank you very much!
@manideepatalukdar9201
@manideepatalukdar9201 2 жыл бұрын
Thanks you so much! This is such a clear explanation of Delta concepts!
@jeevanb8623
@jeevanb8623 2 жыл бұрын
Beautifully Explained...
@esoterictime
@esoterictime 3 жыл бұрын
Quality stuff. Subscribed!
@denermoreira15
@denermoreira15 Жыл бұрын
just amazing
@siddhu1076
@siddhu1076 2 жыл бұрын
Wonderful explanation 🙂👍
@simonheath8701
@simonheath8701 2 жыл бұрын
I'm new to DataBricks and found this as my first video when searching. What a Gem. Haven't bothered to watch any others as it was such a great journey. As someone who spent over 30 years using SQL and saw all the big data stuff from afar I was thinking they are basically using unindexed flat files with a 16 node server cluster... hmmn, that's not advancement. Seeing how they added SQL, journalling and transaction management - I wonder how long it will take them to add indexes and create a block structured database ;)
@SQLBits
@SQLBits 2 жыл бұрын
Lovely to hear, thank you simon! I am sure the team at kzbin.info/door/mRI-X6XoeH2dQE4BShRU9Q will love to hear this!
@mohammedsafiahmed1639
@mohammedsafiahmed1639 Жыл бұрын
hey simon, when you say block structured database, you mean in opposition to traditional rdbms like sql server which are page structured db, right?
@murtazajabalpurwala8124
@murtazajabalpurwala8124 2 жыл бұрын
Very nice video. One of the best videos for understanding the data lake related complex issues. One recommendation is sound audibility should be improved. Thanks again for the amazing video
@SQLBits
@SQLBits 2 жыл бұрын
Thank you for sharing your opinion! All these sessions are recorded LIVE at SQLBits in front of a crowd, so we do apologize if the audio isn't of the best quality!
@Knigh7z
@Knigh7z 2 жыл бұрын
The warehouse is also generally optimised for concurrent queries over many consumers which lake tools like Spark are not and is where Databricks SQL is closing the gap.
@kcbonzer
@kcbonzer 2 жыл бұрын
Hello Simon, this is one of the most lucid videos I have come across. Thank you conveying the message in a very simple manner. I am curious to know what you take is on Snowflake vs Databricks ! Ingestion, Storage, Architecture, Performance & Cost based comparison. A professional, unbiased & candid opinion, if you will :)
@SQLBits
@SQLBits 2 жыл бұрын
Hey! He has a video out on our channel about 'Databricks VS Synapse Analytics' if that's something your interested in (We'll let him know about Snowflake!) kzbin.info/www/bejne/fJvWn4mrmr2coLM
@bobhaffner5902
@bobhaffner5902 3 жыл бұрын
Hi Simon, great video! Hey, do you know if updating a delta table via Synapse Spark NB is supported?
@SQLBits
@SQLBits 2 жыл бұрын
Hey Bob, Simon has his own YouTUbe Channel here if it has any content you are interested in! - kzbin.info/door/mRI-X6XoeH2dQE4BShRU9Q
@RodrigoBocanegraCruz
@RodrigoBocanegraCruz 2 жыл бұрын
Hi, is it delta suitable for tracking data changes overtime, like for examples every day? Or is it more suitable for tracking transformation changes in a given dataset? I have read is more about the second but want to check. Thanks!
@mohammedsafiahmed1639
@mohammedsafiahmed1639 Жыл бұрын
deltra records every single transaction that happens to a table in the delta log. Every time a transaction happens like an update, insert delete or merge, it gets recorded as a json in the delta log. And it gets a version number. Updates and deletes do not physically update and delete the files, but just update the transaction log. This gives you the ability to travel back per transaction basis. Its pretty cool.
Azure Data Factory patterns and best practices
50:24
SQLBits
Рет қаралды 10 М.
The Azure Spark Showdown - Databricks VS Synapse Analytics
49:18
Best KFC Homemade For My Son #cooking #shorts
00:58
BANKII
Рет қаралды 62 МЛН
Как бесплатно замутить iphone 15 pro max
00:59
ЖЕЛЕЗНЫЙ КОРОЛЬ
Рет қаралды 7 МЛН
A clash of kindness and indifference #shorts
00:17
Fabiosa Best Lifehacks
Рет қаралды 126 МЛН
Delta Live Tables A to Z: Best Practices for Modern Data Pipelines
1:27:52
Azure Databricks is Easier Than You Think
1:16:19
Atmosera
Рет қаралды 34 М.
Advancing Spark - Databricks Delta Change Feed
17:01
Advancing Analytics
Рет қаралды 14 М.
Data Warehouse, Data Lake and Lakehouse which is better??
14:14
CloudFitness
Рет қаралды 9 М.
Advancing Spark - Give your Delta Lake a boost with Z-Ordering
20:31
Advancing Analytics
Рет қаралды 27 М.
Azure Data Lake Design and Implementation Patterns
1:10:05
DesignMind
Рет қаралды 21 М.
$1 vs $100,000 Slow Motion Camera!
0:44
Hafu Go
Рет қаралды 28 МЛН
Samsung laughing on iPhone #techbyakram
0:12
Tech by Akram
Рет қаралды 4,1 МЛН
Опасность фирменной зарядки Apple
0:57
SuperCrastan
Рет қаралды 8 МЛН