Ask Databricks - about Delta Live Tables (DLT) with Michael Armbrust

  Рет қаралды 5,495

Advancing Analytics

Advancing Analytics

Күн бұрын

Ask Databricks about Delta Live Tables. What is it? Why should you use it? What are the best practices? How do you use certain features? How does it compare to other ELT/ETL tools? What’s coming next? And more!
Ask Databricks goes beyond conventional webinars, blogs, videos, and tutorials. This new collaborative endeavor taps into the collective knowledge and experience of Databricks experts and the broader community in a live, conversational environment. It doesn’t matter whether you’re new to Databricks or a seasoned user with years of experience. All questions are welcome!

Пікірлер: 9
@lukehoughton
@lukehoughton 10 ай бұрын
Great format. Lots of new and interesting things to think about. Looking forward to the next one.
@svenerikhaberg4146
@svenerikhaberg4146 10 ай бұрын
Really great initiative, giving insight into both the product and vision of the developers. Thank you so much for sharing and putting time and effort into publishing all your great videos 👌🙏
@JD-xd3xp
@JD-xd3xp 8 ай бұрын
I want to see how my architecture diagram will look like when I put Streaming, Materialized Views, Serverless, DLT etc.. out together for ingesting data like structured and semi-structured.
@shikokas
@shikokas 10 ай бұрын
some of my key takeaways : 1. 7:38 - “we want to eventually make it possible to run DLT locally outside of Databricks” - Very important good to know ! 2. 15:50 - “we are not the only project kind of in this space ... things like DBT” - there is an ongoing question regarding DLT and DBT are they the same and what are the differences - see answer at 40:29 3. 17:12 - DLT has better auto scaling then DBX workflows - to bad that they don’t take those capabilities into workflows and allow us to enjoy them 4. 20:07 - “now we can use Unity catalog with DLT”- well ..that isn’t that simple - the limitation list is very long and Managed locations/tables aren’t supported yet 5. 22:20 - medallion model is a cool idea but totally made up ... it should be used as a very useful vocabulary for data quality - so don’t obsess about it :slightly_smiling_face: 6. 26:09 - Enzyme engine is used for materialized view incremental work - doesn’t handle all types of quires - take that into account it will be supported in the long term but no due dates.. 7. 34:12 - always prefer using expressions vs UDF’s 8. DLT Serverless will have a lot of capabilities - Note - serverless isn’t supported on all cloud vendors yet 9. to this day there isn’t an option to Debug DLT (as within a workflow notebook) without needing to run the pipeline - per the answers in the Chat this should come in the future
@MartinIsti
@MartinIsti 10 ай бұрын
It was mentioned at 14:10 that SCD2 sounds conceptually simple to start with but can get so complicated that all examples around streaming SCD2 needed to be corrected in the docs. I wonder if there is a good conceptual guide about all the possible SCD2 scenarios (with sample source and expected outcome records). Besides the usual update / insert / delete there is "deleted row reinstated" and other interesting ones.
@GerardWolfaardt
@GerardWolfaardt 10 ай бұрын
Really excited about apply changes from snapshot! Is there a timeline for this feature? I know, I know, but I had to ask!
@MartinIsti
@MartinIsti 10 ай бұрын
I can support this, it would be so great to have it available soon at least in some kind of preview!
@radosawbrygaa7420
@radosawbrygaa7420 8 ай бұрын
Great video, great series. Please mute next time when ur guest is speaking, cus the echo is super annoying - no problem ;)
@Balquo_
@Balquo_ 9 ай бұрын
I'm using SCD2 apply changes. I would like the end date to be updated if there is a hard delete in the raw data. I could use WHEN NOT MATCHED BY SOURCE outside of DLT. But within DLT, it doesn't seem possible. Is it?.
Ask Databricks - About Unity Catalog with Paul Roome
48:33
Advancing Analytics
Рет қаралды 2 М.
Behind the Hype - The Medallion Architecture Doesn't Work
21:51
Advancing Analytics
Рет қаралды 26 М.
Finger Heart - Fancy Refill (Inside Out Animation)
00:30
FASH
Рет қаралды 14 МЛН
EVOLUTION OF ICE CREAM 😱 #shorts
00:11
Savage Vlogs
Рет қаралды 4,2 МЛН
Ask Databricks about Spark Structured Streaming with Ray Zhu
57:16
Advancing Analytics
Рет қаралды 1,4 М.
Delta Live Tables A to Z: Best Practices for Modern Data Pipelines
1:27:52
Databricks Delta Lake Change Data feed - Real World Use case
42:57
Why Databricks Delta Live Tables?
16:43
Bryan Cafferky
Рет қаралды 16 М.
How to implement Databricks Delta Live Tables in three easy steps?
25:28
Rajaniesh Kaushikk
Рет қаралды 9 М.
Accelerating Data Ingestion with Databricks Autoloader
59:25
Databricks
Рет қаралды 67 М.
Advancing Spark - Databricks Delta Live Tables First Look
33:20
Advancing Analytics
Рет қаралды 41 М.