Data Intelligence Day Hong Kong 2024
1:30
Пікірлер
@paulfunigga
@paulfunigga 31 минут бұрын
You found the best speaker for this talk...
@ame0589
@ame0589 10 сағат бұрын
Where can I find this example notebook?
@blindyogi4997
@blindyogi4997 19 сағат бұрын
isnt that just Redash with LLM search?
@Databricks
@Databricks 8 сағат бұрын
No, there's a lot of new things with Lakeview that weren't possible in Redash; performance and sharing being the biggest changes
@michelle4468
@michelle4468 22 сағат бұрын
21:45 how to get yourself in a good position to use Databricks Lakehouse: 1) store data in data lake, 2) pick a standardized open format (like parquet or ORC), 3) figure out one of the technologies that are building blocks for building lakehouses (like Open-Source Project Delta Lake)
@michelle4468
@michelle4468 22 сағат бұрын
20:33 once you can enable people to do online transaction processing directly on the lakehouse, that's gonna be a major technological breakthrough, and he'd be shocked if they weren't there in five years ~listening to this 3.5 years later, where is DB on this?
@michelle4468
@michelle4468 22 сағат бұрын
16:30 Next issue to solve: awareness of lakehouse paradigm. There's a big shift of people moving from on-prem to the cloud. "And a lot of them are tempted to rebuild the same architectural pattern they had in the on-prem, into the cloud." Which doesn't really buy that much. People need to be reeducated on the lakehouse paradigm, and it needs to be spread wide.
@michelle4468
@michelle4468 22 сағат бұрын
12:45 "If you look at data science and machine-learning tools, they're not built on top of SQL. So enable that downstream use-case." A data lakehouse = open-data lake + downstream data science + Business Intelligence (BI)
@michelle4468
@michelle4468 22 сағат бұрын
11:37 "many projects on top of data lakes were failing, they were pulling us in to do professional services to fix the problems they had with the data lakes. So we just wanted to sort of fix it, automate it with software ones once and for all.."
@michelle4468
@michelle4468 22 сағат бұрын
41:51 "we need to change our attitude from being gatekeepers of systems, to being shopkeepers keepers of data...how do we actually provide data to people with sufficient governance, for them to be able to use it with flexibility, but with a light enough touch that we're not over managing the system?"
@michelle4468
@michelle4468 23 сағат бұрын
44:16 Getting beyond techno-chauvinism and the importance of the personal aspects in data science.
@michelle4468
@michelle4468 23 сағат бұрын
27:19 Trying to drive people to a data-driven culture and to understand the language of data and it's getting better , but it's not perfect.
@michelle4468
@michelle4468 23 сағат бұрын
4:11 Someone in her network tested her on her analytical skills and when she proved a question not to be accurate, she was hired, she saw the hole in the question.
@Prashanth-yj6qx
@Prashanth-yj6qx Күн бұрын
Can anyone tell me why he reduced target size to 100 mB from 200mb
@zombieeplays3146
@zombieeplays3146 Күн бұрын
So good still I can't center the dashboard title 😥
@Databricks
@Databricks 8 сағат бұрын
Feature request raised 👍
@zombieeplays3146
@zombieeplays3146 8 сағат бұрын
@@Databricks yeah let it be like markdown in Jupyter Notebooks 😅
@Databricks
@Databricks 4 сағат бұрын
Sorry for the confusion: The titles are already markdown and I show it at 0:51 for a brief few seconds, but it's basic markdown so you can't centre it within a text panel. What you can do is centre the text panel so it's at least semi centred. Holly
@anandahs6078
@anandahs6078 Күн бұрын
Great feature, thanks for sharing
@lostfrequency89
@lostfrequency89 Күн бұрын
For notebooks should we use even integrate github or we can use dabs for that matter ? I’m kinda confused
@georges7298
@georges7298 2 күн бұрын
Fantastic DLT and pipeline training! well done!. Is there a github project with a complete version of the example codes shown in this video?
@gameversemaster
@gameversemaster 3 күн бұрын
cool
@SpartanPanda
@SpartanPanda 3 күн бұрын
Great video.. complete coverage of a real business usecase..well explained
@shaileshdhumma9096
@shaileshdhumma9096 3 күн бұрын
great job !
@gustyflores
@gustyflores 3 күн бұрын
great! thank you
@sheelstera
@sheelstera 4 күн бұрын
i dont see the system.compute tables at all in my Azure Databricks workspace.. what could be the reason?
@Databricks
@Databricks Күн бұрын
Two reasons I can think of, 1) you have to have Unity Catalog, it's where the compute comes from to deliver all the data, 2) you have to enable it with the API, to do that you'll need to be an Admin. Holly
@aliaksandr2336
@aliaksandr2336 4 күн бұрын
and why it better than usual %sql ?
@Databricks
@Databricks Күн бұрын
Hi Ali, %sql makes the language for the cell SQL, which is useful if you're switching between languages. However, if you want to be SQL the whole way through and not use python then you can use Execute Immediate to build your dynamic queries. Holly
@dusk4377
@dusk4377 8 сағат бұрын
@@Databricks why not just change the notebook to sql?
@Databricks
@Databricks 4 сағат бұрын
@@dusk4377 this is a SQL only feature and can be used to replace Python variables so your code can be 100% SQL. Hope that's clearer, Holly
@ericsims3368
@ericsims3368 4 күн бұрын
I love Databricks Assistant and use it pretty much every day at work. A few days ago I knocked something out in 15min that would have taken me more than an hour if I had done it on my own.
@samirelzein1095
@samirelzein1095 5 күн бұрын
great job!
@syndicatedmaps
@syndicatedmaps 5 күн бұрын
Do you have any map data examples?
@ravisaxena1599
@ravisaxena1599 5 күн бұрын
You sounds pretty as per playback speed 0.75x 😊
@toniolora9226
@toniolora9226 5 күн бұрын
Do you have an example on Azure?
@erukullasrikanth15
@erukullasrikanth15 5 күн бұрын
What is the additional advantage we get by using overwatch when compared using uc system tabled
@elziolima6918
@elziolima6918 5 күн бұрын
Data Lake is a component of Data Pipeline?
@elziolima6918
@elziolima6918 5 күн бұрын
First.
@stefanxhunga1681
@stefanxhunga1681 6 күн бұрын
✅Interesting posts by Databricks
@tarshmidha5879
@tarshmidha5879 6 күн бұрын
what's the link to data preparation video that was mentioned and worked on by data engineering person?
@LearnWithDummy
@LearnWithDummy 6 күн бұрын
strange, 1/ Bronze: Loading data from blob storage , and path is from S3? am i missing something here?
@zombieeplays3146
@zombieeplays3146 7 күн бұрын
Much needed feature. I had to use PySpark just to achieve this earlier.
@erkoo2554
@erkoo2554 7 күн бұрын
Jai Shree Ram
@goodstuff5666
@goodstuff5666 7 күн бұрын
Very nice tutorial! Could you guys share the slides? Thanks.
@shankhadeepghosal731
@shankhadeepghosal731 8 күн бұрын
I want do build a work flow where 2nd notebook should run only when a certain table count is more than 0 in notebook 1. How?
@dilipjha08
@dilipjha08 8 күн бұрын
Thanks for knowledge sharing to the technology user. It was very details about the dlt as well as streaming tables and comprison between it and demo of the topic was very perfect.
@vasanthbloginfo
@vasanthbloginfo 9 күн бұрын
Great talk and very useful info
@pritamdodeja
@pritamdodeja 9 күн бұрын
This is gold!
@stopthink9000
@stopthink9000 10 күн бұрын
Very interesting! I wonder, in the AutoML example at 19:47 why would they have used a "double" data type for age? The robot overlords must already be planning ahead lol!
@harshtrivedi700
@harshtrivedi700 12 күн бұрын
didnt see the notebooks where are the usually deployed?
@elenavi2016
@elenavi2016 13 күн бұрын
outdated, it is useless and just wasting time. delete the video
@NoahPitts713
@NoahPitts713 13 күн бұрын
Hoping to implement this on my current project soon!
@AmineHosni-
@AmineHosni- 13 күн бұрын
Can we use Delta Sharing on Materialized Views that were defined in Delta Live Tables?
@booyaaaaaaa
@booyaaaaaaa 14 күн бұрын
Can you add VIM support for notebooks?
@hasski
@hasski 14 күн бұрын
Marred by the loud pointless music
@hkjpotato
@hkjpotato 14 күн бұрын
why can’t we just show a visualization on the ui about this result?
@Databricks
@Databricks 14 күн бұрын
Hi There! Our experience is that most people want to be able to highly customise their visualisations. The tables are designed in a way to allow for flexibility. At 0:25 when you see the results, there's a + next to the table; that allows you to make a visualisation from this data, you can add it to a dashboard and then schedule and share it as frequently as you'd like - Holly
@zombieeplays3146
@zombieeplays3146 15 күн бұрын
Recently used these for Lineage and Audit, looks cool