Ask Databricks - About Unity Catalog with Paul Roome

  Рет қаралды 2,174

Advancing Analytics

Advancing Analytics

Күн бұрын

Пікірлер: 5
@Markttt5
@Markttt5 Жыл бұрын
These Ask Databricks sessions are great Simon. Content/subjects/questions being covered are spot on. Thanks for all your efforts.
@allthingsdata
@allthingsdata Жыл бұрын
The main design flaw I see in UC is that of external vs managed tables. If I create a schema without an external storage pointer/location, tables will be created in default UC location as managed. If I create a schema with an external storage location and then create a table under it (without explicitly giving a location), I would assume that the table is created in the directory of the schema. But UC creates a managed table with subdirectories with random ids, e.g. `schema_dir/__unitystorage/schemas//tables//`. This makes it impossible to know the location deterministically and other tools that want to access the table need to get the "managed" location. In Hive they were also created as managed but in the schema directory. I can only speculate as to the reasons that Databricks introduces these random structures but conceptually it doesn't make sense to me. I would expect all tables created under a schema with storage location to be created as external tables under that very directory that I have already provided in the schema. They constantly talk about the openness but if every tool needs to go through UC to get the location of a table, it's not really open. And yes I know that you can always simply provide the location during every table creation but that is not (business) user-friendly, safe or intuitive. I wonder what @AdvancingAnalytics thinks about this. How would you use external tables and avoid always having to provide the location? Thanks.
@allthingsdata
@allthingsdata 10 ай бұрын
One reason for the managed location having random IDs is the soft delete that allows UNDROP feature.
Жыл бұрын
Simon, please generate subtitles :) Thank you!
@AdvancingAnalytics
@AdvancingAnalytics Жыл бұрын
They're generated now! Always takes a little while you KZbin to catch up :)
Ask Databricks - About Orchestration with Workflows with Roland Fäustlin
54:52
Synyptas 4 | Арамызда бір сатқын бар ! | 4 Bolim
17:24
Officer Rabbit is so bad. He made Luffy deaf. #funny #supersiblings #comedy
00:18
Funny superhero siblings
Рет қаралды 19 МЛН
Smart Sigma Kid #funny #sigma
00:14
CRAZY GREAPA
Рет қаралды 12 МЛН
Microsoft Fabric for Power BI developers - 3.5 HOUR FREE COURSE
3:29:41
Learn Microsoft Fabric with Will
Рет қаралды 40 М.
Advancing Spark - Azure Databricks News Apr - May 2024
29:28
Advancing Analytics
Рет қаралды 2,1 М.
Understanding the Business of Data Vault 2.0
55:35
DataVaultAlliance
Рет қаралды 1,7 М.
Scale Up Your Databricks Coding with Databricks AI Assistant
31:39
Bryan Cafferky
Рет қаралды 2,8 М.
GEOMETRIC DEEP LEARNING BLUEPRINT
3:33:23
Machine Learning Street Talk
Рет қаралды 189 М.
Azure Data Factory Beginner to Pro Tutorial [Full Course]
2:50:26
Pragmatic Works
Рет қаралды 342 М.
What’s New in Unity Catalog -- With Live Demos
33:36
Databricks
Рет қаралды 10 М.
Advancing Spark - First Look at Unity Catalog
22:32
Advancing Analytics
Рет қаралды 18 М.
Advancing Fabric - Microsoft Build News Update - May 2024
24:43
Advancing Analytics
Рет қаралды 922
Synyptas 4 | Арамызда бір сатқын бар ! | 4 Bolim
17:24