Databases Vs Data Warehouses Vs Data Lakes - What Is The Difference And Why Should You Care?

  Рет қаралды 81,730

Seattle Data Guy

Seattle Data Guy

Күн бұрын

Пікірлер: 58
@wilsonman8661
@wilsonman8661 Жыл бұрын
Hey, really appreciate this video. If I could summarize, it sounds like: - (transactional) databases are generally closer to the data generation source and tend to be closer to operations - data warehouses are further downstream of the transactional databases and have usually gone through some pre-processing to make it more accessible for downstream usage (ie: analytics, machine learning, etc.) - data lakes are kind of a catch all storage method for your data that may require a little more technical knowledge and effort to access
@SeattleDataGuy
@SeattleDataGuy 11 ай бұрын
Glad you liked it, hopefully people find your summary helpful
@Milhouse77BS
@Milhouse77BS 10 ай бұрын
7:05 might be time to mention Dr. Ralph Kimball’s contributions to dimensional data warehouse design.
@endpermia
@endpermia Жыл бұрын
Awesome video. I am prepping for an interview for my dream job and this helped me so much. Thank you!
@SeattleDataGuy
@SeattleDataGuy Жыл бұрын
glad it helped!
@sng9x
@sng9x 2 жыл бұрын
Great video to compare the differences among the 3 types and their general use cases; it is very helpful to help me identify which type I'm dealing with on my job. Their definitions have always been debatable because their use cases vary a lot by how companies define them for their projects.
@yves.dantas
@yves.dantas 2 жыл бұрын
Nice video! One thing that I noticed is none of the content creators (relates to data science) have been talking about technologies like Druid or Clickhouse. Im a telecom engineer and radio access network data is massive, we use Clickhouse to save performance counters and Presto+S3 for taking network configurations snapshots. Teams for other countries use druid, really nice tools not so mentioned here on youtube
@malikmudassarawan
@malikmudassarawan 10 ай бұрын
Boy I love the way you say Seattle data guy
@SeattleDataGuy
@SeattleDataGuy 9 ай бұрын
oh hahaah
@arahso
@arahso Жыл бұрын
data warehouses represent a centralized location for storing data assets from various other sources where the centralization allows data experts to answer business and analytics questions with a 360 view of data that the company has. Often the underlying format of the data is based on the analytical engine of the warehouse chosen. Whether your warehouse is row-based or columnar or just files is decision made by the engine responsible for handling load/insert/query operations. You can have a warehouse that doesn't leverage star schema or snowflake design and still call it a warehouse albeit probably not one that is efficient to analyze.
@BJTangerine
@BJTangerine 2 жыл бұрын
I always thought 'database' was just an umbrella term for referring to any storage thing which stores data, whether its a relational, non-relational, object, etc. type database.
@SeattleDataGuy
@SeattleDataGuy Жыл бұрын
I should have said transactional databases
@muzahmad2104
@muzahmad2104 Жыл бұрын
Nice video, might be useful to show examples of each at the end.
@MahmoudAziz
@MahmoudAziz 6 ай бұрын
You made it super easy, thanks heaps!
@SeattleDataGuy
@SeattleDataGuy 4 ай бұрын
You're welcome!
@bantuandproud8456
@bantuandproud8456 Жыл бұрын
Thank you for this great content. How to reach out if I have other questions? I just got certified data warehouse engineer, so, I'm new to this but I have a good knowledge of the whole concept.
@oyindamolavictor
@oyindamolavictor 2 жыл бұрын
Very interesting guide... Was stuck on a decision earlier on what approach to take but I guess my uncertainty was a result of the evolving use cases and requirements.... Awesome explanation here💯
@SeattleDataGuy
@SeattleDataGuy Жыл бұрын
Glad it was helpful!
@kaischmid9118
@kaischmid9118 2 жыл бұрын
What is the advantage of snapshots in a data warehouse instead of just saving a copy of the database each period? Also, you can use these separate copies for analytics without interfering with the transaction DB version.
@TJInTech10
@TJInTech10 5 ай бұрын
thx for breaking it down
@SeattleDataGuy
@SeattleDataGuy 5 ай бұрын
glad you found it helper!
@TJInTech10
@TJInTech10 5 ай бұрын
@@SeattleDataGuy yes, thx , I'm trying to understand how Knowledge graph/Vector DB's will integrate into this too, is it safe to assume both will be essential pieces of the enterprise ai layer/stack now being invested in heavily, or do you see one being more relevant in next 2-5 yrs?
@bwdelisle
@bwdelisle Ай бұрын
Great review.
@SeattleDataGuy
@SeattleDataGuy Ай бұрын
Glad you found it helpful!
@freddiepalmgren
@freddiepalmgren Жыл бұрын
So if you have a lot of document journals that you need to like archived but accessible for read access. Would you recommend a wear house instead of a lake?
@AnishBhola
@AnishBhola 2 жыл бұрын
Hey Ben! when you say row oriented data warehouse, it caught my attention and I tried to look it up on google but did not get any satisfactory results. Could you elaborate on this term? what are the use cases these address? Why do they exist in the first place?
@Milhouse77BS
@Milhouse77BS 10 ай бұрын
9:06 a well designed star schema aka dimensional model is quite easy to add new facts or dimensions. Opposite of rigid, if designed with shared dimensions in mind. See Kimball.
@mahmoudfadaly8074
@mahmoudfadaly8074 10 ай бұрын
i would appreciate it if u talk in much slower rate to be able to catch these valuable information, I tried to put the video sppeed on 0.75
@garynico9872
@garynico9872 2 жыл бұрын
what's your opinion on Databricks?
@SeattleDataGuy
@SeattleDataGuy 2 жыл бұрын
In terms of data lakehouses?
@milanthakkar9493
@milanthakkar9493 2 жыл бұрын
@@SeattleDataGuy not op but that would be great!
@jhonnafg
@jhonnafg Жыл бұрын
Can you tells how you switch from data analyst to data engineering in your 2 years of being a data analyst, what did you expose your self first into, is it going to be mastering python and SQL then etl? Thank you
@SeattleDataGuy
@SeattleDataGuy Жыл бұрын
THanks for the comment. By chance have you watched my video about this topic? kzbin.info/www/bejne/onjdmWCHmtOXgMU If this wasn't helpful happy to answer more questions
@gilbertoycosta
@gilbertoycosta Жыл бұрын
Great video.
@SeattleDataGuy
@SeattleDataGuy Жыл бұрын
Glad you liked it!
@andresdigi25
@andresdigi25 Жыл бұрын
At my company they treat data stores as the new shiny mirror. Nobody really knows what are the limits and the use cases for the different options
@SeattleDataGuy
@SeattleDataGuy Жыл бұрын
Its always the next shiny thing
@poizentv
@poizentv 10 ай бұрын
Thanks, can you become a Data Warehouse engineer without learning programming? I just want to learn SQL
@DP-md4jf
@DP-md4jf Жыл бұрын
Amamzing thank u
@SeattleDataGuy
@SeattleDataGuy Жыл бұрын
you're welcome!
@carlnascnyc
@carlnascnyc 2 жыл бұрын
Great and informative video, what about datalakehouses? Thanks!!
@SeattleDataGuy
@SeattleDataGuy Жыл бұрын
Great suggestion! I should do that one next
@ageektothepast2912
@ageektothepast2912 2 жыл бұрын
Listening to the data lake explanation all i could think about was the old AS400 XD
@jaradj876
@jaradj876 Жыл бұрын
If your company needs to process transactions quickly, but you also need reporting, then wouldn’t you have BOTH OLAP and OLTP databases?? Instead of picking one or the other??
@willi1978
@willi1978 Жыл бұрын
operational systems (crm, erp) are usually transactional databases. to make reporting more efficient and not bringing down the operational system with reporting a data warehouse is usually created. the data warehouse could be a olap database but i have seen oltp databases in use for that too
@brothermalcolm
@brothermalcolm 2 жыл бұрын
@12:00 data lakes
@SeattleDataGuy
@SeattleDataGuy 2 жыл бұрын
Thanks for the timestamp
@SeattleDataGuy
@SeattleDataGuy 2 жыл бұрын
If you guys want to learn more about data engineering, then sign up for my newsletter here seattledataguy.substack.com/ or join the discord here discord.gg/2yRJq7Eg3k
@willi1978
@willi1978 Жыл бұрын
the data warehouses i worked with were all not columnar
@SeattleDataGuy
@SeattleDataGuy 9 ай бұрын
yeah I have seen a lot of SQL servers used for DWs
@dn9416
@dn9416 2 жыл бұрын
#data #$$$ #analytics
@SeattleDataGuy
@SeattleDataGuy 2 жыл бұрын
#$$$ data
@nandinisugandhi748
@nandinisugandhi748 2 жыл бұрын
What is this code😶
@ryanrodriguez1234
@ryanrodriguez1234 Жыл бұрын
It’s like you’re speaking a different language 😅 I have no idea about whatever this is.
@SeattleDataGuy
@SeattleDataGuy Жыл бұрын
what would help it be clearer? Let me know!
@tejasudha
@tejasudha 2 ай бұрын
Wtf is this?
@mhalton
@mhalton 6 ай бұрын
What a shit definition of "warehouse" by Bill Inmon!
Data Engineering Course | Become A Data Engineer | Intellipaat
8:10:10
ТВОИ РОДИТЕЛИ И ЧЕЛОВЕК ПАУК 😂#shorts
00:59
BATEK_OFFICIAL
Рет қаралды 5 МЛН
When u fight over the armrest
00:41
Adam W
Рет қаралды 30 МЛН
Happy birthday to you by Secret Vlog
00:12
Secret Vlog
Рет қаралды 6 МЛН
Ice Cream or Surprise Trip Around the World?
00:31
Hungry FAM
Рет қаралды 20 МЛН
08 Understanding Word2Vec Part 1
23:40
Financial Data Analytics
Рет қаралды 4
Data Warehouse vs Data Lake vs Data Lakehouse
9:32
Jesper Lowgren
Рет қаралды 51 М.
Data Lake VS Data Warehouse VS Data Marts | CodeLearnX
12:07
CodeLearnX
Рет қаралды 28 М.
Why Everyone Cares About Snowflake
11:41
Seattle Data Guy
Рет қаралды 115 М.
Vocabulary for Data Engineers - Data Engineering 101
15:11
Seattle Data Guy
Рет қаралды 42 М.
ТВОИ РОДИТЕЛИ И ЧЕЛОВЕК ПАУК 😂#shorts
00:59
BATEK_OFFICIAL
Рет қаралды 5 МЛН