DataBasics: Streaming
4:58
9 ай бұрын
DataBasics: Modeling
4:22
9 ай бұрын
DataBasics: Data Lakes
5:29
10 ай бұрын
DataBasics: ETL & ELT
5:19
10 ай бұрын
DataBasics: Architectures
3:07
11 ай бұрын
DataBasics: OLTP and OLAP
5:12
11 ай бұрын
DataBasics: What is a database?
3:08
Choices Data Architects Must Make
7:43
Why you should have One Big Table
3:01
Which One Is Data Engineering?
5:33
The Reality of Scaling
4:46
Жыл бұрын
Don't Pick the Wrong Data Career
7:41
If ChatGPT Replaces Google....
7:32
Python For Data: Quick and Easy
6:33
Пікірлер
@firstshield9507
@firstshield9507 2 күн бұрын
Excellent video bro!
@HeadStronger-HS
@HeadStronger-HS 3 күн бұрын
I bet the h1-bs are still there.
@JonathanBiemond
@JonathanBiemond 5 күн бұрын
Really helpful, practical advice! Thank you
@luisresendez946
@luisresendez946 27 күн бұрын
Your content is great man, keep it up! you've helped a lot
@fishsauce7497
@fishsauce7497 Ай бұрын
What many fail to realise is that a bad data warehouse is not just bad table structure, but also, low documentation, redundant calculations, unnecessarily complicated ETL (mostly tech debt). All of which make the warehouse unusable and difficult to maintain. I also see wrong approach when creating a warehouse e.g. just looking at existing reports to create a data model, no data profiling, no business study, no articulation of data loading rules, heavy on undocumented assumptions. Eventually the new shiny warehouse by modellers is also discarded by analysts as it is not fit for purpose, because the same mistake is repated again and again.
@jeanchindeko5477
@jeanchindeko5477 Ай бұрын
4:55 while a good video introduction to Apache iceberg, there are a few point that needs to be clarified here. Delta Lake is not owned by Databricks but by the Linux Foundation since October 2019, unlike the 2 other table formats Hudi and Iceberg. Apache Kudu, Cassandra and Druid are not table formats so cannot and shouldn’t be compared to Apache Iceberg. Databricks don’t have a Delta engine. Delta, Hudi and Iceberg are all 3 available on AWS in AWS Amazon Glue, AWS EMR and Amazon Redshift Spectrum. It’s not just Iceberg which is supported there. All 3 table formats are fully open source, not owned by any company and community driven. They are all 3 supported by the same set of Query engine such as Spark, Flink, Presto, Trino and more. Delta lake is not just supported by Databricks (which you will hear a lot in the Iceberg community) but also by Google BigQuery and Snowflake. Microsoft Fabric platform is built on top of Delta Lake table format.
@splashoui3760
@splashoui3760 Ай бұрын
"that's it"
@delvallo9652
@delvallo9652 Ай бұрын
Everything is “AI” today because greedy people want to monetize it. Same thing happened with “Blockchain” around 2016. Seemingly overnight there were hundreds of startups selling “blockchain” solutions. Don’t buy they hype, literally
@ol880
@ol880 2 ай бұрын
I dunno man, our management identified that our business may be at risk if nuclear warfare breaks out, so we are building a truly resilient platform which replicates data on Mars. We call it "Storm Data Warehouse", because there's not enough water on Mars to use a Cloud. We're also expecting that every person, dead or yet unborn will be incarnated as an AI consciousness, and thus will be our user, so we have to build high-quality robust pipelines that ingest exabytes of data in real-time, so that the CEO can bask in the glow of twenty-four screens that erupt to life as he wakes up at 4:32am to look at the KPI dashboards to change the world.
@nullQueries
@nullQueries Ай бұрын
But you only have a $3000 budget, and it needs to be done by June
@saisathya7755
@saisathya7755 2 ай бұрын
Aio as an data engineer idk but several times i just laughed when you say moving data from one place to another 😂 but it was fun
@millionare1192
@millionare1192 2 ай бұрын
Wow..had a lot of fun and it removed anxiety
@user-vs2yl2up1l
@user-vs2yl2up1l 2 ай бұрын
“In a world deluged by irrelevant information, clarity is power.” Yuval Noah Harari
@TheR0yalBeast
@TheR0yalBeast 2 ай бұрын
Very clean
@georgechristy11
@georgechristy11 2 ай бұрын
Steps to make pipeline better 1. Good auditing and logging: error handling 2. Repeatable and identical 3. Self healing: finding a way to find the delta , log files and compare, add a data lake before data warehouse , add hash or water marks before compare 4. Decouple EL and T: Landon Rae formate, transform to Dwh, make reporting table clean, 5. Always available: trancate and load refresh faster than update. Or build semantic layer 6. CICD: coded, git connected, versioned , rollbacks
@Jerry-uc1pn
@Jerry-uc1pn 2 ай бұрын
Finally, a short and good guide.
@GARFEDEN
@GARFEDEN 2 ай бұрын
Great explanation!
@Mahshadn
@Mahshadn 2 ай бұрын
Spot on! That's what every data job seeker should understand. By the way, I'm a big fan of your videos, your channel deserves a lot more attention. Keep up the good work 🙌
@user-vs2yl2up1l
@user-vs2yl2up1l 2 ай бұрын
All of analytics and data engineering is just applied accounting. The more you know about finance, the stronger you are as an analytics professional. - Lauren Balik
@chasedoe2594
@chasedoe2594 2 ай бұрын
Glad to see you're back on YT.
@nullQueries
@nullQueries 2 ай бұрын
Thanks! Been a busy few months but I have a number of videos I'm working on
@samjoseph5175
@samjoseph5175 2 ай бұрын
Looking forward! :)
@bobbymbabu007
@bobbymbabu007 3 ай бұрын
Amazingly simplified with your explanation. Thanks!
@hammadyasir2174
@hammadyasir2174 3 ай бұрын
speaking speed too fast
@valentinloghin4004
@valentinloghin4004 3 ай бұрын
Nice presentation !! This solution has one down side , you pay 2 services.
@Salma-Ibrahim
@Salma-Ibrahim 3 ай бұрын
I like your videos, great explanation with clear and nice examples. I just need more time and effort to focus because of the loud music in the background. I hope to make it low in rest of your videos. thanks again for your explanation and your time.
@jimmmanuel7420
@jimmmanuel7420 3 ай бұрын
how did you learn especially your cloud skills?
@edidiongesu4035
@edidiongesu4035 3 ай бұрын
You have a great friend 🔥🔥🔥🔥✨✨
@kpsk1627
@kpsk1627 4 ай бұрын
It is all about problem solving. Unfortunately, companies are too consumed with hiring the best technical talent putting them through rigorous coding challenges. But, they forget to ask how you solve a problem, take us through your thought process when you are trying to solve something.
@priyojitbhattacharjee7552
@priyojitbhattacharjee7552 4 ай бұрын
You will grow in KZbin... I vouch for you 👍
@awakened514
@awakened514 4 ай бұрын
“That snake game” 😂😂😂😂
@vidak92
@vidak92 4 ай бұрын
Really, the best explanation.
@user-qm7be6nw9u
@user-qm7be6nw9u 4 ай бұрын
This all comes from universities creating too many types of majors
@fb-gu2er
@fb-gu2er 4 ай бұрын
You shouldn’t call yourself a data engineer if you are not a software developer yourself
@leonid123ful
@leonid123ful 4 ай бұрын
I love your videos, I see that you aren't trying to clickbait, hype them up or dumb them down to get more likes and views, mostly because KZbin isn't your income stream. As a result, we just get a great analysis of whats out there. Legend
@FelipeCampelo0
@FelipeCampelo0 4 ай бұрын
That was a great walkthrough
@bytedonor
@bytedonor 4 ай бұрын
Well explained in pictorial format. But there should be some use case or an example so the newbies can understand more easily.
@poizentv
@poizentv 5 ай бұрын
Hello. I hope you well. Is it possible to become a Data Warehouse developer without leaning any programming language?
@thghtfl
@thghtfl 5 ай бұрын
You are contradicting yourself. Inmon's model has a datamarts layer with denormalized and ready to consume data, so how should that slow down reporting queries?
@akshaykapoor8180
@akshaykapoor8180 5 ай бұрын
very nice explanation
@Milhouse77BS
@Milhouse77BS 5 ай бұрын
2:00 Using a dimensional model to build OBT May be okay, but seems that analysts should build this via pivot table query instead of data team.
@analystt99
@analystt99 5 ай бұрын
funny enough descriptive analysis falls under data science as well and many data analyst jobs keep you in meeting and creating quick dashboards with little to no analysis. To do well, have all skills under your belt just in case
@boyfromhelll
@boyfromhelll 5 ай бұрын
I have been saying this for 5 y already
@rhambo5554
@rhambo5554 5 ай бұрын
Good video (no apostrophe in 'lots' btw). Love - Grammar Police
@shwet2854
@shwet2854 5 ай бұрын
how to find that, when table is last refresh in snowflake for tables without date field
@cNamikz
@cNamikz 5 ай бұрын
That is so true. Thanks for the honesty
@LOFI_WORLD_SONG
@LOFI_WORLD_SONG 6 ай бұрын
I am actually really confused whether to go for data engineer or devops engineering.. less code , less responsibility
@sarthaksharma6897
@sarthaksharma6897 6 ай бұрын
One of the best videos
@notyourbusiness2672
@notyourbusiness2672 6 ай бұрын
Why so little views? Good stuff. Well made.
@LeoGodin
@LeoGodin 6 ай бұрын
This is pure gold. Too many DEs don't realize we exist to solve business problems. FYI, Your videos are excellent and separated from the tons and tons of videos that are just trying to sell a course.
@f4bglv
@f4bglv 6 ай бұрын
Excellent video. Thank you for sharing!
@ShadowD2C
@ShadowD2C 6 ай бұрын
sad for me as a data scientist
@BenOgorek
@BenOgorek 7 ай бұрын
Awesome video. Minor nitpick: the D in DAG is “directed” as in directed arrows. (End nitpick.) As popular as data engineering is getting, it’s really hard to find conceptual videos like this on the basics.