Рет қаралды 1,219
In this video I talk about data classification and the term "production data" as well as introducing some more suitable terminology to reduce any confusion about your data and systems. I cover machine learning, live data, test data, production data and experimental data.
This series accompanies the documents at github.com/davedoesdemos/Data... which aim to explain some of the governance and operationalisation aspects of data lakes.
0:00 - Introduction to the video
1:12 - Traditional systems
2:46 - Data cuts for QA
3:08 - New, less ambiguous terms
8:16 - How does test data fit in to this?
9:31 - Flow chart of data moving through your environment
11:11 - Some examples
16:02 - Wrap up
For all of my other demos, go to davedoesdemos.com or go straight to the GitHub page at github.com/davedoesdemos/Demo.... Also please subscribe to the channel to make sure the latest demos show up in your playlist!