The hidden labor cost of inherited data

  Рет қаралды 866

Cassie Kozyrkov

Cassie Kozyrkov

Күн бұрын

Although inherited data (a.k.a. secondary data) are cheaper to get than primary data, they're more expensive in terms of data scientist labor-hours. One reason is that you'll have to do lots of extra documentation and you'll also need to detective work to figure out the real story about how the inherited dataset was created. This video walks you through the extra documentation you'll be expected to produce.
At a bare minimum, if you're working with inherited data, I recommend adding this to all your project documents:
“Since our project team did not participate in planning the study or data collection, it is possible that we are missing crucial context which renders our conclusions invalid.”
Learn more on my blog: bit.ly/quaesita...
Don't forget to hit subscribe+notify! If you found this useful or enjoyable (amuseful?), the best way to say thank you is by sharing it.

Пікірлер: 7
@parrotraiser6541
@parrotraiser6541 Жыл бұрын
Was it Socrates who said that his wisdom consisted of realising how much he didn't know, or words to that effect?
@greensock4089
@greensock4089 Жыл бұрын
I hope nobody uses that disclaimer as a get out of jail free card for being data dangerous
@greensock4089
@greensock4089 Жыл бұрын
You're a bit quiet in this video
@jediyoda7338
@jediyoda7338 Жыл бұрын
Listing assumptions and caveats is must. Many thanks for these daily tips ….
@vikashkumar994
@vikashkumar994 Жыл бұрын
Again great advice.
@azarel7
@azarel7 Жыл бұрын
C.Y.A !
@brain_respect_and_freedom
@brain_respect_and_freedom Жыл бұрын
👍
How to work with inherited datasets
5:03
Cassie Kozyrkov
Рет қаралды 2,6 М.
The data scientist's guide to data documentation
3:56
Cassie Kozyrkov
Рет қаралды 1,6 М.
Nastya and balloon challenge
00:23
Nastya
Рет қаралды 69 МЛН
The Joker wanted to stand at the front, but unexpectedly was beaten up by Officer Rabbit
00:12
Do you choose Inside Out 2 or The Amazing World of Gumball? 🤔
00:19
Where does math impostor syndrome come from?
4:35
Cassie Kozyrkov
Рет қаралды 2,2 М.
Step-by-step guide to AI projects
9:46
Cassie Kozyrkov
Рет қаралды 3,5 М.
How to read a box plot (a.k.a. a box-and-whisker plot) - Nick Desbarats
6:53
Practical Reporting Inc.
Рет қаралды 71 М.
Optimize your life with decision science
3:05
Cassie Kozyrkov
Рет қаралды 2,1 М.
I Studied Data Job Trends for 24 Hours to Save Your Career! (ft Datalore)
13:07
Thu Vu data analytics
Рет қаралды 231 М.
Is It All About the Data? (Ina Fried, Cassie Kozyrkov) | DLD 24
21:21
DLD Conference
Рет қаралды 1,6 М.
How I'd Learn Data Analytics in 2024 (If I Had to Start Over)
14:08
CareerFoundry
Рет қаралды 806 М.
Judgment calls in data science
2:23
Cassie Kozyrkov
Рет қаралды 900
How I'd Learn Data Analytics in 2024 | 3 Month Plan
11:42
Rohan Adus
Рет қаралды 391 М.
Nastya and balloon challenge
00:23
Nastya
Рет қаралды 69 МЛН