Probability Distributions Made Easy: Top 3 to Know for Data Science Interviews

  Рет қаралды 8,159

Emma Ding

Emma Ding

Күн бұрын

Пікірлер: 11
@chihirobabuska4422
@chihirobabuska4422 2 жыл бұрын
Hi Emma, thanks for your wonderful video. In your Binomial example, I would like to point out that click through rate follows a normal distribution due to Central Limit Theorem. Assuming the total number of clicks follows a Binomial(n, p), which means that there are total n impressions in consideration, and whether each impression ends up as a click is a Bernoulli(p) variable. In other words, there are only two outcomes for each impression, and with probability p it ended up as a click. The click through rate is the average of the results of all the above n Bernoulli variables. By CLT, the average of all these Bernoulli variables follows a normal distribution. After all, click through rate is a continuous variable, while a Bernoulli distribution is a discrete distribution with only 2 outcomes.
@danielrad7991
@danielrad7991 2 жыл бұрын
In the first example (Avg time spent per user per day), the sample size is 10. Can we assume normality, given our sample size is too small?
@stella123www
@stella123www Жыл бұрын
same question. I think if the n=10, CLT doesn't apply. She probably meant when sample size is larger than 30, the samples' avg time spent per user per day is normally distributed
@yunyihuang9476
@yunyihuang9476 Жыл бұрын
n is 1000 in this example
@songsong2334
@songsong2334 2 жыл бұрын
Thanks, Emma for the great video! If we map the distribution to the AB test distribution, for binary outcomes it will be binomial distribution. At the same time, will other cases all be normal distribution according to the Central limit theorem? I do not have enough practical experience in AB Testing, would love to know how we decide how different distributions are used in the AB test. Why do we have to specify a T-test or a Z-test?
@beyondtheclouds95
@beyondtheclouds95 Жыл бұрын
the churn example is gold!
@emmysway96
@emmysway96 5 ай бұрын
I think the green and blue parameters are swapped for the normal distribution diagram.
@raghavmittal2397
@raghavmittal2397 6 ай бұрын
Hi Emma, I have a doubt - How would one calculate the average time spend per user per day? Say we select a random sample of 10 users as in your example in the video. For those 10 users we have data on the time spent per day for each of the users. Now a user might have multiple time spent per day values depending on if they were active on several days. So for a particular user we calculate the average time spent per day by that user and then take the average time spent for the 10 users using average of individual averages?
@raghavmittal2397
@raghavmittal2397 6 ай бұрын
Other way of approaching this question can be - randomly selecting say 10 dates, and calculating the average of the total time spent per user per day for those days, and repeating the process 1000 times. Will this method work?
@keshavgupta308
@keshavgupta308 2 жыл бұрын
Hii Mam Remember me I love the way you taught us everything 😍😍🤗🤗
@lenka4662
@lenka4662 Жыл бұрын
Hi Emma 你好 本土国内大学数学专业 未留过学的 希望竞争国外的数据科学家有希望吗
Acing the Statistics Interview for Data Science Jobs
9:56
Emma Ding
Рет қаралды 21 М.
How to whistle ?? 😱😱
00:31
Tibo InShape
Рет қаралды 12 МЛН
Kluster Duo #настольныеигры #boardgames #игры #games #настолки #настольные_игры
00:47
Это было очень близко...
00:10
Аришнев
Рет қаралды 2,9 МЛН
哈哈大家为了进去也是想尽办法!#火影忍者 #佐助 #家庭
00:33
火影忍者一家
Рет қаралды 126 МЛН
The 6 MUST-KNOW Statistical Distributions MADE EASY [4/13]
9:25
Andrew Jones
Рет қаралды 7 М.
Probability Top 10 Must Knows (ultimate study guide)
50:51
JensenMath
Рет қаралды 254 М.
The Main Ideas behind Probability Distributions
5:15
StatQuest with Josh Starmer
Рет қаралды 433 М.
1. Introduction to Statistics
1:18:03
MIT OpenCourseWare
Рет қаралды 2 МЛН
5 Concepts in Statistics You Should Know | Data Science Interview
20:48
Probability: Types of Distributions
7:24
365 Data Science
Рет қаралды 375 М.
How to whistle ?? 😱😱
00:31
Tibo InShape
Рет қаралды 12 МЛН