Tutorial 25- Probability Density function and CDF- EDA-Data Science

  Рет қаралды 154,145

Krish Naik

Krish Naik

Күн бұрын

Пікірлер: 176
@howtotipsandtricks4381
@howtotipsandtricks4381 Жыл бұрын
Institution of Data science claims that they have good content for freshers also but it is no there, i always have to come in your channel for many topics clarification which I could never learn from there. You have a great skill of teaching😊
@HarshitSharma-k3p
@HarshitSharma-k3p Жыл бұрын
you are much better than many online courses in market ...thank you please keep going.
@hadishaaben3665
@hadishaaben3665 2 жыл бұрын
Man you do not know how much i learned from you , your explanation is AWESOME
@jhondhelpago1638
@jhondhelpago1638 Жыл бұрын
He explained two topics in less than 10 minutes, yet its so clear and informative
@mischievousmaster
@mischievousmaster 4 жыл бұрын
Krish loves the word PARTICULAR a ton! 😁
@sunnyks108
@sunnyks108 3 жыл бұрын
PRETTY MUCH too :)
@srujankumar637
@srujankumar637 3 жыл бұрын
you are an awesome teacher; seems like completely dwelling over the concepts... simply i take a bow
@Ankurkumar14680
@Ankurkumar14680 4 жыл бұрын
Thanks for sharing sir, I always admire your teaching style, knowledge and helping nature. Small clarification, in normality plot, values on y-axis does not tell us area under the curve. In this way, y-axis corresponding to mean value on the x-axis will always be .5 but that is not the case. Actually, it is the gradient value of the CDF function (graph).
@tensorthug6802
@tensorthug6802 4 жыл бұрын
The y-axis of pdf is gradient of cdf, the higher the gradients, more the density is at that particular point. The y-axis of CDF is the percentage population of a particular point.
@MuhammadAbdullah-lr7sd
@MuhammadAbdullah-lr7sd 3 жыл бұрын
Yes, that is what I'm thinking but in the video it creates some confusion.
@t.saishodhanrao9519
@t.saishodhanrao9519 2 жыл бұрын
this comment should be pinned...... It creates lot of confusion for those who don't about this
@im_tanmay_g
@im_tanmay_g Жыл бұрын
Most simplest and non-confusing video on PDF & CDF. Thank you for the same.
@cherubyGreens
@cherubyGreens 3 жыл бұрын
Feeling amazing with Krish Naik!
@phalgunaa2157
@phalgunaa2157 2 жыл бұрын
Your explanation is too good bro
@nvsyashwanth918
@nvsyashwanth918 4 жыл бұрын
The way you explain concepts is amazing.
@livebiochemistry
@livebiochemistry 2 жыл бұрын
One of best video of PDF and CDF..thanks sir
@amiysrivastava1444
@amiysrivastava1444 4 жыл бұрын
best explanation of cdf when compared with other youtube videos.
@johan-mattias
@johan-mattias Жыл бұрын
learning about machine learning for google spreadsheets, this helped understand the CDF so much thank!
@Vidi_111
@Vidi_111 2 жыл бұрын
Thank you sir .. the way you explain is so easy to understand...
@sapnilpatel1645
@sapnilpatel1645 2 жыл бұрын
Amazing video sir. Thank you so much.
@bluestar2253
@bluestar2253 2 жыл бұрын
Excellent explanation of PDF and CDF
@VVV-wx3ui
@VVV-wx3ui 5 жыл бұрын
Simply explained. Good going Krish.
@oliullah.mahmud
@oliullah.mahmud 2 жыл бұрын
Thank you. I like your teaching style!
@TariqueMahmud313
@TariqueMahmud313 3 жыл бұрын
Too many clear concepts in just 7 minutes !!! Thanks man!!
@chaos8514
@chaos8514 2 жыл бұрын
hello are you also trying to learn data analysis?
@VVV-wx3ui
@VVV-wx3ui 5 жыл бұрын
Thanks Krish for sharing your knowledge. Please keep it going.
@bandhammanikanta1664
@bandhammanikanta1664 5 жыл бұрын
Thanks for this video Krish. We will be very happy to see atleast a single reply to any comments in youtube as well as issues in his github.
@krishnaik06
@krishnaik06 5 жыл бұрын
I usually see the comments and make a note on it to create videos...github videos will be coming up soon
@bandhammanikanta1664
@bandhammanikanta1664 5 жыл бұрын
@@krishnaik06 Thank you Krishna. Waiting to see your updates.
@himanshubansal2701
@himanshubansal2701 4 жыл бұрын
@@krishnaik06 sir what is benefits of cdf over pdf ? bcoz we will be analysing same precentage with pdf also.
@padduchennamsetti6516
@padduchennamsetti6516 6 ай бұрын
wow you are awesome,the best
@kabyabasu
@kabyabasu 3 жыл бұрын
Krish is the God of data science
@navinofficial5439
@navinofficial5439 10 ай бұрын
Crystal Clear!
@ahmedel-bahnihi346
@ahmedel-bahnihi346 4 жыл бұрын
When you explain the PDF, you said, it is the area under the curve till that point. I think this is the CDF, not PDF. Thanks a lot for your effort nd videos
@dharunsainath322
@dharunsainath322 4 жыл бұрын
that is correct...he is probably talking about CDF
@madhuprasath6193
@madhuprasath6193 4 жыл бұрын
A query,then how do you interpret a pdf?
@dattamalpote2005
@dattamalpote2005 4 жыл бұрын
i think krishna sir had explain it right.
@mitultandon5227
@mitultandon5227 4 жыл бұрын
@@madhuprasath6193 It basically gives you the probability of that point. PDF would answer a question like these:- What would be the chance of weight of a person to be 90kg?. Answer to this as per the above graph in the video would be "only 25% chance ( or 0.25 probability )". Basically PDF tells us the exact probability for every point.
@srujankumar637
@srujankumar637 3 жыл бұрын
area under the curve in particular (definite integral) range is pdf:: total must be unity
@santamsaha9415
@santamsaha9415 4 жыл бұрын
this video is a soul saver
@NR_Tutorials
@NR_Tutorials 4 жыл бұрын
nice videos thanks krish naik sir
@Ks-oj6tc
@Ks-oj6tc 3 жыл бұрын
Well explained, Thanks a lot Krish.
@brayansereno4249
@brayansereno4249 3 жыл бұрын
Hi Krish, thank you so much, I speak Spanish but I understand you, I really need an explanation of this topic and I don't found in Spanish, you're great bro
@mahalerahulm
@mahalerahulm 4 жыл бұрын
Excellent !! Very nice explanation.
@AJ-et3vf
@AJ-et3vf 2 жыл бұрын
awesome video sir! thank you!
@Emotekofficial
@Emotekofficial 4 жыл бұрын
As far as I know CDF is Cumulative Distribution function. It can also be calculated for Probability Mass Function. But you can say in this Scenario as Cumulative Distribution function of given Probability Density Function.
@saurabhtripathi62
@saurabhtripathi62 4 жыл бұрын
thanks your series is great , u made this very easy.
@jaheerkalanthar816
@jaheerkalanthar816 3 жыл бұрын
Thanks for the video sir, I learned lot of things in this video
@BalaguruGupta
@BalaguruGupta 4 жыл бұрын
I've commented on your other videos also, there I understood the love you had on the technology. From this video I understood that, actually you're so much passionate on teaching sir. The way you explained PDF and CDF is really amazing sir. Thank you so much. :)
@muhammadyasirbutt3631
@muhammadyasirbutt3631 4 жыл бұрын
very well brother your are great teachning
@sagarkhande4412
@sagarkhande4412 3 жыл бұрын
Ty sir your video was really helpful..👍
@vidyasurbhi3084
@vidyasurbhi3084 2 жыл бұрын
Very nice 👌.
@shadiyapp5552
@shadiyapp5552 2 жыл бұрын
Thank you sir ♥️
@kpratik5551000
@kpratik5551000 4 жыл бұрын
Very good explanation.
@tsaurav18
@tsaurav18 2 жыл бұрын
thankyou sir.
@DataAI_junction
@DataAI_junction 6 ай бұрын
thank you for the video
@aimenbaig6201
@aimenbaig6201 3 жыл бұрын
you are the best
@mohammedabdulahmed8808
@mohammedabdulahmed8808 3 жыл бұрын
Simply amazing explanation 😍😍 Thanks alot and keep doing sir!!!
@magedrefat1658
@magedrefat1658 2 жыл бұрын
Sir, your explanation is amazing ^_^
@xeysus5907
@xeysus5907 5 ай бұрын
Great one
@TheAl217
@TheAl217 4 жыл бұрын
Thank you for clarifying these functions.
@ArchnaVijay
@ArchnaVijay 3 жыл бұрын
amazing video
@aditisrivastava7079
@aditisrivastava7079 4 жыл бұрын
Thanks to wonderfull video..............i will simply add through pdf we can find the probababilty for a point or a range whereas cdf tell about the less than probability
@hbk788dbz
@hbk788dbz 2 жыл бұрын
PDF values can be greater than 1 ! (your y-axis can be greater than 1 ) . Area under the curve cannot be greater than 1 . Please correct me if im wrong @krish naik
@ranjanpal7217
@ranjanpal7217 2 жыл бұрын
Amazing...plz make a video on how to determine the distribution of a dataset using Python.
@PriyaAmar848
@PriyaAmar848 3 жыл бұрын
Why do I need to use this distribution ? In which cases of data it's helpful ? Also we have uniform distribution, binomial and poisson. Where to use these. Appreciate if practical examples are included. Great explanation with graphs. Keep up the enthusias,
@tsrnihar
@tsrnihar 3 жыл бұрын
Hi Krish - Appreciate your effort in putting together the videos. Want to redflag something. You mixed the concepts of PDF and CDF - You are using PDF but conveying the meaning of CDF. point on a PDF indicates probability of the point in the distribution. Whereas point on a CDF indicates the cumulative probability up to the point. This is also area under the graph.
@shrimaykher2978
@shrimaykher2978 3 жыл бұрын
PDF doesn't indicate probability of the point in the distribution. In fact, it is very troubling to believe but probability with which continuous variable takes exact value is zero. We always talk about probability of random variable falling into an interval using PDF by finding area under curve. Moreover as sir explained, i would like to make a little correction that pdf can go beyond 1. Y-axis of PDF is probability density not % distribution. PDF is not actually probability, but rather a density function which tells amount of probability per unit length, therefore it can go beyond 1 unlike PMF. Story of discrete and continuous variable is different and we cannot mix up the theory.
@equbalmustafa
@equbalmustafa 5 жыл бұрын
Nice one
@mukundsudharsan1294
@mukundsudharsan1294 4 жыл бұрын
In 3.08 you mentioned that the y-axis in the normal distribution represents the % of distribution below that point. If that statement holds true then shouldn't the graph be continuously increasing and it would be cdf? So what does the y-axis indicate for the normal distribution falling in the right half? Please correct me if I am mistaken, but would like to understand this better.
@kantafcb1
@kantafcb1 3 жыл бұрын
y axis shows the %age distribution of intervals
@muhammadsaqib2961
@muhammadsaqib2961 4 жыл бұрын
Good explanation
@schuf1738
@schuf1738 2 жыл бұрын
Thank you !!
@deepakbhaiya.shorts
@deepakbhaiya.shorts 3 ай бұрын
Old is gold
@KalyanGk0
@KalyanGk0 3 жыл бұрын
Great explanation krish😊 .please make video on practical implementation of this concepts using python.
@saddamshaikh9285
@saddamshaikh9285 5 жыл бұрын
Sir please make a video on navie byes algorithm.
@sudhirBhalekar007attaboy
@sudhirBhalekar007attaboy 4 жыл бұрын
Why do we calculate CDF as PDF is already giving you % of distribution for required data analysis, through this CDF, we are getting added (C.values) but what is the significance of this concept?
@ahmed96616
@ahmed96616 4 жыл бұрын
Excellent !
@nightowl1596
@nightowl1596 2 жыл бұрын
elite explanation, gg
@sharathchandrakarnati4615
@sharathchandrakarnati4615 3 жыл бұрын
We can use logistics regression right based upon cdf ?
@ibrahimahmethan586
@ibrahimahmethan586 5 жыл бұрын
thank u so much . god bless u
@nehasaroha2505
@nehasaroha2505 5 жыл бұрын
Very informative video....can you suggest some simpler kaggle datasets on which we can perform EDA using PDF,CDF and multivariate analysis. I have already done on iris, Titanic and Haberman's dataset, but was thinking about getting more practice.
@muhammadyasirbutt3631
@muhammadyasirbutt3631 4 жыл бұрын
hello pretty friend
@menakask6050
@menakask6050 Жыл бұрын
Hi krish, kindly let me know pls explain how do you say at point 130 in X axis with 90% in distribution in Y axis is "less than" since the CDF is straight it is increasing and you are mentioning less than 130kg is there in 90% of the dataset. How do you predict it is less or high using CDF?
@MScFabianoBriao
@MScFabianoBriao 3 жыл бұрын
Buenos! Do you have videos of real cases showing inferential statistics to test (validate) models?
@rakhijha8911
@rakhijha8911 4 жыл бұрын
Simply amazing i read so many articles on cdf but everyone was calculating the value no one explained it so well can I plzz connect you on LinkedIn
@lokeshchoraria6559
@lokeshchoraria6559 5 жыл бұрын
best explanation
@prachetade3163
@prachetade3163 4 жыл бұрын
Hi Krish, I think you meant to say that cdf is the value of integration of the pdf uptil a point x and not a summation of pdf as you explained at 4.47. Great videos btw, really appreciate your efforts.
@amolkabugade3728
@amolkabugade3728 4 жыл бұрын
integration is actually addition of those values
@amitmaurya6179
@amitmaurya6179 3 жыл бұрын
If you are smoothening the curve, why count changes to percentage of distribution.
@গোলামমোস্তফা-শ৮থ
@গোলামমোস্তফা-শ৮থ Жыл бұрын
But that area is not 1 according to your curve. Because if the interval is 10 and and suppose height of any bar is 0.25 then 10*0.25 > 1. May be The concept exactly came from CDF function and in pdf the values are called likelihood and these values are the slopes(tan@) of different points from cdf.
@naveenreddy7421
@naveenreddy7421 4 жыл бұрын
Sir it is clear but please make a video on same concept in jupyter note book
@Thanusree234
@Thanusree234 2 жыл бұрын
Sir is this subject exploratory data analysis and statistics subject the same sir please reply 🙏
@rakeshenjapuri3143
@rakeshenjapuri3143 4 жыл бұрын
how will calculate the above 60% in pdf and how will take the cumulative percentage in cdf give with mathematical explanation sir
@bavalpreetsingh4664
@bavalpreetsingh4664 4 жыл бұрын
do make video on all the distributions
@pushkarsaini7653
@pushkarsaini7653 5 жыл бұрын
sir may u please make video on hypothesis.
@karthikvijayasarathi89
@karthikvijayasarathi89 4 жыл бұрын
Small correction - Probability cannot go over 1 , but probability density function can go. Correct me if I am wrong
@wealth_developer_researcher
@wealth_developer_researcher 3 жыл бұрын
Amazing :)
@ratulghosh3849
@ratulghosh3849 4 жыл бұрын
Good going Sir keep up the good work :)
@ankitchakraborty1126
@ankitchakraborty1126 Жыл бұрын
Hi sir. Thanks for sharing awsome content like this. I have 1 question. Can we calculate percentile and median from CDF?
@SandeepGurjar-ko5ju
@SandeepGurjar-ko5ju 4 жыл бұрын
Hi, nice video but I think while you were explaining PDF you were actually talking about CDF.
@Nikhil-jj7xf
@Nikhil-jj7xf 5 жыл бұрын
Krish pls provide you're online course registration link
@RishikeshGangaDarshan
@RishikeshGangaDarshan 4 жыл бұрын
If data is not in form of gaussian distribution then the pdf or cdf will work or not
@seriouscoder1727
@seriouscoder1727 3 жыл бұрын
Cdf the chart that apple used in theiŕ comercial to show sales, Lying with statistic
@ujjwalmv9697
@ujjwalmv9697 Жыл бұрын
what if cdf over a period of time is not being constant and hits 90 degree and then goes constant, why is that straightline coming in cdf?
@adityapathania3618
@adityapathania3618 3 жыл бұрын
dont you think the cumulative total will go above 1? at 3rd 4th value itself ?as the probabilities are getting added ?
@anto1756
@anto1756 3 жыл бұрын
Nice 😁 could u do a comparison about survival function, inf and all other methods please
@lahari1512
@lahari1512 4 жыл бұрын
hai krish , ur vedios are really helping me to learn machine learning very easily , can u please upload svm and xg boost vedios please its a request
@snehalhon
@snehalhon 3 жыл бұрын
Krish sor plz send me playlist link for this tutorial series
@saurabhtripathi62
@saurabhtripathi62 4 жыл бұрын
please add a video that clears how to do this with python , practically
@shreyasaxena5169
@shreyasaxena5169 4 жыл бұрын
sns.distplot(df['weight])
@0505Arjun
@0505Arjun 5 жыл бұрын
What scenarios we will use CDF and PDF in machine learning..?
@gauravsaini728
@gauravsaini728 5 жыл бұрын
I have the same query. Can you please give any domain specific example and show is how PDF and CDF curves will help the data scientist to take certain decisions..
@alphonseinbaraj7602
@alphonseinbaraj7602 5 жыл бұрын
yeah ..same query me too
@rajatchaturvedi7379
@rajatchaturvedi7379 5 жыл бұрын
Don't know its application in ML yet, Since I have started learning recently but one use of it is in EDA Which is the exploratory data analysis.Exploratory means you don't know anything about the data set from before. Your task is to extract some basic yet critical information about the dataset before implementing ML algorithms.Its important to get an idea about the dataset .PDF, CDF various plots such as 2D, pair plot , etc are some aspects of EDA.There are other stuff also.You can google it to understand more .
@umesh789s
@umesh789s 4 жыл бұрын
can you please explain about T-score and T distribution
@dude5697
@dude5697 3 жыл бұрын
In 3.05, you shade the area even outside of the bell curve, but in 3.31 you shade the area only within the bell curve. I can't understand , Pls can you explain me Krish sir?
@pravalikamucherla139
@pravalikamucherla139 4 жыл бұрын
Hi You have used only weight one feature to determine pdf n CDf can we do with two or more features
@PriyaAmar848
@PriyaAmar848 3 жыл бұрын
Great question, expecting answer from any DS enthusiast
@kantafcb1
@kantafcb1 3 жыл бұрын
no, its univariate analysis
@ravitanwar9537
@ravitanwar9537 5 жыл бұрын
amazing as always
@enchanted_swiftie
@enchanted_swiftie 3 жыл бұрын
But sir, when I plot KDE plots with seaborn, I often get the values on the y-axis more than 1. What the interpretation will be then? Or KDE plots are different from Density Curves?
What is a Probability Density Function (pdf)? ("by far the best and easy to understand explanation")
9:46
Iain Explains Signals, Systems, and Digital Comms
Рет қаралды 156 М.
Какой я клей? | CLEX #shorts
0:59
CLEX
Рет қаралды 1,9 МЛН
Probability Distribution Functions (PMF, PDF, CDF)
16:17
zedstatistics
Рет қаралды 1,2 МЛН
Probability and Statistics: Overview
29:43
Steve Brunton
Рет қаралды 107 М.
Probability density functions | Probability and Statistics | Khan Academy
10:02
Tutorial 24-Z Score Statistics Data Science
11:59
Krish Naik
Рет қаралды 138 М.
End To End RAG Agent With DeepSeek-R1 And Ollama
12:47
Krish Naik
Рет қаралды 4,6 М.
Normal Distribution (PDF, CDF, PPF) in 3 Minutes
5:26
3-Minute Data Science
Рет қаралды 80 М.
Cumulative Distribution Functions and Probability Density Functions
11:02
The Organic Chemistry Tutor
Рет қаралды 697 М.
Какой я клей? | CLEX #shorts
0:59
CLEX
Рет қаралды 1,9 МЛН