Training a machine learning model with scikit-learn

  Рет қаралды 225,311

Data School

Data School

Күн бұрын

Пікірлер: 546
@dataschool
@dataschool 3 жыл бұрын
Having problems with the code? I just finished updating the notebooks to use *scikit-learn 0.23* and *Python 3.9* 🎉! You can download the updated notebooks here: github.com/justmarkham/scikit-learn-videos
@johnlim640
@johnlim640 3 жыл бұрын
This is hands down the best machine learning tutorial. Definition and concept is well-explained. THANK YOU SO MUCH!
@dataschool
@dataschool 3 жыл бұрын
Thank you for your kind words! 🙏
@mightyflamelord
@mightyflamelord 8 жыл бұрын
i appreciate the fact that you speak very slowly and express clearly!
@dataschool
@dataschool 8 жыл бұрын
Thanks, I try to make it easy for others to understand me! :)
@11folders
@11folders 4 жыл бұрын
I totally agree. I don't have to pause the video as frequently while taking notes.
@srivathsgondi191
@srivathsgondi191 2 жыл бұрын
Despite this being an old playlist, without a doubt still the best one I found on youtube so far...
@dataschool
@dataschool 2 жыл бұрын
Thank you so much!
@Guinhulol
@Guinhulol 11 ай бұрын
Oh yeah! It doesn't get better than that!
@gsk1740
@gsk1740 6 жыл бұрын
No words to Describe How awesome it is...after watching so many tutorials .
@dataschool
@dataschool 6 жыл бұрын
Thanks very much for your kind words!
@Aviel777Gergel
@Aviel777Gergel 3 жыл бұрын
It is so awesome, that you combine first class knowledge+ impressive pronunciation of a professional voice actor. It's super clear! Thank you for the series
@dataschool
@dataschool 3 жыл бұрын
Wow, thank you! 🙏 I really appreciate your truly kind words!
@hasyahaven
@hasyahaven 5 жыл бұрын
Loss of words! Ur explanation is with the purpose to answer every root question and with an aim so that one clearly understands. Thanks a lot.
@dataschool
@dataschool 5 жыл бұрын
You're welcome!
@pierrelaurent8284
@pierrelaurent8284 8 жыл бұрын
It's a real pleasure to follow this serie, clear, concise and so well teached. Being a non native English speaker, it's 100% understandable. Bravo !
@dataschool
@dataschool 8 жыл бұрын
Wow, thanks so much for your kind words! I really appreciate it.
@edmarkowitz9873
@edmarkowitz9873 6 жыл бұрын
You're videos are so far superior to the commercial products out there I just can't believe it. I wish I had found them before dumping a small fortune into the "pay-to-play courses." Thank you for sharing this information and be sure that I will join the Paetron group.
@dataschool
@dataschool 6 жыл бұрын
Wow! Thank you so much for your kind words! :) I look forward to having you in the Data School Insiders group... you can join here: www.patreon.com/dataschool
@Axle_Max
@Axle_Max 6 жыл бұрын
Your ability to explain this topic in simple terms is remarkable. Thank you so much for these videos.
@dataschool
@dataschool 6 жыл бұрын
You're very welcome!
@yechihast
@yechihast 6 жыл бұрын
One of the best ML online tutor I have come across, very well thought, every minute in informative. AND I do support Kevin's slow speech pace; it makes it much easier to comprehend the complext concepts. Thank you Kevin.
@dataschool
@dataschool 6 жыл бұрын
You're very welcome!
@spandanhetfield
@spandanhetfield 9 жыл бұрын
You have done an awesome job. I'm the TA for a course on Bioinformatics and I'll be using your videos to teach my students a short primer on getting started with ML just so that they can shed that fear and get down to work :)
@dataschool
@dataschool 9 жыл бұрын
+Spandan Madan That's awesome! Please let me know how it goes!
@Dockmark5
@Dockmark5 6 жыл бұрын
Not just educated, but a talented teacher. Fantastic combination
@dataschool
@dataschool 6 жыл бұрын
Thanks so much! I really appreciate it! :)
@kritikakamra22
@kritikakamra22 7 жыл бұрын
Thanks for making such lucid videos Kevin! You have no idea how helpful these videos are for a novice like me.
@dataschool
@dataschool 7 жыл бұрын
Excellent! That's very nice to hear!
@rayuduyarlagadda3473
@rayuduyarlagadda3473 6 жыл бұрын
This is the best explanation, I have gone through many videos but this video helped me a lot for better understanding.... Thank you markham.
@dataschool
@dataschool 6 жыл бұрын
You're very welcome! Glad it was helpful to you!
@kushalmiglani2691
@kushalmiglani2691 8 жыл бұрын
you are doing a very good job. The stuff and tutorials you are providing for free seriously shows your dedication for your work and how much you care for those who cant afford such expensive tutorials. Thanks
@dataschool
@dataschool 8 жыл бұрын
Thanks so much for your kind words! I'm really glad the tutorials have been helpful to you!
@atiflatif7233
@atiflatif7233 7 жыл бұрын
Thanks so much for making it so easy to understand. I have watch many videos on Machine Learning and have never felt so confident in applying the concepts. Well Done!
@dataschool
@dataschool 7 жыл бұрын
You are very welcome! I'm glad to hear my video was helpful to you!
@garriedaden4168
@garriedaden4168 9 жыл бұрын
Many thanks for this video series. I really like the way you develop the subject in manageable chunks and focus on what is really needed to master the subject.
@dataschool
@dataschool 9 жыл бұрын
Garrie Daden That is excellent to hear, and is exactly what I was trying to do! Thanks for your thoughtful comment.
@c00kiemonster247
@c00kiemonster247 8 жыл бұрын
This literally is best tutorial guide on the internet.. thank you so much
@dataschool
@dataschool 8 жыл бұрын
Wow! What a kind thing to say... thank you!
@aquaman788
@aquaman788 4 жыл бұрын
Me too!!!!!!
@KowsalyaSubramanian
@KowsalyaSubramanian 8 жыл бұрын
Thanks Kevin. I like the videos very much. Wish I had known about this series a month back. I dropped my ML course in this semester because the material was very overwhelming. Very useful videos and the material is presented in a very organized manner. Keep up the good work!
@dataschool
@dataschool 8 жыл бұрын
Thank you so much for your kind comments!
@aditi-ind
@aditi-ind 7 жыл бұрын
Overwhelming, I have been trying to learn these basics since a long time, and finally got this video series, Thank you so much for such clear presentation of such a complex (esp for me) topic.
@dataschool
@dataschool 7 жыл бұрын
You are very welcome! I'm so glad to hear it was helpful to you!
@victorekwueme3581
@victorekwueme3581 8 жыл бұрын
Your explanations in your videos are easy to understand and very or should I say extremely helpful. Keep it up....
@dataschool
@dataschool 8 жыл бұрын
+Victor Ekwueme Thanks! I spent a lot of time figuring out how to teach this material in the classroom, and so I thought it was important to spread the knowledge using videos as well :)
@SeaCreature_
@SeaCreature_ 7 жыл бұрын
One of the best channels. Nice to see someone speaking so coherent and educational, compared to other channels. Great job Kevin.
@artemkovera5785
@artemkovera5785 7 жыл бұрын
Totally agree with you. It's a great channel. I just published an e-book about machine learning with clustering algorithms. it's available for free for 5 days. Would you like to get a free copy?
@SeaCreature_
@SeaCreature_ 7 жыл бұрын
of course, thank you Artem
@artemkovera5500
@artemkovera5500 7 жыл бұрын
it's here www.amazon.com/dp/B076NX6KY7 You would really help me if you leave a little review on Amazon
@SeaCreature_
@SeaCreature_ 7 жыл бұрын
Great thank you, and will do. I dont use Amazon kindle but I try to figure out how to get around it. Thanks
@artemkovera5785
@artemkovera5785 7 жыл бұрын
You can easily create an account on Amazon if you don't have one (you don't necessarily need to enter your credit card). After that, you will be able to read free e-books on Amazon website through their cloud reader. I regularly read free e-books available on Amazon, and it's very convenient.
@lightningblade9347
@lightningblade9347 6 жыл бұрын
This is the best Machine learning video I've ever watched, amazing how you did break a complicated topic like machine learning into small sections accompanied with very, very clear explanations, thank you very much I hope you continue it's been a long time since you posted a video on KZbin.
@dataschool
@dataschool 6 жыл бұрын
Thanks so much for your kind comments! I really appreciate it :) P.S. I published 10 videos last month, and will have more in the future!
@lightningblade9347
@lightningblade9347 6 жыл бұрын
Wow thanks for the update, I'm gonna check them right now. bless you.
@tuvantran660
@tuvantran660 4 жыл бұрын
Wow, you're the best teacher I've learned so far. Easy to understand and the contents are well explained.
@ssagga
@ssagga 8 жыл бұрын
Wow, ML suddenly feels a lot less scary. Can't wait to watch the rest of the series.
@dataschool
@dataschool 8 жыл бұрын
Excellent! Here's a link to the entire video series, for others who are interested: kzbin.info/aero/PL5-da3qGB5ICeMbQuqbbCOQWcS6OYBr5A
@RicardoFerrazLeal
@RicardoFerrazLeal 9 жыл бұрын
Best series of machine learning tutorials out there!
@dataschool
@dataschool 9 жыл бұрын
Ricardo Ferraz Leal Wow, thank you! What a kind compliment. I really appreciate it!
@colmorourke4657
@colmorourke4657 5 жыл бұрын
Simply outstanding work. It's highly structured and clearly explained. I also greatly appreciate the excellent references you link for various sections.
@dataschool
@dataschool 5 жыл бұрын
Thank you so much for your kind words!
@pranjalkumar9378
@pranjalkumar9378 5 жыл бұрын
You choose your words very carefully. Awesome teaching 👏
@dataschool
@dataschool 5 жыл бұрын
Thanks so much! 🙌
@flamboyantperson5936
@flamboyantperson5936 7 жыл бұрын
Step by step explanation in a clear way. Just love it. Thank you so much.
@dataschool
@dataschool 7 жыл бұрын
You're very welcome!
@adityarajora7219
@adityarajora7219 6 жыл бұрын
Love your Speed and Clarity man.
@dataschool
@dataschool 6 жыл бұрын
Thank you!
@shobhitsrivastava4496
@shobhitsrivastava4496 6 жыл бұрын
You are one of the best teacher ever got taught !
@dataschool
@dataschool 6 жыл бұрын
Thanks so much! :)
@guptaachin
@guptaachin 6 жыл бұрын
You are the best Kevin. I always find the most relevant stuff in your videos.
@dataschool
@dataschool 6 жыл бұрын
Thanks Achin!
@dishonfano7599
@dishonfano7599 6 жыл бұрын
My friend...Thanks a lot..This is the best introduction to machine learning I have ever come across...Please do a deep learning tutorial...Again thanks a lot.
@dataschool
@dataschool 5 жыл бұрын
Thanks very much for your kind words! I really appreciate it.
@ankrish8692
@ankrish8692 6 жыл бұрын
this is the best speed to make a beginner understand the terminologies one by one ... i really appreciate and thankful yo you for this video . i have seen some videos and i was not able to get wat they were saying bcoz of the speed .... thanks...!!!!!!
@dataschool
@dataschool 6 жыл бұрын
You're very welcome!
@robindong3802
@robindong3802 7 жыл бұрын
you are one of the best instructors online, thank you so much.
@dataschool
@dataschool 7 жыл бұрын
Wow, thanks so much for your kind comment! :)
@dataschool
@dataschool 6 жыл бұрын
*Note:* This video was recorded using Python 2.7 and scikit-learn 0.16. Recently, I updated the code to use Python 3.6 and scikit-learn 0.19.1. You can download the updated code here: github.com/justmarkham/scikit-learn-videos
@terryxie1929
@terryxie1929 6 жыл бұрын
thanks a lot for your job
@dataschool
@dataschool 6 жыл бұрын
You're very welcome!
@a.n.7338
@a.n.7338 5 жыл бұрын
Hi i have trained my model using NN and model is saved so how can i use model to classify images?
@tomparatube6506
@tomparatube6506 5 жыл бұрын
I'm running the latest Anaconda 1.9.7 Jupyter Notebook server 5.7.8, Python 3.6.5, iPython 7.4. Upon hitting Run for the 1st two lines of code: "from IPython.display import IFrame IFrame('archive.ics.uci.edu/ml/machine-learning-databases/iris/iris.data', width=300, height=200)" Jupyter doesn't run and outputs the columns of numbers like in the video, but asks for "iris.data". What should I do? Your Pandas videos have helped so much. I'm enrolled at DataQuest but have been considering enrolling in yours too. Thanks Kevin.
@aquaman788
@aquaman788 4 жыл бұрын
@@dataschool Can we also have a lecture for TensorFlow?
@brunofazoli1
@brunofazoli1 7 жыл бұрын
Amazing explanation! I'm so excited to finish the series! Congrats!
@dataschool
@dataschool 7 жыл бұрын
Thanks very much! Glad you are enjoying the series :)
@saraths9044
@saraths9044 3 жыл бұрын
Please keep on making videos of the same quality. Thank you so much
@dataschool
@dataschool 3 жыл бұрын
Thanks!
@RohitShukla-mm3gz
@RohitShukla-mm3gz 4 жыл бұрын
Such an amazing video. I have never seen this type of clear video. I understand many things. Thanks a lot, Please make a video on unsupervised learning also.
@dataschool
@dataschool 4 жыл бұрын
Thanks for your suggestion!
@ralfmatulat
@ralfmatulat 8 жыл бұрын
This whole series is helpful and fun to watch. Thanks!
@dataschool
@dataschool 8 жыл бұрын
That's excellent to hear. Thanks for watching!
@srinidhibandi2313
@srinidhibandi2313 Жыл бұрын
It is because of these guys we are able to learn Machine Learning concepts so clearly and easily🎉🎉❤❤
@dataschool
@dataschool Жыл бұрын
Thank you so much!
@srinidhibandi2313
@srinidhibandi2313 Жыл бұрын
@@dataschool Thanks to you Sir!
@paolosalamon
@paolosalamon 7 жыл бұрын
One of the best tutorial I ever seen. I love your speech also.
@dataschool
@dataschool 7 жыл бұрын
Thanks! :)
@paolosalamon
@paolosalamon 7 жыл бұрын
Hi. Are you going to make some new paid course?
@dataschool
@dataschool 7 жыл бұрын
I am continuing to work on both free content and paid content. Stay tuned!
@satyakiguha415
@satyakiguha415 9 жыл бұрын
finding these tutorials very interesting.....do continue putting them up...thanks a lot
@dataschool
@dataschool 9 жыл бұрын
***** You're very welcome!
@sneharane2596
@sneharane2596 4 жыл бұрын
Very well explained, you are a great teacher! Loving this series !
@Abhay17291
@Abhay17291 7 жыл бұрын
Thank you for all these videos, Kevin! Very clear and easily understandable.
@dataschool
@dataschool 7 жыл бұрын
You're very welcome! :)
@talkingaboutitinaeasyway5067
@talkingaboutitinaeasyway5067 5 жыл бұрын
Thank you very much. Your videos really help me understand ML deeply.
@dataschool
@dataschool 5 жыл бұрын
That's great to hear! 🙌
@hemenboro4313
@hemenboro4313 4 жыл бұрын
its pretty clear and precise explanation. Thanks for making such videos and keep us educated @data school
@avasararate9271
@avasararate9271 6 жыл бұрын
You're just awesome...best videos in recent times...like your way of explanation and please do continue teaching and sharing your knowledge...peace..
@dataschool
@dataschool 6 жыл бұрын
Thanks so much for your kind words! :)
@alitanwir3372
@alitanwir3372 8 жыл бұрын
Kevin, your a great teacher, your explanations are top notch! Subbed on the channel and the news letter! Thanks a lot! :)
@dataschool
@dataschool 8 жыл бұрын
Wow, thanks so much! Great to hear :)
@victoreirekponor6052
@victoreirekponor6052 7 жыл бұрын
Mr Kevin, I really appreciate this tutorials. I hope to become as good as you are some day..
@dataschool
@dataschool 7 жыл бұрын
You're very welcome!
@tgbaozkn
@tgbaozkn 5 жыл бұрын
your pronuncation is awesome ,im really understand because of you thanks a lot teacher !
@dataschool
@dataschool 5 жыл бұрын
Thanks!
@arjunpukale3310
@arjunpukale3310 6 жыл бұрын
Thank u very much I wanted to start off with ML and tried many tutorials but all were very fast. But u explained each line very nicely
@dataschool
@dataschool 6 жыл бұрын
Great to hear!
2 жыл бұрын
¡Gracias!
@dataschool
@dataschool 2 жыл бұрын
Wow, thank you so much Luis! I truly appreciate it! 🙏
@omarnassor5259
@omarnassor5259 8 жыл бұрын
very simple and straight forward, thank you data school.
@dataschool
@dataschool 8 жыл бұрын
You're welcome!
@JoaoVitorBRgomes
@JoaoVitorBRgomes 3 жыл бұрын
Kevin, you said you don't know how well your model do on new data, but when you test your model with predict on the test data, I think it is standard to evaluate the accuracy (or any other metric) of your model.
@dataschool
@dataschool 3 жыл бұрын
To be clear, if we are talking about truly "new" data, meaning out-of-sample data, then you actually don't know the true target values, and thus there's no way to check how accurate your model was with those samples. Hope that helps!
@JoaoVitorBRgomes
@JoaoVitorBRgomes 3 жыл бұрын
@@dataschool Ah ok, thanks for elaborating. Yes, indeed, e g. a new client asking for a loan (default or not)
@rishabbamrara5072
@rishabbamrara5072 7 жыл бұрын
Very very good and easy to learn lectures. Thank you..
@dataschool
@dataschool 6 жыл бұрын
You're very welcome!
@tomasemilio
@tomasemilio 8 жыл бұрын
This is great man, I am watching this in x2 speed, haha.
@dataschool
@dataschool 8 жыл бұрын
Great!
@saurabhkhodake
@saurabhkhodake 7 жыл бұрын
same here
@maxinteltech3321
@maxinteltech3321 5 жыл бұрын
Exactly that's why it is so understandable
@sandeepgautam2465
@sandeepgautam2465 5 жыл бұрын
it only worked when i used two square bracket knn.predict([[3,5,4,2]])
@dataschool
@dataschool 5 жыл бұрын
Right. See here for an explanation: www.dataschool.io/how-to-update-your-scikit-learn-code-for-2018/#only2ddataarrayscanbepassedtomodels
@hfsbhat
@hfsbhat 4 жыл бұрын
Thanks Sandeep
@yuvaraj2457
@yuvaraj2457 3 жыл бұрын
op = [[1.77, 2.55],] linreg.predict(op) this also works. it is expecting a 2d arrays but i dont know y. adding a comma next to a list makes sense
@udaymallam43
@udaymallam43 6 жыл бұрын
Great explanation, simple & effective, Big Thank you for the videos
@dataschool
@dataschool 6 жыл бұрын
You're very welcome!
@JoeG2324
@JoeG2324 7 жыл бұрын
why would anyone dislike this video?
@dataschool
@dataschool 7 жыл бұрын
Ha! I ask myself that same question :)
@rmehdi5871
@rmehdi5871 7 жыл бұрын
probably some other e-learn teaching competitors :)
@TheAlderFalder
@TheAlderFalder 5 жыл бұрын
They hate-im, cus they ain't-im. ;)
@bardamu9662
@bardamu9662 5 жыл бұрын
@@dataschool Certainly because they are not using the appropriate classifier for high-end teaching videos :-) Congrats for your video series: very instructive, clearly articulated (pondering theory and examples) and with perfect emphasis in critical points. As a well-known French philosopher used to say: "whatever is well conceived is clearly said and the words to say it flow with ease". Bravo Kevin! Very talented teacher!
@aquaman788
@aquaman788 4 жыл бұрын
@@bardamu9662 very good ML lecture!!!!
@hectoralvarorojas1918
@hectoralvarorojas1918 7 жыл бұрын
This video series is the best I have watched about scikit-learn so far. By the time I finished watching all the videos; I will let you know my comments. At this time I am just wondering if you have in mind to something similar but based on the R-project platform. I mean, to go over the principal supervised and unsupervised machine learning methods but sin R-project. Are you planning to do this?
@dataschool
@dataschool 7 жыл бұрын
Glad you like the videos! To answer your question, I'm not planning on making any more videos about R at this time.
@hectoralvarorojas1918
@hectoralvarorojas1918 7 жыл бұрын
How about other machine learning models using python scikit-learn like tree models, cluster models, SVM model and Neural networks?
@dataschool
@dataschool 7 жыл бұрын
I do plan on covering more machine learning in Python in the future! :)
@hectoralvarorojas1918
@hectoralvarorojas1918 7 жыл бұрын
Great! I hope you can cover other algorithms like SVM, Decision Trees, Random forest and, Discriminant Analysis (DA). At the same time, by the time we use linear regression, Logistics regression and DA, for instance, sometimes we need to tune our models to check if they are in line with the assumptions. It would be nice if you can consider those topics in the next video series about machine learning and scikit-learn. How can we get the output graphs in all the models too? For instance: Tree graphs; ROC curve graph, etc. I am already working on it by myself (a lot of googling and reading work so far) but it would be great to have this add-in from you too. You are a great teacher, very precise and direct. Besides, you gave very good additional support in your notes that follows each video. So, it is very easy to follow your examples and recommendations. I hope you can get the new video series done soon. My best regards!
@dataschool
@dataschool 7 жыл бұрын
Thanks so much for your detailed suggestions, and your kind comments! I really appreciate it and will certainly consider your suggestions.
@MJ-em_jay
@MJ-em_jay 8 жыл бұрын
Very clear and easy to follow. Thanks!
@dataschool
@dataschool 8 жыл бұрын
Excellent! You're very welcome!
@Bena_Gold
@Bena_Gold 6 жыл бұрын
Just to clarify ... it's a "(n_samples, n_features) matrix" ... not a "feature matrix" as you simply put ... great video ... thumbs up ...
@dataschool
@dataschool 6 жыл бұрын
The scikit-learn documentation refers to it as a "feature matrix", thus I do as well. Calling it a "feature matrix" indicates that it's made up of features, and it's 2-dimensional, and it's implied that the other dimension is the samples.
@sravankumar5017
@sravankumar5017 4 жыл бұрын
Tq sir for your great explanation it will bring confidence in us that we can learn ml
@thisisgurkaran
@thisisgurkaran 7 жыл бұрын
Great video man. You are hero of humanity.
@dataschool
@dataschool 7 жыл бұрын
Wow, thank you! :)
@taor412
@taor412 9 жыл бұрын
Thank you a lot for giving such a great video for beginner! Very thanks!
@dataschool
@dataschool 9 жыл бұрын
+TA OR You're very welcome!
@transmatter99
@transmatter99 8 жыл бұрын
You're very helpful and intelligent. thank you for these very polished videos.
@dataschool
@dataschool 8 жыл бұрын
You're very welcome!
@harshitsharma589
@harshitsharma589 4 жыл бұрын
Thank you thank you .. I was having some doubts in concept and now it's cleared. I request you to please make some video on data normaliser
@aamirkhan7201
@aamirkhan7201 Жыл бұрын
00:04 Introduction to K Nearest Neighbors (KNN) classification model 02:13 K-nearest neighbors classification model works by selecting the nearest observations and using the most popular response value. 04:44 KNN is a simple machine learning model that predicts the response value based on the nearest neighbor. 07:26 The first step is to import the relevant class and the second step is to instantiate the estimator. 10:11 Training a machine learning model with scikit-learn 12:44 The predict method returns a numpy array with the predicted response value 15:05 Different models can be easily trained using scikit-learn 17:22 Understanding nearest neighbor algorithms and class documentation Crafted by Merlin AI.
@dataschool
@dataschool Жыл бұрын
Thanks for sharing! 🙌
@levon9
@levon9 4 жыл бұрын
Very helpful and clear - thank you, incl the updated notebooks. Toward the end (t=15:40), using the logreg.fit(X, y) function results in a "/usr/local/lib/python3.6/dist-packages/sklearn/linear_model/_logistic.py:940: ConvergenceWarning: lbfgs failed to converge (status=1): STOP: TOTAL NO. of ITERATIONS REACHED LIMIT." warning and a result of [0, 0] even with the updated code. Any suggestions? Changing max_iter to 500 gets rid of the warning, but still ends up with [0, 0] rather than [2. 0] as shown in the video. Any suggestions? I'm using Colab notebooks.
@dataschool
@dataschool 4 жыл бұрын
The default solver for LogisticRegression has changed from liblinear to lbfgs. If you change it back to liblinear, it will converge. Try: logreg = LogisticRegression(solver='liblinear') before fitting. Hope that helps!
@vishwass9491
@vishwass9491 8 жыл бұрын
clear cut and to the point .thanks.
@dataschool
@dataschool 8 жыл бұрын
+vishwas s You're welcome!
@HarshaXSoaD
@HarshaXSoaD 8 жыл бұрын
Very effective tutorial series
@dataschool
@dataschool 8 жыл бұрын
+HarshaXSoaD Thanks! Glad it's helpful to you.
@NazarTropanets
@NazarTropanets 6 жыл бұрын
You are great teacher! Thank you very much!
@dataschool
@dataschool 6 жыл бұрын
You're very welcome!
@khanhdo3988
@khanhdo3988 8 жыл бұрын
Keep up the amazing work!
@dataschool
@dataschool 8 жыл бұрын
Thanks!
@pondapelagechandima8186
@pondapelagechandima8186 6 жыл бұрын
Hello, Thanks for explaining such topics in clean manner. Could you please do a explanation for categorical and numerical NAN values imputations? ( How to handle NAN in Machine Learning ? )
@dataschool
@dataschool 6 жыл бұрын
Thanks for your suggestion!
@yashgos
@yashgos 7 жыл бұрын
Thanks for another great video. I have one conceptual question regarding implementing logistic regression as shown in the video. What I understand is logistic regression is used where the outcome is binary (for example, A or B), In Iris dataset the outcome can be from 3 categories so how does logistic regression work here.
@dataschool
@dataschool 7 жыл бұрын
That's really a mathematical question rather than a conceptual question. Logistic regression can be used for multiclass problems, and a few details are here: scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html You can learn more about this topic by searching for multinomial logistic regression. Hope that helps!
@oguzcanyavuz8069
@oguzcanyavuz8069 8 жыл бұрын
Hello again, your tutorials are awesome. I have an error at here: In [8]:knn.predict([3, 5, 4, 2]) /usr/local/lib/python3.5/dist-packages/sklearn/utils/validation.py:386: DeprecationWarning: Passing 1d arrays as data is deprecated in 0.17 and willraise ValueError in 0.19. Reshape your data either using X.reshape(-1, 1) if your data has a single feature or X.reshape(1, -1) if it contains a single sample. DeprecationWarning) Out[8]:array([2]) But i still get the correct output. I think it is related to numpy. Where should i use the numpy and how exactly? Or should i just ignore it?
@dataschool
@dataschool 8 жыл бұрын
You bring up a great point! It's a long explanation: The 0.17 release of scikit-learn included the following change: "Passing 1D data arrays as input to estimators is now deprecated as it caused confusion in how the array elements should be interpreted as features or as samples. All data arrays are now expected to be explicitly shaped (n_samples, n_features)." Here's what that means: When you pass data to a model (an "estimator" in scikit-learn terminology), it is now required that you pass it as a 2D array, in which the number of rows is the number of observations ("samples"), and the number of columns is the number of features. In this example, I make a prediction by passing a Python list to the predict method: knn.predict([3, 5, 4, 2]). The problem is that the list gets converted to a NumPy array of shape (4,), which is a 1D array. Because I wanted scikit-learn to interpret this as 1 sample with 4 features, it now requires a NumPy array of shape (1, 4), which is a 2D array. There are three separate ways to fix this: 1. Explicitly change the shape to (1, 4): import numpy as np X_new = np.reshape([3, 5, 4, 2], (1, 4)) knn.predict(X_new) 2. Tell NumPy that you want the first dimension to be 1, and have it infer the shape of the second dimension to be 4: import numpy as np X_new = np.reshape([3, 5, 4, 2], (1, -1)) knn.predict(X_new) 3. Pass a list of lists (instead of just a list) to the predict method, which will get interpreted as having shape (1, 4): X_new = [[3, 5, 4, 2]] knn.predict(X_new) Solution #2 is scikit-learn's suggested solution. Solution #3 is the simplest, but also the least clear to someone reading the code.
@oguzcanyavuz8069
@oguzcanyavuz8069 8 жыл бұрын
Thanks for the explanation. I think i will use second option. Which one you are using? :)
@dataschool
@dataschool 8 жыл бұрын
If I'm writing code for myself, I use option #3, otherwise I use option #2.
@carlomott
@carlomott 8 жыл бұрын
Hi thanks for these video, they are amazing! One thing I noticed: it turns out that with the 0.17 release if you just type >>knn.predict(X_new) nothing will be output. My workaround is to type >>print knn.predict(X_new) >>[2] But I am not sure it is the best solution...
@dataschool
@dataschool 8 жыл бұрын
Glad the videos are helpful to you! I'm using scikit-learn 0.17, and I'm not seeing the behavior you are describing. Are you sure you're running exactly the same code, in exactly the same order I'm running it?
@khushalvyas5633
@khushalvyas5633 5 жыл бұрын
You are awesome! Thanks to you for making these videos.
@dataschool
@dataschool 5 жыл бұрын
You're very welcome!
@vishalbhosle6570
@vishalbhosle6570 6 жыл бұрын
Thank you, u teach in very simple words, I have a question when should we use "Decision Tree Classifier" and when should we use K Neighbors Classifier ?
@dataschool
@dataschool 6 жыл бұрын
This might be helpful: www.dataschool.io/comparing-supervised-learning-algorithms/
@dimiro1
@dimiro1 9 жыл бұрын
Very good explanation. Thank you.
@dataschool
@dataschool 9 жыл бұрын
***** You're very welcome!
@alanjoseph3190
@alanjoseph3190 5 жыл бұрын
Thank you sir for your vedio on machine learning.Sir,i got an erro saying expected 2d array,got 1d array instead when using knn.predict. Nb: got the outpu when it is knn.predict( [ [ 3,5,4,2 ] ] ). 2 [ is used
@alanjoseph3190
@alanjoseph3190 5 жыл бұрын
@Salma FRIKHA use 2 brackets as given in my comment.it worked for me.but i dont know how he got.maybe he used numpy
@dataschool
@dataschool 5 жыл бұрын
See this blog post for a detailed explanation: www.dataschool.io/how-to-update-your-scikit-learn-code-for-2018/#only2ddataarrayscanbepassedtomodels
@danniecutts6221
@danniecutts6221 11 ай бұрын
Great communicator! Thanks!
@dataschool
@dataschool 11 ай бұрын
Thank you!
@ashutoshsingh-de5gp
@ashutoshsingh-de5gp 8 жыл бұрын
Thanks for providing such a good tutorial..... But it is helpful if you increase the font size of python code because it is hard to read if one view these videos on mobile... So please increase the font size...... Thanks
@dataschool
@dataschool 8 жыл бұрын
Thanks for the suggestion. I'll consider it for the future!
@dheerajsonawane6738
@dheerajsonawane6738 7 жыл бұрын
Great videos, as great talent and easy method to teach, thanks!
@dataschool
@dataschool 7 жыл бұрын
You're welcome!
@MrMikeWyn
@MrMikeWyn 8 жыл бұрын
Great series of video. Thanks.
@dataschool
@dataschool 8 жыл бұрын
You're welcome! Glad you are enjoying them :)
@jbowater
@jbowater 8 жыл бұрын
Really enjoying this series! Thanks for creating it. Do you know where I might find the code for making one of the lovely classification maps you show at e.g. 4.15?
@dataschool
@dataschool 8 жыл бұрын
I'm sorry, I don't know how those classification maps were made! If you find a way, feel free to let me know :)
@jbowater
@jbowater 8 жыл бұрын
OK, will do! Thanks for letting me know!
@jbowater
@jbowater 8 жыл бұрын
This solution: scikit-learn.org/stable/auto_examples/neighbors/plot_classification.html works straight out of the box for Iris data, though sadly struggling to adapt it to my dataset.
@dataschool
@dataschool 8 жыл бұрын
Thanks for sharing! Keep in mind that this technique will only work with two features.
@MohammedTamali
@MohammedTamali 8 жыл бұрын
Thanks for the helpful videos It's really a good tutorial for learn safe machine learning.
@dataschool
@dataschool 8 жыл бұрын
+Mohammed Tamali You're very welcome! Good luck with your studies.
@alialsaady5
@alialsaady5 6 жыл бұрын
Hi, thank you for your explanation it's very clear. But there is something I don't really understand. At 4:00 you say that the data is represented by 2 numerical features, so you have two axes X and Y. But what if there were more features like the iris dataset. How does NNB work in that case? Is it taking the same steps as you explain in this video, but not on a 2D graph but 4D graph?
@dataschool
@dataschool 6 жыл бұрын
I don't know how to explain this briefly, I'm sorry!
@alialsaady5
@alialsaady5 6 жыл бұрын
@@dataschool That is very unfortunate to hear. Is it possible if we make an appointment via skype? I do not get out and need this information for my thesis. I hope you can help me with it.
@dataschool
@dataschool 6 жыл бұрын
I don't work with anyone one-on-one, I'm sorry! However, you are welcome to join Data School Insiders and ask a question during a live webcast or on the private forum: www.patreon.com/dataschool - I prioritize answering questions from Insiders because they are investing in me.
@GenzoVandervelden
@GenzoVandervelden 8 жыл бұрын
These video's are really great, thanks!
@dataschool
@dataschool 8 жыл бұрын
You're welcome!
@EyeIn_The_Sky
@EyeIn_The_Sky 7 жыл бұрын
Hi Kevin, Great explanation and step by step guide. One thing that I can't fully grasp is that the "y" is supposed to be the vector but I don't get how the ".fit" works. I can understand that the X is the matrix with the column headings but I don't get what y is for in the fit.(X,y) as y is the "answer" or label/target.
@dataschool
@dataschool 7 жыл бұрын
The fit method learns the relationship between the input (X) and the output (y). Hope that helps!
@subodhmantri8365
@subodhmantri8365 7 жыл бұрын
Hi Kevin, you are amazing teacher and possess excellent voice modulation, which keep me continuing on the series. Thanks a ton! I have a question though, Is machine learning related only to extracting data from computers? What I mean is, with advent of IoT, lot of devices are getting into human life, hence lots of data accompanied. What about this data? Not necessary all these data get transferred to computers!
@dataschool
@dataschool 7 жыл бұрын
Thanks for your kind comments! Regarding your question, any kind of data can be used for machine learning, regardless of how it is generated. Hope that helps!
@sjljc2019
@sjljc2019 4 жыл бұрын
Hi, I am getting an error as "STOP: TOTAL NO. of ITERATIONS REACHED LIMIT." and the predictions is coming as array([0, 0]). Any help would be appreciated.
@alenaosipova4660
@alenaosipova4660 4 жыл бұрын
try logreg = LogisticRegression(solver = 'liblinear') instead
@thiennguyen9186
@thiennguyen9186 4 жыл бұрын
@@alenaosipova4660 Thanks a lot, that helps.
@dataschool
@dataschool 4 жыл бұрын
Exactly!
@eldert1735
@eldert1735 5 жыл бұрын
Thank you for the video. It is really well-explained. I got several future warnings with LogisticRegression. When I used the fit method, it says that the default solver would change to 'lbfgs' and that I should specify a solver. Also, I got a warning that the default multi_class is also going to change to 'auto' and that I have to specify the multi_class myself. Even after I specify these two, I get a ConvergenceWarning, claiming that lbfgs failed to converge. I am new to machine learning, and I don't know what to do. Can you please tell me what I can do to solve these warnings?
@dataschool
@dataschool 5 жыл бұрын
You're doing the right thing! I would just try a different solver. Sorry, I know these warnings can be hard to understand.
@sreekrishnanr1812
@sreekrishnanr1812 6 жыл бұрын
Superb explanation thank u 😊😊
@dataschool
@dataschool 6 жыл бұрын
Thanks!
@bevansmith3210
@bevansmith3210 7 жыл бұрын
Thanks Kevin, these are really great!
@dataschool
@dataschool 7 жыл бұрын
You are very welcome!
@jerinjohn9518
@jerinjohn9518 7 жыл бұрын
Clear to understand. Thanks kevin
@dataschool
@dataschool 7 жыл бұрын
You're welcome!
@Dynamite_mohit
@Dynamite_mohit 4 жыл бұрын
Thankyou , very helpfull resources. You are awsome
@dataschool
@dataschool 4 жыл бұрын
Thank you!
@elilavi7514
@elilavi7514 9 жыл бұрын
Thanks for material ! One question : In the video you try to solve classification problem with regression model , if I recall correctly from previous video , the regression models are good for regression problems and classification models for classification problems . Is there any criteria when I can choose with confidence regression model to solve classification problem ?
@dataschool
@dataschool 9 жыл бұрын
Eli Lavi Actually, "logistic regression" is a classification model (not a regression model), despite its name! That's why I used logistic regression in this case. There are some limited circumstances in which regression models can be used to solve classification problems, but it usually doesn't make sense. I wouldn't worry about it for now... that's a very advanced technique. Does that answer your question?
@memensalihi4456
@memensalihi4456 4 жыл бұрын
can you explain the Dynamic time wraping for time series with example please
Comparing machine learning models in scikit-learn
26:42
Data School
Рет қаралды 187 М.
How do I encode categorical features using scikit-learn?
27:59
Data School
Рет қаралды 140 М.
Sigma Kid Mistake #funny #sigma
00:17
CRAZY GREAPA
Рет қаралды 30 МЛН
When you have a very capricious child 😂😘👍
00:16
Like Asiya
Рет қаралды 18 МЛН
AI Is Making You An Illiterate Programmer
27:22
ThePrimeTime
Рет қаралды 247 М.
A/B Testing in R with Elea McDonnell Feit
1:50:10
R-Ladies Philly
Рет қаралды 5 М.
Jake VanderPlas: Machine Learning with Scikit Learn
1:34:37
PyData
Рет қаралды 45 М.
ROC Curves and Area Under the Curve (AUC) Explained
14:06
Data School
Рет қаралды 494 М.
But what is a neural network? | Deep learning chapter 1
18:40
3Blue1Brown
Рет қаралды 18 МЛН
Getting started in scikit-learn with the famous iris dataset
15:26
Data School
Рет қаралды 254 М.
Selecting the best model in scikit-learn using cross-validation
35:54
How to find the best model parameters in scikit-learn
27:46
Data School
Рет қаралды 154 М.