How to do the Titanic Kaggle Competition

  Рет қаралды 75,712

Aladdin Persson

Aladdin Persson

Күн бұрын

This video is for those who want to get started doing #kaggle.
❤️ Support the channel ❤️
/ @aladdinpersson
Paid Courses I recommend for learning (affiliate links, no extra cost for you):
⭐ Machine Learning Specialization bit.ly/3hjTBBt
⭐ Deep Learning Specialization bit.ly/3YcUkoI
📘 MLOps Specialization bit.ly/3wibaWy
📘 GAN Specialization bit.ly/3FmnZDl
📘 NLP Specialization bit.ly/3GXoQuP
✨ Free Resources that are great:
NLP: web.stanford.e...
CV: cs231n.stanford...
Deployment: fullstackdeepl...
FastAI: www.fast.ai/
💻 My Deep Learning Setup and Recording Setup:
www.amazon.com...
GitHub Repository:
github.com/ala...
✅ One-Time Donations:
Paypal: bit.ly/3buoRYH
▶️ You Can Connect with me on:
Twitter - / aladdinpersson
LinkedIn - / aladdin-persson-a95384153
Github - github.com/ala...

Пікірлер: 80
@OSShubho
@OSShubho 3 жыл бұрын
Thanks for sharing this simple and elegant beginner friendly code. Your approach are very clear and understandable.
@grantfoster2663
@grantfoster2663 2 жыл бұрын
Really helpful to someone to see someone work through a really simple solution as someone moving from to python from R!
@suhass6628
@suhass6628 3 жыл бұрын
And the 100% people, rumour has it that some people have got the info of the people from the actual Titanic records which is publicly available. So it would give 100% obviuosly
@AladdinPersson
@AladdinPersson 3 жыл бұрын
Makes sense!
@viralmedia.007
@viralmedia.007 9 ай бұрын
so are kaggle competitions genuine?? i always wonder how would people get 100% correct predictions or is this specific to this competition only? moreover they come with such huge prize pools
@suhass6628
@suhass6628 9 ай бұрын
@@viralmedia.007 yes the actual competitions which have prize money very genuine. The rigged ones are usually very basic or fir which data is already available publicly
@ccuuttww
@ccuuttww 3 жыл бұрын
100% its mean over fitting of course u can do more stuff to boost your performance PCA, boost sampling, cross validation, even prior parameter
@AladdinPersson
@AladdinPersson 3 жыл бұрын
I agree, you can try/do a lot to more to make it even better, for this one I tried to keep it minimal and simple
@adilsonmedronha706
@adilsonmedronha706 2 жыл бұрын
Actually it is not overfitting because this accuracy were measure through test set (unseen data), not train set.
@shaikhkashif9973
@shaikhkashif9973 Жыл бұрын
Bro for *Embarked*u should go for Nominal encoding not a label because it's names of ports
@hailongnguyen7176
@hailongnguyen7176 Жыл бұрын
Yeah you right
@gauravms6681
@gauravms6681 3 жыл бұрын
remember me when this channel is gonna go hit : )
@jose3538
@jose3538 3 жыл бұрын
Remember me too!
@suhass6628
@suhass6628 3 жыл бұрын
Well done mate!. Thanks for this. Hopefully you will do more Kaggle stuff. Will follow everything
@AladdinPersson
@AladdinPersson 3 жыл бұрын
Yeah it will for sure, got another video coming soon on a bit more advanced competition
@jayz4581
@jayz4581 2 жыл бұрын
People get 100% because this dataset is so classic and they are always finding the best features or maybe use ensemble methods. But your intro is so straightforward for me to start at kaggle. Thanks!
@yuliusharjoseputro2069
@yuliusharjoseputro2069 3 жыл бұрын
Hi, thanks for your tutorial. I've implemented your code, but why the accuracy that I got is different with you?
@fentazimohamedreadh5274
@fentazimohamedreadh5274 Жыл бұрын
Thank you so much!!! It was really helpful to get started in Kaggle competitions^^
@JoaoPecorella
@JoaoPecorella 21 күн бұрын
After 17 submission, I'll watch your video sir!
@JoaoPecorella
@JoaoPecorella 21 күн бұрын
Around 75% accurate, best so far is 78!
@aexairkeys
@aexairkeys 3 жыл бұрын
awesome job! love the simplicity. keep going!
@mukundkrishna2789
@mukundkrishna2789 2 жыл бұрын
For logistic regression, isn't it necessary to do feature scaling before training? When I searched in the net, it was specified that we should do feature scaling for logistic regression
@deepudeepak1390
@deepudeepak1390 3 жыл бұрын
I did the same approach when I started my kaggle journey 😀 .. .. request from my side please make some viedos on transfer learning in natural language processing thank you
@LeonidasParigoris
@LeonidasParigoris Жыл бұрын
Thanks for this! I have a question, at 3:45 how are you able to avoid writing the whole directory of the file and just say "train.csv", instead of writing the whole snake of the directory e.g. "C:\\Users\\etcetc\\Python\\titanic\\train.csv"?
@classicemmaeasy2292
@classicemmaeasy2292 Жыл бұрын
Very short,simple and explanatory, but you use machine learning techniques all through, you don't really explore and visualize the data. This video is awesome by the way,and beginner's friendly
@adayinthelife5496
@adayinthelife5496 Жыл бұрын
I think your code is excellent, but it freaks me out how many data scientists only see their accuracy as a result. Understanding and presenting the results in meaningful way is key to any science. So... who was likely to survive??
@ouhjnadmacabenta3054
@ouhjnadmacabenta3054 2 жыл бұрын
Hi bro how did you set up the CSV file on the jupyter because my CSV file was not defined thanks
@danilomontalvo5756
@danilomontalvo5756 3 жыл бұрын
everything else works for me except predictions when getting to 14:43 it just says "AttributeError: 'function' object has no attribute 'predict'"
@gurudevdatta3960
@gurudevdatta3960 3 жыл бұрын
im getting an error while spilting the data can you help me? or if you dont mind an you send your number please i will send screenshot to you?
@Honest_Reply900
@Honest_Reply900 2 жыл бұрын
Well done. thanks for you efforts! 100% accuracy? I am sure they have cheated :)
@AIPlayerrrr
@AIPlayerrrr 3 жыл бұрын
Planning to do more real ones in the future?
@AladdinPersson
@AladdinPersson 3 жыл бұрын
yes
@AladdinPersson
@AladdinPersson 3 жыл бұрын
Got any ideas of some you think would be useful?
@AIPlayerrrr
@AIPlayerrrr 3 жыл бұрын
I watched a lot of video of yours and I think you are very likely to place high as you are really knowledgeable. You explain thing very well. I think you can try the recent human protein competition. It’s a fun weakly supervised classification problem.
@talha_anwar
@talha_anwar 3 жыл бұрын
Upvoted
@Borzacchinni
@Borzacchinni 2 жыл бұрын
Thanks for the video! Do you happen to be from Norway perhaps?
@minhlong1920
@minhlong1920 3 жыл бұрын
Thank you sm dude
@pranjalsingh1389
@pranjalsingh1389 Жыл бұрын
Why did we not used fit.transform on test set
@arnelecleir4876
@arnelecleir4876 2 жыл бұрын
In this case (using a regression), is it possible to just use stata? I feel like most of the actions performed here would have been easier/quicker in stata… I’m asking this since I now how to work with stata and am currently learning data science via datacamp/kaggle and want to compare some tools :)
@RpSKhaira
@RpSKhaira Жыл бұрын
Noob here, question: why did you clean your data through a function? Why not just run those exact commands outside of the function?
@timgen-iu1qo
@timgen-iu1qo Жыл бұрын
i think because he had 2 tables with input data and it's easier to write 1 function and call it 2 times than writing the algorithm 2 times and change something for each table
@jinks6887
@jinks6887 2 жыл бұрын
Thanks I've subscribed. Very simple yet informative content.
@maitrijain7758
@maitrijain7758 Жыл бұрын
Ur code gives error when we predict x test
@udbhavprasad3521
@udbhavprasad3521 3 жыл бұрын
Can you make a video about XGBoost; their is not many resources for that
@nabshieshty
@nabshieshty 11 ай бұрын
nice vid, did my assignment with this
@AbdulRehman-nu2pb
@AbdulRehman-nu2pb 2 жыл бұрын
Thankyou so much for sharing this elegent and simple , beautifully written code. As a bigineer your code is a holy grail !!!
@LameGamerYT
@LameGamerYT 2 жыл бұрын
GOD LEVEL VIDEO THANKS SO MUCH!
@sanskarram992
@sanskarram992 3 жыл бұрын
Very helpful for begineers .................. Thanks for such content.
@newkamphora
@newkamphora 2 жыл бұрын
Thank you, very helpful ;)
@gezahagnnegash9740
@gezahagnnegash9740 2 жыл бұрын
Thanks a lot. As a beginners, it's helpful for me!
@maxvettel7337
@maxvettel7337 Жыл бұрын
This is what I really need as a beginner
@vishalgoklani
@vishalgoklani 3 жыл бұрын
LogisticRegression??? where's the neural network? :)
@AladdinPersson
@AladdinPersson 3 жыл бұрын
In the moment it felt like it would be overkill, in retrospect I regret it :3
@karlagonzalez6808
@karlagonzalez6808 3 жыл бұрын
Do u know how to find the most popular name among male Titanic passengers?
@krnl1304
@krnl1304 3 жыл бұрын
One with the maximum frequency should be the most. So use count() and max()
@mtk-0_0
@mtk-0_0 Жыл бұрын
appreciate good effort!
@abdoali-nl2yt
@abdoali-nl2yt Жыл бұрын
thanks for you
@teamsonnyliston
@teamsonnyliston 2 жыл бұрын
Thanks a lot man you helped me
@kefahelhelou9418
@kefahelhelou9418 Жыл бұрын
Thanks a lot
@MDEMANURRAHAMAN-
@MDEMANURRAHAMAN- 2 жыл бұрын
That was really helpful. Thanks
@magikarp1743
@magikarp1743 Жыл бұрын
can someone pls help me out here ? at 14:55 on running it shows "value error: X has 8 features per sample; expecting 7"
@timgen-iu1qo
@timgen-iu1qo Жыл бұрын
I have same error, haven't you solved it yet?
@magikarp1743
@magikarp1743 Жыл бұрын
@@timgen-iu1qo yea i got my mistake... in the 2nd cell i wrote test = pd.read_csv("train.csv") instead of test = pd.read_csv("test.csv")... silly of me
@timgen-iu1qo
@timgen-iu1qo Жыл бұрын
@@magikarp1743 IMAGINE, same mistake... Thanks 😂😂
@sena1663
@sena1663 2 жыл бұрын
That was easy and helpful :) Thanx!!
@SussyBaka-ci5xi
@SussyBaka-ci5xi Жыл бұрын
helped a lot! thank you!
@kyoujinko
@kyoujinko 2 жыл бұрын
This made so much sense, thank you.
@Leopar525
@Leopar525 2 жыл бұрын
I really like your style of thinking and explaining. Could you please advise on any (free or not) courses/articles or anything you believe is good for beginners?
@danasharon4752
@danasharon4752 2 жыл бұрын
Thank you!
@denisvoronov6571
@denisvoronov6571 2 жыл бұрын
Perfect for the beginner!
@juan.forero_
@juan.forero_ 3 жыл бұрын
Thank you bro!!
@Ajay_Pathak_
@Ajay_Pathak_ 3 жыл бұрын
I'm having errors while fitting the model It says Float() must be str or .... Not method
@nevilleodhiambo1370
@nevilleodhiambo1370 2 жыл бұрын
How did you handle this
@maitrijain7758
@maitrijain7758 Жыл бұрын
I also get the same error
@prannayhemachandran966
@prannayhemachandran966 Жыл бұрын
got solution for this????
@Ajay_Pathak_
@Ajay_Pathak_ Жыл бұрын
I have no idea what I did after this error , I might have even left it entirely 😂, sorry guys !
@mehermanoj45
@mehermanoj45 3 жыл бұрын
Plz speed runing datasets like games😂
@AladdinPersson
@AladdinPersson 3 жыл бұрын
How you mean? :P
@mehermanoj45
@mehermanoj45 3 жыл бұрын
@@AladdinPersson pick a random dataset and try how fast can u go from downloading to inference.
Classifying Cat vs Dogs | Kaggle Top 1%, No Ensemble
12:54
Aladdin Persson
Рет қаралды 10 М.
Minecraft Creeper Family is back! #minecraft #funny #memes
00:26
Остановили аттракцион из-за дочки!
00:42
Victoria Portfolio
Рет қаралды 3,8 МЛН
How to do the Titanic Kaggle competition in R - Part 1
35:07
Data Science Dojo
Рет қаралды 100 М.
How to Win Kaggle Competitions - Competition Master Advice
18:18
Kaggle Winning Solution : Retail Sales Forecasting
56:20
YanAITalk
Рет қаралды 16 М.
The One and Only Data Science Project You Need
13:05
StrataScratch
Рет қаралды 307 М.
Titanic Survival Prediction in Python - Machine Learning Project
53:38
Anthony Goldbloom - How to Win Kaggle Competitions
44:18
Weights & Biases
Рет қаралды 53 М.
Beginner Kaggle Data Science Project Walk-Through (Titanic)
38:16