Titanic Dataset Analysis (Classification) | Kaggle | Machine Learning | Python

  Рет қаралды 23,447

Hackers Realm

Hackers Realm

Күн бұрын

⭐️ Content Description ⭐️
In this video, I have explained about titanic survival prediction dataset analysis using python. This is one of the basic kaggle project every beginner should try. I have done various techniques to explore the data and submitted the results to kaggle. This will be a end to end project for any competition.
Text-based Tutorial: www.hackersrealm.net/post/tit...
GitHub Code Repo: bit.ly/mlcoderepo
Dataset link: www.kaggle.com/c/titanic/data
🌐 Website: www.hackersrealm.net
🔔 Subscribe: bit.ly/hackersrealm
🗓️ 1:1 Consultation with Me: calendly.com/hackersrealm/con...
📷 Instagram: / aswintechguy
🔣 Linkedin: / aswintechguy
🎯 GitHub: github.com/aswintechguy
🎬 Share: • Titanic Dataset Analys...
⚡️ Data Structures & Algorithms tutorial playlist: bit.ly/dsatutorial
😎 Hackerrank problem solving solutions playlist: bit.ly/hackerrankplaylist
🤖 ML projects tutorial playlist: bit.ly/mlprojectsplaylist
🐍 Python tutorial playlist: bit.ly/python3playlist
💻 Machine learning concepts playlist: bit.ly/mlconcepts
✍🏼 NLP concepts playlist: bit.ly/nlpconcepts
🕸️ Web scraping tutorial playlist: bit.ly/webscrapingplaylist
Make a small donation to support the channel 🙏🙏🙏:-
🆙 UPI ID: hackersrealm@apl
💲 PayPal: paypal.me/hackersrealm
🕒 Timeline
00:00 Introduction to Titanic Dataset Analysis
05:47 Import Modules
07:00 Loading the Titanic Dataset
11:05 Exploratory Data Analysis
25:43 Data Preprocessing
37:27 Correlation Matrix of Titanic Data
43:15 Drop unnecessary columns
45:00 Label Encoding
47:06 Train Test Split
50:30 Model Training
01:01:46 Get Predictions from Test Data
01:04:51 Submit Predictions to Kaggle
01:11:51 End
#titanicdatasetanalysis #machinelearning #hackersrealm #classification #titanic #deeplearning #datascience #model #project #artificialintelligence #beginner #analysis #python #tutorial #aswin #ai #dataanalytics #data #bigdata #programming #datascientist #technology #coding #datavisualization #computerscience #pythonprogramming #analytics #tech #dataanalysis #iot #programmer #statistics #developer #ml #business #innovation #coder #dataanalyst

Пікірлер: 44
@anamaposadar
@anamaposadar Жыл бұрын
I really liked your explanation, everything was so clear!
@HackersRealm
@HackersRealm Жыл бұрын
Glad you liked it!!!😄
@syedhaideralizaidi1828
@syedhaideralizaidi1828 2 жыл бұрын
waiting !!!
@nidhik5270
@nidhik5270 Жыл бұрын
Thankyou so much..... It was so helpful ❤️
@HackersRealm
@HackersRealm Жыл бұрын
Happy to h3lp!!!
@Johan_S4
@Johan_S4 Жыл бұрын
55:56 i'm wondering why you delete the cv=5 on cross_val_score ? is it no longer neccesary? as few second before that it still exist. btw great video, i'm learning much from it, and oddly enough, when i tried your logistic regression it work just fine.
@anilsailakhinana94
@anilsailakhinana94 2 жыл бұрын
waiting
@alexnieto5036
@alexnieto5036 Жыл бұрын
very good work !
@HackersRealm
@HackersRealm Жыл бұрын
Glad you liked it!!!
@SaniaShaikh9202
@SaniaShaikh9202 10 ай бұрын
sns.countplot(train['Sex']) i get an error saying that it cannot convert string to float: 'male' and also during the exploratory data analysis i am only getting output graph for one value as in Axes: ylabel='count' it does not show the xlabel pls help
@taibaarain2000
@taibaarain2000 9 ай бұрын
Same happened with me in survived graph did you find any solution?
@digitalshaji2636
@digitalshaji2636 9 ай бұрын
when I am using a sns for survival visual graph it returns with and a graph filled with only blue color
@quotez4943
@quotez4943 3 ай бұрын
Same Issus Did you understand what is the probles
@patrick.bateman.psy1
@patrick.bateman.psy1 Ай бұрын
Yes same issue
@Akashvardhan._
@Akashvardhan._ Ай бұрын
Actually the problem is that the method he did is kind of old where it automatically detects 'X' axis to be 'Survived' column but when we try to do it now it won't work instead in the function call itself we should mention as sns.countplot(x=train['Survived']) ...('We are now manually mentioning that the X axis should be of the specified column')
@jitendratrivedi7889
@jitendratrivedi7889 2 жыл бұрын
Can you please explain why you use lower case (model x , y ) for CV whereas for LR ( model , X, y).
@HackersRealm
@HackersRealm 2 жыл бұрын
Both are in same case only, Please check!!!
@ST-nu6ib
@ST-nu6ib 2 жыл бұрын
How to fill NaN in age with respect to Survived column. Suppose if I want to fill age of a survived human with mean age of all survived humans, and fill age of a not survived human with age of not survived human, then how to do it?
@HackersRealm
@HackersRealm 2 жыл бұрын
you can use groupby function or pivot table function to get the mean of the category. you can check bigmart sales or black friday sales project videos for the methods
@jerryvictor3583
@jerryvictor3583 11 ай бұрын
Thank you for this. Pls I have a little challenge, when I get to LogisticRegression, I get error, it says ' X is not defined'.... Pls how do I solve this?
@HackersRealm
@HackersRealm 11 ай бұрын
try to run the cells in order to avoid any errors!!!
@tooba5611
@tooba5611 2 жыл бұрын
My 'Age' data is float too but when I am trying to solve it it cant execute. The error shows- TypeError: unsupported operand type(s) for +: 'float' and 'method' Can anyone help?
@HackersRealm
@HackersRealm 2 жыл бұрын
what method you are trying to use on age?
@shubhankarsharma4094
@shubhankarsharma4094 5 ай бұрын
Fare null should be replaced with median
@abhishekprakash9803
@abhishekprakash9803 2 жыл бұрын
ks stats kab perfrom krte h
@adithyabm9182
@adithyabm9182 11 ай бұрын
why are you dropping the survived column in the last
@HackersRealm
@HackersRealm 11 ай бұрын
before the model training part?
@tejaswinimunumudi8440
@tejaswinimunumudi8440 2 жыл бұрын
Sir I am getting values of passengers in 800 's sir what can I do now
@HackersRealm
@HackersRealm 2 жыл бұрын
I didn't get you, value of passengers means?
@daniecookie505
@daniecookie505 Жыл бұрын
im getting an error i dont understand why TypeError: Singleton array array(891) cannot be considered a valid collection.
@HackersRealm
@HackersRealm Жыл бұрын
Are you using the same code?
@patrick.bateman.psy1
@patrick.bateman.psy1 Ай бұрын
When I run the sns.countplot(train['Survived']) then only the blue graph is showing with value of 800.... What's the solution of this or this is the bug from the dataset itself..
@HackersRealm
@HackersRealm Ай бұрын
Could you double check the column values whether it has 2 classes
@patrick.bateman.psy1
@patrick.bateman.psy1 Ай бұрын
No it has only 1 class
@HackersRealm
@HackersRealm Ай бұрын
@@patrick.bateman.psy1 it should be 2 classes, survived and not survived, I think some issue in the dataset
@patrick.bateman.psy1
@patrick.bateman.psy1 Ай бұрын
@@HackersRealm Yes it's the issue with the dataset, Is anyway to fix it)
@HackersRealm
@HackersRealm Ай бұрын
@@patrick.bateman.psy1 try the dataset in my github or use the kaggle link to download it. You could also try from other websites as well
@Sumer-zt5cu
@Sumer-zt5cu Жыл бұрын
Input Nan Contains NaN ... How to deal with it brother plzz explain
@HackersRealm
@HackersRealm Жыл бұрын
drop the null values or replace the null values with some stats or negatives
@HackersRealm
@HackersRealm Жыл бұрын
you can also check this video kzbin.info/www/bejne/fHa0oZedpK2Spq8
Exploratory Data Analysis with Pandas Python
40:22
Rob Mulla
Рет қаралды 439 М.
I wish I could change THIS fast! 🤣
00:33
America's Got Talent
Рет қаралды 122 МЛН
ОСКАР vs БАДАБУМЧИК БОЙ!  УВЕЗЛИ на СКОРОЙ!
13:45
Бадабумчик
Рет қаралды 4,3 МЛН
Каха и суп
00:39
К-Media
Рет қаралды 2,9 МЛН
Beginner Kaggle Data Science Project Walk-Through (Titanic)
38:16
Stanford's FREE data science book and course are the best yet
4:52
Python Programmer
Рет қаралды 672 М.
How to do the Titanic Kaggle Competition
18:28
Aladdin Persson
Рет қаралды 72 М.
XGBoost in Python from Start to Finish
56:43
StatQuest with Josh Starmer
Рет қаралды 218 М.
I wish I could change THIS fast! 🤣
00:33
America's Got Talent
Рет қаралды 122 МЛН