Train Test Split with Python Machine Learning (Scikit-Learn)

  Рет қаралды 30,586

Ryan & Matt Data Science

Ryan & Matt Data Science

Күн бұрын

Пікірлер: 37
@RyanAndMattDataScience
@RyanAndMattDataScience 5 ай бұрын
Hey guys I hope you enjoyed the video! If you did please subscribe to the channel! Join our Data Science Discord Here: discord.com/invite/F7dxbvHUhg If you want to watch a full course on Machine Learning check out Datacamp: datacamp.pxf.io/XYD7Qg Want to solve Python data interview questions: stratascratch.com/?via=ryan I'm also open to freelance data projects. Hit me up at ryannolandata@gmail.com *Both Datacamp and Stratascratch are affiliate links.
@mikabenson315
@mikabenson315 Ай бұрын
Very simple yet with so much insights, thanks Ryan
@RyanAndMattDataScience
@RyanAndMattDataScience Ай бұрын
No problem
@tengsolomon
@tengsolomon Жыл бұрын
Thanks so much for your video. So simple and easy to follow.
@RyanAndMattDataScience
@RyanAndMattDataScience Жыл бұрын
No problem. I try to keep all my vids simple and straight to the point
@rifqimaruf
@rifqimaruf 9 ай бұрын
im starting learning machine learning cause my duty on college, this video explain with ease, thank you Ryan, keep it up.
@RyanAndMattDataScience
@RyanAndMattDataScience 9 ай бұрын
Np check out my other ML vids
@onurdatascience
@onurdatascience Жыл бұрын
Important topic, great content!
@RyanAndMattDataScience
@RyanAndMattDataScience Жыл бұрын
Thanks!
@gajendrakc813
@gajendrakc813 3 ай бұрын
Thank you Ryan. Learning so much from you.
@RyanAndMattDataScience
@RyanAndMattDataScience 3 ай бұрын
No problem join our discord also! We will be hosting trainings and office hours in the future
@user-hb2df6nn2y
@user-hb2df6nn2y 2 ай бұрын
DENNIS SHA PO VA LOV!!!
@henry-o8i
@henry-o8i 10 ай бұрын
Thanks for the great content. I wonder if you discuss data leakage in your later videos/project. I was confused on when I should do train_test_split in a project. like should i do the pre-processing data first or train_test_split first
@RyanAndMattDataScience
@RyanAndMattDataScience 10 ай бұрын
Hey may cover this way later this year. Focusing on Ai vids next few months
@darks_
@darks_ 7 ай бұрын
Thanks a lot! Small question, what should I do if I want to have a stratified splitting with the same database?
@jencinas8586
@jencinas8586 8 ай бұрын
Hello , do you recommend learning sql first ,before starting with ML ?
@RyanAndMattDataScience
@RyanAndMattDataScience 8 ай бұрын
Sql and Python which I have vids on my channel
@N1246-c2f
@N1246-c2f 11 ай бұрын
Thanks this makes so much sense! I'm running a multiple regression on some stock data but my r2 value is coming out pretty low.. do you know how i can improve the model? or do u have a vid on it?
@RyanAndMattDataScience
@RyanAndMattDataScience 11 ай бұрын
No problem and ye check out my Kaggle projects. I go over different techniques. Try different models and hyper parameters. Optuna also may help
@henry-o8i
@henry-o8i 10 ай бұрын
Ryan - Great content- thanks but wonder if you can provide a road map for the playlists. I think that will be really helpful.
@RyanAndMattDataScience
@RyanAndMattDataScience 10 ай бұрын
the playlist is in order + has a few projects along the way. I do plan on adding to it later this year
@henry-o8i
@henry-o8i 10 ай бұрын
Thank You. Thanks for the great content.- I been giving up on studying data science after attending bootcamp 2 years ago. I found your videos been really helpful for me to refresh/studying data science again.@@RyanAndMattDataScience
@Reshmakrishnan21
@Reshmakrishnan21 2 ай бұрын
What would happen if you used 50% of the data for testing rather than 20%?
@dvdlog
@dvdlog Ай бұрын
I guess the model is less accurate cause 50% data for training only
@mehdismaeili3743
@mehdismaeili3743 Ай бұрын
Excellent.
@RyanAndMattDataScience
@RyanAndMattDataScience Ай бұрын
Thanks
@frankdearr2772
@frankdearr2772 10 ай бұрын
great topic thanks 👍
@RyanAndMattDataScience
@RyanAndMattDataScience 10 ай бұрын
Thank you
@epicmemesandanime329
@epicmemesandanime329 4 ай бұрын
at 3:01 why y=df["HOF"]?
@gajendrakc813
@gajendrakc813 3 ай бұрын
That is assigning HOF column as y. X ( Rest of the columns ) is input and y is output. We are using X ( rest of the columns ) to determine y ( output). Hope that made sense.
@kimaudreymagan484
@kimaudreymagan484 3 ай бұрын
Thank you!
@RyanAndMattDataScience
@RyanAndMattDataScience 3 ай бұрын
No problem
@subhanjalpant8824
@subhanjalpant8824 3 ай бұрын
Where is the data????
@curiousworm1632
@curiousworm1632 3 ай бұрын
Github
@casonpark
@casonpark 3 ай бұрын
on his github
@subhanjalpant8824
@subhanjalpant8824 3 ай бұрын
@@casonpark link?
@shahinurrahman6309
@shahinurrahman6309 Күн бұрын
Need data
Python Feature Scaling in SciKit-Learn (Normalization vs Standardization)
11:59
Ryan & Matt Data Science
Рет қаралды 17 М.
REAL or FAKE? #beatbox #tiktok
01:03
BeatboxJCOP
Рет қаралды 18 МЛН
黑天使被操控了#short #angel #clown
00:40
Super Beauty team
Рет қаралды 61 МЛН
Сестра обхитрила!
00:17
Victoria Portfolio
Рет қаралды 958 М.
Train Test Split | Training and Testing data | Machine Learning
10:07
How I'd learn ML in 2025 (if I could start over)
16:24
Boris Meinardus
Рет қаралды 142 М.
Learn Machine Learning Like a GENIUS and Not Waste Time
15:03
Infinite Codes
Рет қаралды 385 М.
Simplify Data Preprocessing with Python's Column Transformer: A Step-by-Step Guide
13:52
ML Was Hard Until I Learned These 5 Secrets!
13:11
Boris Meinardus
Рет қаралды 357 М.
All Machine Learning algorithms explained in 17 min
16:30
Infinite Codes
Рет қаралды 521 М.
Normalization Vs. Standardization (Feature Scaling in Machine Learning)
19:48
Stanford's FREE data science book and course are the best yet
4:52
Python Programmer
Рет қаралды 714 М.
REAL or FAKE? #beatbox #tiktok
01:03
BeatboxJCOP
Рет қаралды 18 МЛН