Train, Test, & Validation Sets | How to Train Machine Learning Models (Properly!!!)

  Рет қаралды 21,228

Greg Hogg

Greg Hogg

2 жыл бұрын

The Notebook: colab.research.google.com/dri...
Thank you for watching the video! You can learn data science FASTER at mlnow.ai!
Master Python at mlnow.ai/course-material/python/!
Learn SQL & Relational Databases at mlnow.ai/course-material/sql/!
Learn NumPy, Pandas, and Python for Data Science at mlnow.ai/course-material/data...!
Become a Machine Learning Expert at mlnow.ai/course-material/ml/!
Don't forget to subscribe if you enjoyed the video :D

Пікірлер: 28
@GregHogg
@GregHogg 11 ай бұрын
Take my courses at mlnow.ai/!
@djjiang3718
@djjiang3718 2 жыл бұрын
Just want to say Thank you, Greg! love your content always good stuff! Keep up with hard work! Thanks for teaching and sharing !
@GregHogg
@GregHogg 2 жыл бұрын
I really appreciate this, thanks so much and you're very welcome 😄😄
@tartlord7061
@tartlord7061 Жыл бұрын
Great and easy to follow video, thanks!
@natureclipsreal
@natureclipsreal Жыл бұрын
If I am using cross validation during the training process, can I still use a validation set after training the model?
@panagiotisgoulas8539
@panagiotisgoulas8539 2 жыл бұрын
Cool. thanks for that.
@arsheyajain7055
@arsheyajain7055 2 жыл бұрын
Oooh awesome!!
@PeterPan-xp2gl
@PeterPan-xp2gl Жыл бұрын
Hey Greg, thanks for your video! Could you explain why you convert the dataframe to a numpy matrix? Since the whole procedure also works with data frames?
@ismailucuran4523
@ismailucuran4523 Жыл бұрын
probably its because he didn't use scikitlearn treain_test_split
@kshitijalawandi4443
@kshitijalawandi4443 2 жыл бұрын
This is awesome....
@GregHogg
@GregHogg 2 жыл бұрын
Thank you!
@mayreyes4402
@mayreyes4402 Жыл бұрын
when I try to find mean absolute error for linear_test_ preds the result when I run it is "ValueError: could not convert string to float: 'KS'" what should I do with that?
@itowindsor
@itowindsor 2 ай бұрын
One of your feature might be a categorical one. Therefore your models can't work with it. One way to overcome it is to one hot encode this feature. It will increase the dimension of your features spaces (as you will have one more dimension for each category) but every algorithm will understand it
@emekaanammah9207
@emekaanammah9207 2 жыл бұрын
Where did you download the california housing test and train dataset from?
@GregHogg
@GregHogg 2 жыл бұрын
Google Colab. Free cloud env
@nayliafiqah4072
@nayliafiqah4072 Жыл бұрын
Thank you so much;
@GregHogg
@GregHogg Жыл бұрын
You're very welcome!!
@user-ip9zo8zp4c
@user-ip9zo8zp4c 10 ай бұрын
Is it possible that you wrote the arguments of 'mean_absolute_error' in the wrong order?
@jyotikapatil2354
@jyotikapatil2354 Жыл бұрын
Only video explaining actual hyperparameter tuning using validation test.
@diazjubairy1729
@diazjubairy1729 2 жыл бұрын
How to set the proportion of train, val and test of our dataset ?
@GregHogg
@GregHogg 2 жыл бұрын
How technically, or what numbers should you use?
@diazjubairy1729
@diazjubairy1729 2 жыл бұрын
@@GregHogg train : val : test = 50 : 20 : 30
@GregHogg
@GregHogg 2 жыл бұрын
@@diazjubairy1729 is this a question?
@diazjubairy1729
@diazjubairy1729 2 жыл бұрын
@@GregHogg that is the proportion that i want, how to split the dataset like that ?
@GregHogg
@GregHogg 2 жыл бұрын
@@diazjubairy1729 sklearn train test split (do it twice), or shuffle a pandas DataFrame and get split indices
@pogiribhanuteja4650
@pogiribhanuteja4650 2 жыл бұрын
When will be the next live session..?
@GregHogg
@GregHogg 2 жыл бұрын
I think I'll be busy this weekend unfortunately. If not, this weekend, otherwise probably the weekend after.
@mehediazad1780
@mehediazad1780 Жыл бұрын
you confused me by your variable name
Train Test Split | Training and Testing data | Machine Learning
10:07
Самый Молодой Актёр Без Оскара 😂
00:13
Глеб Рандалайнен
Рет қаралды 3,7 МЛН
LOVE LETTER - POPPY PLAYTIME CHAPTER 3 | GH'S ANIMATION
00:15
Русалка
01:00
История одного вокалиста
Рет қаралды 6 МЛН
154 - Understanding the training and validation loss curves
27:47
DigitalSreeni
Рет қаралды 103 М.
Training Data Vs Test Data Vs Validation Data| Krish Naik
14:41
Krish Naik Hindi
Рет қаралды 48 М.
Train, Test, & Validation Sets explained
6:58
deeplizard
Рет қаралды 202 М.
Selecting the best model in scikit-learn using cross-validation
35:54