Sergey Feldman: You Should Probably Be Doing Nested Cross-Validation | PyData Miami 2019

  Рет қаралды 10,240

PyData

PyData

Күн бұрын

Пікірлер: 7
@iancherabier5920
@iancherabier5920 7 ай бұрын
Thanks a lot, an extremely clear explanation of nested CV!
@BeckCaesar-r8l
@BeckCaesar-r8l 17 күн бұрын
Davis Timothy Anderson Michael White Donna
@QIQIWU-fd1xz
@QIQIWU-fd1xz Жыл бұрын
This is really helpful! Thanks for sharing. One question, at 10:55, when running the 5-fold CV, shouldn't we use X_train_val instead of X_train? Because the splitting is done by sklearn, thus we don't need to hold out a validation set.
@sergey_of_fields
@sergey_of_fields Жыл бұрын
Yes! Sorry that was a bug in the code.
@bryanparis7779
@bryanparis7779 Жыл бұрын
THANK YOU so helpful! so interesting so so so :)
@BulkySplash169
@BulkySplash169 2 жыл бұрын
Nice, thx!
@nespereira
@nespereira 2 ай бұрын
Very useful! One question: in many medical datasets, especially in single-group research settings, the sample sizes are more around 100 or less (being in the thousands is rare). With this number of subjects, one worry is that putting away subjects for testing removes samples in a context where there really is not much data to begin with. Then you need to think about how many features you can afford etc... Don't get me wrong, I'm all in for nested cross-validation, but I am curious to hear your thoughts on this type of scenario, where getting data is really expensive.
LIFEHACK😳 Rate our backpacks 1-10 😜🔥🎒
00:13
Diana Belitskay
Рет қаралды 3,9 МЛН
Brawl Stars Edit😈📕
00:15
Kan Andrey
Рет қаралды 57 МЛН
отомстил?
00:56
История одного вокалиста
Рет қаралды 7 МЛН
A Bluffer's Guide to Dimension Reduction - Leland McInnes
36:33
Complete Guide to Cross Validation
29:49
Rob Mulla
Рет қаралды 55 М.