Machine Learning Lecture 27 "Gaussian Processes II / KD-Trees / Ball-Trees" -Cornell CS4780 SP17

No video

Machine Learning Lecture 27 "Gaussian Processes II / KD-Trees / Ball-Trees" -Cornell CS4780 SP17

Рет қаралды 28,882

Kilian Weinberger

Күн бұрын

Пікірлер: 48

@bharasiva96 4 жыл бұрын

KD-Trees begins at 28:50

@yuanchia-hung8613 4 жыл бұрын

The best explanation for Gaussian Process ever!

@mlst3rg 4 жыл бұрын

this series is a work of art. needs way more views.

@rajupowers 4 жыл бұрын

Most intuitive explanation of the topics in classroom

@clementpeng 4 жыл бұрын

Love this. Probably the clearest explanation i have seen on GP online.

@deltasun 4 жыл бұрын

thank you very much! I've tried a couple of times to understand GPs, but always gave up. Now i think they're much clearer to me. very very greatful

@raedbouslama2263 3 жыл бұрын

The previous video and the current one are the best material I watched on Gaussian Processes! Wonderful :)

@peterhojnos6705 3 жыл бұрын

definitely! I saw many, but this one is one of the best

@abhinav9561 3 жыл бұрын

Prof Killian killin it! Thanks prof for all the lectures. This course should be the first introduction to the Machine Learning world for everyone

@chamaleewickrama3276 3 жыл бұрын

Omg. I love this lecture material. To the point, clear and the best!

@saikumartadi8494 4 жыл бұрын

awesome simulation of a beautiful application !

@Illinoise888 4 жыл бұрын

This helps me with my exam preparation, thank you.

@isaacbuitrago2370 4 жыл бұрын

You make it look easy ! Thanks for the clear explanation of GP.

@atagomes_lncc_br 3 жыл бұрын

Best and simplest explanation of GPR.

@AlexPadula 5 жыл бұрын

Thank you very much, these lectures are really useful.

@vaaal88 4 жыл бұрын

this is such a great lesson. Thanks!

@udiibgui2136 3 жыл бұрын

Thank you for the lecture, very clear! Just one question, how does the Bayesian Optimisation already have a mapped surface?

@kilianweinberger698 3 жыл бұрын

initially that is just a flat surface, which is an uninformed prior.

@chaowang3093 3 жыл бұрын

This guy is brilliantly funny.

@sarvasvarora 3 жыл бұрын

Living for that "YAY" 😂😂

@Biesterable 5 жыл бұрын

Hm isn't there maybe a way to do low-dimensional egg-search (if it's a manifold there should allways be some main directions) so for the start just make it elipsoid in just one dimension and for comparing distort the room so the elipsoid you're comparing with becomes a globe hm...

@andreariboni4242 Ай бұрын

dead mouse got me

@TeoChristopher 4 жыл бұрын

To Clarify, for 26:19 , for a Gaussian Process, each data point on the X-axis would we a queried test point , the grey region would be the standard deviation and the points that we have not "queried" would be fitted according to its respective determined distribution which it itself would be a Gaussian distribution with its own mean and s.d?

@kilianweinberger698 4 жыл бұрын

Exactly :-)

@mertkurttutan2877 2 жыл бұрын

Question: Regarding hyperparameter search via GP, I recall that the earlier steps in hyperparameter search involves determining the scale of hyperparameter. How should we determine the scale? Should we use GP for both scale and minimal value at the same scale. Or, Use grid search to determine scale and then, use GP to find the value of hyperparameter. Thanks for both rigorous and enjoyable lectures :)

@akshaygrao77 Жыл бұрын

U keep running bayes optimization which uses gaussian processes, with more iterations it converges to smaller scales itself

@ayushmalik7093 2 жыл бұрын

hi Prof In Bayesian Optimiser I assume that algorithm for which we are trying to find out best hyper-parameters should be costly enough otherwise it will not make any sense to use GP on top of another algo.

@kevinshao9148 7 ай бұрын

9:30, so for my test data, y_test, it has 1) its own variance, 2) n correlations with respect to all observed data y1...yn, then how to determine y_test distribution? how did you get the conclusion at 11:06? Thanks!

@chenwang6684 4 жыл бұрын

Awesome lecture! One question is are the projects available for public? I have found homeworks but no coding projects.

@kilianweinberger698 4 жыл бұрын

Sorry, I cannot post them. The projects are still used at Cornell University, and if they were public someone would certainly post solutions somewhere and spoil all the fun. :-(

@rajupowers 4 жыл бұрын

Important @8:00

@LauraJoana 3 жыл бұрын

THANKS!

@ehfo 5 жыл бұрын

are the homeworks available for public?

@kilianweinberger698 4 жыл бұрын

www.dropbox.com/s/tbxnjzk5w67u0sp/Homeworks.zip?dl=0

@sankalpthakuravi 4 жыл бұрын

Kilian Weinberger you must be an angel

@imblera6571 4 жыл бұрын

For the hyper parameter search, wouldn't the bayesian optimization approach be more likely to get stuck at a local minimum?

@kilianweinberger698 4 жыл бұрын

No, Bayesian optimization is global. The exploration component makes sure that you don’t get stuck.

@salahghazisalaheldinataban5632 2 жыл бұрын

Seems from your explanation that the covariance matrix is a simple kernel/distance matrix that does not take into account variable importance. (1) Does that cause any issues if there are variables that have no significant prediction value?, (2) Does it mean we have to be careful about variable selection? And (3) is there a way to incorporate feature importance in the kernel?

@kilianweinberger698 2 жыл бұрын

For the linear kernel that's not an issue (as your algorithm becomes identical to linear regression where you learn a weight for each dimension), however for non-linear kernels that can indeed be a problem. One common trick is to multiply each feature dimension by a non-negative weight, and also learn these weights as part of the kernel parameters.

@sandeshhegde9143 5 жыл бұрын

KD Tree starts from kzbin.info/www/bejne/eKure2hthqiXjNE

@vatsan16 4 жыл бұрын

One thing I would like to ask is, "what's the catch?" The algorithms seems great but where would we not want to use GPR? Is it in situations where we would like to actually know what the function is? Or are there some situations where GPR wont work well?

@kilianweinberger698 4 жыл бұрын

Well, I wouldn’t recommend them for data that is very high dimensional (e.g. bag of word vectors, or images in pixel space). Also, when features are sparse splitting along features becomes tedious and too restrictive, as almost all samples always have zeros in all dimensions.