8.2 David Thompson (Part 2): Nearest Neighbors and the Curse of Dimensionality

  Рет қаралды 12,223

caltech

caltech

Күн бұрын

Пікірлер: 7
@caseyli5580
@caseyli5580 6 жыл бұрын
Explanation for why KNN is problematic in high dimensions starts at 8:53. Super helpful, surprised this doesn't have more views - thanks for putting up!
@ollie-d
@ollie-d 3 жыл бұрын
Agreed.
@coldbrewed8308
@coldbrewed8308 Жыл бұрын
I'm surprised that he could conduct a 16min explanation without any full stop in his sentence. Loads of information packed here
@thequantartist
@thequantartist 3 жыл бұрын
Great video! I understand way better the curse of dimensionality now.
@ahmedsaied8373
@ahmedsaied8373 3 жыл бұрын
Which sorting algorithm can sort in sub linear time (< O(n)) ?
@tupoiu
@tupoiu 10 ай бұрын
None, but the binary tree data structure has cheap insert (log n per insert) and you can perform binary search on it efficiently. I think the idea is to preprocess data usefully first, and then try and find nearest neighbours.
@spamspamer3679
@spamspamer3679 Жыл бұрын
Another way of thinking about this is the central limit theorem. If you use euclidian distance you essentially sum up distances. Therefore you can imagine it like taking some kind of a mean through all dimensions (only conceptionally). Then you have the problem that the distances become more and more distributed like a gaussian/normal distribution (gets worse with more dimensions). Therefore they seem all more or less equidistant and meaning is lost. I don't think that this covers the whole hypersphere/hypercube effect. But I thought it would perhaps bring up a new perspective.
8.3  David Thompson (Part 3): Feature Selection
15:36
caltech
Рет қаралды 2,1 М.
小丑揭穿坏人的阴谋 #小丑 #天使 #shorts
00:35
好人小丑
Рет қаралды 53 МЛН
Lecture 4 "Curse of Dimensionality / Perceptron" -Cornell CS4780 SP17
47:43
The Curse of Dimensionality
11:42
Shaina Race Bennett
Рет қаралды 2,7 М.
8.5  David Thompson (Part 5): Metric Learning
18:17
caltech
Рет қаралды 11 М.
Lecture 1: The curse of dimensionality
1:40:40
High-Dimensional Statistics and Probability
Рет қаралды 4,7 М.
Approximate Nearest Neighbors : Data Science Concepts
15:05
ritvikmath
Рет қаралды 26 М.
Graph-Based Approximate Nearest Neighbors (ANN) and HNSW
58:31
What is...the curse of dimensionality?
22:34
VisualMath
Рет қаралды 1,2 М.