LSH.9 Locality-sensitive hashing: how it works

  Рет қаралды 54,768

Victor Lavrenko

Victor Lavrenko

Күн бұрын

Пікірлер: 16
@ayushsinghal28
@ayushsinghal28 7 жыл бұрын
how do we determine a good value of 'L' i.e. the number of tables. Is there some logic to get it?
@mariusm5187
@mariusm5187 9 жыл бұрын
Finally understand LSH
@carles5601
@carles5601 8 жыл бұрын
Great explanation. Thanks!
@AnkitSharmaKumar
@AnkitSharmaKumar 8 жыл бұрын
Great explanation... Thanks for sharing!
@gaaligadu148
@gaaligadu148 7 жыл бұрын
I don't understand how we found near duplicates inside a bucket.You can't compare using all the D-dimensions because obviously it will be different because they are near duplicates.Don't we use subsections of the D-dimensions like you said it before to check near duplicates ?
@dani0qiu0china
@dani0qiu0china 7 жыл бұрын
very good complexity analysis !
@MrBertmsk
@MrBertmsk 8 жыл бұрын
it's little unclear how to eliminate duplicates? Each "table" (bucket?) contains different hash ids for the data. Should I do comparasions within one bucket or against all buckets? How to combine resulting hash then?
@renzocoppola4664
@renzocoppola4664 7 жыл бұрын
if you compare against all buckets then you would be comparing aganst all points
@federicomagliani1
@federicomagliani1 7 жыл бұрын
Have you been understood? I repeated the hash process and I only concatenate the results.
@gauravmenghani4
@gauravmenghani4 7 жыл бұрын
You do comparisons within the buckets to remove false positives. You repeat the process with new random hyperplanes to consider points which were false negatives in the previous iteration.
@renzocoppola4664
@renzocoppola4664 7 жыл бұрын
I suppose you could take the adventage that the neraby buckets have 1 bit difference.
@alihusen111
@alihusen111 9 жыл бұрын
would you pleas tell me what do you mean when you said we do the same comparison to eliminate d ??
@harshgoyal5694
@harshgoyal5694 7 жыл бұрын
What do u mean by D dimensional document??? Thanks in advance :)
@RobertoMartin1
@RobertoMartin1 7 жыл бұрын
D is the size of the dictionary. you represent each document with words from the dictionary, so a D dimensional document will have at most d unique words. Usually, each document will contain less than D unique words, but they're still represented with D dimensions, just that some of the dimensions have zero as values.
@rahulat85
@rahulat85 7 жыл бұрын
8:39
@gaaligadu148
@gaaligadu148 7 жыл бұрын
Hi harsh, each document can be represented numerically by D-dimensions. For ex: it can be image whose D-dimension vector would be all of it's pixel values
LSH.10 False positive and negative errors of LSH
10:42
Victor Lavrenko
Рет қаралды 12 М.
Locality Sensitive Hashing   Part 1, Jeffrey D  Ullman
50:09
EIT Digital
Рет қаралды 46 М.
Brawl Stars Edit😈📕
00:15
Kan Andrey
Рет қаралды 52 МЛН
Running With Bigger And Bigger Lunchlys
00:18
MrBeast
Рет қаралды 70 МЛН
Expectation Maximization: how it works
10:39
Victor Lavrenko
Рет қаралды 279 М.
The Boundary of Computation
12:59
Mutual Information
Рет қаралды 1 МЛН
Bayes theorem, the geometry of changing beliefs
15:11
3Blue1Brown
Рет қаралды 4,4 МЛН
Similarity & MinHash
37:33
Ben Langmead
Рет қаралды 13 М.
5 Simple Steps for Solving Any Recursive Problem
21:03
Reducible
Рет қаралды 1,2 МЛН
What's a Tensor?
12:21
Dan Fleisch
Рет қаралды 3,6 МЛН
Куда пропал Kodak?
1:01
MOTIVESSION
Рет қаралды 921 М.
CRAZY KEYBOARD CHALLENGE 😮How fast could you type?
0:41
Hipyo Tech
Рет қаралды 22 МЛН
iPhone 16 Pro - ПЕРВЫЙ обзор!
0:59
808
Рет қаралды 141 М.
iPhone 16 в 2012 году!
0:38
ÉЖИ АКСЁНОВ
Рет қаралды 1,5 МЛН
📱🛠️Молотижи VS 15 pro MAX #diy #iphone #tools
0:57
Короче, ВИ
Рет қаралды 496 М.
Распаковка 16 iPhone pro max
0:50
KERRY CATT
Рет қаралды 195 М.