C5W3L04 Refining Beam Search

  Рет қаралды 20,409

DeepLearningAI

DeepLearningAI

Күн бұрын

Пікірлер: 10
@maximdemey3322
@maximdemey3322 4 жыл бұрын
Thanks for explaining the length normalization of beam search! I was already wondering at the end of your last video what would happen if in some branches you predict an EOS token.
@vinitabaniwal685
@vinitabaniwal685 3 жыл бұрын
Thank you so much for this explanation!!
@AbhishekKumar-wf6io
@AbhishekKumar-wf6io Жыл бұрын
Voice of the century :D
@sandipansarkar9211
@sandipansarkar9211 3 жыл бұрын
nice explanation
@trexmidnite
@trexmidnite 3 жыл бұрын
Sexiest voice in AI
@luck3949
@luck3949 6 жыл бұрын
I guess beam width can be made dynamic, for example, if NN tells that with p = 1 next letter is Z, than we can safely take width=1, and if on some step NN has no idea what should be next, than it's better to use bigger width. Right?
@zhifengyang1850
@zhifengyang1850 6 жыл бұрын
Even in the extreme case that one word's p is equal to 1, you still need to use fixed beam width. Suppose beam width is 3, RNN outputs A, B, C at time step 1, with prob 0.1, 0.2, 0.3 respectively. When we choose A as the input of the time step 2, we get the output word Z with prob 1, and we will get other outputs when we choose other words. But after RNN outputs in time step 2, you still need to compare all outputs from the 3 words A, B and C.
@luck3949
@luck3949 6 жыл бұрын
Zhifeng Yang thank you, now I see it. I did a little googling, and I found that there actually are some papers about dynamic width or dynamic puring, and it improves speed of the search by approximately 10%, with same quality.
@rohanvarma7777
@rohanvarma7777 5 жыл бұрын
At 4:05, you say that a log of a probability is
@nikhilbalwani5556
@nikhilbalwani5556 3 жыл бұрын
exactly
C5W3L05 Error Analysis of Beam Search
9:44
DeepLearningAI
Рет қаралды 9 М.
C5W3L03 Beam Search
11:55
DeepLearningAI
Рет қаралды 84 М.
Они так быстро убрались!
01:00
Аришнев
Рет қаралды 2,3 МЛН
小宇宙竟然尿裤子!#小丑#家庭#搞笑
00:26
家庭搞笑日记
Рет қаралды 17 МЛН
Best Toilet Gadgets and #Hacks you must try!!💩💩
00:49
Poly Holy Yow
Рет қаралды 22 МЛН
C5W3L06 Bleu Score (Optional)
16:26
DeepLearningAI
Рет қаралды 112 М.
Dijkstra's Algorithm - Computerphile
10:43
Computerphile
Рет қаралды 1,3 МЛН
CS 152 NN-23:  Generating Sequences: Beam Search
17:30
Neil Rhodes
Рет қаралды 3,6 М.
The moment we stopped understanding AI [AlexNet]
17:38
Welch Labs
Рет қаралды 860 М.
C5W3L07 Attention Model Intuition
9:42
DeepLearningAI
Рет қаралды 291 М.
C5W3L02 Picking the most likely sentence
8:57
DeepLearningAI
Рет қаралды 35 М.
The Boundary of Computation
12:59
Mutual Information
Рет қаралды 994 М.
The real world applications of the dot product
12:49
Zach Star
Рет қаралды 219 М.
Beam search (NLP817 10.5)
18:12
Herman Kamper
Рет қаралды 1,8 М.
Они так быстро убрались!
01:00
Аришнев
Рет қаралды 2,3 МЛН