C5W3L04 Refining Beam Search

  Рет қаралды 20,409

DeepLearningAI

DeepLearningAI

Күн бұрын

Пікірлер: 10
@maximdemey3322
@maximdemey3322 4 жыл бұрын
Thanks for explaining the length normalization of beam search! I was already wondering at the end of your last video what would happen if in some branches you predict an EOS token.
@vinitabaniwal685
@vinitabaniwal685 3 жыл бұрын
Thank you so much for this explanation!!
@AbhishekKumar-wf6io
@AbhishekKumar-wf6io Жыл бұрын
Voice of the century :D
@sandipansarkar9211
@sandipansarkar9211 3 жыл бұрын
nice explanation
@trexmidnite
@trexmidnite 3 жыл бұрын
Sexiest voice in AI
@luck3949
@luck3949 6 жыл бұрын
I guess beam width can be made dynamic, for example, if NN tells that with p = 1 next letter is Z, than we can safely take width=1, and if on some step NN has no idea what should be next, than it's better to use bigger width. Right?
@zhifengyang1850
@zhifengyang1850 6 жыл бұрын
Even in the extreme case that one word's p is equal to 1, you still need to use fixed beam width. Suppose beam width is 3, RNN outputs A, B, C at time step 1, with prob 0.1, 0.2, 0.3 respectively. When we choose A as the input of the time step 2, we get the output word Z with prob 1, and we will get other outputs when we choose other words. But after RNN outputs in time step 2, you still need to compare all outputs from the 3 words A, B and C.
@luck3949
@luck3949 6 жыл бұрын
Zhifeng Yang thank you, now I see it. I did a little googling, and I found that there actually are some papers about dynamic width or dynamic puring, and it improves speed of the search by approximately 10%, with same quality.
@rohanvarma7777
@rohanvarma7777 5 жыл бұрын
At 4:05, you say that a log of a probability is
@nikhilbalwani5556
@nikhilbalwani5556 3 жыл бұрын
exactly
C5W3L05 Error Analysis of Beam Search
9:44
DeepLearningAI
Рет қаралды 9 М.
C5W3L03 Beam Search
11:55
DeepLearningAI
Рет қаралды 84 М.
No empty
00:35
Mamasoboliha
Рет қаралды 10 МЛН
Smart Sigma Kid #funny #sigma #comedy
00:40
CRAZY GREAPA
Рет қаралды 33 МЛН
CHOCKY MILK.. 🤣 #shorts
00:20
Savage Vlogs
Рет қаралды 16 МЛН
Spain Was a Warning
16:30
Economics Explained
Рет қаралды 1 МЛН
Researchers thought this was a bug (Borwein integrals)
17:26
3Blue1Brown
Рет қаралды 3,4 МЛН
SHA: Secure Hashing Algorithm - Computerphile
10:21
Computerphile
Рет қаралды 1,2 МЛН
The Traveling Salesman Problem: When Good Enough Beats Perfect
30:27
CS 152 NN-23:  Generating Sequences: Beam Search
17:30
Neil Rhodes
Рет қаралды 3,6 М.
The moment we stopped understanding AI [AlexNet]
17:38
Welch Labs
Рет қаралды 860 М.
Dijkstra's Algorithm - Computerphile
10:43
Computerphile
Рет қаралды 1,3 МЛН
C5W3L02 Picking the most likely sentence
8:57
DeepLearningAI
Рет қаралды 35 М.
C5W3L06 Bleu Score (Optional)
16:26
DeepLearningAI
Рет қаралды 112 М.