Self-classifying MNIST Digits (Paper Explained)

  Рет қаралды 13,198

Yannic Kilcher

Yannic Kilcher

Күн бұрын

Пікірлер: 33
@CristianGarcia
@CristianGarcia 4 жыл бұрын
Thanks Yannic! I found it odd that they used L2 instead of Crossentropy + "label smoothing", it should have the same regularization effect of not pushing the logits to infinity.
@YannicKilcher
@YannicKilcher 4 жыл бұрын
Good point!
@edoardoguerriero2464
@edoardoguerriero2464 4 жыл бұрын
It seems to be really sensitive to the image size. If you draw a number big as the size of the window the cells start getting a hard time finding an equilibrium.
@albertwang5974
@albertwang5974 4 жыл бұрын
Great, it's a mini implement of thousand brains theory.
@JuanColonna
@JuanColonna 4 жыл бұрын
Great video and explanations. Thanks
@etiennetiennetienne
@etiennetiennetienne 4 жыл бұрын
isn't this just a convolutional rnn? if so can't they pass a hidden state bounded between -1,1 and not the decoded logits ?
@YannicKilcher
@YannicKilcher 4 жыл бұрын
yes there's certainly a lot in common
@etiennetiennetienne
@etiennetiennetienne 4 жыл бұрын
if it is a rnn, i don't think this L2 stuff is needed, othewise people would do the same for standard rnn training. either detach the prediction from the state or feedback the probabilities after a softmax ?
@patf9770
@patf9770 3 жыл бұрын
There's something about this very reminiscent to Geoffrey Hinton's GLOM
@shairuno
@shairuno 4 жыл бұрын
Wow, I realize that a transformer in NLP is also similar to the message passing mechanism. But for a transformer, the number of passing messages is limited.
@potatooflife8603
@potatooflife8603 4 жыл бұрын
It's so cute.
@rickandelon9374
@rickandelon9374 4 жыл бұрын
I like your name :~)
@RickeyBowers
@RickeyBowers 4 жыл бұрын
It definitely needs a "this is not a number" option.
@sayakpaul3152
@sayakpaul3152 4 жыл бұрын
I see many grounds to connect this to implicit neural representations.
@herp_derpingson
@herp_derpingson 4 жыл бұрын
9:30 Its like the children's game called Chinese whispers? 10:30 In non-trivial problems, we might run into some bandwidth issues. 17:30 Cells deteriorating over time. Is this analogous to ageing? 26:40 is that what I think it is? :)
@YannicKilcher
@YannicKilcher 4 жыл бұрын
It's like chinese whispers with backprop, whatever that translates to :D The analogy to ageing might be pretty weak, but maybe :) something like being more and more set in your ways over time ;)
@qw4316
@qw4316 4 жыл бұрын
Hi your explanation is good ,is there any paper about CNN
@АлексейТучак-м4ч
@АлексейТучак-м4ч 4 жыл бұрын
that recurrent residual convolution resembles neural ode paper
@004307ec
@004307ec 4 жыл бұрын
what if the signal processing part use localized convolution (with some constraint to prevent kernels from being too different from each other)
@1998sini
@1998sini 4 жыл бұрын
How and where do you find these interesting papers?
@TheMazyProduction
@TheMazyProduction 4 жыл бұрын
Follow a bunch of the researchers on twitter.
@herp_derpingson
@herp_derpingson 4 жыл бұрын
Also the discord group
@mohamedosman6740
@mohamedosman6740 4 жыл бұрын
@@herp_derpingson group name?
@priyamdey3298
@priyamdey3298 4 жыл бұрын
also keeping an eye on arxiv-sanity I guess
@herp_derpingson
@herp_derpingson 4 жыл бұрын
@@mohamedosman6740 Its in the description
@Marcos10PT
@Marcos10PT 4 жыл бұрын
19:44 Freudian slip
@default632
@default632 4 жыл бұрын
urinal instead or neural haha
@DanFrederiksen
@DanFrederiksen 4 жыл бұрын
Romanticizing without benefit?
@spsharan2000
@spsharan2000 4 жыл бұрын
Which tablet and app are you using?
@YannicKilcher
@YannicKilcher 4 жыл бұрын
OneNote on an old surface
@not_a_human_being
@not_a_human_being 4 жыл бұрын
now do this with brain-cells and we're done! :D
@zhangcx93
@zhangcx93 4 жыл бұрын
looks like gcn in some sense...
@default632
@default632 4 жыл бұрын
Isn't that AMD graphics architecture?
Deep Ensembles: A Loss Landscape Perspective (Paper Explained)
46:32
Yannic Kilcher
Рет қаралды 23 М.
I Turned My Mom into Anxiety Mode! 😆💥 #prank #familyfun #funny
00:32
Osman Kalyoncu Sonu Üzücü Saddest Videos Dream Engine 275 #shorts
00:29
Human vs Jet Engine
00:19
MrBeast
Рет қаралды 185 МЛН
Learning To Classify Images Without Labels (Paper Explained)
45:34
Yannic Kilcher
Рет қаралды 48 М.
Thinking outside the 10-dimensional box
27:07
3Blue1Brown
Рет қаралды 3 МЛН
Rethinking Attention with Performers (Paper Explained)
54:39
Yannic Kilcher
Рет қаралды 56 М.
Watching Neural Networks Learn
25:28
Emergent Garden
Рет қаралды 1,3 МЛН
What are neural cellular automata?
8:35
Emergent Garden
Рет қаралды 169 М.
I Turned My Mom into Anxiety Mode! 😆💥 #prank #familyfun #funny
00:32