Self-classifying MNIST Digits (Paper Explained)

Training more effective learned optimizers, and using them to train themselves (Paper Explained)

Deep Ensembles: A Loss Landscape Perspective (Paper Explained)

I Turned My Mom into Anxiety Mode! 😆💥 #prank #familyfun #funny

ЭКСКЛЮЗИВ: «Папа мені көп ұратын!» Біреудің семьясын бұздым деп айта алмаймын! Алғашқы сұхбат

Osman Kalyoncu Sonu Üzücü Saddest Videos Dream Engine 275 #shorts

Human vs Jet Engine

Self-classifying MNIST Digits (Paper Explained)

Рет қаралды 13,198

Yannic Kilcher

Yannic Kilcher

Күн бұрын

Пікірлер: 33

@CristianGarcia

@CristianGarcia 4 жыл бұрын

Thanks Yannic! I found it odd that they used L2 instead of Crossentropy + "label smoothing", it should have the same regularization effect of not pushing the logits to infinity.

@YannicKilcher 4 жыл бұрын

Good point!

@edoardoguerriero2464

@edoardoguerriero2464 4 жыл бұрын

It seems to be really sensitive to the image size. If you draw a number big as the size of the window the cells start getting a hard time finding an equilibrium.

@albertwang5974

@albertwang5974 4 жыл бұрын

Great, it's a mini implement of thousand brains theory.

@JuanColonna 4 жыл бұрын

Great video and explanations. Thanks

@etiennetiennetienne

@etiennetiennetienne 4 жыл бұрын

isn't this just a convolutional rnn? if so can't they pass a hidden state bounded between -1,1 and not the decoded logits ?

@YannicKilcher 4 жыл бұрын

yes there's certainly a lot in common

@etiennetiennetienne

@etiennetiennetienne 4 жыл бұрын

if it is a rnn, i don't think this L2 stuff is needed, othewise people would do the same for standard rnn training. either detach the prediction from the state or feedback the probabilities after a softmax ?

@patf9770 3 жыл бұрын

There's something about this very reminiscent to Geoffrey Hinton's GLOM

@shairuno 4 жыл бұрын

Wow, I realize that a transformer in NLP is also similar to the message passing mechanism. But for a transformer, the number of passing messages is limited.

@potatooflife8603

@potatooflife8603 4 жыл бұрын

It's so cute.

@rickandelon9374

@rickandelon9374 4 жыл бұрын

I like your name :~)

@RickeyBowers 4 жыл бұрын

It definitely needs a "this is not a number" option.

@sayakpaul3152 4 жыл бұрын

I see many grounds to connect this to implicit neural representations.

@herp_derpingson

@herp_derpingson 4 жыл бұрын

9:30 Its like the children's game called Chinese whispers? 10:30 In non-trivial problems, we might run into some bandwidth issues. 17:30 Cells deteriorating over time. Is this analogous to ageing? 26:40 is that what I think it is? :)

@YannicKilcher 4 жыл бұрын

It's like chinese whispers with backprop, whatever that translates to :D The analogy to ageing might be pretty weak, but maybe :) something like being more and more set in your ways over time ;)

@qw4316 4 жыл бұрын

Hi your explanation is good ,is there any paper about CNN

@АлексейТучак-м4ч

@АлексейТучак-м4ч 4 жыл бұрын

that recurrent residual convolution resembles neural ode paper

@004307ec 4 жыл бұрын

what if the signal processing part use localized convolution (with some constraint to prevent kernels from being too different from each other)

@1998sini 4 жыл бұрын

How and where do you find these interesting papers?

@TheMazyProduction

@TheMazyProduction 4 жыл бұрын

Follow a bunch of the researchers on twitter.

@herp_derpingson

@herp_derpingson 4 жыл бұрын

Also the discord group

@mohamedosman6740

@mohamedosman6740 4 жыл бұрын

@@herp_derpingson group name?

@priyamdey3298 4 жыл бұрын

also keeping an eye on arxiv-sanity I guess

@herp_derpingson

@herp_derpingson 4 жыл бұрын

@@mohamedosman6740 Its in the description

@Marcos10PT 4 жыл бұрын

19:44 Freudian slip

@default632 4 жыл бұрын

urinal instead or neural haha

@DanFrederiksen

@DanFrederiksen 4 жыл бұрын

Romanticizing without benefit?

@spsharan2000 4 жыл бұрын

Which tablet and app are you using?

@YannicKilcher 4 жыл бұрын

OneNote on an old surface

@not_a_human_being

@not_a_human_being 4 жыл бұрын

now do this with brain-cells and we're done! :D

@zhangcx93 4 жыл бұрын

looks like gcn in some sense...

@default632 4 жыл бұрын

Isn't that AMD graphics architecture?

Training more effective learned optimizers, and using them to train themselves (Paper Explained)

53:36

Training more effective learned optimizers, and using them to train themselves (Paper Explained)

Yannic Kilcher

Рет қаралды 19 М.

Deep Ensembles: A Loss Landscape Perspective (Paper Explained)

46:32

Deep Ensembles: A Loss Landscape Perspective (Paper Explained)

Yannic Kilcher

Рет қаралды 23 М.

I Turned My Mom into Anxiety Mode! 😆💥 #prank #familyfun #funny

00:32

I Turned My Mom into Anxiety Mode! 😆💥 #prank #familyfun #funny

Skitsters

Рет қаралды 2,7 МЛН

ЭКСКЛЮЗИВ: «Папа мені көп ұратын!» Біреудің семьясын бұздым деп айта алмаймын! Алғашқы сұхбат

2:20:23

ЭКСКЛЮЗИВ: «Папа мені көп ұратын!» Біреудің семьясын бұздым деп айта алмаймын! Алғашқы сұхбат

НТК Show

Рет қаралды 1,2 МЛН

Osman Kalyoncu Sonu Üzücü Saddest Videos Dream Engine 275 #shorts

00:29

Osman Kalyoncu Sonu Üzücü Saddest Videos Dream Engine 275 #shorts

Osman Kalyoncu

Рет қаралды 18 МЛН

Human vs Jet Engine

00:19

Human vs Jet Engine

MrBeast

Рет қаралды 185 МЛН

Learning To Classify Images Without Labels (Paper Explained)

45:34

Learning To Classify Images Without Labels (Paper Explained)

Yannic Kilcher

Рет қаралды 48 М.

SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization (Paper Explained)

35:52

SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization (Paper Explained)

Yannic Kilcher

Рет қаралды 10 М.

Thinking outside the 10-dimensional box

27:07

Thinking outside the 10-dimensional box

3Blue1Brown

Рет қаралды 3 МЛН

Self-training with Noisy Student improves ImageNet classification (Paper Explained)

40:35

Self-training with Noisy Student improves ImageNet classification (Paper Explained)

Yannic Kilcher

Рет қаралды 17 М.

Yann LeCun - Self-Supervised Learning: The Dark Matter of Intelligence (FAIR Blog Post Explained)

58:37

Yann LeCun - Self-Supervised Learning: The Dark Matter of Intelligence (FAIR Blog Post Explained)

Yannic Kilcher

Рет қаралды 101 М.

Rethinking Attention with Performers (Paper Explained)

54:39

Rethinking Attention with Performers (Paper Explained)

Yannic Kilcher

Рет қаралды 56 М.

LambdaNetworks: Modeling long-range Interactions without Attention (Paper Explained)

59:33

LambdaNetworks: Modeling long-range Interactions without Attention (Paper Explained)

Yannic Kilcher

Рет қаралды 48 М.

SIREN: Implicit Neural Representations with Periodic Activation Functions (Paper Explained)

56:05

SIREN: Implicit Neural Representations with Periodic Activation Functions (Paper Explained)

Yannic Kilcher

Рет қаралды 45 М.

Watching Neural Networks Learn

25:28

Watching Neural Networks Learn

Emergent Garden

Рет қаралды 1,3 МЛН

What are neural cellular automata?

8:35

What are neural cellular automata?

Emergent Garden

Рет қаралды 169 М.

I Turned My Mom into Anxiety Mode! 😆💥 #prank #familyfun #funny

00:32

I Turned My Mom into Anxiety Mode! 😆💥 #prank #familyfun #funny

Skitsters

Рет қаралды 2,7 МЛН