From Deep Learning of Disentangled Representations to Higher-level Cognition

  Рет қаралды 73,625

Microsoft Research

Microsoft Research

Күн бұрын

Пікірлер: 61
@flamingxombie
@flamingxombie 6 жыл бұрын
The intuition for why we current speech models can't produce good unconditional samples (see wavenet) is simply mindblowing. Phonemes occupy a small number of bits as compared with the overall signal (~10/s as compared with 16 k/s)!
@mjParetoQuant
@mjParetoQuant 3 жыл бұрын
I liked that point very much. However, it is also quite obvious: we don't analyse the signal with our brain, we analyse the sentenses and the meaning.
@johntanchongmin
@johntanchongmin Жыл бұрын
Interpolating in abstract space, exactly what Stable Diffusion is doing. This idea is really impactful.
@itaybenou
@itaybenou 6 жыл бұрын
Wonderful video. You can't help but admire his approach for what is AI, and the way he manages to convey these concepts. Brilliant!
@catsaresocute650
@catsaresocute650 2 жыл бұрын
In Lexes Podcast just the mention of the example of speech as becoming unrecognicable gibberish (due to the amount of data) but when you seperate the gibberish to get a baisc feel for intonation or sound and speach as vocilisation of cerain tones that humans think of as speach you get a functional way to work it out would have totally suffieced to get the thing
@tempvariable
@tempvariable 5 жыл бұрын
In 12:07, are cognitive states low dimensional if that is the case are they sparse? If they are both sparse and low dimensional it contradicts with what he said in his MSS talk in 2012, where he states high dimensional and sparse is better than low dimensional
@maloxi1472
@maloxi1472 Жыл бұрын
He's allowed to change his opinion and improve his theories over time. That's (interestingly enough) the kind of stuff that general intelligence allows
@juliocardenas4485
@juliocardenas4485 2 жыл бұрын
The camera work negatively affects a wonderful lecture
@bingeltube
@bingeltube 4 жыл бұрын
Thanks for the interesting talk! Please post the slides as well!
@johntanchongmin
@johntanchongmin Жыл бұрын
In 1:09:33, there was a question on gradual change in hypothesis space from very few samples - theory revision. I feel like neural nets may be quite ill-suited for fast change of learnt knowledge as the weights take a long time to change by backpropagation. What I believe is necessary, will be to imbue some form of learnable external memory bank on which we draw our knowledge from (in addition to neural nets), so we can just change that knowledge bank and learn new concepts instantly.
@silberlinie
@silberlinie 6 жыл бұрын
Deepening of learning into a higher cognitive level: Very good. What and where are the works, who is working on this approach?
@siarez
@siarez 6 жыл бұрын
Who is the gentleman at 1:09:35 asking a question, and bringing up gradual learning?
@DeadRabbittt
@DeadRabbittt 6 жыл бұрын
Its Patrice Simard from Microsoft Research
@muckvix
@muckvix 6 жыл бұрын
Anyone has a link to the slides? And come on camera people, it's not a beauty pageant, it's ok if you show slides instead of the speaker's face :)
@oudarjyasensarma4199
@oudarjyasensarma4199 4 жыл бұрын
@@ProfessionalTycoons 404 not found!!! Can you share a revised link? Thanks in advance!!!
@reidalbecker3830
@reidalbecker3830 4 жыл бұрын
medium.com/@SeoJaeDuk/archived-post-from-deep-learning-of-disentangled-representations-to-higher-level-cognition-b848fdc0de2c
@nguyenngocly1484
@nguyenngocly1484 4 жыл бұрын
You can turn artificial neural networks inside-out by using fixed dot products (weighted sums) and adjustable (parametric) activation functions. The fixed dot products can be computed very quickly using fast transforms like the FFT. Also the number of overall parameters required is vastly reduced. The dot products of the transform act as statistical summary measures. Ensuring good behavour. See Fast Transform (fixed filter bank) neural networks. The variance equation for linear combinations of random variables is very useful for understanding dot products in neural networks especially in conjunction with cosine angle. Also ReLU is a switch. The electricty in your house is a sine wave. Turn on a switch and the output is f(x)=x. Again the same sine wave as the input. Off(x)=0. A ReLU neural network then is a switched composition of dot products. If the switch states are known then there is a linear mapping between the input vector and the output vector which you can check out with various metrics.
@rahuldeora5815
@rahuldeora5815 5 жыл бұрын
Someone should write a detailed blog explaining stuff in this
@scose
@scose 6 жыл бұрын
Sampling rate * bit depth is a big overestimate of the amount of information in speech audio signals - look at the compression ratios that audio codecs can achieve
@ewfq2
@ewfq2 4 жыл бұрын
I want to talk with the guy talking about barycentres and wasserstein distance!
@ewfq2
@ewfq2 4 жыл бұрын
around 1:06:00
@ewfq2
@ewfq2 4 жыл бұрын
What's the neurips optimal transport tutorial mentioned?
@jonabirdd
@jonabirdd 5 жыл бұрын
51:00 I like the idea of a two-level system but disagree with the mutual information criterion.
@nathanbittner1452
@nathanbittner1452 5 жыл бұрын
Interesting. Could you say a bit more on this?
@ProfessionalTycoons
@ProfessionalTycoons 5 жыл бұрын
amazing talk.
@dr.mikeybee
@dr.mikeybee 6 жыл бұрын
Doesn't translation into an abstract space necessitate a loss of information?
@tomm7273
@tomm7273 6 жыл бұрын
Yes, but the benefits of dimensionality reduction far outweigh that. You don't need to consider every pixel of a picture to reason about the objects contained within that picture and their features.
@zachundisclosed6706
@zachundisclosed6706 5 жыл бұрын
There is also information stored in the decoder.
@nauy
@nauy 3 жыл бұрын
No, not if the information content is low dimensional to begin with. Consider a circle of radius r is rendered at location (x, y) on a bit map. The information in pixel space is high dimensional - the number of pixels in the bit map. But the same circle can be transformed into a 3-dimensional parameter space representation - (x, y , r) with no loss of information. The same circle in pixel space can always be regenerated using the parameter space representation.
@dr.mikeybee
@dr.mikeybee 3 жыл бұрын
@@nauy Thanks. I've been learning about matrix transformations and PCA lately. It took me a few years to get here.
@runvnc208
@runvnc208 5 жыл бұрын
Sounds right to me. But why do they assume that the traditional neural net and deep learning are the best or only possible fundamental structures and processes for a system with these capabilities of disentangled abstractions working together with granular representations?
@moisesfelipe9596
@moisesfelipe9596 5 жыл бұрын
Good point. I'd like to hear something in that sense. I fell the current popularization of NN and DL has lead a lot of people to not consider any other alternatives and then losing other ways to solve problems and useful insights.
@moisesfelipe9596
@moisesfelipe9596 5 жыл бұрын
For example, when I heard the term disentangling I cannot stop thinking on this as a fancy (and potentially more sophisticated) way to refer to blind source separation.
@yeodongyoun6780
@yeodongyoun6780 2 жыл бұрын
Wow the subtitles are terrible :( ... GAN -> gown, k means -> keys lol
@MartinLichtblau
@MartinLichtblau 6 жыл бұрын
Humans use fuzzy approaches, while computers use precise numbers. Which one can work in this complex world?
@tomm7273
@tomm7273 6 жыл бұрын
Computers can use fuzzy approaches as well. Almost all modern machine learning techniques are fuzzy.
@MartinLichtblau
@MartinLichtblau 6 жыл бұрын
​@@tomm7273 If you mean Deep Learning I'd say: yeah the direction seems ok. But the way computers work, and any representation they use is, quantitive and they are absolutely precise with those numbers. While humans think in qualitative terms, like "this rough concept is very similar to that one". Indeed they can't quantify things precisely, but that is what makes humans more capable to deal with all this ambiguous complexity.
@ahilanpalarajah3159
@ahilanpalarajah3159 5 жыл бұрын
@@MartinLichtblau Why do you think fuzziness has to contradict precise numbers? I'm not arguing it doesn't, I'm just asking because we can fuzzify and work with vagueness to eliminate as much of the search space as possible.
@MartinLichtblau
@MartinLichtblau 5 жыл бұрын
@@ahilanpalarajah3159 It's complicated, but in it's basic sense it doesn't. I just couldn't find simple terms to tell them apart. Perhpas better say Accurate vs. Approximate or rigid vs. flexible...
@mikepict9011
@mikepict9011 5 жыл бұрын
Humans have emotion , humans care .... robots will never care
@ahmadchamseddine6891
@ahmadchamseddine6891 6 жыл бұрын
He is genius
@zlh
@zlh 5 жыл бұрын
46:00 re: attention as gating the conscious and unconscious thoughts - can you imagine a machine which can widen and narrow its aperture of attention to accomplish different tasks?
@rahuldeora5815
@rahuldeora5815 5 жыл бұрын
Someone should write a detailed blog explaining stuff in this
Humans, Machines, and Work: The Future is Now
1:15:25
Microsoft Research
Рет қаралды 2,6 М.
Geoffrey Hinton: The Foundations of Deep Learning
28:22
Elevate
Рет қаралды 130 М.
When u fight over the armrest
00:41
Adam W
Рет қаралды 26 МЛН
What type of pedestrian are you?😄 #tiktok #elsarca
00:28
Elsa Arca
Рет қаралды 17 МЛН
PIZZA or CHICKEN // Left or Right Challenge
00:18
Hungry FAM
Рет қаралды 16 МЛН
Research in Focus: Deep Learning Research and the Future of AI
26:48
Microsoft Research
Рет қаралды 51 М.
The Thousand Brains Theory
1:30:07
Microsoft Research
Рет қаралды 45 М.
Top Minds in AI Explain What’s Coming After GPT-4o | EP #130
25:30
Peter H. Diamandis
Рет қаралды 102 М.
When u fight over the armrest
00:41
Adam W
Рет қаралды 26 МЛН