Efficiently Modeling Long Sequences with Structured State Spaces - Albert Gu | Stanford MLSys #46

  Рет қаралды 20,220

Stanford MLSys Seminars

Stanford MLSys Seminars

Күн бұрын

Пікірлер: 14
@tarepan_YT
@tarepan_YT 3 жыл бұрын
Impressive works, clear presentation, and intriguing discussions! Thanks for sharing great seminar.
@BREAKDRS
@BREAKDRS 2 жыл бұрын
Very well organized and easy to follow. Thanks!
@salehgholamzadeh3368
@salehgholamzadeh3368 2 жыл бұрын
A really Great Talk. Can S4 be integrated into Reinforcement Learning?
@brandomiranda6703
@brandomiranda6703 2 жыл бұрын
How do you think it will compare with memorizing transformers?
@m.d.4979
@m.d.4979 8 ай бұрын
Hello! Great talk! I am currently studying your SSM-related works. They are amazing! Please share your ideas, challenges, and outcomes for implementing your MAMBA model into human(sports athlete) action forecasting. Thank you for your kind reply!
@halilibrahimakgun7569
@halilibrahimakgun7569 Ай бұрын
can yu share slides
@sabrango
@sabrango 7 ай бұрын
Amazing
@gebob19
@gebob19 2 жыл бұрын
really great talk
@stergiosbachoumas2476
@stergiosbachoumas2476 Жыл бұрын
With regards to the stability question and I repeat: "Are Hippo A matrices stable?": The answer is that they are not stable or Hurwitz as we say in Control Theory because their eigenvalues are outside the unit circle. This is trivial to show as they are Lower triangular and therefore their eigenvalues are sitting on the diagonal. Thus the eigenvalues are 1,2,...,n+1 for an (nxn) matrix. Unfortunately, the organizers did not let Albert share his screen to show the form of the A matrix again. With this information now it would be very interesting to talk again about stability because Albert said that they are stable, well in what sense? Also, it's very interesting that other stable matrices do not lead to good learning.
@jonathanballoch
@jonathanballoch Жыл бұрын
what are the implications of this instability
@stergiosbachoumas2476
@stergiosbachoumas2476 Жыл бұрын
@@jonathanballoch What my comment above says is all wrong, the HiPPO matrix is stable because the eigenvalues are -1,-2,...,-n+1 all in the Left hand plane (i.e. negative). I forgot to come back and delete this comment but I will leave it here to remind myself that I must be more careful next time.
@simonl1938
@simonl1938 11 ай бұрын
I'm trying to implement the S4 myself in C right now and have the issue of the state exploding, I don't see how the matrix is stable at all. Do you have any suggestions on what I should look into?
@swfsql
@swfsql 10 ай бұрын
@@stergiosbachoumas2476 Thanks for your update. Just a question, by being on the left hand plane are you referring to the root places in space state control theory?
@rohanasokan7338
@rohanasokan7338 8 ай бұрын
​@@simonl1938 To add to Stergios. It seems Gu keeps the matrix form exploding by keeping the matrix in the left hand plane and he is doing that by limiting the real part of the diagonal to -1/2. There are some ablations he does to this in his dissertation if you are interested. And because it is on the left hand plane, the entire formulation will transform to the complex unit circle. In positive real space, you will always have the state explosion problem.
Data Science for Infrastructure w Pixie CEO Zain Asgar | Stanford MLSys #47
55:24
Stanford MLSys Seminars
Рет қаралды 1,3 М.
Monarch Mixer: Making Foundation Models More Efficient - Dan Fu | Stanford MLSys #86
56:32
To Brawl AND BEYOND!
00:51
Brawl Stars
Рет қаралды 17 МЛН
It works #beatbox #tiktok
00:34
BeatboxJCOP
Рет қаралды 41 МЛН
Quando A Diferença De Altura É Muito Grande 😲😂
00:12
Mari Maria
Рет қаралды 45 МЛН
Mamba, Mamba-2 and Post-Transformer Architectures for Generative AI with Albert Gu - 693
57:25
The TWIML AI Podcast with Sam Charrington
Рет қаралды 6 М.
Hardware-aware Algorithms for Sequence Modeling - Tri Dao | Stanford MLSys #87
1:19:06
Training LLMs at Scale - Deepak Narayanan | Stanford MLSys #83
55:59
Stanford MLSys Seminars
Рет қаралды 10 М.
The Most Important Algorithm in Machine Learning
40:08
Artem Kirsanov
Рет қаралды 546 М.
Text2SQL: The Dream versus Reality - Laurel Orr | Stanford MLSys #89
57:05
Stanford MLSys Seminars
Рет қаралды 6 М.
Notes on AI Hardware - Benjamin Spector | Stanford MLSys #88
1:16:48
Stanford MLSys Seminars
Рет қаралды 5 М.
The State Space Model Revolution, with Albert Gu
1:42:16
Cognitive Revolution "How AI Changes Everything"
Рет қаралды 2,4 М.