XLSTM - Extended LSTMs with sLSTM and mLSTM (paper explained)

  Рет қаралды 4,937

AI Bites

AI Bites

Күн бұрын

Пікірлер: 13
@lenhobach3162
@lenhobach3162 6 ай бұрын
I really like the way you explain the paper. A lot of concepts I confused have been touched, but I wish the block parts been explained more detailed, like why the modules are used like that in those blocks. Anw thank you so much for the video, +1 subscriber, and hope to see more from you in the future.
@AIBites
@AIBites 3 ай бұрын
sure. So are you more interested in papers and theory? Or would you like more on hands-on LLMs, RAG, etc. Just trying to understand the audience better. :)
@ariisaac5111
@ariisaac5111 2 ай бұрын
@@AIBites I'm more interested in the research papers and theories and any insightful implications that you can contribute along the way. What you did here is a nice Baseline. thx!
@thesimplicitylifestyle
@thesimplicitylifestyle 6 ай бұрын
It's so much fun looking under the hood. Thanks for explaining it so well! 😎🤖
@AIBites
@AIBites 3 ай бұрын
my pleasure :)
@yuanyuan4985
@yuanyuan4985 5 ай бұрын
Thank you so much for providing this video!!!!!
@AIBites
@AIBites 3 ай бұрын
my pleasure Yuan! 🙂
@newbie8051
@newbie8051 3 ай бұрын
Well the graphs at 2:18 are incorrect, sigmoid and tanh have different ranges, so the output gate should have range - 1 to 1 (tanh)
@AIBites
@AIBites 3 ай бұрын
thats a great spot. Copy pasting oversight I guess 🙂 will pay more attention while making the videos on attention. Thank you 😀
@newbie8051
@newbie8051 6 ай бұрын
Could only grasp the sLSTM on the first read So the exponential activation pushes up everything So we use log to get every activation in a smaller range ? damn, pretty interesting
@AIBites
@AIBites 3 ай бұрын
thank you. Yes, whenever I don't understand equations, I plug in numbers to push values to the extremes. This way, it paints a better picture to understand! :)
@JAYWRITE-h3e
@JAYWRITE-h3e 6 ай бұрын
🎉🎉🎉🎉🎉🎉🎉
@AIBites
@AIBites 3 ай бұрын
🙂🙂🙂
Meta Movie Gen Research Paper explained
23:28
AI Bites
Рет қаралды 408
AI can't cross this line and we don't know why.
24:07
Welch Labs
Рет қаралды 1,3 МЛН
From Small To Giant 0%🍫 VS 100%🍫 #katebrush #shorts #gummy
00:19
Do you love Blackpink?🖤🩷
00:23
Karina
Рет қаралды 22 МЛН
New xLSTM explained: Better than Transformer LLMs?
22:33
Discover AI
Рет қаралды 6 М.
2 Years of My Research Explained in 13 Minutes
13:51
Edan Meyer
Рет қаралды 58 М.
Were RNNs All We Needed? (Paper Explained)
27:48
Yannic Kilcher
Рет қаралды 52 М.
Transformers (how LLMs work) explained visually | DL5
27:14
3Blue1Brown
Рет қаралды 3,8 МЛН
A Brain-Inspired Algorithm For Memory
26:52
Artem Kirsanov
Рет қаралды 162 М.
Long Short-Term Memory (LSTM), Clearly Explained
20:45
StatQuest with Josh Starmer
Рет қаралды 607 М.
The moment we stopped understanding AI [AlexNet]
17:38
Welch Labs
Рет қаралды 1,3 МЛН
The Most Important Algorithm in Machine Learning
40:08
Artem Kirsanov
Рет қаралды 525 М.
MAMBA and State Space Models explained | SSM explained
22:27
AI Coffee Break with Letitia
Рет қаралды 54 М.
From Small To Giant 0%🍫 VS 100%🍫 #katebrush #shorts #gummy
00:19