AI Coffee Break with Letitia

AI Coffee Break with Letitia

Lighthearted bite-sized ML videos for your AI Coffee Break! 📺 Mostly videos about the latest technical advancements in AI, such as large language models (LLMs), text-to-image models and everything cool in natural language processing, computer vision, etc.!

We try to post twice a month! 🤞 But you know, Letitia has a full-time job, and Ms. Coffee Bean tends to enjoy time off to go out and have fun. 😄

Disclaimer: Opinions expressed are solely my own and do not express the views or opinions of my employer.

Impressum: aicoffeebreak.com/impressum.html

Math Anxiety? and what it has to do with AI - 🔴at #HLF24 with Prof. Yael Tauman Kalai

7:52

Math Anxiety? and what it has to do with AI - 🔴at #HLF24 with Prof. Yael Tauman Kalai

2 ай бұрын

Graph Language Models EXPLAINED in 5 Minutes! [Author explanation 🔴 at ACL 2024]

6:38

Graph Language Models EXPLAINED in 5 Minutes! [Author explanation 🔴 at ACL 2024]

3 ай бұрын

How OpenAI made o1 "think" - Here is what we think and already know about o1 reinforcement learning

9:24

How OpenAI made o1 "think" - Here is what we think and already know about o1 reinforcement learning

4 ай бұрын

I am a Strange Dataset: Metalinguistic Tests for Language Models - Paper Explained [🔴 at ACL 2024]

4:08

I am a Strange Dataset: Metalinguistic Tests for Language Models - Paper Explained [🔴 at ACL 2024]

4 ай бұрын

Transformer LLMs are Turing Complete after all !?

28:47

Transformer LLMs are Turing Complete after all !?

4 ай бұрын

Mission: Impossible language models - Paper Explained [ACL 2024 recording]

11:05

Mission: Impossible language models - Paper Explained [ACL 2024 recording]

4 ай бұрын

Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution - Paper Explained

11:22

Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution - Paper Explained

4 ай бұрын

My PhD Journey in AI / ML (while doing YouTube on the side)

37:18

My PhD Journey in AI / ML (while doing YouTube on the side)

4 ай бұрын

[Own work] On Measuring Faithfulness or Self-consistency of Natural Language Explanations

8:48

[Own work] On Measuring Faithfulness or Self-consistency of Natural Language Explanations

5 ай бұрын

Supercharging RAG with Generative Feedback Loops from Weaviate

11:08

Supercharging RAG with Generative Feedback Loops from Weaviate

6 ай бұрын

GaLore EXPLAINED: Memory-Efficient LLM Training by Gradient Low-Rank Projection

11:38

GaLore EXPLAINED: Memory-Efficient LLM Training by Gradient Low-Rank Projection

7 ай бұрын

Shapley Values Explained | Interpretability for AI models, even LLMs!

9:59

Shapley Values Explained | Interpretability for AI models, even LLMs!

8 ай бұрын

Stealing Part of a Production LLM | API protects LLMs no more

18:49

Stealing Part of a Production LLM | API protects LLMs no more

9 ай бұрын

Genie explained 🧞 Generative Interactive Environments paper explained

9:22

Genie explained 🧞 Generative Interactive Environments paper explained

10 ай бұрын

MAMBA and State Space Models explained | SSM explained

22:27

MAMBA and State Space Models explained | SSM explained

10 ай бұрын

Sparse LLMs at inference: 6x faster transformers! | DEJAVU paper explained

13:17

Sparse LLMs at inference: 6x faster transformers! | DEJAVU paper explained

11 ай бұрын

Transformers explained | The architecture behind LLMs

19:48

Transformers explained | The architecture behind LLMs

11 ай бұрын

Direct Preference Optimization: Your Language Model is Secretly a Reward Model | DPO paper explained

8:55

Direct Preference Optimization: Your Language Model is Secretly a Reward Model | DPO paper explained

Жыл бұрын

LLM hallucinations discover new math solutions!? | FunSearch explained

11:36

LLM hallucinations discover new math solutions!? | FunSearch explained

Жыл бұрын

DALL-E 3 is better at following Text Prompts! Here is why. - DALL-E 3 explained

8:03

DALL-E 3 is better at following Text Prompts! Here is why. - DALL-E 3 explained

Жыл бұрын

Adversarial Attacks and Defenses. The Dimpled Manifold Hypothesis. David Stutz from DeepMind #HLF23

13:06

Adversarial Attacks and Defenses. The Dimpled Manifold Hypothesis. David Stutz from DeepMind #HLF23

Жыл бұрын

What is LoRA? Low-Rank Adaptation for finetuning LLMs EXPLAINED

8:22

What is LoRA? Low-Rank Adaptation for finetuning LLMs EXPLAINED

Жыл бұрын

Are ChatBots their own death? | Training on Generated Data Makes Models Forget - Paper explained

11:53

Are ChatBots their own death? | Training on Generated Data Makes Models Forget - Paper explained

Жыл бұрын

The first law on AI regulation | The EU AI Act

14:37

The first law on AI regulation | The EU AI Act

Жыл бұрын

Author Interviews, Poster Highlights, Summary of the ACL 2023 Toronto NLP

50:36

Author Interviews, Poster Highlights, Summary of the ACL 2023 Toronto NLP

Жыл бұрын

ChatGPT ist not an intelligent agent. It is a cultural technology. - Gopnik Keynote

4:46

ChatGPT ist not an intelligent agent. It is a cultural technology. - Gopnik Keynote

Жыл бұрын

[Own work] MM-SHAP to measure modality contributions

6:55

[Own work] MM-SHAP to measure modality contributions

Жыл бұрын

Eight Things to Know about Large Language Models

14:46

Eight Things to Know about Large Language Models

Жыл бұрын

Moral Self-Correction in Large Language Models | paper explained

14:50

Moral Self-Correction in Large Language Models | paper explained

Жыл бұрын

Пікірлер

@MatzeU-r8q Күн бұрын

One of the BEST tutorials/explanations i have ever seen, thank you really much for that ❤

@AICoffeeBreak 20 сағат бұрын

Much appreciated!

@pranavb9768 Күн бұрын

you’re cute

@ahsan4life2020

@ahsan4life2020 4 күн бұрын

This is a very valuable video for me. Thank you so much

@AICoffeeBreak 4 күн бұрын

For a second I thought you're talking about the three minutes of uncomfortable silence . On the comment notification I don't see in which video the comment was made. 😅 So, I'm glad it is about this one.

@erongjoni3464 5 күн бұрын

Lost it on "Watch the full video in the link below"

@AICoffeeBreak 5 күн бұрын

And that is just the finest 3 minute selection of the full 50 minutes awkward silence video. 😏

@foreignconta 6 күн бұрын

What exactly happened here?🤔

@AICoffeeBreak 6 күн бұрын

My editor had fun cutting out moments of awkward silence from my next (1h+) video.

@bmebri1 7 күн бұрын

I love it; most unawkward facet of you yet😊

@AICoffeeBreak 6 күн бұрын

@DaNa-px7ol 7 күн бұрын

Wow I didn’t know I can enjoy “awkward” silence 😮

@AICoffeeBreak 7 күн бұрын

😂

@Thomas-gk42 7 күн бұрын

Waiting for the phd result...?

@AICoffeeBreak 7 күн бұрын

Haha, fortunately, that one is clear. 😅🎉🤗

@outliier 7 күн бұрын

Some would upload this as ASMR

@AICoffeeBreak 7 күн бұрын

🎙️

@WhatsAI 7 күн бұрын

Haha a classic when recording!

@DerPylz 7 күн бұрын

I like the part where she doesn't say anything.

@AICoffeeBreak 7 күн бұрын

😅

@azmathmoosa4324

@azmathmoosa4324 15 күн бұрын

good concise explanation.

@sohambit9393 19 күн бұрын

Damn you like Neffex ❤ Neffex is like 10 % of my life.

@enicay7562 22 күн бұрын

Thank you Miss Coffee Bean !

@AICoffeeBreak 21 күн бұрын

@kristoferkrus 22 күн бұрын

Cool! The simple but effective ideas are the most interesting. Have you checked out Nvidia's normalized transformer (nGPT)? It seems to be one of those cases. The differential transformer (arxiv: 2410.05258) also seems like it could be interesting.

@davide0965 25 күн бұрын

Terrible

@DerPylz 25 күн бұрын

If you don't like her videos, why do you keep coming back to them just to comment that you didn't like it? Just watch something else.

@AICoffeeBreak 26 күн бұрын

03:23 It looks like the MLP takes the representation of JUST the 8th layer, and not also of the previous ones. I just found out from a quick exchange via email with the authors.

@bethany-rp2tq 26 күн бұрын

CONGRATULATIONS Letitia !!!! Is there a pdf version where I can read your PhD thesis? Always amazed learning new knowledge from you :)

@AICoffeeBreak 26 күн бұрын

Yes, there is! archiv.ub.uni-heidelberg.de/volltextserver/35753/ Thanks for your interest!

@davidrichards1302

@davidrichards1302 29 күн бұрын

Unison's unique features offer promising possibilities for enhancing the FunSearch operational model: Content-addressed code: Unison's hash-based identification of code could streamline FunSearch's program generation and evaluation process. Each generated program would have a unique hash, simplifying tracking and caching of results. Immutable codebase: Unison's immutable data structure approach to codebases could enhance FunSearch's evolutionary process. It would allow for efficient storage and retrieval of program versions without conflicts. No builds: Unison's ability to parse and typecheck definitions once, storing results in a cache, could significantly speed up FunSearch's evaluation phase. This would reduce overhead in assessing generated programs. Easy distributed computing: Unison's content-addressed nature facilitates distributed computation, which could enhance FunSearch's parallelization capabilities. This could allow for more efficient scaling of the search process across multiple machines. First-class documentation: Unison's approach to documentation as executable code could improve FunSearch's ability to generate and evaluate self-documenting programs, potentially leading to more interpretable solutions. Strong typing with inference: Unison's type inference could help FunSearch generate type-safe programs more efficiently, potentially reducing the number of invalid programs generated. These features could potentially make Unison an excellent choice for implementing core components of the FunSearch system, particularly in program generation, evaluation, and distributed processing.

@duzx4541 Ай бұрын

Hmm, the only thing I dont really understand is why we have to use SIN and COS instead of only one of them D:

@AICoffeeBreak Ай бұрын

They are the same with just a phase shift. Rotary embeddings take the idea of phase shifting and turn it up.

@katorea Ай бұрын

Loved your explanation! thank you very much!! :D

@AICoffeeBreak Ай бұрын

@solsospecial Ай бұрын

Okay, the approach is “guessing”. Got it.

@KnowledgeSynthesizer

@KnowledgeSynthesizer Ай бұрын

can I ask what app/software tools you use for recording, editing audio and video?

@AICoffeeBreak 26 күн бұрын

Hi, it is good old powerpoint for all visualisations. Ms. Coffee Bean comes during editing in Adobe Premiere. :)

@floriankowarsch8682

@floriankowarsch8682 Ай бұрын

It would be definitely interesting to train end to end with contrastive lost. Because right now it is a more like a distillation task that is limited be the teacher model's capability & domain

@dylancope Ай бұрын

"wait a few years"... 9 months later with Genie 2 😂

@AICoffeeBreak 26 күн бұрын

Years in ML pass very quickly. 🤣🤣🤣

@rogerthat7190 Ай бұрын

This was very helpful, thank you!

@AICoffeeBreak Ай бұрын

Thanks for the kind words!

@hjups Ай бұрын

It's a clever idea, but the paper has many methodological issues which may amount to more hype than substance. Then results are good, but the major gains come from stacking contributions. And of course you have to worry about the extra training cost (DINO is very expensive) and generalization, etc. The peak accuracy shift is also worrisome, indicating that removing the driving force would likely result in an expensive shift / performance degradation when finetuning.

@fast_harmonic_psychedelic

@fast_harmonic_psychedelic Ай бұрын

thats such a good idea. very simple .. i would not have thought to do this. I have added CLIP loss terms on the last layer but not dinoon the 8th layer lol.

@AICoffeeBreak 26 күн бұрын

Yes, indeed. 😅 The last layers still need to focus on reconstruction, or rather predicting the noise. Injecting in the first layers helps the network "conceptualise" what the denoising needs to become. The following layers need to focus on predicting that noise.

@EkShunya Ай бұрын

i missed you, glad to see you back with your great explainers

@AICoffeeBreak Ай бұрын

@circuitbreaker7860

@circuitbreaker7860 Ай бұрын

i'm curious how they may scale this with additional external representations. How would one have to change the training approach for e.g. 3 sources? Could one simply co-optimise for all three losses + image-gen or would they interfere with each other? Can't wait to read the followup papers exploring such questions.

@m_ke Ай бұрын

It's not a long term approach, autoregressive generative vision language models are the future.

@LoFiLatentSpaceTools

@LoFiLatentSpaceTools Ай бұрын

lol. JEPA says otherwise 😂

@m_ke Ай бұрын

@ JEPA is not a generative method, I’m saying diffusion models will get replaced by autoregressive ones

@hjups Ай бұрын

That probably depends on the application goal. In general, AR isn't a good fit for a modality that is best described by a continuous space with translation invariance. There's even some indication that diffusion / flow matching models may work better for NLP, since it's more parallel and can perform self-correction. If anything, I would say GNN-Diffusion-hybrid models are probably the future.

@m_ke Ай бұрын

@ look up HART, VAR, Infinity and Switti, all recent works that use AR to match or beat diffusion models while being faster to train and way faster at inference. With a decoder transformer you also get a much more natural integration of modalities and easy conditioning / in context learning

@m_ke Ай бұрын

@ other great benefit of AR VLMs for generation is that you get to benefit from all of the advances of multimodal LLMs and share feature representations instead of training a huge denoising model that’s hard to use for other tasks

@theshow3376 Ай бұрын

Why did nobody think of this earlier!????

@Ali-wf9ef Ай бұрын

You are absolutely amazing thanks for this explanation. I read the paper and understood nothing!

@AICoffeeBreak Ай бұрын

Thanks a lot for your appreciation!

@realbenjoyo Ай бұрын

This was really great, never really understood query, key and values before.

@AICoffeeBreak Ай бұрын

Thank you!

@nwokebugoodness4819

@nwokebugoodness4819 Ай бұрын

Congrats! You're inspiring

@AICoffeeBreak Ай бұрын

Thank you

@gettingdatasciencedone

@gettingdatasciencedone Ай бұрын

Great explanation -- loving your videos.The time codes for specific topics is really useful.

@AICoffeeBreak Ай бұрын

Thank you!

@davide0965 Ай бұрын

The begin was clear, then very obscure👎

@muhammadbilalawais2976

@muhammadbilalawais2976 Ай бұрын

Awesome content!

@AICoffeeBreak Ай бұрын

Thank you!

@davide0965 Ай бұрын

Terrible explanation

@KhazeemasaleemKhazeemasale-o8s

@KhazeemasaleemKhazeemasale-o8s Ай бұрын

Hey khazeema saleem Al here

@GaryGan-US Ай бұрын

very concise; what an amazing video.

@AICoffeeBreak Ай бұрын

Thank you!

@martinkunev9911

@martinkunev9911 Ай бұрын

I disagree that people are being judged for being bad at math more harshly than for other subjects. Compare "I'm not interested in history" or "I don't like reading" with "I'm bad at math".

@wapsyed Ай бұрын

UMAP rocks! The only problem I see is the explainability of this high dimensionality reduction, which is easily done in PCA. In other words, you can get the best variables to explain the clustering, which is important when you are focusing on variable selection. What do you think?

@LinkhManu Ай бұрын

You’re the best 👏👏👏

@OlgaIvina Ай бұрын

Thank you very much for this thorough, well-curated, and comprehensive review of MAMBA.

@AICoffeeBreak Ай бұрын

Thank you, for your appreciation! I just saw you on LinkedIn, let's stay connected!

@alexkubiesa9073

@alexkubiesa9073 Ай бұрын

How are exploration and exploitation abilities or forms of intelligence? To me they're more like competing actions, like going to the shops vs going to the cinema. I am still capable of both actions.

@luise.suelves8270

@luise.suelves8270 Ай бұрын

sooo well explain, brilliant!

@AICoffeeBreak Ай бұрын

Thanks!

@Jupiter-Optimus-Maximus

@Jupiter-Optimus-Maximus 2 ай бұрын

Another great video, as usual! This little bean mutant of yours always puts a smile on my face ☺ Is it possible that it is actually an AI? For example, a transformer that converts language information into the facial expressions of the animated bean. That would be so cool 😎 I have a question: I am looking for training methods that are not based on backpropagation. Specifically, I want to avoid running backwards through the NNW again after the forward pass. Do you know of any algorithms like this? Already 2^10 * Thanks in advance 😄

@moeinhasani8718

@moeinhasani8718 2 ай бұрын

Thanks!

@AICoffeeBreak 2 ай бұрын

Wow, thank You!