[Own work] On Measuring Faithfulness or Self-consistency of Natural Language Explanations

  Рет қаралды 3,388

AI Coffee Break with Letitia

AI Coffee Break with Letitia

Күн бұрын

Пікірлер: 20
@serta5727
@serta5727 4 ай бұрын
Cool 😎 your explanation was very understandable
@MikeAirforce111
@MikeAirforce111 4 ай бұрын
Congrats Doctor!! :-) Looking forward for your future work!
@theosalmon
@theosalmon 4 ай бұрын
Thank you Dr. Letitia.
@DerPylz
@DerPylz 4 ай бұрын
Thanks for sharing your work! Always great so see what you're up to!
@AICoffeeBreak
@AICoffeeBreak 4 ай бұрын
Much appreciated!
@Thomas-gk42
@Thomas-gk42 4 ай бұрын
Congratulations to your doctorate🖖
@alexkubiesa9073
@alexkubiesa9073 3 ай бұрын
This sounds very useful! LLM users tend to assume that just because it writes like a human, that it can introspect and reason about its thought processes, which of course not a given. But it’s great to see progress on measuring this ability (or at least self-consistency) so that newer models can be more ergonomic.
@MaxShawabkeh
@MaxShawabkeh 4 ай бұрын
Congrats on the PhD! This is really valuable work! I'm currently trying to squeeze out as much reasoning capabilities as I can out of small LLMs (7-15B) for my company's product, and I'd love a longer video or recorded talk going into details of your findings, any patterns you've found that contribute to improving or reducing self-consistency, or any insights on which existing models or training corpora result in better self consistency and reasoning capabilities. If you have any pointers, I'd appreciate it!
@AICoffeeBreak
@AICoffeeBreak 3 ай бұрын
As far as we can see with this paper's experiments, RLHF helps improve self-consistency, but we have not yet any hints for what else had this effect. Maybe size, but for what we *could* test on our infrastructure, we did not measure an effect, but it might be there, we just couldn't test far enough.
@MaxShawabkeh
@MaxShawabkeh 3 ай бұрын
@@AICoffeeBreak Thanks!
@beatrixcarroll8144
@beatrixcarroll8144 4 ай бұрын
Congrats Dr. Letitia!!!! Wow, YOU ROCK!!!!!!! :-D :-) P.S. We missed you!!
@fingerstyledojo
@fingerstyledojo 4 ай бұрын
Yay, new video! Thanks for letting me pass yesterday lol
@AICoffeeBreak
@AICoffeeBreak 3 ай бұрын
Wow, you have a channel! It's amazing, just checked it out! 🤩
@nitinss3257
@nitinss3257 4 ай бұрын
1 minute ago for non members ... good to see ya
@naromsky
@naromsky 4 ай бұрын
🎉
@Ben_D.
@Ben_D. 4 ай бұрын
No ASMR? 😟
@AICoffeeBreak
@AICoffeeBreak 4 ай бұрын
It was an entire blooper. Next time for sure. 😅
@anluifb
@anluifb 3 ай бұрын
So you came up with a method, didn't have time to explain the method to us, and didn't show us that it works. Great. If you still have time before Bangkok I would suggest rerecording and focusing on the implementation and interpretation of results rather than the context and wordy descriptions.
@AICoffeeBreak
@AICoffeeBreak 3 ай бұрын
Thanks for your feedback. The method is in the video, just not the tiny details. 1. Interpret with SHAP prediction and explanation. (Mentioned in the video) 2. Measure their alignment (mentioned) after: - normalisation: to bring the values to the same range (mentioned. Did not mention that shap properties make their value very different between output tokens with different probabilities) - aggregation: to collect the many values from many outputs. (mentioned. Did not mention we use the mean for this) For the results I've synthesized what we see with words and the main takeaways. For lengthy tables, please check the paper and its appendix. I don't know what you mean that the video doesn't show that it works. I've also shown an individual example before the takeaways. The problem that there is no ground truth, of course exists for us as well as for previous work. But for the first time in literature, we now *compare* existing works to each other-and to our method to them. This is why the context is important, namely to make this clear. Because our paper makes the contribution to evaluate and clarify the state of the field, and as a follow-up contribution, we have this new method by solving the shortcomings of existing tests.
Supercharging RAG with Generative Feedback Loops from Weaviate
11:08
AI Coffee Break with Letitia
Рет қаралды 4,7 М.
Mission: Impossible language models - Paper Explained [ACL 2024 recording]
11:05
AI Coffee Break with Letitia
Рет қаралды 8 М.
1, 2, 3, 4, 5, 6, 7, 8, 9 🙈⚽️
00:46
Celine Dept
Рет қаралды 113 МЛН
Don't underestimate anyone
00:47
奇軒Tricking
Рет қаралды 17 МЛН
Twin Telepathy Challenge!
00:23
Stokes Twins
Рет қаралды 103 МЛН
Do you love Blackpink?🖤🩷
00:23
Karina
Рет қаралды 14 МЛН
Stealing Part of a Production LLM | API protects LLMs no more
18:49
AI Coffee Break with Letitia
Рет қаралды 17 М.
Transformers explained | The architecture behind LLMs
19:48
AI Coffee Break with Letitia
Рет қаралды 27 М.
Sparse LLMs at inference: 6x faster transformers! | DEJAVU paper explained
13:17
AI Coffee Break with Letitia
Рет қаралды 6 М.
The moment we stopped understanding AI [AlexNet]
17:38
Welch Labs
Рет қаралды 1,3 МЛН
MAMBA and State Space Models explained | SSM explained
22:27
AI Coffee Break with Letitia
Рет қаралды 53 М.
Ex-Professor Reveals Way to REALLY Learn Languages (according to science)
23:44
GaLore EXPLAINED: Memory-Efficient LLM Training by Gradient Low-Rank Projection
11:38
AI Coffee Break with Letitia
Рет қаралды 9 М.
Shapley Values Explained | Interpretability for AI models, even LLMs!
9:59
AI Coffee Break with Letitia
Рет қаралды 5 М.
1, 2, 3, 4, 5, 6, 7, 8, 9 🙈⚽️
00:46
Celine Dept
Рет қаралды 113 МЛН