Large Language Models: Part 2

  Рет қаралды 115,110

Graphics in 5 Minutes

Graphics in 5 Minutes

Күн бұрын

How do language models like GPT and Palm work?
Part 1: • Large Language Models ...
See next: text-to-image (Parti, Imagen, Dall-E): • Text to Image in 5 min...
0:00 - intro
0:14 - next word prediction
0:20 - word embeddings
1:01 - transformers
3:11 - generating text
4:13 - stacking attention layers
4:47 - training data
5:21 - GPT-3 examples

Пікірлер: 174
@fazlfazl2346
@fazlfazl2346 Ай бұрын
The good thing with short tutorials is that if they are good, you will save hours of work. If they are bad, no time is wasted. This is a good one. Saved hours of work which I would never do anyway.
@5133937
@5133937 Жыл бұрын
This is one of the best explainer vids on LLMs I’ve seen yet. Not too long, not too short, good pacing, good visualizations. Great work, thanks!
@kahoku451
@kahoku451 Жыл бұрын
It’s amazing and so funny to me how LLM can produce fully functional python programs and write these poetic Bob Dylan inspired lyrics… but when prompted with “you’re going north. You turn right, then you turn left. Now you’re going …” it said SOUTH ☠️
@samandoria
@samandoria Жыл бұрын
Just like human "brain cramps" or "brain farts". The problem is that the current models aren't self learning or made to analyse their own answers so they don't correct these error before output or can correct the weights in the future when confronted with new data.
@TimHulse
@TimHulse Жыл бұрын
It Re-raises old but interesting points about the difference between language and logic. You can make linguistically sensible statements that defy logic. Try the book “Godel, Escher, Bach” if you like this area.
@webcache1000
@webcache1000 Жыл бұрын
6:02 If you are just just south of the North Pole, turn right and turn left over the North Pole, you would be heading South. I suspect it is getting confused due to the similar riddle where if you head North, turn right and you are now heading South - where are you? It has probably seen this riddle a confusing number of times and weaved that into its weights/response.
@distrologic2925
@distrologic2925 Жыл бұрын
Because it doesn't necessarily understand how directions work, it only knows how "north" and "right and left" has been used in language before, and is only an estimation. People are unlikely to talk about celestial directions and turn right or left, so there was probably not enough similar to your query in its training data that it could draw on and other examples such as "north and south" have taken more weight.
@swanandjoshi333
@swanandjoshi333 Жыл бұрын
Logic is a whole different thing ig
@TimHulse
@TimHulse Жыл бұрын
I really enjoyed both of these LLM videos. They are so concise and informative and the pacing is excellent.
@whannabi
@whannabi Жыл бұрын
He is an Arab from the Middle East. My dog wants a walk.
@apc067
@apc067 Жыл бұрын
I [or shall I say, my internal GPT?] first misread your comment as "... and the pancake is excellent". 😃
@csmac3144a
@csmac3144a 6 ай бұрын
This should be adopted by high schools everywhere. Superb teaching. A bit fast for me (I’m 61), but I can re-watch until I get it.
@f18a
@f18a Жыл бұрын
Great stuff. Possibly the best intro material to LLMs that I have seen. Thunbs up!
@Charlieee1
@Charlieee1 Жыл бұрын
I like that you used a recipe prompt to demonstrate what a LLM is good at doing, then actually followed the recipe and proved that it actually worked (and tasted good!).
@markbordelon1601
@markbordelon1601 6 күн бұрын
part one and this part two are masterful introductory teaching
@roywem
@roywem Жыл бұрын
This is the single best explaination I’ve come across on LLM’s
@stuartthomas7441
@stuartthomas7441 Жыл бұрын
So pleased to get a clear and credible glimpse under the hood. Thank you.
@scottlott3794
@scottlott3794 11 ай бұрын
Dude, I have been researching Transformers and Attention for weeks. You’re a master mind, I get it now!
@ianassarandas
@ianassarandas Жыл бұрын
you actually made the cookies hahaha that's awesome, great video btw!
@poklet
@poklet Жыл бұрын
Thank you. Those were very clear explanations of just the right length. Loved that you cooked the pancakes too!
@yojimbomk
@yojimbomk Жыл бұрын
Just 16 minutes, it is amazing. These are the best concise videos on the topic including basics in neural networks I have seen. Many thanks for sharing your knowledge!
@shoe_Bin
@shoe_Bin 2 ай бұрын
Soothing voice to learn about transformers and warm room is perfect for recipe for my sleep.🛌
@aidarfaizrakhmanov1901
@aidarfaizrakhmanov1901 Жыл бұрын
This is a really good intro indeed! I encourage to make more content like this
@MaleGeminiCat
@MaleGeminiCat Жыл бұрын
These are two great videos that introduced how large language model works in a very comprehensive way.👍👍👍
@janicem5942
@janicem5942 Жыл бұрын
Brilliant overview for a non-technical person like me... and glad to see you tested the recipe!
@jaunalapa
@jaunalapa 9 ай бұрын
You, sir, have talent for teaching. Even with my fairly technical background and watching a lot of videos I was struggling to get my head around LLMs until I watched your videos. Hope you continue what you started it would help a lot of people especially now when we see a lot of people trying to run their own local LLM versions on their PCs.
@somethingness
@somethingness Жыл бұрын
Typo alert! Frost's wonderful poem begins as so: "Two roads diverged in a yellow wood". (Not "diverted"!)
@vast634
@vast634 Жыл бұрын
You really have a talent to teach things.
@eenvleugjegoeiegames
@eenvleugjegoeiegames 8 ай бұрын
Man you NEED more subscribers, the content and video quality is way too good for only 15k!
@EGlobalKnowledge
@EGlobalKnowledge Жыл бұрын
An excellent video on language models
@fernwood
@fernwood Жыл бұрын
This brings me one step closer to understanding. Thanks.
@ypetkar
@ypetkar 11 ай бұрын
Very informative and concise videos to understand LLMs their complexities and what i takes to make these models successful.
@0815Snickersboy
@0815Snickersboy Жыл бұрын
This is the best video on Language Models that I seen. Probably the best on the Internet. You should maybe add chatgpt to the title to get more views.
@smudgepost
@smudgepost Жыл бұрын
Very much enjoyed these two videos. More please! Clear and detailed.
@sreenivasulupala2374
@sreenivasulupala2374 Ай бұрын
Excellent explanation. Thank you.
@craftycurate
@craftycurate Жыл бұрын
Really easy to follow, well paced, easy on the ear, and just the right level thanks!
@jteichma
@jteichma Жыл бұрын
What a great walk through! Thanks so much for sharing.
@xybnedasdd2930
@xybnedasdd2930 Жыл бұрын
I think this is the best, most intuitive and most illustrative video describing LLMs/transformers. Thank you so much!
@DavidDiaz-zg5sv
@DavidDiaz-zg5sv Жыл бұрын
I love that you finished cooking the recipe! Great video :)
@ravinatarajan4894
@ravinatarajan4894 Жыл бұрын
Thanks for the crisp walkthrough of the technology. It is a very good introduction.
@Paul-rs4gd
@Paul-rs4gd Жыл бұрын
Thanks for an original presentation of Large Language Models. It gave me new insight.
@KalebPeters99
@KalebPeters99 Жыл бұрын
Not a second wasted. Just brilliant ❤️
@roscatres
@roscatres Жыл бұрын
I hope you keep up with these videos, they are seriously great. Already suscribed and I'll check the rest of your channel. Thank you.
@DrNuyenVanFaulk
@DrNuyenVanFaulk Жыл бұрын
Thanks so much for both of these videos. They are wonderful. I think I understood them a bit more since I’ve done some basic assisted machine learning dev (up to neural networks). If anyone is a bit lost, read up on linear and logistic regression, then onto neural networks.
@sciexp
@sciexp Жыл бұрын
Very interesting... Now I understand more about how ChatGPT works...
@MindyMcAdams
@MindyMcAdams Жыл бұрын
Thank you! This is excellent. I love the animations. They are helpful!
@KumquatChampion
@KumquatChampion Жыл бұрын
This is awesome. Would love to learn more!
@GodofStories
@GodofStories 11 ай бұрын
Absolute best explainer. Where have you been though since GPT-4?? Would love to see you do more content. You have a talent!! I think you might be busy working on LLM's for research or in a company, but would be cool to see more videos!
@stefanbiesdorf4637
@stefanbiesdorf4637 11 ай бұрын
Thank you Steve for a personal moment of enlightening by opening the "black box of AI". This is an outstanding educational piece, in particular in combination with the first part.
@saisaske1
@saisaske1 11 ай бұрын
Very good explanation
@frank-reneschafer5512
@frank-reneschafer5512 11 ай бұрын
Great effort - nice video, good and comprehensive explanation.
@ayushsharma9036
@ayushsharma9036 Жыл бұрын
That language neural network at 0:50 belongs on a tshirt somewhere
@tvillaluz
@tvillaluz Жыл бұрын
Haha you made the pancake
@bloodywolftr
@bloodywolftr Жыл бұрын
Pretty nice explanation in both videos.Thanks!
@AncientSlugThrower
@AncientSlugThrower Жыл бұрын
Great video, leaving a comment to let you know it was very insightful. Thank you.
@AMGbot
@AMGbot Жыл бұрын
Loved this! Thanks for the great video!
@TheCaioKyleBraga
@TheCaioKyleBraga Жыл бұрын
This is my first search for a LLM explanation and very pleased with the video. I am not a mathematician or programmer but I am very interested in learning how LLM works. From my humble perspective I can say we reached a point of no return and this technology is progressing at an exponential rate. With the development of quantum computing, I have no doubt that it will surpass human intelligence in ways we don't understand.
@dv6165
@dv6165 Жыл бұрын
This was awesome! Thanks
@StratosFair
@StratosFair Жыл бұрын
Great explanation, glad I stumbled upon it ! Subscribed :)
@bingolio
@bingolio Жыл бұрын
EXCELLENT Vid, PLS DO MORE, on Deep learning , covering the whole workflow of making an LLM, especially, what os LORa, VEctor embeddings, etc I'm sure you'll get huge interest You have a gift for explaining. Thanks!
@preethamrangaswamy7371
@preethamrangaswamy7371 Жыл бұрын
Very underrated and underappreciated video.
@danjsy
@danjsy 10 ай бұрын
Great, thanks so much, massively useful.
@coraltown1
@coraltown1 Жыл бұрын
the Mona Lisa of LLM explanations .. thanks!
@g5min
@g5min Жыл бұрын
Okay, that's the best comment yet -- thank you :-)
@user-hf5og1bt8j
@user-hf5og1bt8j 3 ай бұрын
amazing video! thank you!
@padetiit7014
@padetiit7014 Жыл бұрын
I loved that you actually cooked that recipe! :-)
@Patapom3
@Patapom3 Жыл бұрын
Very well made!
@jorgesoberon6866
@jorgesoberon6866 Жыл бұрын
Very clear. Thanks a lot.
@Sukant98
@Sukant98 Жыл бұрын
amazing videos!! learnt so much
@fenandamilanda2032
@fenandamilanda2032 Жыл бұрын
nice work, really good to visualize these things even though I already know this.
@yourfuneral
@yourfuneral Жыл бұрын
Thanks for the good explanation, very much on time
@ivanocj
@ivanocj Жыл бұрын
Very good content! Keep going! thanks.
@christianwestermann4680
@christianwestermann4680 Жыл бұрын
Well done!
@xflory26x
@xflory26x Жыл бұрын
Please do more videos on LLMs!!! But also I need to know, how were the pancakes?
@carrumar
@carrumar Жыл бұрын
Great videos! Btw, how did you like your avocado cocoa thing? 😄
@rajachan8588
@rajachan8588 Жыл бұрын
Awesome. Thank you
@thomasforrest1931
@thomasforrest1931 Жыл бұрын
great video bud cheers!
@prathams8685
@prathams8685 Жыл бұрын
Big thanks mate
@ChatGPt2001
@ChatGPt2001 8 ай бұрын
Continuing from the previous response, here are some additional considerations and challenges when working with large language models: 14. Data Privacy and Ethical Concerns: - Be aware of privacy concerns when collecting and using data for training. Ensure that you have the necessary permissions and comply with data protection regulations. Ethical considerations, such as bias and fairness in the data, also need to be addressed. 15. Compute Resources: - Training large language models requires substantial computational resources, including high-end GPUs or TPUs and large-scale distributed computing infrastructure. These resources can be expensive and may not be accessible to everyone. 16. Energy Consumption: - Training large language models consumes a significant amount of electricity, contributing to environmental concerns. Some organizations are actively working on making AI training more energy-efficient. 17. Model Size and Efficiency: - While larger models tend to perform better, they also require more memory and computational power. Balancing model size and efficiency is crucial for real-world applications, as very large models might not be practical for all use cases. 18. Fine-Tuning and Transfer Learning: - Fine-tuning pre-trained models on specific tasks is a common practice, as it requires less data and computational resources compared to training from scratch. Understanding how to effectively fine-tune models is essential. 19. Evaluation Metrics: - Choosing appropriate evaluation metrics is critical. Different NLP tasks may require different metrics. For instance, accuracy may be suitable for classification tasks, while BLEU scores are used for machine translation. Select metrics that align with your objectives. 20. Bias and Fairness: - Large language models can inherit biases present in the training data. Mitigating bias and ensuring fairness in AI systems is a significant challenge. It requires careful curation of training data and ongoing monitoring. 21. Robustness and Safety: - Ensuring that large language models are robust and safe is essential. This includes protecting against adversarial attacks, avoiding harmful or inappropriate outputs, and ensuring that the model behaves predictably. 22. Research and Collaboration: - The field of large language models is rapidly evolving. Staying up-to-date with the latest research and collaborating with the AI community can help improve your understanding and the quality of your models. 23. Resource Sharing: - Due to the resource-intensive nature of training, sharing pre-trained models, datasets, and trained weights is common in the AI community. Leveraging existing resources can save time and resources. 24. Ethical Considerations: - Consider the ethical implications of your work. The power of large language models also comes with responsibility. Engage in ethical discussions and follow guidelines for responsible AI development. 25. Interpretability: - Large language models are often criticized for their lack of interpretability. Efforts to make AI models more understandable and explainable are ongoing to build trust and ensure accountability. Training large language models from scratch is a challenging and resource-intensive endeavor, and it may not be necessary for many practical applications. Leveraging pre-trained models and fine-tuning them for specific tasks is a more common and efficient approach in many cases. Moreover, ensuring responsible and ethical AI practices is paramount when working with such powerful language models.
@koho
@koho Жыл бұрын
Great overview! People need to see this video pair before freaking out that LLMs are actually intelligent.
@moedemama
@moedemama Жыл бұрын
Cool video, thanks
@abhaychandrol
@abhaychandrol 5 ай бұрын
Thanks
@andresroca9736
@andresroca9736 Жыл бұрын
Great the concepts transition. Great illustrations. The best of the best this couple of videos. What about more on other networks like r-cnn and audio nets? 😃
@g5min
@g5min Жыл бұрын
Thanks! I'm working on one on reinforcement learning now...
@bgustinjr
@bgustinjr Жыл бұрын
Thumbs-up for actually making the pancakes. 😂
@deand6411
@deand6411 Жыл бұрын
Thank you. Just the right level for my tiny organic brain.
@food4yann
@food4yann 10 ай бұрын
I love the fact that you really tried that recipe! How was it?
@CoreDump07
@CoreDump07 Жыл бұрын
liked and subscribed
@mineralt
@mineralt 10 ай бұрын
those pancakes look pretty good
@DJWESG1
@DJWESG1 Жыл бұрын
Ever think about audio synthesis and wave forms?? And how analogue synthesis utilising wave tables can offer a way to both communicate and comput information.
@SethWieder
@SethWieder Жыл бұрын
I would love more of the visual explainers on ML concepts. Subscribing. "Some folks say they're overhyped / But I do think that's true / I think they're just misunderstood / Just like me and you" That generated lyric gave me CHILLS
@kahoku451
@kahoku451 Жыл бұрын
Sameee
@lunaxquinn
@lunaxquinn 8 күн бұрын
The important question though is, were the pancakes good?
@richardharris9708
@richardharris9708 Жыл бұрын
ChatGPT gets the 37 question right now.
@leeamraa
@leeamraa Жыл бұрын
fun funny fantastic and I am a fan!
@dormin1850
@dormin1850 Жыл бұрын
This was super informative and concise, loved it! But my real question is how were those pancakes?
@KeikosCake
@KeikosCake Жыл бұрын
pancake approved 🥞👍
@chnolte
@chnolte Жыл бұрын
Just look how far we have come in only eight months.
@triton62674
@triton62674 Жыл бұрын
Fantastic work, hope those pancakes tasted better than they looked! xD
@g5min
@g5min Жыл бұрын
they were seriously delicious
@Ai.Sentinel
@Ai.Sentinel Жыл бұрын
Was the chocolate guacamole pancake any good?
@zuqini
@zuqini Жыл бұрын
I'm a bit confused by how stacking attention layers works at 4:12. Does the second layer take the first layer's prediction as input? Is the first layer's prediction still "next words" at that point, or is it now some sort of abstract intermediate value? How exactly does that capture higher level reasoning? Would appreciate any clarification!
@martinstu8400
@martinstu8400 Жыл бұрын
eliminating bias and stereotypes from language models is a lost cause, because it's the same as asking the network to lie.
@parabolicpanorama
@parabolicpanorama Жыл бұрын
you didn't understand anything did you
@speicaldark
@speicaldark Жыл бұрын
This is really nice! Now I understand why chatGPT tends to make up a lot of stuff with coherent sentences
@whannabi
@whannabi Жыл бұрын
What I'm wondering is how are they correcting its errors. For traditional NN, we have heat maps but I'd like to see something similar with transformers at the highest level to see what kind of patterns it noticed. Maybe that's what they use to correct its mistakes
@gideonk123
@gideonk123 Жыл бұрын
@@whannabi ChatGPT used a process called reinforcement learning from human feedback (RLHF): They used an already trained GPT-3 which already at the time. Humans both submitted new sentences as input prompts to ChatGPT being trained, and also ranked the output (responses) of the model. Then the ranking of the responses were then used as reward targets to continue training the model to obtain more desirable responses (measured by how the response rankings had increased).
@DonaldTamMisterDee
@DonaldTamMisterDee Жыл бұрын
This is super fascinating. I want to learn to grow my own language models from the ground up with languages like c++/rust. I dont care if the language model i develop is inaccurate or not. I just want to understand chatgpt under the hood. How may i get started?
@spenarkley
@spenarkley Жыл бұрын
Why dont you ask the language model how to make a language model?
@infiniteplanes5775
@infiniteplanes5775 Жыл бұрын
I have! Really great tips!
@parabolicpanorama
@parabolicpanorama Жыл бұрын
ask gpt to stack transformer layers and add some input output layers. add some loss functions to optimize and get a large dataset.
@mlusalin2379
@mlusalin2379 Жыл бұрын
It looks like GPT hands you a great chocolate guacamole pancake recipe... now I wanted to try too loool
@genegade
@genegade Жыл бұрын
Verdict on the guacookies?
@efisgpr
@efisgpr Жыл бұрын
GPT-4: Might I suggest dubbing them "Chocomole Pancakes"? 😮😂
@GeorgeJohnsonJackofAllTrades
@GeorgeJohnsonJackofAllTrades 10 ай бұрын
Rather than asking chatgpt to translate, I recommend asking the question in the target language. You'll significantly different results - more accurate in the target language. Stated without proof or explanation which should be intuitively obvious after watching these two videos...
@altalt7653
@altalt7653 Жыл бұрын
I can't get why we stack them like so. If the first transformer block predicts a word, what second does, third? And why do they still need attention then?
@jaredf6205
@jaredf6205 Жыл бұрын
What do you think of GPT4?
Text to Image in 5 minutes:  Parti, Dall-E 2, Imagen
6:00
Graphics in 5 Minutes
Рет қаралды 12 М.
Sigma Girl Education #sigma #viral #comedy
00:16
CRAZY GREAPA
Рет қаралды 116 МЛН
The Worlds Most Powerfull Batteries !
00:48
Woody & Kleiny
Рет қаралды 27 МЛН
Как быстро замутить ЭлектроСамокат
00:59
ЖЕЛЕЗНЫЙ КОРОЛЬ
Рет қаралды 13 МЛН
Large Language Models from scratch
8:25
Graphics in 5 Minutes
Рет қаралды 335 М.
Transformer Neural Networks, ChatGPT's foundation, Clearly Explained!!!
36:15
StatQuest with Josh Starmer
Рет қаралды 585 М.
Subdivision surfaces in 5 minutes
5:40
Graphics in 5 Minutes
Рет қаралды 5 М.
Ray Tracing in 5 minutes:  Part 2 -- implementing a basic ray tracer
5:29
Graphics in 5 Minutes
Рет қаралды 4,1 М.
But what is a neural network? | Chapter 1, Deep learning
18:40
3Blue1Brown
Рет қаралды 16 МЛН
Text to Image:  Part 2 -- how image diffusion works in 5 minutes
6:13
Graphics in 5 Minutes
Рет қаралды 10 М.
Why Neural Networks can learn (almost) anything
10:30
Emergent Garden
Рет қаралды 1,2 МЛН
Splines in 5 Minutes:  Part 2 -- Catmull-Rom and Natural Cubic Splines
6:54
Graphics in 5 Minutes
Рет қаралды 16 М.
Reinforcement Learning:  ChatGPT and RLHF
6:31
Graphics in 5 Minutes
Рет қаралды 8 М.
What are Transformer Neural Networks?
16:44
Ari Seff
Рет қаралды 159 М.
Sigma Girl Education #sigma #viral #comedy
00:16
CRAZY GREAPA
Рет қаралды 116 МЛН