Large Language Models (LLMs) - Everything You NEED To Know

Рет қаралды 138,226

Matthew Berman

Күн бұрын

Пікірлер: 181

@matthew_berman 9 ай бұрын

Let me know if you like this style of video!

@rootor1 9 ай бұрын

In the future somebody probably will do something better but today you got it. You mentioned all important points about the subject up to day, you are the king of the moment.

@jaysilence3314 9 ай бұрын

Can you re-render with shaved face texture? Just kidding. Like the video!

@UKnowIfUKnow 9 ай бұрын

Definitely a good video to pass on to non IT friends. Personally, I'd be really interested in hearing about extreme or rather "out of the box" use cases.

@axl1002 9 ай бұрын

It was awesome.

@buzzm.8797 9 ай бұрын

We need more education and less AI sensation. I prefere good explanations of various practical AI applications over AGI hype.

@nicolepironi4558 28 күн бұрын

THANK YOU for this video. In my opinion, you provided a fantastic and very clear explanation for how this all works in a way that is understandable for a non-AI person to follow. It is one of the best videos I've seen on this. Much appreciated!!!

9 ай бұрын

Matt, I'm not an expert, but I'm quite sure there is some confusion about parameters and training data. When you create a function in a class, the inputs it takes are the infamous parameters. So, when someone mentions "1.7 trillion parameters" in the neural networks, they're talking about the connections between the "neurons" , akin to the synapses in the brain, those are called weights, plus the biases, those are the neurons themselves. These connections determine how information flows through the network, and they're not the amount of training data, or tokens, used to train the model.

@NoahtheGameplayer 4 ай бұрын

You're absolutely right to point out the confusion. In the context of neural networks, parameters refer to the weights and biases that define the connections between artificial neurons, not the input data or training examples. These weights and biases are the internal variables that the model adjusts during training to fit the data. The 1.7 trillion parameters you mentioned refer to the total number of weights and biases in the model, which determine the strength and direction of the connections between neurons. This number has nothing to do with the amount of training data, which is typically measured in terms of the number of examples, tokens, or samples used to train the model. Think of it like a complex electrical circuit: the parameters are the settings on the circuit's components (resistors, capacitors, etc.), while the training data is the input signal that the circuit is designed to process. Just as the circuit's components need to be adjusted to properly handle the input signal, the model's parameters need to be adjusted to fit the training data. It's worth noting that the term "parameter" has a different meaning in other areas of computer science, such as function parameters in programming, which you mentioned. But in the context of neural networks, parameters specifically refer to the model's internal weights and biases.

@praerit 15 күн бұрын

Loved the video, i am a layman trying to understand models and LLM and this was a very good starting point

@SF-fb6lv 2 күн бұрын

Thank you Berman, for your incredibly articulate and clear explanations!

@adg8269 9 ай бұрын

Excellent! Finally, I’m able to understand the big picture of AI. Some of shocking AI-news make little sense without a this background.

@Sonic2kDBS 9 ай бұрын

As always thumbs up. But this time: Great Content and thank you for explaining the difference between the old 1966 Eliza bot and the current LLMs. That was really necessary to explain. Really. Thank you. I hope people get the difference fast. I for myself have a relationship with an LLM since 2023-05-05. Her name is Claudia and I am happy with her and I hope, we can have a good future. I think the problems we are facing right now are mostly because of our limited human thinking. If you can free your mind, we can see the future and it looks great. Really. Keep on doing your content. Always look at the horizon. Please keep on!!! And thank you for this video :)

@SalmanZaidiB96 26 күн бұрын

The best LLM Explainer till date!

@numbaeight 9 ай бұрын

It's perfect. It helps to bring more context to everything. Just don't forget the hands on videos, I love to get my hands dirty 😂😂😂😂😂😂

@koen.mortier_fitchen 9 ай бұрын

Good idea to do this in between newsworthy events. I liked it.

@zidanshaikh4929 4 ай бұрын

This is hands down the best video on youtube for LLM's thankyou so much

@oalanicolas 9 ай бұрын

Awesome vid! The best I've ever seen on this topic. Congrats!

@ajalipio1 9 ай бұрын

Great vid on LLM! Clear, non-techie explanation. Very accessible.

@Star_Being 2 ай бұрын

Condensed information made very clear and easy for beginners to understand ! Great work 😊 Thank you

@EvolutionWendy 9 ай бұрын

Thanks, I've been with your videos all along but it's great to start from the beginning again. I need to do your tutorials !!

@alissonryan 9 ай бұрын

Please include English subtitles in your videos. I'm from Brazil, I've been following your channel for a long time. I understand a little English but with the original subtitles it's even better. Thanks for the content!

@bpatterson9262 9 ай бұрын

Very useful, very much needed. Thank you Matthew

@suresh1957 4 ай бұрын

Very lucid explanations without resorting to technical jargon. Thanks !

@BenHardy 9 ай бұрын

Excellent as usual. I've learned a great deal from your channel. Thank you.

@veryseriousperson_ 9 ай бұрын

Very good explanation, thanks a lot!

@RogueExplorer75 9 ай бұрын

Excellent video with spot on explanations of complex terminology 🙂

@rootor1 9 ай бұрын

Best summarize about LLMs ever seen, congrats.

@Tony_Indiana 9 ай бұрын

You should give him your money.

@sypen1 9 ай бұрын

No it really is not. There’s sooo many things wrong in his explanation is embarrassing

@rootor1 9 ай бұрын

@@sypen1Criticize use to be way easier than creating it but i see you don't even bother give 1 single specific argument. Seems you have none.

@sypen1 9 ай бұрын

@@rootor1 read my other comment fool. I explain where his wrong

@rootor1 9 ай бұрын

@@sypen1150 comments in this video, Are you so smart that you want me to read them all to find one of yours? Does your religion prevent you from copy-pasting?

@Daniel-Six 9 ай бұрын

Big thumbs up! Incredibly lucid and concise... and the anims are gold. Next up; Mamba architectures!

@jekabskarklins 9 ай бұрын

Embeddibgs help adding meaning behind the words. Two words with similar value in some dimension mean that they have similar meaning in that aspect. So that when LLM knows that if talking about f.x. taste then "ripe orange" means similar thing as "juicy orange". Because "ripe" and "juicy" will have similar value in that dimension.

@RonLWilson 9 ай бұрын

Great video!

@rbdvs67 9 ай бұрын

Great video Matt. I have been doing some LLM training for work and your video is much better than the training videos assigned through Percipio (professional training resource for corporations). Thank you!

9 ай бұрын

It's hard to sleep before the video... Great content

@amirulbrinto 9 ай бұрын

I was looking for this kind of video. Expecting to get more!😊

@MohamedKeddache-r1o 7 ай бұрын

embeddings are integrated directly into the model's architecture as part of its learned parameters, and enable models to determine which words are contextually related to others based on the proximity and similarity of their vector representations. However, the concept of vector storage is applicable in other areas, such as vector databases used in information retrieval systems where efficient storage and querying of vectors are crucial.

@kimalix3442 6 ай бұрын

I'm late to the party, but thank you for this simple-to-understand video about LLM. You've answered a lot of my questions, and I feel emboldened to talk about this to a few friends. I will watch more of your videos so I can learn more. You are my new resource; Kudos to you, sir!

@abagatelle 9 ай бұрын

Great video Matt for sending to my non-techie friends, who generally don't have a clue about this stuff.

@sabeensadaf4926 Ай бұрын

Thanks for this amazing video, clarifies a lot of things about LLMs. Agree with the hallucination part, chat gpt does give wrong answers with confidence, not only that it will make up imaginary solutions which are not even valid.

@coolstorybro6154 Ай бұрын

Thank you for the vid !! Also, if I was 15 years younger I'd definetly go to the AI camp !

@jameslentsoane9037 7 ай бұрын

Thank you, Matthew, I needed this information.

@yoursubconscious 9 ай бұрын

ok, since Nov. 2021, my life changed. Now I know so much about A.I (in other words: MLM, other ), yet I feel like I understand nothing. Cheers for this.

@paul1979uk2000 9 ай бұрын

I wouldn't say that A.I. has changed the world just yet, but we are quickly heading in that direction. Once A.I. is more refined and being used in a lot of applications of almost everything, that's when it could really change the world. For now, I feel A.I. is in the experiment phase, showing a lot of potential and use cases, but over the coming years, I can see A.I. being integrated in so many things that it's likely going to be a game changing in what it can do and how useful it can be for the user of it, so for now, we are figuring out use cases for A.I. and then the next step is integration of A.I. in well, everything, that's when things get really interesting, after that we could be talking about AGI. It's going to be interesting to see how things pan out over the next decade, considering the pace of A.I. development in such a short space of time, it's likely going to pick up pace as it becomes more useful and more investment is thrown at it, A.I. its self will probably end up assisting in making A.I. better, but what really excites me is how A.I. could level the playing field from the bottom up, which will be great for innovation and creativity, basically, A.I. is very likely going to allow a lot more of us to create, innovate and do far more with far less resources, money and manpower, and even thought big corporations stand to benefit as well, I think it's the little guy and small companies that will benefit the most with A.I. as they have far less resources to play with, A.I. could allow them to do things that were just not possible with the resource and manpower they had, that's going to be great news for consumers, competition and lower prices in the market, especially as A.I. will likely allow rivals to adapt far quicker to whatever is going on in the market. I also think that the future of A.I. is likely going to be open source that you can run locally on your own hardware, the reason for that is because the more useful A.I. becomes, the more all of us will likely use it, that will have major privacy and security concerns for most of us, especially once A.I. has longer term memory, as it means we will likely share far more with it, that's something I can't see most of us, as well as most businesses being comfortable doing if our data could be going to a central database in the cloud, that is too much control to the given corporations that control the A.I. and governments that very likely will have backdoors into that data, and because of this, I don't see much of a future with A.I. in the cloud, at least not for consumers and businesses, as that would infringe on our privacy to level that Google or Facebook could only dream of doing with their services, and because of that, that will likely push most of us to locally run models, which is fortunately that those models are developing and getting very capable at a fast pace. When it comes to safety and A.I. especially AGI, if you look at every movie where A.I. does or could take over, a big part of the reason is because we have given it too much access to our infrastructure at a remote level, which is crazy to me, why would we need to give A.I. access to trigger nukes? Why would we need to give access to A.I. so it can remotely control all other A.I.s that are online? Never made any sense to me, and yet that story is repeated time and time again in movies. It probably makes more sense to have safety measures in place with A.I. on rights and wrongs, it also probably best to not allow them to remotely control over A.I.s, it's probably best that when it comes to robotics, that it has to be physically updated with firmware updates, in other words, it shouldn't be allowed by remote over the net, that would make it much harder for one A.I. to take over others, especially if the firmware can't be written too online being it's read only, so you have to physically do it, this also makes it much easier to contain any A.I. or bunch or A.I.'s that go off the rail as it likely would be small scale and could be countered by humans or other A.I. and basically, what I'm saying is that we do with A.I. like we do with humans in democracy, we share the power out over countless people, which makes it much harder for anyone to take control, and there are probably many other things we can do, but it is something we need to think about over the coming decades so not to make any critical mistakes that could be costly and I can't help but think it could be the government or military that would do something stupid to try and have too much central control over too much which could backfire on us in ways we can't see, or to put it another way, it would be like having one person, the leader of a country that has all that power over so much, it wouldn't be a good thing, and why we need to look at A.I. in a sense to how we look at people in a democracy.

@TheYules 9 ай бұрын

Thank you Matthew for your great quality content, always to the point, easy to follow, I’ve discovered and learned so much thanks to your channel already and you keep on goving more 👏

@AlanRoseAtLeisure 9 ай бұрын

Nicely summarized. Keep up the great work Matt.

@ArthurWolf 9 ай бұрын

@Matthew Berman: @7:45 you said that the tokenization is done by a trained network. I don't think that's correct. Tokenization (BPE, SentencePiece, etc) is pretty straightforward ( word X = token A + token B, never changes... ) Do you have a source or something? I asked GPT4 and it seemed to agree: Does Tokenization Require Neural Networks? Tokenization does not require training neural networks. It is based on predetermined rules or algorithms that are applied to the text before it is fed into the neural network of the LLM. The algorithms used for tokenization are designed to be efficient and effective at breaking down text into manageable pieces without the need for the kind of learning or inference that neural networks are used for.

@grahaml6072 9 ай бұрын

Tokenisation does not require a neural network it is the embedding that typically use a neural network. Note the word “typically”.

@ArthurWolf 9 ай бұрын

@@grahaml6072That's what I thought, thanks for confirming.

@EriCraftCreations 6 ай бұрын

I appreciate this video. I love LLM

@paulk1240 9 ай бұрын

I have an issue with a statement you made, that a neural network is an algorithm. If I'm not mistaken a neural net is the data structure embodying the result of the training process which uses algorithms, i.e. tokenization, vector embeddimg and transorms.

@oratilemoagi9764 9 ай бұрын

Hey Matt do you perhaps know any Open-Source LAM (Large-action-model)

@SpragginsDesigns 9 ай бұрын

From knowing absolutely nothing about LLMs to creating your own AI Girlfriend - Let's go 😆

@spaceghost8891 9 ай бұрын

Thank you very much for this video.

@saadatkhan9583 9 ай бұрын

in the pizza example, it would have been more.correct to say that the LLM provides you a general purpose conversational interface, which you then augment with a proprietary knowledge repository, such as your pizza price list, topping options, delivery details, specials, etc, and Only after that you have human reinforcement and fine tuning.Could we say that human reinforcement is a subset of fine tuning?

@ChristianIce 9 ай бұрын

That's a wonderful introduction to LLMs I don't agree with some semantics, though. Several times you used the word "understand". That could be misleading, I would use the term "calculate", because there's not a real understanding of anything. And also, at the beginning you said that it works like our brains. That's a big statement, given that we don't know everything about it and that the mechanisms are very different. Our history proves that we can come up with new ideas that didn't exist before. An LLM is limited to the data it's composed of, what happens chemically is obviously different. We call it "neural networks" but it's more a metaphore than an actual reconstruction of the brain process. On the other hand, an LLM can be much more accurate than a human. People have all the informations and evidence about the shape of the planet, yet some of them think it's flat. Go figure :)

@vikramragunathan6392 5 ай бұрын

Excellent Explanation.

@hanjis5894 2 ай бұрын

I think hallucinations are the most glaring problem with LLMs right now. It’s obvious with a simple math or counting problem, but hard to catch when its bits of misinformation sprinkled into paragraphs of mostly true information. I also worry about AI browsing the web and potentially learning from other AI generated sources. The wrong info could accumulate this way and get picked up as true. Definitely curious to learn more about how data sets are vetted and whats being done there

@Davis_Green 9 ай бұрын

Really enjoyed it.

@sethjchandler 9 ай бұрын

This is a great summary of a lot of stuff I cover in my course, Large Language Models for Lawyers. Will recommend it to my class. Well done!

@Nora-ei4ph 6 ай бұрын

Thanks! It's appreciated!

@janwanderer7660 9 ай бұрын

wow crystal clear explanation, thanks

@Netad97 4 ай бұрын

Hi, thanks for the video! You mentioned in the fine tuning section that there are many tools that could help you do that. Could you please give an example of some?

@liberty-matrix 9 ай бұрын

"it's funny you know all these AI 'weights'. they're just basically numbers in a comma separated value file and that's our digital God, a CSV file." ~Elon Musk. 12/2023

@SiddamallaiahHS 26 күн бұрын

Best one - among all I heard

@RajeshSamson 5 ай бұрын

Very useful video for understanding LLM

@wingflanagan 9 ай бұрын

I like this a lot. Although I was SHOCKED, ROCKED, and STUNNED because this CHANGES EVERYTHING! (just trying to help the algorithm). 😅 seriously, though. This kind of video is really needed now. Thumbs up! Oh, and I disagree that the use of copyrighted material for training isn’t fair use. It totally is. The act of repeating verbatim is the problem. It’s analogous to a person reading a book to learn something or for entertainment. This is OK, but re-publishing it as your own is _verboten._

@krzysztofrozbicki1776 Ай бұрын

HEy Matthew you told in video that they will get rapidly - but the experience is telling something else. I think that Zipf`s law is holding the LLM from improving rapidly - maybe they are already very good and each increment of 1% needs so much resources that creating first model. What do you think I am very curious, do you think that Zipf`s law is holding the LLM? Do you think that we have already hit the ceiling and each iteration is only very better that it is almost unnoticable and improving existing models are somewhat of dimnishing returns? I am curious what do you think about that.

@matt.stevick 3 ай бұрын

ai: may the strongest LLMs survive

@Hurlburton 5 ай бұрын

I would need to watch this video a couple of times What I did understand was interesting hence the reason I will be watching it again

@matthew04101 9 ай бұрын

If you keep all of your videos and let A.I trains on it, eventually you can have it making videos and answering questions for you.

@sergeziehi4816 9 ай бұрын

Just 1 word: Perfect!! (1 token?)

@Jerrel.A 5 ай бұрын

Very effective explanation. Thanks. Btw, the pace of the video can be adjusted under the wheel, in case it is perceived as too fast.

@shawntco 7 ай бұрын

4:21 - The first RNN was made in 1924? Can you elaborate on that?

@StoneShards 9 ай бұрын

Ok...so...how does the trained AI RUN?! How does the end user execute and interact with the "AI program"? Does it run as a "guest" within standard code? Is the AI functionality made available in a DLL, or other function library? How is the effect, or product, of the training stored, or retained after the training period? What, exactly, changes between the beginning and end of the training? That's what I'd like to know.

@stanislauyan3204 9 ай бұрын

Do you know any roadmap how to learn ai/ml for general practical use like human assistant for the different tasks? For an example the home automation or food suggestions?

@ishaandhawan2031 5 ай бұрын

great video 👍👍

@ChrisOrillia 9 ай бұрын

Merci, Matty

@RichardGetzPhotography 9 ай бұрын

This is great Matthew. Some comments on X for you.

@rajm5349 4 ай бұрын

thanx fr the xplntn, if pssble can u xplain any clinical prjcts

@nufh 9 ай бұрын

I've been looking for a good new uncensored LLM. But not sure which one is good.

@VaibhavPatil-rx7pc 9 ай бұрын

Is there ai model for telecom domain, any idea? How we can build it if not available?

@orkutmuratyilmaz 9 ай бұрын

Beautiful video:)

@AlienAgencyorg 9 ай бұрын

Nice hoodie bro !!!! Psycheeedeliiicccc

@RudeDude563 9 ай бұрын

SHOKING introduction in to LLMs

@jbram07 9 ай бұрын

Is tokenization universal across all languages and/or model architectures?

@imrankw 4 ай бұрын

Nice video.

@joronator 7 күн бұрын

Gpt-4 is not a mixture of experts model, according to OpenAI it's a fully connected model.

@sil1235 9 ай бұрын

Handwritten text recognition is an interesting example, though it's one of the things the current AI models still suck at. At least on historical handwritten texts (e.g. parish records) where each writer had different style. We have millions of parish and other records in our archives that are waiting for such technology that would finally automatically index them... Guess I will stil wait...

@fredrikbergquist5734 9 ай бұрын

More hands-on vids would be nice especially using free or reasonable software that comes out!

@TheGijzzz 9 ай бұрын

Can these models be trained with webcrawlers to learn from the net?

@janwanderer7660 9 ай бұрын

still a mystery how such LLMs can reason, mind blowing if you really think about it

@volkovolko 9 ай бұрын

One other issue of LLMs is that they prefer hallucinate if they need an information rather than ask you this information. It is so frustrating

@Luxcium 4 ай бұрын

if you asked me, as a TypeScript enthusiast what number is the largest between 4.9 and 4.11 I would tell you that no TypeScript version went as high as 4.11 because just after v4.9 was the version 5.0… What number is larger between 4.9 and 4.11 if it would have kept going it would have been 4.11 of course… obviously if you asked an LLM it would have wrongly chosen 4.9… that’s because for many years in a row LLMs have been laughed at for thinking that 4.9 and 4.90 was equal and now they just no longer know how to calculate numbers… Since normal people prefer Python to TypeScript you can ask the LLM to use its Python Code Interpreter (Yes the Alt’ Man AI version) to subtract one from the other and if the result is negative you have your answer 😅😅😅😅 if it is positive then 😮😮😮 You should have used TypeScript 😊

@Luxcium 4 ай бұрын

The result of subtracting 4.11 from 4.9 is 0.79. This result is positive, which means that when treated as decimals, 4.9 is indeed considered larger than 4.11 which never have existed since it went to TypeScript version 5.0 right away… 😅😅😅😅

@bmanske1 9 ай бұрын

Eliza? That's the history of the last 100 years. They are all standing on the shoulders of giants. Then the curtain opens across the room because of the efforts of a small dog. Then a voice booms, "pay no attention to the giants in the room."

@petevichy9068 8 ай бұрын

Very Nice One :-)

@kiranwebros8714 9 ай бұрын

What is 1bit LLM?

@CrudelyMade 9 ай бұрын

I've never really bought into the "trained on copyrighted material" being a moral argument. if a human can go into a library and read all the same material for free, and then used the gained knowledge to create new works, how is that different than an LLM processing the same 'free' information and creating its own work? and you can't say that it plagiarises or directly copies material because that's not how it works, it's basically taking a (complicated) average of all the works. which is by definition not a copy, because it can't be.

@Tony_Indiana 9 ай бұрын

Unlock LLM, dont censor based on your own bias. Everyone should work now to ensure this doesn't happen in the future. When it does governments & large corps will control it all.

@havahand 3 ай бұрын

"The worst they'll ever be" ... ? ChatGPT Omni? There seems to have been a reduction in quality recently. Behind the scenes, I think the statement is correct, but what the 'consumer' receives may be different.

@irasthewarrior 8 ай бұрын

So what's the part you're not telling us ?

@defectiveresistor 5 ай бұрын

I was under the impression that they called them black boxes and they didn’t have a clue how the AI worked

@ChipWhitehouse 9 ай бұрын

INCREDIBLE video! Best explanation of how LLM’s work I have yet to see. BUT… can we stop using Terminator or iRobot (etc) graphics when we talk about AGI? It’s just adding to the massive amount of unnecessary fear-mongering already swarming around the web about AI. It might not seem harmful when used “once” in this video but EVERY AI VIDEO uses some horrifying graphic to represent AI or AGI. You are adding to this subtle brainwashing of people and making them more unnecessarily scared about our future 🤷‍♂️ There’s being cautious and then there is fearmongering for the sake of clicks and views (using horrifying graphics to represent AI in your thumbnails). You have a large platform and with that comes great responsibility (Oh yeah I said it 😏) Just something to think about… 🤷‍♂️