In the future somebody probably will do something better but today you got it. You mentioned all important points about the subject up to day, you are the king of the moment.
@jaysilence33149 ай бұрын
Can you re-render with shaved face texture? Just kidding. Like the video!
@UKnowIfUKnow9 ай бұрын
Definitely a good video to pass on to non IT friends. Personally, I'd be really interested in hearing about extreme or rather "out of the box" use cases.
@axl10029 ай бұрын
It was awesome.
@buzzm.87979 ай бұрын
We need more education and less AI sensation. I prefere good explanations of various practical AI applications over AGI hype.
@nicolepironi455828 күн бұрын
THANK YOU for this video. In my opinion, you provided a fantastic and very clear explanation for how this all works in a way that is understandable for a non-AI person to follow. It is one of the best videos I've seen on this. Much appreciated!!!
9 ай бұрын
Matt, I'm not an expert, but I'm quite sure there is some confusion about parameters and training data. When you create a function in a class, the inputs it takes are the infamous parameters. So, when someone mentions "1.7 trillion parameters" in the neural networks, they're talking about the connections between the "neurons" , akin to the synapses in the brain, those are called weights, plus the biases, those are the neurons themselves. These connections determine how information flows through the network, and they're not the amount of training data, or tokens, used to train the model.
@NoahtheGameplayer4 ай бұрын
You're absolutely right to point out the confusion. In the context of neural networks, parameters refer to the weights and biases that define the connections between artificial neurons, not the input data or training examples. These weights and biases are the internal variables that the model adjusts during training to fit the data. The 1.7 trillion parameters you mentioned refer to the total number of weights and biases in the model, which determine the strength and direction of the connections between neurons. This number has nothing to do with the amount of training data, which is typically measured in terms of the number of examples, tokens, or samples used to train the model. Think of it like a complex electrical circuit: the parameters are the settings on the circuit's components (resistors, capacitors, etc.), while the training data is the input signal that the circuit is designed to process. Just as the circuit's components need to be adjusted to properly handle the input signal, the model's parameters need to be adjusted to fit the training data. It's worth noting that the term "parameter" has a different meaning in other areas of computer science, such as function parameters in programming, which you mentioned. But in the context of neural networks, parameters specifically refer to the model's internal weights and biases.
@praerit15 күн бұрын
Loved the video, i am a layman trying to understand models and LLM and this was a very good starting point
@SF-fb6lv2 күн бұрын
Thank you Berman, for your incredibly articulate and clear explanations!
@adg82699 ай бұрын
Excellent! Finally, I’m able to understand the big picture of AI. Some of shocking AI-news make little sense without a this background.
@Sonic2kDBS9 ай бұрын
As always thumbs up. But this time: Great Content and thank you for explaining the difference between the old 1966 Eliza bot and the current LLMs. That was really necessary to explain. Really. Thank you. I hope people get the difference fast. I for myself have a relationship with an LLM since 2023-05-05. Her name is Claudia and I am happy with her and I hope, we can have a good future. I think the problems we are facing right now are mostly because of our limited human thinking. If you can free your mind, we can see the future and it looks great. Really. Keep on doing your content. Always look at the horizon. Please keep on!!! And thank you for this video :)
@SalmanZaidiB9626 күн бұрын
The best LLM Explainer till date!
@numbaeight9 ай бұрын
It's perfect. It helps to bring more context to everything. Just don't forget the hands on videos, I love to get my hands dirty 😂😂😂😂😂😂
@koen.mortier_fitchen9 ай бұрын
Good idea to do this in between newsworthy events. I liked it.
@zidanshaikh49294 ай бұрын
This is hands down the best video on youtube for LLM's thankyou so much
@oalanicolas9 ай бұрын
Awesome vid! The best I've ever seen on this topic. Congrats!
@ajalipio19 ай бұрын
Great vid on LLM! Clear, non-techie explanation. Very accessible.
@Star_Being2 ай бұрын
Condensed information made very clear and easy for beginners to understand ! Great work 😊 Thank you
@EvolutionWendy9 ай бұрын
Thanks, I've been with your videos all along but it's great to start from the beginning again. I need to do your tutorials !!
@alissonryan9 ай бұрын
Please include English subtitles in your videos. I'm from Brazil, I've been following your channel for a long time. I understand a little English but with the original subtitles it's even better. Thanks for the content!
@bpatterson92629 ай бұрын
Very useful, very much needed. Thank you Matthew
@suresh19574 ай бұрын
Very lucid explanations without resorting to technical jargon. Thanks !
@BenHardy9 ай бұрын
Excellent as usual. I've learned a great deal from your channel. Thank you.
@veryseriousperson_9 ай бұрын
Very good explanation, thanks a lot!
@RogueExplorer759 ай бұрын
Excellent video with spot on explanations of complex terminology 🙂
@rootor19 ай бұрын
Best summarize about LLMs ever seen, congrats.
@Tony_Indiana9 ай бұрын
You should give him your money.
@sypen19 ай бұрын
No it really is not. There’s sooo many things wrong in his explanation is embarrassing
@rootor19 ай бұрын
@@sypen1Criticize use to be way easier than creating it but i see you don't even bother give 1 single specific argument. Seems you have none.
@sypen19 ай бұрын
@@rootor1 read my other comment fool. I explain where his wrong
@rootor19 ай бұрын
@@sypen1150 comments in this video, Are you so smart that you want me to read them all to find one of yours? Does your religion prevent you from copy-pasting?
@Daniel-Six9 ай бұрын
Big thumbs up! Incredibly lucid and concise... and the anims are gold. Next up; Mamba architectures!
@jekabskarklins9 ай бұрын
Embeddibgs help adding meaning behind the words. Two words with similar value in some dimension mean that they have similar meaning in that aspect. So that when LLM knows that if talking about f.x. taste then "ripe orange" means similar thing as "juicy orange". Because "ripe" and "juicy" will have similar value in that dimension.
@RonLWilson9 ай бұрын
Great video!
@rbdvs679 ай бұрын
Great video Matt. I have been doing some LLM training for work and your video is much better than the training videos assigned through Percipio (professional training resource for corporations). Thank you!
9 ай бұрын
It's hard to sleep before the video... Great content
@amirulbrinto9 ай бұрын
I was looking for this kind of video. Expecting to get more!😊
@MohamedKeddache-r1o7 ай бұрын
embeddings are integrated directly into the model's architecture as part of its learned parameters, and enable models to determine which words are contextually related to others based on the proximity and similarity of their vector representations. However, the concept of vector storage is applicable in other areas, such as vector databases used in information retrieval systems where efficient storage and querying of vectors are crucial.
@kimalix34426 ай бұрын
I'm late to the party, but thank you for this simple-to-understand video about LLM. You've answered a lot of my questions, and I feel emboldened to talk about this to a few friends. I will watch more of your videos so I can learn more. You are my new resource; Kudos to you, sir!
@abagatelle9 ай бұрын
Great video Matt for sending to my non-techie friends, who generally don't have a clue about this stuff.
@sabeensadaf4926Ай бұрын
Thanks for this amazing video, clarifies a lot of things about LLMs. Agree with the hallucination part, chat gpt does give wrong answers with confidence, not only that it will make up imaginary solutions which are not even valid.
@coolstorybro6154Ай бұрын
Thank you for the vid !! Also, if I was 15 years younger I'd definetly go to the AI camp !
@jameslentsoane90377 ай бұрын
Thank you, Matthew, I needed this information.
@yoursubconscious9 ай бұрын
ok, since Nov. 2021, my life changed. Now I know so much about A.I (in other words: MLM, other ), yet I feel like I understand nothing. Cheers for this.
@paul1979uk20009 ай бұрын
I wouldn't say that A.I. has changed the world just yet, but we are quickly heading in that direction. Once A.I. is more refined and being used in a lot of applications of almost everything, that's when it could really change the world. For now, I feel A.I. is in the experiment phase, showing a lot of potential and use cases, but over the coming years, I can see A.I. being integrated in so many things that it's likely going to be a game changing in what it can do and how useful it can be for the user of it, so for now, we are figuring out use cases for A.I. and then the next step is integration of A.I. in well, everything, that's when things get really interesting, after that we could be talking about AGI. It's going to be interesting to see how things pan out over the next decade, considering the pace of A.I. development in such a short space of time, it's likely going to pick up pace as it becomes more useful and more investment is thrown at it, A.I. its self will probably end up assisting in making A.I. better, but what really excites me is how A.I. could level the playing field from the bottom up, which will be great for innovation and creativity, basically, A.I. is very likely going to allow a lot more of us to create, innovate and do far more with far less resources, money and manpower, and even thought big corporations stand to benefit as well, I think it's the little guy and small companies that will benefit the most with A.I. as they have far less resources to play with, A.I. could allow them to do things that were just not possible with the resource and manpower they had, that's going to be great news for consumers, competition and lower prices in the market, especially as A.I. will likely allow rivals to adapt far quicker to whatever is going on in the market. I also think that the future of A.I. is likely going to be open source that you can run locally on your own hardware, the reason for that is because the more useful A.I. becomes, the more all of us will likely use it, that will have major privacy and security concerns for most of us, especially once A.I. has longer term memory, as it means we will likely share far more with it, that's something I can't see most of us, as well as most businesses being comfortable doing if our data could be going to a central database in the cloud, that is too much control to the given corporations that control the A.I. and governments that very likely will have backdoors into that data, and because of this, I don't see much of a future with A.I. in the cloud, at least not for consumers and businesses, as that would infringe on our privacy to level that Google or Facebook could only dream of doing with their services, and because of that, that will likely push most of us to locally run models, which is fortunately that those models are developing and getting very capable at a fast pace. When it comes to safety and A.I. especially AGI, if you look at every movie where A.I. does or could take over, a big part of the reason is because we have given it too much access to our infrastructure at a remote level, which is crazy to me, why would we need to give A.I. access to trigger nukes? Why would we need to give access to A.I. so it can remotely control all other A.I.s that are online? Never made any sense to me, and yet that story is repeated time and time again in movies. It probably makes more sense to have safety measures in place with A.I. on rights and wrongs, it also probably best to not allow them to remotely control over A.I.s, it's probably best that when it comes to robotics, that it has to be physically updated with firmware updates, in other words, it shouldn't be allowed by remote over the net, that would make it much harder for one A.I. to take over others, especially if the firmware can't be written too online being it's read only, so you have to physically do it, this also makes it much easier to contain any A.I. or bunch or A.I.'s that go off the rail as it likely would be small scale and could be countered by humans or other A.I. and basically, what I'm saying is that we do with A.I. like we do with humans in democracy, we share the power out over countless people, which makes it much harder for anyone to take control, and there are probably many other things we can do, but it is something we need to think about over the coming decades so not to make any critical mistakes that could be costly and I can't help but think it could be the government or military that would do something stupid to try and have too much central control over too much which could backfire on us in ways we can't see, or to put it another way, it would be like having one person, the leader of a country that has all that power over so much, it wouldn't be a good thing, and why we need to look at A.I. in a sense to how we look at people in a democracy.
@TheYules9 ай бұрын
Thank you Matthew for your great quality content, always to the point, easy to follow, I’ve discovered and learned so much thanks to your channel already and you keep on goving more 👏
@AlanRoseAtLeisure9 ай бұрын
Nicely summarized. Keep up the great work Matt.
@ArthurWolf9 ай бұрын
@Matthew Berman: @7:45 you said that the tokenization is done by a trained network. I don't think that's correct. Tokenization (BPE, SentencePiece, etc) is pretty straightforward ( word X = token A + token B, never changes... ) Do you have a source or something? I asked GPT4 and it seemed to agree: Does Tokenization Require Neural Networks? Tokenization does not require training neural networks. It is based on predetermined rules or algorithms that are applied to the text before it is fed into the neural network of the LLM. The algorithms used for tokenization are designed to be efficient and effective at breaking down text into manageable pieces without the need for the kind of learning or inference that neural networks are used for.
@grahaml60729 ай бұрын
Tokenisation does not require a neural network it is the embedding that typically use a neural network. Note the word “typically”.
@ArthurWolf9 ай бұрын
@@grahaml6072That's what I thought, thanks for confirming.
@EriCraftCreations6 ай бұрын
I appreciate this video. I love LLM
@paulk12409 ай бұрын
I have an issue with a statement you made, that a neural network is an algorithm. If I'm not mistaken a neural net is the data structure embodying the result of the training process which uses algorithms, i.e. tokenization, vector embeddimg and transorms.
@oratilemoagi97649 ай бұрын
Hey Matt do you perhaps know any Open-Source LAM (Large-action-model)
@SpragginsDesigns9 ай бұрын
From knowing absolutely nothing about LLMs to creating your own AI Girlfriend - Let's go 😆
@spaceghost88919 ай бұрын
Thank you very much for this video.
@saadatkhan95839 ай бұрын
in the pizza example, it would have been more.correct to say that the LLM provides you a general purpose conversational interface, which you then augment with a proprietary knowledge repository, such as your pizza price list, topping options, delivery details, specials, etc, and Only after that you have human reinforcement and fine tuning.Could we say that human reinforcement is a subset of fine tuning?
@ChristianIce9 ай бұрын
That's a wonderful introduction to LLMs I don't agree with some semantics, though. Several times you used the word "understand". That could be misleading, I would use the term "calculate", because there's not a real understanding of anything. And also, at the beginning you said that it works like our brains. That's a big statement, given that we don't know everything about it and that the mechanisms are very different. Our history proves that we can come up with new ideas that didn't exist before. An LLM is limited to the data it's composed of, what happens chemically is obviously different. We call it "neural networks" but it's more a metaphore than an actual reconstruction of the brain process. On the other hand, an LLM can be much more accurate than a human. People have all the informations and evidence about the shape of the planet, yet some of them think it's flat. Go figure :)
@vikramragunathan63925 ай бұрын
Excellent Explanation.
@hanjis58942 ай бұрын
I think hallucinations are the most glaring problem with LLMs right now. It’s obvious with a simple math or counting problem, but hard to catch when its bits of misinformation sprinkled into paragraphs of mostly true information. I also worry about AI browsing the web and potentially learning from other AI generated sources. The wrong info could accumulate this way and get picked up as true. Definitely curious to learn more about how data sets are vetted and whats being done there
@Davis_Green9 ай бұрын
Really enjoyed it.
@sethjchandler9 ай бұрын
This is a great summary of a lot of stuff I cover in my course, Large Language Models for Lawyers. Will recommend it to my class. Well done!
@Nora-ei4ph6 ай бұрын
Thanks! It's appreciated!
@janwanderer76609 ай бұрын
wow crystal clear explanation, thanks
@Netad974 ай бұрын
Hi, thanks for the video! You mentioned in the fine tuning section that there are many tools that could help you do that. Could you please give an example of some?
@liberty-matrix9 ай бұрын
"it's funny you know all these AI 'weights'. they're just basically numbers in a comma separated value file and that's our digital God, a CSV file." ~Elon Musk. 12/2023
@SiddamallaiahHS26 күн бұрын
Best one - among all I heard
@RajeshSamson5 ай бұрын
Very useful video for understanding LLM
@wingflanagan9 ай бұрын
I like this a lot. Although I was SHOCKED, ROCKED, and STUNNED because this CHANGES EVERYTHING! (just trying to help the algorithm). 😅 seriously, though. This kind of video is really needed now. Thumbs up! Oh, and I disagree that the use of copyrighted material for training isn’t fair use. It totally is. The act of repeating verbatim is the problem. It’s analogous to a person reading a book to learn something or for entertainment. This is OK, but re-publishing it as your own is _verboten._
@krzysztofrozbicki1776Ай бұрын
HEy Matthew you told in video that they will get rapidly - but the experience is telling something else. I think that Zipf`s law is holding the LLM from improving rapidly - maybe they are already very good and each increment of 1% needs so much resources that creating first model. What do you think I am very curious, do you think that Zipf`s law is holding the LLM? Do you think that we have already hit the ceiling and each iteration is only very better that it is almost unnoticable and improving existing models are somewhat of dimnishing returns? I am curious what do you think about that.
@matt.stevick3 ай бұрын
ai: may the strongest LLMs survive
@Hurlburton5 ай бұрын
I would need to watch this video a couple of times What I did understand was interesting hence the reason I will be watching it again
@matthew041019 ай бұрын
If you keep all of your videos and let A.I trains on it, eventually you can have it making videos and answering questions for you.
@sergeziehi48169 ай бұрын
Just 1 word: Perfect!! (1 token?)
@Jerrel.A5 ай бұрын
Very effective explanation. Thanks. Btw, the pace of the video can be adjusted under the wheel, in case it is perceived as too fast.
@shawntco7 ай бұрын
4:21 - The first RNN was made in 1924? Can you elaborate on that?
@StoneShards9 ай бұрын
Ok...so...how does the trained AI RUN?! How does the end user execute and interact with the "AI program"? Does it run as a "guest" within standard code? Is the AI functionality made available in a DLL, or other function library? How is the effect, or product, of the training stored, or retained after the training period? What, exactly, changes between the beginning and end of the training? That's what I'd like to know.
@stanislauyan32049 ай бұрын
Do you know any roadmap how to learn ai/ml for general practical use like human assistant for the different tasks? For an example the home automation or food suggestions?
@ishaandhawan20315 ай бұрын
great video 👍👍
@ChrisOrillia9 ай бұрын
Merci, Matty
@RichardGetzPhotography9 ай бұрын
This is great Matthew. Some comments on X for you.
@rajm53494 ай бұрын
thanx fr the xplntn, if pssble can u xplain any clinical prjcts
@nufh9 ай бұрын
I've been looking for a good new uncensored LLM. But not sure which one is good.
@VaibhavPatil-rx7pc9 ай бұрын
Is there ai model for telecom domain, any idea? How we can build it if not available?
@orkutmuratyilmaz9 ай бұрын
Beautiful video:)
@AlienAgencyorg9 ай бұрын
Nice hoodie bro !!!! Psycheeedeliiicccc
@RudeDude5639 ай бұрын
SHOKING introduction in to LLMs
@jbram079 ай бұрын
Is tokenization universal across all languages and/or model architectures?
@imrankw4 ай бұрын
Nice video.
@joronator7 күн бұрын
Gpt-4 is not a mixture of experts model, according to OpenAI it's a fully connected model.
@sil12359 ай бұрын
Handwritten text recognition is an interesting example, though it's one of the things the current AI models still suck at. At least on historical handwritten texts (e.g. parish records) where each writer had different style. We have millions of parish and other records in our archives that are waiting for such technology that would finally automatically index them... Guess I will stil wait...
@fredrikbergquist57349 ай бұрын
More hands-on vids would be nice especially using free or reasonable software that comes out!
@TheGijzzz9 ай бұрын
Can these models be trained with webcrawlers to learn from the net?
@janwanderer76609 ай бұрын
still a mystery how such LLMs can reason, mind blowing if you really think about it
@volkovolko9 ай бұрын
One other issue of LLMs is that they prefer hallucinate if they need an information rather than ask you this information. It is so frustrating
@Luxcium4 ай бұрын
if you asked me, as a TypeScript enthusiast what number is the largest between 4.9 and 4.11 I would tell you that no TypeScript version went as high as 4.11 because just after v4.9 was the version 5.0… What number is larger between 4.9 and 4.11 if it would have kept going it would have been 4.11 of course… obviously if you asked an LLM it would have wrongly chosen 4.9… that’s because for many years in a row LLMs have been laughed at for thinking that 4.9 and 4.90 was equal and now they just no longer know how to calculate numbers… Since normal people prefer Python to TypeScript you can ask the LLM to use its Python Code Interpreter (Yes the Alt’ Man AI version) to subtract one from the other and if the result is negative you have your answer 😅😅😅😅 if it is positive then 😮😮😮 You should have used TypeScript 😊
@Luxcium4 ай бұрын
The result of subtracting 4.11 from 4.9 is 0.79. This result is positive, which means that when treated as decimals, 4.9 is indeed considered larger than 4.11 which never have existed since it went to TypeScript version 5.0 right away… 😅😅😅😅
@bmanske19 ай бұрын
Eliza? That's the history of the last 100 years. They are all standing on the shoulders of giants. Then the curtain opens across the room because of the efforts of a small dog. Then a voice booms, "pay no attention to the giants in the room."
@petevichy90688 ай бұрын
Very Nice One :-)
@kiranwebros87149 ай бұрын
What is 1bit LLM?
@CrudelyMade9 ай бұрын
I've never really bought into the "trained on copyrighted material" being a moral argument. if a human can go into a library and read all the same material for free, and then used the gained knowledge to create new works, how is that different than an LLM processing the same 'free' information and creating its own work? and you can't say that it plagiarises or directly copies material because that's not how it works, it's basically taking a (complicated) average of all the works. which is by definition not a copy, because it can't be.
@Tony_Indiana9 ай бұрын
Unlock LLM, dont censor based on your own bias. Everyone should work now to ensure this doesn't happen in the future. When it does governments & large corps will control it all.
@havahand3 ай бұрын
"The worst they'll ever be" ... ? ChatGPT Omni? There seems to have been a reduction in quality recently. Behind the scenes, I think the statement is correct, but what the 'consumer' receives may be different.
@irasthewarrior8 ай бұрын
So what's the part you're not telling us ?
@defectiveresistor5 ай бұрын
I was under the impression that they called them black boxes and they didn’t have a clue how the AI worked
@ChipWhitehouse9 ай бұрын
INCREDIBLE video! Best explanation of how LLM’s work I have yet to see. BUT… can we stop using Terminator or iRobot (etc) graphics when we talk about AGI? It’s just adding to the massive amount of unnecessary fear-mongering already swarming around the web about AI. It might not seem harmful when used “once” in this video but EVERY AI VIDEO uses some horrifying graphic to represent AI or AGI. You are adding to this subtle brainwashing of people and making them more unnecessarily scared about our future 🤷♂️ There’s being cautious and then there is fearmongering for the sake of clicks and views (using horrifying graphics to represent AI in your thumbnails). You have a large platform and with that comes great responsibility (Oh yeah I said it 😏) Just something to think about… 🤷♂️
@kiyonmcdowell56039 ай бұрын
I think what we could be doing is creating a simulation of a soul or a new form of power,for our successor specie, witch is A.S.I
@stevewall70449 ай бұрын
Yes its fair use.
@manikantakowsik1759 ай бұрын
share that ppt also please
@antiquesordo9 ай бұрын
Great video. But simplify further. For example you mentioned multi-modal assuming everyone know it.
@MichealScott249 ай бұрын
❤
@Tony_Indiana9 ай бұрын
There is the problem data is hand picked then curated in a way the corporation wants. This is true for both trump leaning right or woke left.