What Makes Large Language Models Expensive?

  Рет қаралды 75,504

IBM Technology

IBM Technology

Күн бұрын

Пікірлер: 78
@KP-sg9fm
@KP-sg9fm 11 ай бұрын
Can you make a video talking about smaller more effecient models (Orca, Phi II, Gemini Nano, etc) Do they have a future, and if so, what does it look like? Will more sota models leverage the techniques used by smaller models to become more effiecient? Or will they always remain separate?
@teleprint-me
@teleprint-me 11 ай бұрын
There are pros and cons to each approach. Larger models are scaled in a way that makes their capabilities proportional to their parameters. So, larger models are smarter and that will always be the case. Both techniques feed off of one another, so improvements in one will lead to improvements in another. It's cheaper and easier and faster to iterate over smaller models and any gains made throughout the process are applied to larger models. Not sure if this helps. Anyone can feel free to correct me if I misrepresented any information.
@imanrezazadeh
@imanrezazadeh 11 ай бұрын
Excellent explanation! A minor note: the analogy of curtain makes sense, but then you mentioned fine-tuning makes structural changes to the parameters, which is not accurate. It just changes the values of the parameters.
@aymerico11
@aymerico11 10 ай бұрын
How does it change the value ? Is it token change ? Basically it means that once you've tuned your model f(x) no longer equals y but actually z right ?
@aqynbc
@aqynbc 11 ай бұрын
Another excellent videos that makes you understand the fundamentals of an otherwise complicated subject.
@jediTempleGuard
@jediTempleGuard 11 ай бұрын
I think customized language models will become more important over time. Companies will want artificial intelligence applications specific to their fields of activity, and individuals will want artificial intelligence applications specific to their special interests. Not to sound like I'm telling fortunes, but with improvements in cost, customized smaller models may become more dominant in the market.
@Cahangir
@Cahangir 11 ай бұрын
what types of AI apps would individuals want apart from personal assistants that would need customizing?
@Anurag_Hansda
@Anurag_Hansda 10 ай бұрын
I very much agree with you... Google could be much more efficient by giving specific detail.
@RajeshR-bz3nj
@RajeshR-bz3nj 2 ай бұрын
@@CahangirIndustry specific LLMs. If I am a pancreatic cancer research company, I don’t want to know about Renaissance in Europe
@webgpu
@webgpu 11 ай бұрын
anyone noticed she kept on talking * while * writing ? women are real multitaskers - i swear to God my brain is 100% monotask and i could never Ever: write AND do anything else. The apex of my manly monotaskiness is to be able to talk while i'm driving (but i can only talk about light subjects, if you talk about anything a little more involved, i will just not follow you.
@shaniquedasilva1856
@shaniquedasilva1856 11 ай бұрын
Great video Jessica and so informative!! I’m working on a project now implementing Gen AI (gen fallback, generators). Identifying proper use cases are so important to yield the best results while thinking about the # of LLM calls.
@unclenine9x9
@unclenine9x9 9 ай бұрын
Yes, we need to select the suitable LLMs for pickings up the request with cost effective way. Thus the cost of operation should be lowered.
@ameliarose6833
@ameliarose6833 2 ай бұрын
absolutely love this video. You really answers so many questions to a person who had to know how thing work from the very beginning in order to learn a new skill. Thank you so much.
@carkawalakhatulistiwa
@carkawalakhatulistiwa 11 ай бұрын
And PHI-2 with 2,7 B billion parameters. proves that we have spent a lot of time and money on computerization that is wasted because of bad data. with better data PHI-2 LLM can be equivalent to gpt 3 175 billion parameters . and there is still the possibility to reduce LLM to 1 billion parameters with the same capabilities
@akj3344
@akj3344 11 ай бұрын
There are 1B models on huggingface made for RAGs.
@Murat-hh4hu
@Murat-hh4hu 11 ай бұрын
For a moment I thought she is AI generated)
@CYBERPOX
@CYBERPOX 11 ай бұрын
Truth
@Beny123
@Beny123 11 ай бұрын
Don’t blame you . Pretty
@MohitSharma-dv7mg
@MohitSharma-dv7mg 11 ай бұрын
Yeah and looked finely tuned!
@Alice8000
@Alice8000 8 ай бұрын
nope u didn't
@uduakedet2861
@uduakedet2861 13 күн бұрын
😂😂😂
@bastabey2652
@bastabey2652 11 ай бұрын
I once attended a whole day IBM sales presentation in Delhi for telco CRM/Billing system.. it was an educational experience more than sales.. IBM sales is really good
@renanmonteirobarbosa8129
@renanmonteirobarbosa8129 11 ай бұрын
There are mistakes with the information provided. PEFT and Lora are separate things model size is influenced mostly by numerical choice and how you compile the GPU kernel. ...
@fasteddylove-muffin6415
@fasteddylove-muffin6415 11 ай бұрын
You walk into a dealership & ask a salesperson how much a vehicle will cost. Answer: This vehicle will cost you whatever you're willing to pay.
@attainconsult
@attainconsult 11 ай бұрын
this is a great start to costing running models, I think you need to think/explain more along the lines of business i.e. adding in all biz file/google/365 docs, biz emails, other biz data sales cash flow, stock usage, forecasting usage of consumables lettuces coffee... all the things biz work off
@emil8367
@emil8367 11 ай бұрын
Very interesting and useful. Thanks for explaining so many topics !
@saikatnextd
@saikatnextd 10 ай бұрын
Thanks Jessica for this video, really eye opening and introspective at the same time.......
@luciengrondin5802
@luciengrondin5802 11 ай бұрын
Stumbled upon this and feel like asking : how did IBM miss the LLM train? Watson was very impressive IMHO. Very much ahead of its time. How could IBM not capitalize on it? Why was it OpenAI that ended up with the language model breakthrough? Which innovation openAI had that IBM could not think of? Was it RLHF?
@VoltLover00
@VoltLover00 11 ай бұрын
You can easily google the answer to your question
@silberlinie
@silberlinie 11 ай бұрын
They used an interesting technique to record the video.
@benthiele
@benthiele 11 ай бұрын
Incredibly helpful video. Please make more!
@teresafarrer1252
@teresafarrer1252 11 ай бұрын
Great video: really clear and professional (unlike a couple of the saddos commenting). Thanks!
@mohsenghafari7652
@mohsenghafari7652 9 ай бұрын
hi. please help me. how to create custom model from many pdfs in Persian language? tank you.
@seanlee2002
@seanlee2002 11 ай бұрын
Excellent explanation. A great understanding of how AI works
@oieieio741
@oieieio741 11 ай бұрын
Excellent explanation. A solid understanding of how AI works. Thanks IBM
@jhaimp.sullivan5618
@jhaimp.sullivan5618 2 ай бұрын
Bot? There is another comment saying the exact same thing. Interesting.. I'm noticing a pattern.. just noticed this on another video. Not knocking whoever's behind doing this. But if your going through the trouble of using different accounts why use the same exact comment? Anyways. I'm just halfway curious. Don't really care tbh. I have other reasons behind my curiosity not necessarily bad .. just couldn't resist but to address and pry to a degree not to expose but . Eh idk. Do not wish to further elaborate.
@sambistabeauty
@sambistabeauty 5 күн бұрын
Why LLMs Cost So Much (noun clause) NO question mark / vs Why DO LLMs cost so much? (Question form)
@LeonButler-b8r
@LeonButler-b8r 11 ай бұрын
Great explanation Jessica
@team-m2
@team-m2 11 ай бұрын
Great and concise, thanks! But ... is she writing from the right to the left? 🤔
@gihan5812
@gihan5812 11 ай бұрын
How can i speak to someone at IBM about working together.
@ChrisJSnook
@ChrisJSnook 10 ай бұрын
What software solution powers this mirrored whiteboard in front of you? It’s awesome and I want to use it?
@djembello
@djembello 5 ай бұрын
I think it can be simple done by rotating/fliping the video itself :)
@markfitz8315
@markfitz8315 11 ай бұрын
very good - thanks
@Alice8000
@Alice8000 8 ай бұрын
Daaaaamn woman. Good explanation.
@aymerico11
@aymerico11 10 ай бұрын
Very good video thanks a lot !
@johnnyalam7301
@johnnyalam7301 11 ай бұрын
Very nicely and intelligently explained 3:49 pm ( Christmas Day 2023)
@gamingbeast710
@gamingbeast710 11 ай бұрын
awsome , 100% focued :D thx for the professionalisme :D
@mrd6869
@mrd6869 11 ай бұрын
Small and powerfulmodels will win out.Phi 2 and Orca2 are some good examples.
@AdamSioud
@AdamSioud 11 ай бұрын
Great video
@reazulislam8446
@reazulislam8446 11 ай бұрын
So precise..
@wzqdhr
@wzqdhr 11 ай бұрын
Does IBM have anything to do with this AI booming?
@potatodog7910
@potatodog7910 11 ай бұрын
How much of this can be done with GPTs?
@scottyb3b7
@scottyb3b7 11 ай бұрын
A GPT is just one type of an LLM
@cleansebob1
@cleansebob1 11 ай бұрын
Looks like it all depends...
@rursus8354
@rursus8354 11 ай бұрын
If you cannot find the best man, take the next best.
@jameshopkins3541
@jameshopkins3541 10 ай бұрын
LLM IS BLA BLA BLAAAAA??????
@jameshopkins3541
@jameshopkins3541 10 ай бұрын
THEN A COMMON PERSON CAN'T DO A LLM FROM SCRATCH???
@joung-joonlee1037
@joung-joonlee1037 11 ай бұрын
I think, that LLM or GAI Look like Spread-Sheet if concern the facts that this type of engine inject By SELF toward tokens and Spell Out tokens..!! AND This type of tokens look like iterated by LLM or GAI, because that is also programs using Computer Iterations...! AND The LLM or GAI's using cost can be acquired using calculations over Time/Number of Tokens/Weight of Meaning.... But, I know that this calculations is just approximation by User. Thank you for NICE Video! and I'm korean.
@joung-joonlee1037
@joung-joonlee1037 11 ай бұрын
😗
@potatodog7910
@potatodog7910 11 ай бұрын
Nice
@jameshopkins3541
@jameshopkins3541 10 ай бұрын
She is 36 years old Isn't it?
@NisseOhlsen
@NisseOhlsen 11 ай бұрын
What makes them so expensive? Simple. Their Architecture is not right.
@SiegelBantuBear
@SiegelBantuBear 7 ай бұрын
🙏🏼
@ciphore
@ciphore 11 ай бұрын
Nancy Pi did it first 😤
@Free-pp8mr
@Free-pp8mr 11 ай бұрын
It is not intelligent to pay for AI! It’s simply marketing!
@markmaurer6370
@markmaurer6370 11 ай бұрын
1:19 So IBM does not believe consumers need to have their data protected.
@reninj
@reninj 2 ай бұрын
How is it that, these videos still give such basic generic examples? Use cases for example. She couldn't find different use cases that an enterprise might have? She had to give the example of a car dealership???
@thierry-le-frippon
@thierry-le-frippon 10 ай бұрын
People will pay for that 😅😅😅 ???
@Canadainfo
@Canadainfo 11 ай бұрын
amazon bedrock!!
@Drunrealer
@Drunrealer 8 ай бұрын
Drink from de bottle
@michaelm8460
@michaelm8460 2 ай бұрын
anthropomorphism makes you forget you have another (abiet sophisticated ) search engine. Worse is the Model can enforce that idea by using personal pronouns
@aprilmeowmeow
@aprilmeowmeow 8 ай бұрын
so sad that people cant even write a speech anymore.
@bobanmilisavljevic7857
@bobanmilisavljevic7857 11 ай бұрын
🦾🥳
@ashishsehrawat_007
@ashishsehrawat_007 11 ай бұрын
Kinda boring explanation.
Ten Everyday Machine Learning Use Cases
7:07
IBM Technology
Рет қаралды 45 М.
Why Large Language Models Hallucinate
9:38
IBM Technology
Рет қаралды 209 М.
Do you love Blackpink?🖤🩷
00:23
Karina
Рет қаралды 23 МЛН
小路飞和小丑也太帅了#家庭#搞笑 #funny #小丑 #cosplay
00:13
家庭搞笑日记
Рет қаралды 10 МЛН
SIZE DOESN’T MATTER @benjaminjiujitsu
00:46
Natan por Aí
Рет қаралды 7 МЛН
Become a value creator with generative AI
14:38
IBM Technology
Рет қаралды 54 М.
Visualizing transformers and attention | Talk for TNG Big Tech Day '24
57:45
4 Methods of Prompt Engineering
12:42
IBM Technology
Рет қаралды 162 М.
AI can't cross this line and we don't know why.
24:07
Welch Labs
Рет қаралды 1,3 МЛН
Microsoft Ignite 2024: Everything Revealed in 15 Minutes
15:03
Generative AI in a Nutshell - how to survive and thrive in the age of AI
17:57
Large Language Models explained briefly
8:48
3Blue1Brown
Рет қаралды 698 М.
Do you love Blackpink?🖤🩷
00:23
Karina
Рет қаралды 23 МЛН