Large Language Models: How Large is Large Enough?

  Рет қаралды 14,863

IBM Technology

IBM Technology

Күн бұрын

Пікірлер: 24
@dominiquecoladon8343
@dominiquecoladon8343 Жыл бұрын
Well done video, Get kip to more of these please.
@worldwar_two2894
@worldwar_two2894 2 ай бұрын
excellent insight! luv this! Kip, keep it up! 🎊🎊
@MrTehRave
@MrTehRave 2 күн бұрын
Video clean as hell. Great stuff
@thatdudewiththething
@thatdudewiththething Жыл бұрын
These videos are fantastic! Thank you so much for making them available :D
@ttjordan81
@ttjordan81 Жыл бұрын
Thank you, this is the information I was searching for. I was explaining the concept in theory to someone. The idea was to use smaller models that are trained for specific domains. By eliminating or reduce all the other domains, the model should perform better and reduce messy results.
@LaurenFrazier-ch4kn
@LaurenFrazier-ch4kn Жыл бұрын
Great video, super informative!
@YvesNewman
@YvesNewman Жыл бұрын
Great video Kip! At the moment it seems that bigger equals better. Time to change that perception accordingly
@sherpya
@sherpya Жыл бұрын
already the trend, see mixture of experts concept
@IsaacFoster..
@IsaacFoster.. Жыл бұрын
My llm's so large, it reaches almost every 1 and 0 it can write on; you can literally call it a "wipe"
@gjjakobsen
@gjjakobsen Жыл бұрын
The MBA in me says, beyond some point, the trade-off isn't worth it. Then again, that's probably what they said about the Apollo mission.
@donson3326
@donson3326 6 ай бұрын
0:16 🤣 You tell me.
@di-egohumilde4515
@di-egohumilde4515 3 ай бұрын
I had EXACTLY the same thought LMAO
@7rich79
@7rich79 Жыл бұрын
Thank you, that was informative. One question I have is how you determine domain specificity, and perhaps potential lost opportunity? For example, using financial services tasks as in your example. If you ask someone working in finance about what insights they'd be looking for, tax or perhaps transfer pricing may not be what they consider as part of their domain. However, transfer pricing and tax could have a huge impact on what finance should consider when taking decisions. How do you ensure the domain specificity is not too narrow?
@julioberas2106
@julioberas2106 11 ай бұрын
I believe anything remotely related to the domains should be included in the training data. He didn't talk about the training data size, but I believe it should still be very big (but smaller than a general one)
@Alice8000
@Alice8000 9 ай бұрын
5:38 THANK YOU BRO. Definitely feel more confident after hearing that.
@nirmal7103
@nirmal7103 Жыл бұрын
How can we find a domain specific models or how to train them?
@ttjordan81
@ttjordan81 Жыл бұрын
I think that's the next business idea, lol... At this point, pick an industry, and create specific domain model! It's a race! Also, specific domain Vector Databases will be needed!
@einstein_god
@einstein_god Жыл бұрын
It really depends
@tyrojames9937
@tyrojames9937 Жыл бұрын
INTERESTING. 😀
@aberobwohl
@aberobwohl Жыл бұрын
I see no point whatsoever in comparing a domain specific finetuned model to a non finetuned model to draw conclusions or suggest any insights doing this.
@warsin8641
@warsin8641 Жыл бұрын
The Bloke
@deathlife2414
@deathlife2414 Жыл бұрын
Lets go phi. chroot chroot chroot
@Alice8000
@Alice8000 9 ай бұрын
bro you got worse handwriting than me!!! Good info though. lol
@TheBiffsterLife
@TheBiffsterLife 10 ай бұрын
Kip, that’s a very poor analogy.
Fight Insider Threats with AI-infused SIEM
7:16
IBM Technology
Рет қаралды 12 М.
What Makes Large Language Models Expensive?
19:20
IBM Technology
Рет қаралды 78 М.
Sigma Kid Mistake #funny #sigma
00:17
CRAZY GREAPA
Рет қаралды 30 МЛН
Арыстанның айқасы, Тәуіржанның шайқасы!
25:51
QosLike / ҚосЛайк / Косылайық
Рет қаралды 700 М.
RAG vs. Fine Tuning
8:57
IBM Technology
Рет қаралды 115 М.
Has Generative AI Already Peaked? - Computerphile
12:48
Computerphile
Рет қаралды 1 МЛН
Cybersecurity Trends for 2025 and Beyond
16:55
IBM Technology
Рет қаралды 146 М.
Data Scientist vs. AI Engineer
10:39
IBM Technology
Рет қаралды 211 М.
Optimize Your AI Models
11:43
Matt Williams
Рет қаралды 17 М.
Why Large Language Models Hallucinate
9:38
IBM Technology
Рет қаралды 217 М.
AI Trends for 2025
7:32
IBM Technology
Рет қаралды 255 М.
SLM (Small Language Model) with your Data | Data Exposed
7:55
Microsoft Developer
Рет қаралды 10 М.
Large Language Models explained briefly
7:58
3Blue1Brown
Рет қаралды 1,1 МЛН
Sigma Kid Mistake #funny #sigma
00:17
CRAZY GREAPA
Рет қаралды 30 МЛН