No more Fine-Tuning: Unsupervised ICL+

  Рет қаралды 4,600

code_your_own_AI

code_your_own_AI

Күн бұрын

A new Paradigm of AI, Unsupervised In-Context Learning (ICL) of Large Language Models (LLM). Advanced In-Context Learning for new LLMs w/ 1 Mio token context length open up new possibilities for NEW Autonomous LEARNING of LLMs. Google DeepMind published new research on advanced ICL: ICL+.
In this video we go from few-shot ICL (In-Context Learning) to many-shot ICL to improve the performance of our LLM (w/ 1M token) significantly. But note: it is domain specific and sensitive to causal reasoning patterns learned during pre-training of the LLM.
A new Paradigm of AI, Unsupervised In-Context Learning (ICL).
Forget Fine-Tuning: Unsupervised ICL Reinforced
00:00 1Mio token Context Length LLM ICL
05:26 From Few-Shot to MANY-Shot ICL
06:58 Performance jump
10:55 Benchmark data on ICL+
16:05 Max Performance ICL
18:34 Reinforced and Unsupervised ICL
26:51 Autonomous Learning LLM by advanced ICL
#airesearch #aieducation #ai

Пікірлер: 18
@davidbarton3361
@davidbarton3361 Ай бұрын
This is really interesting, but until commercial LLMs don't charge for input tokens how viable is this? I can only imagine my OpenAI rate limits and credit card smoking if I decided to pump that many input tokens through for a single query. Even 32k tokens per request burns money fast. OTOH fine tuning LLMs is really low cost. If you were to fine tune that many examples with Mixtral, the performance might improve a lot more than that. And then using Lorax to have multiple fine tunes running on the same hardware makes this a lot more cost effective.
@user-hd9or3dp1t
@user-hd9or3dp1t Ай бұрын
I did a 2 month I took a two-month break, and so much has happened! You've been putting out a ton of content too. How am I supposed to catch up with all of this haha
@umbertosurricchio5365
@umbertosurricchio5365 Ай бұрын
Amazing topic❤ please let's do more videos about It and I am interested about particolar context where special words and concepts are used , like Healtcare, Finance, Legal context. Thank You so much in Advance.🙏
@BradleyKieser
@BradleyKieser Ай бұрын
Fascinating.
@mohamedfouad1309
@mohamedfouad1309 Ай бұрын
This is amazing 😂
@gileneusz
@gileneusz Ай бұрын
8:15 this will increase prompt evaluation time to the roof
@johnytheripper
@johnytheripper Ай бұрын
and cost per inference call
@kengonzo1640
@kengonzo1640 Ай бұрын
This is interesting I'm trying to imagine how parsing principles of the of thought into subdivided components are then rearranged to create a different outcome much like order of operations affects math equations
@markopancic6060
@markopancic6060 10 күн бұрын
Could it be that the wider variety of problems suggested in the unsupervised icl is just activating math related attention heads allowing it to solve the problems. where as QA might be less varied and cause more of a pigeonhole effect? I feel like this has been seen with some of deepminds RL work where less prescriptive performs better that prescriptive work.
@chituyiwakhusama9944
@chituyiwakhusama9944 Ай бұрын
ICL many-shot inference will give you a better-grounded output. The tradeoff is cost and latency (Groq can save us on this 😊 but not on that')
@explorer945
@explorer945 Ай бұрын
I haven't finished watching the video yet, but what about the cost?
@code4AI
@code4AI Ай бұрын
I haven't finished reading your complete question yet, but π?
@explorer945
@explorer945 Ай бұрын
@@code4AI sorry I don't understand your pi reference..later I finished it. You didn't mention cost, security issues with long context. Anyways, please concise videos. There are way too many videos to watch.
@daryladhityahenry
@daryladhityahenry Ай бұрын
@@explorer945 lol.. Please don't shorten the video. I like his teaching because it's great to study, easy to understand even though this isn't my field. :D:D.
@explorer945
@explorer945 Ай бұрын
@@daryladhityahenry got it. Ok, don't change anything. I am not the type of audience as I have to consume 10s of videos and can't afford to have each 30min. Sometimes, I feel like I can do my own reading, tell me the gist and high level points. Happens when you are dealing with so many updates. Not sure how many channels you follow . It's really difficult. You can see why @code4ai doesn't have as many subscriptions as others for same reasons.
@daryladhityahenry
@daryladhityahenry Ай бұрын
@@explorer945 Yes yes I can understand your point very well.. hahahah. I follow so much channel, but I didn't watch all of them. Just a couple ( less then ten a day I think ). Anyway, I do really understand your point and also about his long video format that makes him not getting much subs. But... I happen to be one of the one that likes it :D:D:D.
@pi5549
@pi5549 Ай бұрын
Can you try to summarise the innovation really early on in the video? My experience watching this is like trying to ride a bicycle too slowly. I just can't focus on anything.
In-Context Learning: EXTREME vs Fine-Tuning, RAG
21:42
code_your_own_AI
Рет қаралды 3,4 М.
RAG explained step-by-step up to GROKKED RAG sys
59:31
code_your_own_AI
Рет қаралды 3,4 М.
FOOTBALL WITH PLAY BUTTONS ▶️❤️ #roadto100million
00:20
Celine Dept
Рет қаралды 14 МЛН
IS THIS REAL FOOD OR NOT?🤔 PIKACHU AND SONIC CONFUSE THE CAT! 😺🍫
00:41
New Discovery: LLMs have a Performance Phase
29:51
code_your_own_AI
Рет қаралды 12 М.
BEST RAG you can buy: LAW AI (Stanford)
19:12
code_your_own_AI
Рет қаралды 2,5 М.
LLM - Reasoning SOLVED (new research)
47:51
code_your_own_AI
Рет қаралды 12 М.
New Trick for Fine-Tuning LLMs #airesearch
27:23
code_your_own_AI
Рет қаралды 2,4 М.
TransformerFAM: Feedback attention is working memory
37:01
Yannic Kilcher
Рет қаралды 34 М.
MAMBA from Scratch: Neural Nets Better and Faster than Transformers
31:51
Algorithmic Simplicity
Рет қаралды 120 М.
Understand DSPy: Programming AI Pipelines
28:21
code_your_own_AI
Рет қаралды 3,2 М.
"okay, but I want Llama 3 for my specific use case" - Here's how
24:20
Очень странные дела PS 4 Pro
1:00
ТЕХНОБЛОГ ГУБАРЕВ СЕРГЕЙ
Рет қаралды 469 М.
iPhone 12 socket cleaning #fixit
0:30
Tamar DB (mt)
Рет қаралды 30 МЛН
How much charging is in your phone right now? 📱➡️ 🔋VS 🪫
0:11
Непробиваемый телевизор 🤯
0:23
FATA MORGANA
Рет қаралды 73 М.
ПРОБЛЕМА МЕХАНИЧЕСКИХ КЛАВИАТУР!🤬
0:59
Корнеич
Рет қаралды 3,8 МЛН