Fine Tuning Large Language Models with InstructLab

  Рет қаралды 29,675

IBM Technology

IBM Technology

Күн бұрын

Пікірлер: 42
@cloudnativecedric
@cloudnativecedric Ай бұрын
Many thanks for the opportunity!
@murunatan6661
@murunatan6661 Ай бұрын
This is really good. Easy to understand and implement.
@eduardoguevara7880
@eduardoguevara7880 2 күн бұрын
8 minutes to start tapping on open source potential - great content, thanks 🫡
@taygundogan
@taygundogan Ай бұрын
Yeeeey Cedric, more videos from Cedric please please!!
@khairullahhabib3982
@khairullahhabib3982 Ай бұрын
Very good delivery! I can watch this guy explain stuff all day. Keep it up! I can't believe all this knowledge is just free out here
@josephmyalla3611
@josephmyalla3611 18 күн бұрын
Finally there is an open source tool for llm fine-tuning this amazing.
@ryansun8256
@ryansun8256 9 күн бұрын
A google search returns multiple opensource fine tuning tools like axolotl, llama factory. What makes this one different?
@stanTrX
@stanTrX Ай бұрын
Thanks. Does it work the same with ollama?
@volkovolko
@volkovolko 23 күн бұрын
Great Tool, I was waiting a full well made video of this tool and here it is ! A collab notebook would be great if possible 😉
@learnbydoingwithsteven
@learnbydoingwithsteven Ай бұрын
Very clear. Gonna try it.
@gokcerbelgusen1062
@gokcerbelgusen1062 Ай бұрын
I will try this, thank you
@SameerJatoi-w3e
@SameerJatoi-w3e 7 күн бұрын
What if I want to fine tune it on a language other than English ?
@PB-kx4vv
@PB-kx4vv Ай бұрын
InstructLab presentations lead me to fantasize about training a model to shorten the learning curve for large open source projects. For example, the code-aster finite element package, with huge amounts of documentation and many documented test cases can many structural and dynamic and even thermal mechanical systems. However, the combinations of features which work compatibly with each other feels to a beginner like a fractal landscape. It is ok to go through an example, but it is easy to loose footing at near adjacencies. It would be nice to talk to a model about strategies to construct a new model, which can reference particular documents and examples, and identify prospective strategies as self conflicting. But when I imagine mapping this problem to instruct lab, I imagine it to be a more daunting task than just working with the program and gaining experience, and reading a lot.
@maneeshs3876
@maneeshs3876 Ай бұрын
Nice video !
@ml00000
@ml00000 Ай бұрын
Excellent presenter!
@justwanderin847
@justwanderin847 Ай бұрын
I was just wondering how they really train AI. This helps.
@광광이-i9t
@광광이-i9t 29 күн бұрын
Thanks!!
@kingshukbasak7363
@kingshukbasak7363 4 күн бұрын
Where is the InstructLab channel?
@ZakinAbdul
@ZakinAbdul Ай бұрын
That was awesome, and I was wondering, can we fine-tune that model with an RAG chatbot-like, chat with it and feed it new info through our chats?
@andrewcameron4172
@andrewcameron4172 Ай бұрын
What version of ilab were you running in this demo?
@cloudnativecedric
@cloudnativecedric Ай бұрын
Ah, so this was InstructLab v.17 when we recorded :)
@AmeerHamza-cy6km
@AmeerHamza-cy6km Ай бұрын
how can I train it on PHP programming language, and some php projects.
@aganithshanbhag
@aganithshanbhag Ай бұрын
question answer set (vast training material on php programming)
@munawwarkhan1926
@munawwarkhan1926 Ай бұрын
This is a great video and a good intro to an amazing tool. Just one suggestion, it does need some knowledge and background of computer science and data structures. I don't think it is for people with zero knowledge or background as the video suggestsin the beginning. Amazing content IBM, learning a lot here.
@cloudnativecedric
@cloudnativecedric Ай бұрын
Thank you very much for the feedback! That is true, there are some basics that are helpful in doing this, as well as terminal usage skills, but what we're working on as well is a user interface for the upstream InstructLab project, so it's essentially a simple form to include Q&A pairs, source documents, and attribution! Then the rest of the process like data generation and training is automated :)
@Pregidth
@Pregidth Ай бұрын
@@cloudnativecedric If I understand correctly, by providing exact Q&A pairs during the fine-tuning process, we are effectively guiding the LLM to produce specific, deterministic answers to certain questions. Does this mean we are reducing the inherent randomness in the answers that LLMs typically generate based on their pre-trained weights? If so, wouldn’t this approach limit the model’s flexibility to incorporate its broader pre-trained knowledge into the context of the fine-tuned domain?
@LoVe-iu9rd
@LoVe-iu9rd Ай бұрын
May I know what is your laptop spec?
@gauravmodi12
@gauravmodi12 Ай бұрын
How much data it need to do proper fine tuning ?
@george_davituri
@george_davituri Ай бұрын
impressive, need to try cool stuff
@ajaykumarpandey7327
@ajaykumarpandey7327 25 күн бұрын
Which laptop is being used here
@nadoiz
@nadoiz 18 күн бұрын
You say that you have to link the data you created to a GitHub link, and then a pull is done. Is this mandatory?
@activewire-web5710
@activewire-web5710 Ай бұрын
What about hallucinations or guardrails
@rajavemula3223
@rajavemula3223 Ай бұрын
Can fine tuning can be done with cpu? I mean without gpu?
@philtoa334
@philtoa334 Ай бұрын
Nice.
@jacquesgastebois
@jacquesgastebois Ай бұрын
I want to do the same with a tiny model please
@vdpoortensamyn
@vdpoortensamyn Ай бұрын
Our Granite models are quite tiny. 😊
@nazarmohammed5681
@nazarmohammed5681 29 күн бұрын
Plz share the Github repo
@Jobfox645
@Jobfox645 3 күн бұрын
Sorry but this is PEFT with Lora, not fine tuning the LLm to create a new base model
@NhatNguyen-bq6jj
@NhatNguyen-bq6jj Ай бұрын
Quantum AI
@godfather_2001
@godfather_2001 28 күн бұрын
Kate Winslet, Anne hathaway 🤭
Fine-tuning Large Language Models (LLMs) | w/ Example Code
28:18
Shaw Talebi
Рет қаралды 394 М.
RAG vs. Fine Tuning
8:57
IBM Technology
Рет қаралды 130 М.
How Strong Is Tape?
00:24
Stokes Twins
Рет қаралды 96 МЛН
Transformers (how LLMs work) explained visually | DL5
27:14
3Blue1Brown
Рет қаралды 4,5 МЛН
Llama: The Open-Source AI Model that's Changing How We Think About AI
8:46
Let's fine-tune an LLM using the InstructLab project
8:16
InstructLab
Рет қаралды 1,9 М.
From Idea to AI: Building Applications with Generative AI
7:13
IBM Technology
Рет қаралды 18 М.
Can AI Think? Debunking AI Limitations
9:01
IBM Technology
Рет қаралды 21 М.
What is a Context Window? Unlocking LLM Secrets
11:31
IBM Technology
Рет қаралды 8 М.
Zuckerberg DROPS AI BOMBSHELL: The End Of Software Engineers
19:41
"okay, but I want Llama 3 for my specific use case" - Here's how
24:20