Fine Tuning Large Language Models with InstructLab

Рет қаралды 29,675

IBM Technology

Күн бұрын

Пікірлер: 42

@cloudnativecedric Ай бұрын

Many thanks for the opportunity!

@murunatan6661 Ай бұрын

This is really good. Easy to understand and implement.

@eduardoguevara7880 2 күн бұрын

8 minutes to start tapping on open source potential - great content, thanks 🫡

@taygundogan Ай бұрын

Yeeeey Cedric, more videos from Cedric please please!!

@khairullahhabib3982 Ай бұрын

Very good delivery! I can watch this guy explain stuff all day. Keep it up! I can't believe all this knowledge is just free out here

@josephmyalla3611 18 күн бұрын

Finally there is an open source tool for llm fine-tuning this amazing.

@ryansun8256 9 күн бұрын

A google search returns multiple opensource fine tuning tools like axolotl, llama factory. What makes this one different?

@stanTrX Ай бұрын

Thanks. Does it work the same with ollama?

@volkovolko 23 күн бұрын

Great Tool, I was waiting a full well made video of this tool and here it is ! A collab notebook would be great if possible 😉

@learnbydoingwithsteven Ай бұрын

Very clear. Gonna try it.

@gokcerbelgusen1062 Ай бұрын

I will try this, thank you

@SameerJatoi-w3e 7 күн бұрын

What if I want to fine tune it on a language other than English ?

@PB-kx4vv Ай бұрын

InstructLab presentations lead me to fantasize about training a model to shorten the learning curve for large open source projects. For example, the code-aster finite element package, with huge amounts of documentation and many documented test cases can many structural and dynamic and even thermal mechanical systems. However, the combinations of features which work compatibly with each other feels to a beginner like a fractal landscape. It is ok to go through an example, but it is easy to loose footing at near adjacencies. It would be nice to talk to a model about strategies to construct a new model, which can reference particular documents and examples, and identify prospective strategies as self conflicting. But when I imagine mapping this problem to instruct lab, I imagine it to be a more daunting task than just working with the program and gaining experience, and reading a lot.

@maneeshs3876 Ай бұрын

Nice video !

@ml00000 Ай бұрын

Excellent presenter!

@justwanderin847 Ай бұрын

I was just wondering how they really train AI. This helps.

@광광이-i9t 29 күн бұрын

Thanks!!

@kingshukbasak7363 4 күн бұрын

Where is the InstructLab channel?

@ZakinAbdul Ай бұрын

That was awesome, and I was wondering, can we fine-tune that model with an RAG chatbot-like, chat with it and feed it new info through our chats?

@andrewcameron4172 Ай бұрын

What version of ilab were you running in this demo?

@cloudnativecedric Ай бұрын

Ah, so this was InstructLab v.17 when we recorded :)

@AmeerHamza-cy6km Ай бұрын

how can I train it on PHP programming language, and some php projects.

@aganithshanbhag Ай бұрын

question answer set (vast training material on php programming)

@munawwarkhan1926 Ай бұрын

This is a great video and a good intro to an amazing tool. Just one suggestion, it does need some knowledge and background of computer science and data structures. I don't think it is for people with zero knowledge or background as the video suggestsin the beginning. Amazing content IBM, learning a lot here.

@cloudnativecedric Ай бұрын

Thank you very much for the feedback! That is true, there are some basics that are helpful in doing this, as well as terminal usage skills, but what we're working on as well is a user interface for the upstream InstructLab project, so it's essentially a simple form to include Q&A pairs, source documents, and attribution! Then the rest of the process like data generation and training is automated :)

@Pregidth Ай бұрын

@@cloudnativecedric If I understand correctly, by providing exact Q&A pairs during the fine-tuning process, we are effectively guiding the LLM to produce specific, deterministic answers to certain questions. Does this mean we are reducing the inherent randomness in the answers that LLMs typically generate based on their pre-trained weights? If so, wouldn’t this approach limit the model’s flexibility to incorporate its broader pre-trained knowledge into the context of the fine-tuned domain?