Finetuning Open-Source LLMs

  Рет қаралды 26,273

Sebastian Raschka

Sebastian Raschka

7 ай бұрын

This video offers a quick dive into the world of finetuning Large Language Models (LLMs). This video covers
- common usage scenarios for pretrained LLMs
- parameter-efficient finetuning
- a hands-on guide to using the 'lit-GPT' open-source repository for LLM finetuning
#FineTuning #LargeLanguageModels #LLMs #OpenAI #DeepLearning
Useful links to resources discussed in this video:
Code for the LLM classifier: github.com/rasbt/LLM-finetuni...
Lit-GPT repository: github.com/Lightning-AI/lit-gpt
NeurIPS LLM efficiency challenge: llm-efficiency-challenge.gith...
My latest articles on LLM research: magazine.sebastianraschka.com/

Пікірлер: 12
@Dom-zy1qy
@Dom-zy1qy 23 сағат бұрын
Very much appreciate this video, fine-tuning seemed like a somewhat amorphous concept to me for sometime, but the diagrams you showed really made it easier to understand how people finetune.
@SebastianRaschka
@SebastianRaschka 22 сағат бұрын
Thanks so much, glad these diagram were helpful and helped clarifying!
@captinbo1
@captinbo1 5 ай бұрын
Thanks! Great overview
@mysticaltech
@mysticaltech 4 ай бұрын
Awesome, thank you!
@prakhargurawa
@prakhargurawa 4 ай бұрын
Thank you :)
@nadranaj
@nadranaj 3 ай бұрын
Thanks
@Mayur7Garg
@Mayur7Garg 6 ай бұрын
One of the approaches I have experimented with, which is both manual labor, time and compute expensive but more reliable, is as follows: - Use a LLM to query for outputs. Use RAG and prompt engineering to get the best possible results. - Generate chat logs for each query. The log should include everything - the prompt, the retrieved info if any and the model output. Any special symbol such as to denote the system prompt or anything else should also be left in. This is because LLMs are text generation models with no concept of chat. - Manually update the model outputs to better reflect the expected output. This is a data creation task. - Fine tune a copy of the same LLM using PEFT using the updated chat logs. This can also be done iteratively as long the chat logs are generated initially by a model which hasn't been fine-tuned yet. Like a sort of A/B experiment. Some use cases are served the original model that generates the data for fine-tuning while the other are served the fine-tune model whose outputs are not used for any further fine-tuning. Expensive but over time, your model would work better for realistic inputs.
@lalmuansangachhakchhuak4927
@lalmuansangachhakchhuak4927 7 ай бұрын
Cool
@mohammadkad
@mohammadkad 6 ай бұрын
Amazing, Thanks
@zjffdu
@zjffdu Ай бұрын
Thanks for the video, very helpful for me to understand different kinds of finetunning. BTW, what kind of finetunnig is huggingface belong to?
@SebastianRaschka
@SebastianRaschka Ай бұрын
Glad that it was helpful! HF itself has different tools for finetuning. Similarly, the LitGPT library I help developing supports full finetuning, LoRA, QLoRA, etc.
@muhammadanas7698
@muhammadanas7698 7 ай бұрын
Time saw you here on YT! Hope you remember me.!
Fine-tuning Large Language Models (LLMs) | w/ Example Code
28:18
Shaw Talebi
Рет қаралды 223 М.
"okay, but I want Llama 3 for my specific use case" - Here's how
24:20
Which one will take more 😉
00:27
Polar
Рет қаралды 82 МЛН
Зомби Апокалипсис  часть 1 🤯#shorts
00:29
INNA SERG
Рет қаралды 6 МЛН
QLoRA-How to Fine-tune an LLM on a Single GPU (w/ Python Code)
36:58
Fine-tune LLMs - Line by line code example
8:21
Scientific Coding
Рет қаралды 2,4 М.
Should You Use Open Source Large Language Models?
6:40
IBM Technology
Рет қаралды 329 М.
[1hr Talk] Intro to Large Language Models
59:48
Andrej Karpathy
Рет қаралды 1,8 МЛН
GPT-4o Deep Dive: the AI that CRUSHES everything
28:11
AI Search
Рет қаралды 53 М.
Insights from Finetuning LLMs with Low-Rank Adaptation
13:49
Sebastian Raschka
Рет қаралды 3,7 М.
Let's build GPT: from scratch, in code, spelled out.
1:56:20
Andrej Karpathy
Рет қаралды 4,2 МЛН
APPLE УБИЛА ЕГО - iMac 27 5K
19:34
ЗЕ МАККЕРС
Рет қаралды 94 М.
Внутренности Rabbit R1 и AI Pin
1:00
Кик Обзор
Рет қаралды 2,1 МЛН
3D printed Nintendo Switch Game Carousel
0:14
Bambu Lab
Рет қаралды 2 МЛН