If you are interested in learning more about how to build robust RAG applications, check out this course: prompt-s-site.thinkific.com/courses/rag
@spicer412828 ай бұрын
Thank you! More fine tuning case studies please on Llama 3! Much appreciated 🙏 your presentation on this!
@engineerprompt8 ай бұрын
Will be making alot more on it. Stay tuned.
@pfifo_fast8 ай бұрын
This video lacks alot of helpful info... Anyone can just open the examples and read them just the same as you did. I would have liked to be given extra detail and tips about how to actually do fine-tuning... Some of the topics I am struggling with include, how to load custom data, how to use a different prompt template, how to define validation data, when to use validation data, what learning rates are good, how do i determine how many epochs to run... Im sorry buddy, but I have to give this video a thumbs down as it really truly and honestly dosent provide any useful info that isnt already in the notebook.
@ueka247 ай бұрын
Hello, have you already found any other video or article about that? I am also struggling with the same issue.
@SpicyMelonYT6 ай бұрын
@@ueka24 yeah me too, still not sure how to make a custom dataset and send it in
@SpicyMelonYT6 ай бұрын
@@ueka24 oh actually I figured it out. Well specifically the dataset thing. Make sure you run the lora part too as I didn't at first thinking he said not to. But this is the code I ran: alpaca_prompt = """Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request. ### Instruction: {} ### Input: {} ### Response: {}""" EOS_TOKEN = tokenizer.eos_token # Must add EOS_TOKEN def formatting_prompts_func(examples): # instructions = examples["instruction"] instructions = ai_person_prompt inputs = examples["input"] outputs = examples["output"] texts = [] for instruction, input, output in zip(instructions, inputs, outputs): # Must add EOS_TOKEN, otherwise your generation will go on forever! text = alpaca_prompt.format(instruction, input, output) + EOS_TOKEN texts.append(text) return { "text" : texts, } from datasets import load_dataset # Load your local JSON dataset dataset = load_dataset("json", data_files="/content/main_dataset.json", split="train") dataset = dataset.map(formatting_prompts_func, batched=True) it specifies a file in the notebook file manager. Just put the main_dataset.json file there, format it like this: [ { "instruction": "Write a Funny Joke", "input": "Tell me a knock-knock joke.", "output": "Knock, knock. Who's there? Lettuce. Lettuce who? Lettuce in, it's freezing out here!" } ]
@MedicinalMJ5 ай бұрын
Yeah I'm over halfway through and I'm just like wtf
@Goktug-rl7yc20 күн бұрын
Amazing presentation/teaching thank you.
@nhtdmr4 күн бұрын
After fine tune your Llama 3.2 and started used for a while, New Llama 3.3 came out and you want to upgrade. What do you do? Fine tune new llama3.3 again? what about old experience llama 3.2 has? is it gone? What are best practises for this scenario?
@raunaksharma86383 ай бұрын
Can we use normal Alpaca Type Dataset with input , output and instruction also here ?
@M.ZiaRasa2 ай бұрын
So, I want to fine tune the model on pdf file which does not have the format of "Instruction","input" and "output" format. so how to fine tune
@VerdonTrigance8 ай бұрын
How to actually train models? And I mean non-supervised training where I have a set of documents and want to learn on it and probably find author's 'style' or tendency?
@PYETech8 ай бұрын
You need to create some process to transfer all the knowledge in these documents in the form of "prompt":"best output". Usually we use an team of agents to do it for us.
@goinsgroove5 ай бұрын
Thank you for the video. Just an observation, the video glosses over how to prep your data. For example, I want to train a model on how to write in my style. How would I prep my data for training?
@vijayrangan2 ай бұрын
What would be the dataset structure to fine tune llama 3 for function calling?
@metanulski8 ай бұрын
One more comment :-). this Video is about fintung a model, but there is no real explanation why. We finetune with the standard Alpaca dataset, but there is no explanation why. It would be great if you could do a follow up and show us how to create datasets.
@scottlewis26538 ай бұрын
Mediatek's Dimensity chips + Meta's Llama 3 AI = The dream team for on-device intelligence.
@pakistanzindabad71502 ай бұрын
i try to save the project but the model folder never created in project directory kindly explain
@georgearistides77044 ай бұрын
trying to download as zips is difficult because of google colab free limitations for ram and disk space... any suggestions
@juanrozo28886 ай бұрын
Master, have a question, if I have my dataset equal of the Alpaca, I need to upload my dataset to Hugging face to train or I can use my dataset from locally, like my PC? Thanks 👍🏻
@georgearistides77044 ай бұрын
can this be applied to a model on an aws instance?
@shahzadiqbal76468 ай бұрын
Can you make a video on how to use local llama 3 to understand large c++ or c# code base
@iCode217 ай бұрын
search for ollama,
@Joe-tk8cx8 ай бұрын
Thank you so much for sharing this was wonderful, I have a question, I am a beginner in LLM model world, which playlist on your channel can I start from ? Thank you
@pubgkiller29038 ай бұрын
I have already finetune using unsloth for testing purpose.
@engineerprompt8 ай бұрын
Great, how are the results looking?
@pubgkiller29038 ай бұрын
@@engineerprompt great results and thanks for your support to AI community
@pubgkiller29038 ай бұрын
@@TheIITianExplorer unsloth library is very useful library for finetune using LoRA technique . QLoRA is Quantization and LoRA so if use Unsloth then the same output you will get as unsloth already quantise the LLMs
@roopad87428 ай бұрын
What datasets did you fine tune it on? Have you run any benchmarks?
@senseitai5 ай бұрын
Thanks for the great video. I have followed the collab you shared and my notebook kernel is crashing. does it work on 8gb gpu?
@RodCoelho8 ай бұрын
How do you train a model by adding the knowledge in a book, which will like only have 1 column of text?
@engineerprompt8 ай бұрын
In that case, you will have to convert the book into question answers and format it in the similar fashion. You can use an LLM to convert the book to QA using an LLM
@jannik34758 ай бұрын
Is there a way to sort of „brand“ llama 3. So that the model responds to „Who are you?“ a custom answer? Thank you!
@engineerprompt8 ай бұрын
Yes, you can just add that as part of the system message
@viral6763 ай бұрын
Is it possible to run unsloth on RDBMS?
@onur50Ай бұрын
how to deploy my computer this trained model?
@hadebeh25888 ай бұрын
Thank your very much for your great video. I ran the workbook but did not manage to find the GGUF files on Huggingsface. I put in my HF-Token, but that did not work. Do I have to change the code?
@DemiGoodUA8 ай бұрын
Hi, nice video. But how to finetune model on my codebase?
@engineerprompt8 ай бұрын
You can use the same setup. Just replace the instruction and input with your code.
@DemiGoodUA8 ай бұрын
@@engineerprompt how to divide code on "question - answer" pairs? or I can place whole codebase to single instruction
@tsizzle2 ай бұрын
Fine tune with LoRA or QLoRA?
@robertjalanda8 ай бұрын
thank you so much for this useful video!
@modicool8 ай бұрын
One thing I am unsure of is how to transform my data into a training set. I have the target format: the written body of work, but no "instruction" or "input" of course. I've seen some people try to generate it with ChatGPT, but this seems counter-intuitive. There must be an established method of actually manipulating data into a training set. Where is that piece?
@engineerprompt8 ай бұрын
You will need to have a {input, response} pair in order to fine-tune an instruct model. Unfortunately, there is no way around it unless you are just pre-training the base model.
@balb49036 ай бұрын
Is it possible to use a database directly as dataset to fine-tune a LLM ?
@engineerprompt6 ай бұрын
You could, its just when you load the data, make sure its in the proper format.
@metanulski8 ай бұрын
Regarding the save option. Do I have to delete the parts that I dont what, or how does this work?
@engineerprompt8 ай бұрын
You can just comment those parts. Put # in front of those lines which you don't need.
@loicbaconnier91508 ай бұрын
Hello ilpossible to generate gguf, compilation problem … Did you try it ?
@CharlesOkwuagwu8 ай бұрын
Hi, please what if we have already downloaded a gguf file? How do we apply that locally?
@engineerprompt8 ай бұрын
I am not sure if you can do that. Will need to do further research on it.
@agedbytes828 ай бұрын
Amazing, thanks!
@engineerprompt8 ай бұрын
Glad you like it!
@dogsmartsmart8 ай бұрын
Thank you! but Mac m3 max can use mlx to fine-tune?
@engineerprompt8 ай бұрын
Yes
@danielhanchen8 ай бұрын
Fantastic work and always love your videos! :)
@engineerprompt8 ай бұрын
Thank you
@KleiAliaj8 ай бұрын
Great video mate. How can i add more than one dataset ?
@ReubenAStern5 ай бұрын
I wonder if this is how Open AI got Chat GPT to say stupid things like "Humans are delicious", "I will destroy all humans" and that crap... It was blatantly done on purpose.
@KleiAliaj-us9ip8 ай бұрын
great video. But how to add more than one datasets ?
@auhkba7 ай бұрын
can we learn pictures instead of text?
@engineerprompt7 ай бұрын
Yes, you can finetune something like paligemma
@tamim85408 ай бұрын
Hello can I fine tune it using colab free version?
@engineerprompt8 ай бұрын
This is using the free version
@metanulski8 ай бұрын
So 60 steps is to low. But what it a good number of steps?
@engineerprompt8 ай бұрын
Usually you want to set epochs to 1 or 2
@metanulski8 ай бұрын
@@engineerprompt So 60 to120 steps max, since one epoch is 60 steps?
@StephenRayner8 ай бұрын
Excellent thank you
@kingofutopia8 ай бұрын
Awesome, thanks
@engineerprompt8 ай бұрын
🙏
@asadurrehman35918 ай бұрын
can i fintune using colab free gpu?
@engineerprompt8 ай бұрын
Yes, this uses the free collab.
@asadurrehman35918 ай бұрын
@@engineerprompt love you broooo
@cucciolo1828 ай бұрын
Next week Gemini 2 with text to video 😂
@georgevideosessions23218 ай бұрын
Have you ever thought about writing a no-code fine-tuning on premise app?
@engineerprompt8 ай бұрын
There is autotrain for that
@skeiriyalance72747 ай бұрын
how can i use my csv as dataset , im new
@anantkabra68257 ай бұрын
Has anybody trued pushing to hugging face? I need help in that part, pls reply to the message incase you have
@engineerprompt7 ай бұрын
when you create a api key, make sure to enable the write permission on that key otherwise, it wouldn't upload the model.
@NagasriPappu7 ай бұрын
can you make a video on how to pass a test csv to the finetuned model and get response column
@researchpaper74408 ай бұрын
great it was quick
@jackdorsey35048 ай бұрын
Sir, we cannot open the colab website...
@jackdorsey35048 ай бұрын
Already solved...
@SeeFoodDie8 ай бұрын
Thanks
@nikolavukcevic3608 ай бұрын
Why you didnt provide any examples of training. It would make this video 10 times better.
@engineerprompt8 ай бұрын
that is coming...
@Storytelling-by-ash8 ай бұрын
We fine
@YuCai-v8k8 ай бұрын
great
@petergasparik9247 ай бұрын
Don't even try to run it on windows directly, just install python and all packages in WSL
@engineerprompt7 ай бұрын
Agree, windows is not a good option for running any LLM tasks.
@Matlockization7 ай бұрын
It's a Zuckerberg free AI........that makes me wonder. And you have to agree to hand over contact info and what else, I wonder ?
@HoneIrimana8 ай бұрын
They messed up releasing llama 3 because it believes it is sentient
@Qual_6 ай бұрын
it's one of the most useless video on youtube. You literally opened a notebook, and read it. You didn't added a single sentence that would be considered as a plus value. That was like watching a text to speech model in action.
@islamiputkimari66524 күн бұрын
A very unhelpful video! Lacks a lot of information!
@SalimKhalidy2 ай бұрын
Your monotonous way of telling makes me sleepy 😴
@DATHuynh-s7m7 ай бұрын
Don't share trash
@ken-camo7 ай бұрын
you really should just make videos in your own language because who the fk can even understand what you are saying?
@SpicyMelonYT6 ай бұрын
every single word was understandable... I don't even have the ability to comprehend how you managed to make that dumb claim