Fine Tune Palm 2 + How to make a dataset

Рет қаралды 14,413

Күн бұрын

Пікірлер: 36

@paulocoronado2376 Жыл бұрын

Great video, Sam!! You are awesome!! I’ve got some cool ideas for future videos: 1. Advanced examples of PaLM2 + Langchain (expression language, custom tools etc.) 2. Langsmith (very useful for managing and troubleshooting models)

@samwitteveenai Жыл бұрын

Certainly will make some Lang Smith vids and some more with Expression Languages

@Pure_Science_and_Technology Жыл бұрын

Thank you Sam for putting this video together..

@fontenbleau Жыл бұрын

Also they would never allow downloading because Palm2 can be used to operate robots by camera and many more as people say on LLM Bootcamp convention.

@sergeyscorin 3 ай бұрын

Hi. Cool video. Could you please tell me is it possible to set a book as a training data to learn it and than ask things from this book to get instant general answers from it ?

@권소예대학원생-컴퓨 10 ай бұрын

Hi. Thank you for your work. Where can I get the 'reduced_output.json' file?

@EstebanAstudillo Жыл бұрын

Thank you Sam, very useful as usual! I'm wondering how much money did you ended up spending while using the vertex services? Or did you have a trial? (I missed that). Thanks!

@eddiehaug Жыл бұрын

Regarding the number of training steps. According to the docs: Train steps: Enter the number of steps to run for model tuning. The batch size varies by tuning location: * us-central1 has a batch size of 8 * europe-west4 has a batch size of 24 If there are 240 examples in a training dataset, in europe-west4, it takes 240 / 24 = 10 steps to process the entire dataset once. In us-central1, it takes 240 / 8 = 30 steps to process the entire dataset once.

@sebastiansanchez7331 Жыл бұрын

Hello Sam, Is this supposed to take a while once I click "start tuning" button? I don't know if it's just me but once I click start tuning, it doesn't show me anything nor looks it's training.

@oxytic Жыл бұрын

Great tutorial thank you for doing this , kindly create autonomous angent tutorial using LLM please

@ahmadzaimhilmi Жыл бұрын

Fine-tuned openai models cost more to run inferences than the base ones. How about this?

@samwitteveenai Жыл бұрын

Currently from everything I have seen Google isn't charging more for the FT models, This may change though.

@sakshatkatyarmal2303 Жыл бұрын

Awesome video..Do you know how to make a dataset for chat-bison for finetuning on gcp?

@MuaazOsaid 10 ай бұрын

Hi Sam thank u for the video. im facing this issue Invalid file extension. Supported extension: 'jsonl' on vertex-pipelines-prompt-validation node, any help will be appreciated.

@samwitteveenai 10 ай бұрын

I think they have changed this since I made the video. Something similar for Gemini is coming so I will make some new vids for that

@pranaymehta7958 10 ай бұрын

I followed the steps for emoticon fine tuning and it is over 20 minutes that the step 12 / 19 is still running. The job is large-language-model-tuner. I wonder when this will be over. The dataset size is so small and the time taken to tune the model is so high.

@pranaymehta7958 10 ай бұрын

finally finished after 1 hour. thank you for the video , this was super helpful and I got a good idea of the internal workings. Lets see how much costly this one comes out to be.

@dedekj26 11 ай бұрын

are we can just use the fine tuning feature from google colab?

@samwitteveenai 11 ай бұрын

Not for these models. We can FT the open source models in Colab. We can call the APIs to FT from Colab though.

@waneyvin Жыл бұрын

Great video Sam, Is there any chance to have a tutorial about RLHF as well?

@samwitteveenai Жыл бұрын

currently putting one together

@waneyvin Жыл бұрын

cheers mate!@@samwitteveenai

@sam_joshua_s Жыл бұрын

I love ur video and can you make video about fine tuning using deepspeed

@mohsenghafari7652 10 ай бұрын

hi. please help me. how to create custom model from many pdfs in Persian language? tank you.

@samwitteveenai 10 ай бұрын

Hi the keys to this will be 1. getting the data out of PDF (check out unstructured.io for this) 2. fine tuning on a model that has a good tokenizer. Gemini is probably a good fit for this.

@mohsenghafari7652 10 ай бұрын

@@samwitteveenai tank you. link unavailable !

@mohsenghafari7652 10 ай бұрын

@@samwitteveenai dear ! do you have a linke for a sample? tanks

@tommhk Жыл бұрын

i got "The following quota metrics exceed quota limits:" no matter what Accelerator type i chose (i have upgraded to pay account)

@samwitteveenai Жыл бұрын

it is probably in relation to quota of GPU or TPU in that region. Go to the quota section and apply for quota in those regions

@grym6857 Жыл бұрын

not me already using me 300$ free trials while palm was subpar :(

@kishantripathi4521 Жыл бұрын

Unable to fine tune with free tier

@iskrabesamrtna Жыл бұрын

@@kishantripathi4521 ofc not :)

@888felipe Жыл бұрын

Never try this. Just to train this emoji example cost me $40. The KZbin video is great, but he never mentioned the cost... Maybe because he wanted the views. I could just send $40 to him instead...

@samwitteveenai Жыл бұрын

wow that seems high. When I made the video they were still in the testing phase. I am genuinely surprised it cost that much. I will check it out and put a warning in the description.