Рет қаралды 17,594
Code script how to fine-tune LLama 2 model with parameter efficient fine-tuning, a low rank approximation of matrix and tensor structures, a 4-bit quantization of tensors, a transformer based Reinforcement Learning (RL) and HuggingFace's Supervised Fine-tuning trainer. LLama v2 model, finetuning.
Plus we code a synthetic dataset for our LLama 2 model to fine-tune on, w/ GPT-4 (or your preferred CLAUDE 2 or ....) as the central intelligence - to create task specific datasets for a given user query to fine-tune LLMs on.
All rights with Matt Shumer for his Jupyter NB on fine-tuning LLama 2 model:
colab.research...
See also Matt Shumer's Github repo for the GPT-LLM-Trainer:
github.com/msh...
#gpt
#finetuning
#llama2