Fine tuning Embeddings Model

  Рет қаралды 1,003

Mosleh Mahamud

Mosleh Mahamud

Күн бұрын

Fine tuning with the new Sentence Transformers v3.0.
Join Skool Community for $129:
www.skool.com/data-society-42...
Have questions or ideas, meet similar people?
join the discord : / discord
Don't fall behind the AI revolution, I can help integrate machine learning/AI into your company.
mosleh587084.typeform.com/to/...
Notebook: github.com/mosh98/RAG_With_Mo...
This video you will learn
1. Fine tuning embeddings model
2. What types of Data sets can be used
3. How to to test fine tuned embeddings model.
What is sentence transformer?
Sentence Transformers v3.0 introduces significant improvements to the framework for creating and fine-tuning embedding models. This update includes a new training API, backed by `SentenceTransformerTrainer`, enhancing multi-GPU training and detailed loss logging. The version adds new similarity functions like cosine, dot, euclidean, and manhattan, specified via `similarity_fn_name`, for better adaptability to specific tasks Additionally, it supports hyperparameter optimization, extending capabilities from the broader `transformers` library. The release expands loss functions and datasets, ensuring a wide range of training scenarios are covered. While maintaining backward compatibility, the update encourages transitioning to the new API for full benefits.
You can used either BGE or nomic-embed-text model to fine tune your model.
Intro 0:00
Sentence Transformer v3.0 0:49
Download packages 1:08
Load Dataset 1:20
How to Adapt it to your data 2:15
Loading Data and Training Arguments 3:45
Training and Testing 4:45

Пікірлер: 9
@thevadimb
@thevadimb 13 күн бұрын
First, thank you for your video - I really appreciate your work! A question - I see the validation loss is actually growing... Am I missing some point here?
@moslehmahamud
@moslehmahamud 12 күн бұрын
You are right, i didn’t properly train the model with sufficient data or necessary steps/epochs. Please don’t be like me hahaha Hope that answers your question
@ashleeclaral3271
@ashleeclaral3271 10 күн бұрын
how should my own custom dataset look like?
@moslehmahamud
@moslehmahamud 10 күн бұрын
you can try using pair-wise, labeled dataset to train the embeddings model
@rahul01483
@rahul01483 12 күн бұрын
do you have any video on how I can train my own dataset from scratch and create embedding vector store
@moslehmahamud
@moslehmahamud 10 күн бұрын
yes, a new video will be uploaded tomorrow (as of writing), using hf model to get embeddings. You can use a chroma db to store the embeddings Hope that helps
@rahul01483
@rahul01483 9 күн бұрын
@@moslehmahamud sure it helps, as have been using chromadb for some time now... would love to see ur impl
@wilfredomartel7781
@wilfredomartel7781 29 күн бұрын
Great video! is it only for english?
@moslehmahamud
@moslehmahamud 28 күн бұрын
Thanks, you can train on other languages too, make sure to pick a multi-lingual model.
Prompt Engineering, RAG, and Fine-tuning: Benefits and When to Use
15:21
Session 8: Fine-Tuning Embedding Models for RAG Systems
15:46
AI Makerspace
Рет қаралды 4,3 М.
Please be kind🙏
00:34
ISSEI / いっせい
Рет қаралды 180 МЛН
How to set up RAG - Retrieval Augmented Generation (demo)
19:52
Don Woodlock
Рет қаралды 15 М.
Fine Tuning Qwen 2 with Custom Data
5:26
Mosleh Mahamud
Рет қаралды 1,2 М.
QLoRA-How to Fine-tune an LLM on a Single GPU (w/ Python Code)
36:58
DSPy Explained!
54:16
Connor Shorten
Рет қаралды 50 М.
Llama 3 Fine Tuning for Dummies (with 16k, 32k,... Context)
23:16
Nodematic Tutorials
Рет қаралды 21 М.
Adding Agentic Layers to RAG
19:40
AI User Group
Рет қаралды 15 М.
Autogen Full Beginner Course
1:24:45
Tyler AI
Рет қаралды 26 М.
iOS 18 vs Samsung, Xiaomi,Tecno, Android
0:54
AndroHack
Рет қаралды 95 М.
Will the battery emit smoke if it rotates rapidly?
0:11
Meaningful Cartoons 183
Рет қаралды 34 МЛН
Cadiz smart lock official account unlocks the aesthetics of returning home
0:30
Asus  VivoBook Винда за 8 часов!
1:00
Sergey Delaisy
Рет қаралды 1,1 МЛН
Ждёшь обновление IOS 18? #ios #ios18 #айоэс #apple #iphone #айфон
0:57
Best mobile of all time💥🗿 [Troll Face]
0:24
Special SHNTY 2.0
Рет қаралды 618 М.