Zephyr 7b beta: paper deep dive, code, & RAG

Рет қаралды 9,201

Күн бұрын

Is Zephyr 7b beta the best fine-tuned smaller open-source LLM? It is fine-tuned on the best small open-source foundation model Mistral 7B, outperforms all small models, and even outperforms LLama2-Chat-70B and ChatGPT on multiple benchmarks.
What are the difference between Zephyr 7b alpha and Zephyr 7b beta?
From the author: "The main differences are (a) better dataset filtering of UltraChat, namely fixing grammatical errors and some unwanted responses and (b) training for longer, ie 3 DPO epochs vs 1"
Model: huggingface.co...
Technical report: arxiv.org/abs/...
Colab: colab.research...
LlamaIndex Tweet (colab link in the tweet): / 1718054631413363196
🔔 SUBSCRIBE to my channel: www.youtube.co...
⭐ Stay in touch ⭐
📚 DS/ML Book Club: dsbookclub.gith...
▶ KZbin: / sophiayangds
✍️ Medium: / sophiamyang
🐦 Twitter: / sophiamyang
🤝 Linkedin: / sophiamyang
💚 #ai

Пікірлер: 29

10 ай бұрын

Thanks a lot. Using Zephyr with ollama and it's pretty cool and fast.

@SophiaYangDS 10 ай бұрын

Awesome! Ollama is really nice.

@jstevh 10 ай бұрын

Thanks for that analysis. Like a good overview of a solid paper.

@SophiaYangDS 10 ай бұрын

Thank you!

@prestonmccauley43 10 ай бұрын

Thanks for the information I look forward to giving this model a try though it does seem like we keep producing models every week. They really need to be run for many months to determine how well they work.

@SophiaYangDS 10 ай бұрын

Thanks! Yeah too many models coming out. One of the takeaways is Mistral 7B is a great foundation model to fine-tune on.

@Hash_Boy 10 ай бұрын

many many thanks

@taeyangoh7305 10 ай бұрын

great review, Sophia! I want to give it a try for care robot use case!!

@SophiaYangDS 10 ай бұрын

Thanks! Sounds like a really cool use case. Good luck!

@foreignconta 10 ай бұрын

I am using this model as my default model currently. It is not perfect but I prompted to according to my liking. I wish the prompt template was a bit simpler. 😅

@SophiaYangDS 10 ай бұрын

Nice!

@DistortedV12 10 ай бұрын

It's okay..I tried it with different prompts. Can't wait till they scale it.

@SophiaYangDS 10 ай бұрын

It's a good 7B model.

@talhayousuf4599 10 ай бұрын

Your channel is really informative, good thing is that you remain updated with new things coming. Are you a researcher?, do you have any blog? / related to llm research?

@SophiaYangDS 10 ай бұрын

Thanks 🙏 I have a casual blog with book reading notes and other random things: sophiamyang.medium.com/

@TheReferrer72 10 ай бұрын

Is this the Model that does not reason or code well? How can this model be useful?

@SophiaYangDS 10 ай бұрын

Still better than llama2 70B on coding and math

@jsalsman 10 ай бұрын

Do they mention the cost of the GPT-4 calls during the training steps?

@SophiaYangDS 10 ай бұрын

They did not mention cost in the paper.

@jsalsman 10 ай бұрын

@@SophiaYangDS Any idea roughly how many API calls they must have used?

@SophiaYangDS 10 ай бұрын

Must be a lot. Just to process and create this UltraFeedback dataset (huggingface.co/datasets/HuggingFaceH4/ultrafeedback_binarized?row=0), it's around 74k x 5 API calls (4 models responses + 1 GPT4 ratings). @@jsalsman