Рет қаралды 118
A talk by Clemens Schroeer from Lemon AI.
This session covers Data Curation for Open Source LLM Fine-Tuning.
Everyone wants to fine-tune open source LLMs, but a lack of high quality data makes this hard. Even the data that companies do have is difficult to understand, making it challenging to iterate towards a high quality dataset that will provide good results from fine-tuning. Clemens will share his experience curating datasets to fine-tune models such as Mistral 7B and discuss some of the challenges that should be taken into consideration.
Technical Level: Technical practitioner
This session was part of the Data Science Festival MayDay event 2024. Find out more at datasciencefes...
The Data Science Festival is the place for data-driven people to come together, share cutting-edge ideas, and solve real-world problems. We run monthly events, meet-ups, and the biggest free-to-attend data festivals in the UK. Join the community at datasciencefes...