i love the ml community. Everyone is sharing information :)
@foreignconta Жыл бұрын
You know there is a new optimizer in your name. :) I wrote my nanoChatGPT from scratch and had to clip its context size to stop ooming my GPU memory. The only thing it does with 8 decoder layers is to output nonsense. Thinking about training it on a better instruct dataset. I am using Sentencepiece tokenizer. Subscribed.
@SophiaYangDS Жыл бұрын
Yes the Sophia optimizer! And thanks for the great insight 🙏
@foreignconta Жыл бұрын
@@SophiaYangDS You're welcome. I am looking forward to your videos.🙏
@xspydazx8 ай бұрын
Also thanks ! As after watching this video I also re-watch kaparthy and recreated everything and it worked ok for a text generation model .
@TerrorByteTW Жыл бұрын
Totally going to download by Twitter and Facebook data to train a model that talks like me lol. Let's see how this goes!
@SophiaYangDS Жыл бұрын
Sounds fun! Good luck!
@TheSterg987 ай бұрын
How did it work out?
@F3367 ай бұрын
Yea his video was like a missing link explained perfectly...This one is nice too...:)
@Avra_b Жыл бұрын
Wow Sophia , happy to see quick concise 15 mins on “how to implement nano-GPT” . Super helpful . 🎉😊 Btw , by any chance , did you try the lyrics / or any other datasets with more iterations ? How was the result ?
@kilianlindberg Жыл бұрын
I concur, kudos for the impressive work showcased in this video! On a side note, I experimented with Nano-GPT a few weeks back and found it quite user-friendly, thanks to the clear instructions provided on its GitHub page. I successfully ran it on both a 2018 iMac (running Big Sur) and a 2019 MacBook Pro (equipped with Ventura), each with 16GBs of RAM. The MacBook Pro's GPU offered a slight speed advantage. Undoubtedly, Nano-GPT conveys potential for more specialized/dedicated models running on a local machine.
@SophiaYangDS Жыл бұрын
Thanks so much Avra 🤗! Yes I did try it a little more and it looks better after 1k iterations. I should have trained more to show in the video 😅
@SophiaYangDS Жыл бұрын
@@kilianlindberg Thanks so much Kilian! Yes nano-GPT is so user-friendly. Andrej Karpathy is a legend! I'm so glad I finally watched his video and tried out nano-GPT.
@adamstewarton Жыл бұрын
@@SophiaYangDS how long roughly did it take?
@fab_spaceinvaders Жыл бұрын
came from its video and learned more this morning than in one week hands on 13b stuff 😂
@johnnyjax644411 ай бұрын
Can we train this Model on any thing , eg analyze the our project Code base by same way feed our source code
@iDineshKrishnan Жыл бұрын
Great! Thanks for doing this video, very informative.
@SophiaYangDS Жыл бұрын
Thanks so much for your support 🙏
@AllenLSY2 ай бұрын
6:40 on m3 max, each iter runs 14ms. Sophie, ask your company to give you a better laptop :)
@curioussoul5151 Жыл бұрын
Thank you so much for this video! It was very helpful
@SophiaYangDS Жыл бұрын
thanks for the kind words 🙏
@seattledude20223 ай бұрын
Andrej just told me that he likes your video :)
@Ashton35110 ай бұрын
Fantastic, thank you so much.
@宋保兴 Жыл бұрын
wow thx, i learn a lot from this video
@imai_official8 ай бұрын
--start is promt? also Sophia please tell, will it work for other language?
@amparoconsuelo9451 Жыл бұрын
If I will order from Amazon a GPT assembly kit, what would it deliver me? How much would the kit cost?
@venkatreddy-px8fm8 ай бұрын
Please, can you let me know if you connected your GPU to MAC M1 laptop or other source ?
@danasugu1767 Жыл бұрын
Hi, Sophia, thanks for this tutorial. Instead of a txt file could we used a pdf file?
@PotatoMagnet Жыл бұрын
PDFs are not very machine readable text. TXT is best. Maybe use a pdf parser?
@DaveisLearningtoPlay Жыл бұрын
i had this on in the background and all of the sudden heard the applause track when it generated some content. that was very funny. ;)
@xspydazx8 ай бұрын
Sorry but how to save the final model produced for deployment ? .. then how to fine tune this model for a downstream task such as question and answer ?
@mshonle Жыл бұрын
The “3” in the list of tokens was conspicuous. Turns out it’s a left over page heading.
@jmisc Жыл бұрын
can this be done in a chatGPT way, where we say, here is the lyric, revise it, or here is a song about potato, now write a lyric about tomato in the same style?
@datahacker1405 Жыл бұрын
It can be done with langchain prompting and custom llm wrappers
@AryanSingh-py7sm Жыл бұрын
i want to ask one doubt. cant we get an output from a query we write and not all the decoded data?
@SunGlassesBlue Жыл бұрын
Thank you ! You Saved my day ~~
@sw-ln1hh Жыл бұрын
Thank you for your video
@lutfiikbalmajid7 ай бұрын
Can i train this in Mac Air M1?
@bogdanbarbu363 Жыл бұрын
At 8:50 you say that you can pass the data type as a parameter. How do you do that?
@og_23yg54 Жыл бұрын
What if your watching this video you should know that before h came here go learn python then come back
@colinmaharaj3 ай бұрын
Can this be done in C++?
@yusufcan1304 Жыл бұрын
Edited. thanks
@fractal3588 Жыл бұрын
can this be a totally offline process?
@SophiaYangDS Жыл бұрын
Yes
@spencerfunk6697 Жыл бұрын
so this is using smaller lang models? could you train something even smaller like koala lol?
@RedCloudServices Жыл бұрын
Sophia if we can load lots of text into langchain and vectordb and have chatbot why do we still need to fine tune a LLM?
@SophiaYangDS Жыл бұрын
Hi great question! Check out my latest video interviewing Lewis Tunstall. He talked about zero-shot, few-shot learning vs fine-tuning.
@rockstreamlive Жыл бұрын
is nano gpt offline or need internet for working?
@SophiaYangDS Жыл бұрын
Offline. That's the beauty of nanoGPT : )
@sonkyung32978 ай бұрын
a tutorial of a tutorial -- lol -- much thanks!! 🙏
@HANKTHETANKAD117 ай бұрын
Do ai Poe chat how generation image and lyric on there can you do review on ai Poe chat
@axystorm5269 Жыл бұрын
does anyone have any idea how to get this working with windows?