Train your own language model with nanoGPT | Let’s build a songwriter

  Рет қаралды 50,841

Sophia Yang

Sophia Yang

Күн бұрын

Пікірлер: 54
@EkShunya
@EkShunya Жыл бұрын
i love the ml community. Everyone is sharing information :)
@foreignconta
@foreignconta Жыл бұрын
You know there is a new optimizer in your name. :) I wrote my nanoChatGPT from scratch and had to clip its context size to stop ooming my GPU memory. The only thing it does with 8 decoder layers is to output nonsense. Thinking about training it on a better instruct dataset. I am using Sentencepiece tokenizer. Subscribed.
@SophiaYangDS
@SophiaYangDS Жыл бұрын
Yes the Sophia optimizer! And thanks for the great insight 🙏
@foreignconta
@foreignconta Жыл бұрын
@@SophiaYangDS You're welcome. I am looking forward to your videos.🙏
@xspydazx
@xspydazx 8 ай бұрын
Also thanks ! As after watching this video I also re-watch kaparthy and recreated everything and it worked ok for a text generation model .
@TerrorByteTW
@TerrorByteTW Жыл бұрын
Totally going to download by Twitter and Facebook data to train a model that talks like me lol. Let's see how this goes!
@SophiaYangDS
@SophiaYangDS Жыл бұрын
Sounds fun! Good luck!
@TheSterg98
@TheSterg98 7 ай бұрын
How did it work out?
@F336
@F336 7 ай бұрын
Yea his video was like a missing link explained perfectly...This one is nice too...:)
@Avra_b
@Avra_b Жыл бұрын
Wow Sophia , happy to see quick concise 15 mins on “how to implement nano-GPT” . Super helpful . 🎉😊 Btw , by any chance , did you try the lyrics / or any other datasets with more iterations ? How was the result ?
@kilianlindberg
@kilianlindberg Жыл бұрын
I concur, kudos for the impressive work showcased in this video! On a side note, I experimented with Nano-GPT a few weeks back and found it quite user-friendly, thanks to the clear instructions provided on its GitHub page. I successfully ran it on both a 2018 iMac (running Big Sur) and a 2019 MacBook Pro (equipped with Ventura), each with 16GBs of RAM. The MacBook Pro's GPU offered a slight speed advantage. Undoubtedly, Nano-GPT conveys potential for more specialized/dedicated models running on a local machine.
@SophiaYangDS
@SophiaYangDS Жыл бұрын
Thanks so much Avra 🤗! Yes I did try it a little more and it looks better after 1k iterations. I should have trained more to show in the video 😅
@SophiaYangDS
@SophiaYangDS Жыл бұрын
​@@kilianlindberg Thanks so much Kilian! Yes nano-GPT is so user-friendly. Andrej Karpathy is a legend! I'm so glad I finally watched his video and tried out nano-GPT.
@adamstewarton
@adamstewarton Жыл бұрын
​@@SophiaYangDS how long roughly did it take?
@fab_spaceinvaders
@fab_spaceinvaders Жыл бұрын
came from its video and learned more this morning than in one week hands on 13b stuff 😂
@johnnyjax6444
@johnnyjax6444 11 ай бұрын
Can we train this Model on any thing , eg analyze the our project Code base by same way feed our source code
@iDineshKrishnan
@iDineshKrishnan Жыл бұрын
Great! Thanks for doing this video, very informative.
@SophiaYangDS
@SophiaYangDS Жыл бұрын
Thanks so much for your support 🙏
@AllenLSY
@AllenLSY 2 ай бұрын
6:40 on m3 max, each iter runs 14ms. Sophie, ask your company to give you a better laptop :)
@curioussoul5151
@curioussoul5151 Жыл бұрын
Thank you so much for this video! It was very helpful
@SophiaYangDS
@SophiaYangDS Жыл бұрын
thanks for the kind words 🙏
@seattledude2022
@seattledude2022 3 ай бұрын
Andrej just told me that he likes your video :)
@Ashton351
@Ashton351 10 ай бұрын
Fantastic, thank you so much.
@宋保兴
@宋保兴 Жыл бұрын
wow thx, i learn a lot from this video
@imai_official
@imai_official 8 ай бұрын
--start is promt? also Sophia please tell, will it work for other language?
@amparoconsuelo9451
@amparoconsuelo9451 Жыл бұрын
If I will order from Amazon a GPT assembly kit, what would it deliver me? How much would the kit cost?
@venkatreddy-px8fm
@venkatreddy-px8fm 8 ай бұрын
Please, can you let me know if you connected your GPU to MAC M1 laptop or other source ?
@danasugu1767
@danasugu1767 Жыл бұрын
Hi, Sophia, thanks for this tutorial. Instead of a txt file could we used a pdf file?
@PotatoMagnet
@PotatoMagnet Жыл бұрын
PDFs are not very machine readable text. TXT is best. Maybe use a pdf parser?
@DaveisLearningtoPlay
@DaveisLearningtoPlay Жыл бұрын
i had this on in the background and all of the sudden heard the applause track when it generated some content. that was very funny. ;)
@xspydazx
@xspydazx 8 ай бұрын
Sorry but how to save the final model produced for deployment ? .. then how to fine tune this model for a downstream task such as question and answer ?
@mshonle
@mshonle Жыл бұрын
The “3” in the list of tokens was conspicuous. Turns out it’s a left over page heading.
@jmisc
@jmisc Жыл бұрын
can this be done in a chatGPT way, where we say, here is the lyric, revise it, or here is a song about potato, now write a lyric about tomato in the same style?
@datahacker1405
@datahacker1405 Жыл бұрын
It can be done with langchain prompting and custom llm wrappers
@AryanSingh-py7sm
@AryanSingh-py7sm Жыл бұрын
i want to ask one doubt. cant we get an output from a query we write and not all the decoded data?
@SunGlassesBlue
@SunGlassesBlue Жыл бұрын
Thank you ! You Saved my day ~~
@sw-ln1hh
@sw-ln1hh Жыл бұрын
Thank you for your video
@lutfiikbalmajid
@lutfiikbalmajid 7 ай бұрын
Can i train this in Mac Air M1?
@bogdanbarbu363
@bogdanbarbu363 Жыл бұрын
At 8:50 you say that you can pass the data type as a parameter. How do you do that?
@og_23yg54
@og_23yg54 Жыл бұрын
What if your watching this video you should know that before h came here go learn python then come back
@colinmaharaj
@colinmaharaj 3 ай бұрын
Can this be done in C++?
@yusufcan1304
@yusufcan1304 Жыл бұрын
Edited. thanks
@fractal3588
@fractal3588 Жыл бұрын
can this be a totally offline process?
@SophiaYangDS
@SophiaYangDS Жыл бұрын
Yes
@spencerfunk6697
@spencerfunk6697 Жыл бұрын
so this is using smaller lang models? could you train something even smaller like koala lol?
@RedCloudServices
@RedCloudServices Жыл бұрын
Sophia if we can load lots of text into langchain and vectordb and have chatbot why do we still need to fine tune a LLM?
@SophiaYangDS
@SophiaYangDS Жыл бұрын
Hi great question! Check out my latest video interviewing Lewis Tunstall. He talked about zero-shot, few-shot learning vs fine-tuning.
@rockstreamlive
@rockstreamlive Жыл бұрын
is nano gpt offline or need internet for working?
@SophiaYangDS
@SophiaYangDS Жыл бұрын
Offline. That's the beauty of nanoGPT : )
@sonkyung3297
@sonkyung3297 8 ай бұрын
a tutorial of a tutorial -- lol -- much thanks!! 🙏
@HANKTHETANKAD11
@HANKTHETANKAD11 7 ай бұрын
Do ai Poe chat how generation image and lyric on there can you do review on ai Poe chat
@axystorm5269
@axystorm5269 Жыл бұрын
does anyone have any idea how to get this working with windows?
@shihuacheng5429
@shihuacheng5429 Жыл бұрын
WSL2
@pythonfamily
@pythonfamily Жыл бұрын
Dollar sign hahahaha
LlamaIndex overview & use cases | LangChain integration
12:36
Sophia Yang
Рет қаралды 25 М.
Run your own AI (but private)
22:13
NetworkChuck
Рет қаралды 1,6 МЛН
Car Bubble vs Lamborghini
00:33
Stokes Twins
Рет қаралды 20 МЛН
这是自救的好办法 #路飞#海贼王
00:43
路飞与唐舞桐
Рет қаралды 96 МЛН
When mom gets home, but you're in rollerblades.
00:40
Daniel LaBelle
Рет қаралды 126 МЛН
Large Language Models from scratch
8:25
Graphics in 5 Minutes
Рет қаралды 349 М.
NanoGPT using Simpsons Data: Get Started with Large Language Models
7:56
Rajistics - data science, AI, and machine learning
Рет қаралды 4,3 М.
I Analyzed My Finance With Local LLMs
17:51
Thu Vu data analytics
Рет қаралды 491 М.
How to Build an LLM from Scratch | An Overview
35:45
Shaw Talebi
Рет қаралды 259 М.
How I Made AI Assistants Do My Work For Me: CrewAI
19:21
Maya Akim
Рет қаралды 871 М.
Create Your Own AI Person (For Free)
23:33
Matt Wolfe
Рет қаралды 365 М.
Let's build GPT: from scratch, in code, spelled out.
1:56:20
Andrej Karpathy
Рет қаралды 4,8 МЛН
Car Bubble vs Lamborghini
00:33
Stokes Twins
Рет қаралды 20 МЛН