I really like how well the content is organized and displayed.
@davidcampos14632 ай бұрын
KZbin closed captions are still full of errors. But not here and now.
@jonnysolaris2 ай бұрын
There's an error at @3:16 - the encoder module doesn't map tokens to a 3D space, but rather a higher dimensional hypersphere. More info below 👇🏻
@jonnysolaris2 ай бұрын
Token embeddings in Large Language Models operate in a far more complex space than simple 3D representations might suggest. Instead of mapping tokens to basic coordinate points, they exist in a high-dimensional hypersphere where the dimensionality (d_model) can reach up to 12,000 in advanced models like GPT. Rather than thinking of these embeddings as fixed points in this vast vector space, it's more accurate to conceptualise them as directional vectors on the surface of this hypersphere. Each token's representation combines two key elements: its direction in the high-dimensional space (which encodes semantic meaning) and its magnitude (representing emphasis or importance). This approach enables the model to capture rich semantic relationships through directional similarities while maintaining information about each token's relative significance. The massive dimensionality allows for extremely nuanced distinctions between concepts, far beyond what could be achieved in lower-dimensional spaces. This is why simplistic visualisations of embedding spaces can be misleading - they necessarily lose much of the geometric complexity that makes these representations so powerful for language processing tasks.
@togai-dev2 ай бұрын
I thought gpts were decoder only?
@MrCooldude032 ай бұрын
Will mirroring the video solves mirror text issue?
@Copa207772 ай бұрын
❤GPTs 4rmZambia 🇿🇲
@Idonthaveahandleok2 ай бұрын
should have put the transcript into GPT without the ground truth first