Рет қаралды 10,492
This video will teach you everything there is to know about the Unigram algorithm for tokenization. How it's trained on a text corpus and how it's applied to tokenize texts.
This video is part of the Hugging Face course: huggingface.co/...
Related videos:
Byte Pair Encoding Tokenization: • Byte Pair Encoding Tok...
WordPiece Tokenization: • WordPiece Tokenization
Don't have a Hugging Face account? Join now: huggingface.co/...
Have a question? Checkout the forums: discuss.huggin...
Subscribe to our newsletter: huggingface.cu...