Рет қаралды 5,775
BytePair embeddings are a really cool idea. BytePair Embeddings can be seen as a lightweight variant of FastText. They need less memory because they are more selective in what subtokens they remember. This also makes them useful in certain scenarios because they can ignore subwords as well. They're also available in 275 languages!
If you want to see the Rasa NLU examples repo, go here:
github.com/Ras...
If you want to see the Whatlies repo for these embeddings, go here:
rasahq.github....
If you want to see the BPEmb repo, go here:
nlp.h-its.org/...