Word2vec in PyTorch. We introduced N-gram models (unigrams, bigrams, trigrams), together with their probability modeling and issues
Chain rule and n-gram estimation. Perplexity and its close relationship with entropy. Smoothing and interpolation.
No comments:
Post a Comment