How do we do online learning with n-gram language models?
Learning Semantic Similarity for Multi-label Text Categorization (2014):
The continuous Skip-gram algorithm is an efficient deep learning method for learning high-quality distributed vector representations that capture a large number of precise semantic word relationships.
See also my quick and dirty webpage: