首页>
外国专利>
Unsupervised Topic Modeling For Short Texts
Unsupervised Topic Modeling For Short Texts
展开▼
机译:短文本的无监督主题建模
展开▼
页面导航
摘要
著录项
相似文献
摘要
Topics are determined for short text messages using an unsupervised topic model. In a training corpus created from a number of short text messages, a vocabulary of words is identified, and for each word a distributed vector representation is obtained by processing windows of the corpus having a fixed length. The corpus is modeled as a Gaussian mixture model in which Gaussian components represent topics. To determine a topic of a sample short text message, a posterior distribution over the corpus topics is obtained using the Gaussian mixture model.
展开▼