首页> 外文会议>Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies >Symmetric Patterns and Coordinations: Fast and Enhanced Representations of Verbs and Adjectives
【24h】

Symmetric Patterns and Coordinations: Fast and Enhanced Representations of Verbs and Adjectives

机译:对称模式和协调:动词和形容词的快速表示和增强表示

获取原文

摘要

State-of-the-art word embeddings, which are often trained on bag-of-words (BOW) contexts, provide a high quality representation of aspects of the semantics of nouns. However, their quality decreases substantially for the task of verb similarity prediction. In this paper we show that using symmetric pattern contexts (SPs, e.g., "X and Y") improves word2vec verb similarity performance by up to 15% and is also instrumental in adjective similarity prediction. The unsupervised SP contexts are even superior to a variety of dependency contexts extracted using a supervised dependency parser. Moreover, we observe that SPs and dependency coordination contexts (Coor) capture a similar type of information, and demonstrate that Coor contexts are superior to other dependency contexts including the set of all dependency contexts, although they are still inferior to SPs. Finally, there are substantially fewer SP contexts compared to alternative representations, leading to a massive reduction in training time. On an 8G words corpus and a 32 core machine, the SP model trains in 11 minutes, compared to 5 and 11 hours with BOW and all dependency contexts, respectively.
机译:经常在词袋(BOW)上下文中进行训练的最先进的词嵌入技术可提供名词语义方面的高质量表示。但是,它们的质量大大降低了动词相似性预测的任务。在本文中,我们表明使用对称模式上下文(SP,例如“ X和Y”)可将word2vec动词相似性性能提高多达15%,并且在形容词相似性预测中也非常有用。无监督的SP上下文甚至优于使用监督的依赖关系解析器提取的各种依赖关系上下文。此外,我们观察到SP和依赖项协调上下文(Coor)捕获了相似类型的信息,并表明Coor上下文优于包括所有依赖项上下文集的其他依赖项上下文,尽管它们仍然不如SP。最后,与替代表示相比,SP上下文明显更少,从而大大减少了培训时间。在8G单词语料库和32核心机器上,SP模型的训练时间为11分钟,而使用BOW和所有依赖项上下文的训练时间分别为5小时和11小时。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号