Ultradense Word Embeddings by Orthogonal Transformation

机译：正交变换的超密集词嵌入

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Embeddings are generic representations that are useful for many NLP tasks. In this paper, we introduce Densifier, a method that learns an orthogonal transformation of the embedding space that focuses the information relevant for a task in an ultradense subspace of a dimensionality that is smaller by a factor of 100 than the original space. We show that ultradense embeddings generated by Densifier reach state of the art on a lexicon creation task in which words are annotated with three types of lexical information - sentiment, con-creteness and frequency. On the SemEval2015 10B sentiment analysis task we show that no information is lost when the ultradense sub-space is used, but training is an order of magnitude more efficient due to the compactness of the ultradense space.

机译：嵌入是可用于许多NLP任务的通用表示形式。在本文中，我们介绍了Densifier，它是一种学习嵌入空间的正交变换的方法，该方法将与任务相关的信息集中在维数比原始空间小100倍的超密集子空间中。我们表明，由Densifier生成的超密集嵌入在词汇创建任务上达到了最新水平，在该任务中用三种类型的词汇信息（情感，具体程度和频率）对单词进行注释。在SemEval2015 10B情感分析任务上，我们显示了使用超密集子空间时不会丢失任何信息，但是由于超密集空间的紧凑性，训练的效率提高了一个数量级。

著录项

来源
《Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies》|2016年|767-777|共11页
会议地点
作者
Sascha Rothe; Sebastian Ebert; Hinrich Schuetze;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. BIOT: Explaining multidimensional nonlinear MDS embeddings using the Best Interpretable Orthogonal Transformation [J] . Bibal Adrien, Marion Rebecca, von Sachs Rainer, Neurocomputing . 2021,第Sepa17期

机译：Biot：使用最佳可解释的正交变换解释多维非线性MDS嵌入
2. Integrating Low-rank Approximation and Word Embedding for Feature Transformation in the High-dimensional Text Classification [J] . Le Nguyen Hoai Nam, Ho Bao Quoc Procedia Computer Science . 2017,第1期

机译：在高维文本分类中集成低秩近似和词嵌入以进行特征转换
3. Integrating Low-rank Approximation and Word Embedding for Feature Transformation in the High-dimensional Text Classification [J] . Le Nguyen Hoai Nam, Ho Bao Quoc Procedia Computer Science . 2017,第1期

机译：在高维文本分类中集成低秩近似和词嵌入以进行特征转换
4. Ultradense Word Embeddings by Orthogonal Transformation [C] . Sascha Rothe, Sebastian Ebert, Hinrich Schuetze Conference on the North American Chapter of the Association for Computational Linguistics: Human Language Technologies . 2016

机译：通过正交转换来封闭词嵌入
5. A transformation of orthogonal polynomial sequences into orthogonal Laurent polynomial sequences. [D] . Hagler, Brian Allan. 1997

机译：将正交多项式序列转换为正交Laurent多项式序列。
6. Learning linear transformations between counting-based and prediction-based word embeddings [O] . Danushka Bollegala, Kohei Hayashi, Ken-ichi Kawarabayashi 2011

机译：学习基于计数和基于预测的词嵌入之间的线性转换
7. Ultradense Word Embeddings by Orthogonal Transformation [O] . Rothe, Sascha, Ebert, Sebastian, Schütze, Hinrich 2016

机译：通过正交变换进行超密集嵌入

Ultradense Word Embeddings by Orthogonal Transformation

摘要

著录项

相似文献

相关主题

期刊订阅