Exploring Numeracy in Word Embeddings

机译：探讨嵌入词中的算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Word embeddings are now pervasive across NLP subfields as the de-facto method of forming text representataions. In this work, we show that existing embedding models are inadequate at constructing representations that capture salient aspects of mathematical meaning for numbers, which is important for language understanding. Numbers are ubiquitous and frequently appear in text. Inspired by cognitive studies on how humans perceive numbers, we develop an analysis framework to test how well word embeddings capture two essential properties of numbers: magnitude (e.g. 3＜4) and numeration (e.g. 3=three). Our experiments reveal that most models capture an approximate notion of magnitude, but are inadequate at capturing numeration. We hope that our observations provide a starting point for the development of methods which better capture numeracy in NLP systems.

机译：Word Embeddings现在跨越NLP子场普遍存在，作为形成文本替代品的De-Facto方法。在这项工作中，我们表明，在构建数学意义的突出方面，现有的嵌入模型不足以捕获数学意义的数学意义，这对语言理解很重要。数字普遍存在，经常出现在文本中。灵感来自认知研究对人类如何感知号码，我们开发一个分析框架来测试Word Embeddings如何捕获数字的两个基本属性：幅度（例如3 <4）和数量（例如3 =三）。我们的实验表明，大多数模型都捕获了近似占概念的概念，但在捕获数量时不足。我们希望我们的观察结果为开发方法提供了更好地捕获NLP系统的方法的起点。

著录项

来源
《Annual meeting of the Association for Computational Linguistics》|2019年|cxxxiv p. 3297-3951|共7页
会议地点
作者
Aakanksha Naik; Abhilasha Ravichander; Carolyn Rose; Eduard Hovy;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Numbers are not like words: Different pathways for literacy and numeracy [J] . Carreiras Manuel, Monahan Philip J., Lizarazu Mikel, NeuroImage . 2015,第Null期

机译：数字不像单词：识字和计算的不同途径
2. Tailoring value elicitation to decision makers' numeracy and fluency: Expressing value judgments in numbers or words [J] . Barbara Fasolo, Carlos A. Bana e Costa Omega . 2014,第apra期

机译：为决策者的计算能力和流利度定制价值启发：以数字或文字表达价值判断
3. Exploring the Privacy-Preserving Properties of Word Embeddings: Algorithmic Validation Study [J] . Mohamed Abdalla, Moustafa Abdalla, Graeme Hirst, Journal of medical Internet research . 2020,第7期

机译：探索Word Embeddings的隐私保留属性：算法验证研究
4. Exploring Numeracy in Word Embeddings [C] . Aakanksha Naik, Abhilasha Ravichander, Carolyn Rose, Annual meeting of the Association for Computational Linguistics . 2019

机译：探索词嵌入中的算术
5. Using Word Embeddings to Explore the Language of Depression on Twitter [D] . Gopchandani, Sandhya. 2019

机译：使用Word Embeddings探索Twitter上的抑郁症语言
6. A Word on Words in Words: How Do Embedded Words Affect Reading? [O] . Joshua Snell, Jonathan Grainger, Mathieu Declerck 2018

机译：单词中的单词：嵌入式单词如何影响阅读？
7. Exploring Numeracy in Word Embeddings [O] . Aakanksha Naik, Abhilasha Ravichander, Carolyn Rose, 2019

机译：探讨嵌入词中的算法

Exploring Numeracy in Word Embeddings

摘要

著录项

相似文献

相关主题

期刊订阅