Methods for automatically evaluating answers to complex questions

Jimmy Lin; Dina Demner-Fushman

首页> 外文期刊>Information retrieval >Methods for automatically evaluating answers to complex questions

【24h】

Methods for automatically evaluating answers to complex questions

机译：自动评估复杂问题答案的方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Evaluation is a major driving force in advancing the state of the art in language technologies. In particular, methods for automatically assessing the quality of machine output is the preferred method for measuring progress, provided that these metrics have been validated against human judgments. Following recent developments in the automatic evaluation of machine translation and document summarization, we present a similar approach, implemented in a measure called POURPRE, an automatic technique for evaluating answers to complex questions based on n-gram co-occurrences between machine output and a human-generated answer key. Until now, the only way to assess the correctness of answers to such questions involves manual determination of whether an information "nugget" appears in a system's response. The lack of automatic methods for scoring system output is an impediment to progress in the field, which we address with this work. Experiments with the TREC 2003, TREC 2004, and TREC 2005 QA tracks indicate that rankings produced by our metric correlate highly with official rankings, and that POURPRE outperforms direct application of existing metrics.

机译：评估是推动语言技术发展的主要动力。特别是，自动评估机器输出质量的方法是衡量进度的首选方法，前提是这些度量标准已根据人的判断进行了验证。继机器翻译和文档摘要自动评估的最新发展之后，我们提出了一种类似的方法，该方法在称为POURPRE的措施中实施，POURPRE是一种自动技术，用于基于机器输出和人类之间的n-gram共现来评估复杂问题的答案生成的答案键。到目前为止，评估此类问题答案正确性的唯一方法是手动确定信息“块”是否出现在系统的响应中。缺乏对系统输出进行评分的自动方法，阻碍了该领域的进步，我们将通过这项工作来解决这一问题。对TREC 2003，TREC 2004和TREC 2005质量检查轨道的实验表明，我们的指标产生的排名与官方排名高度相关，并且POURPRE优于直接应用现有指标。

著录项

来源
《Information retrieval》 |2006年第5期|p.565-587|共23页
作者
Jimmy Lin; Dina Demner-Fushman;
展开▼
作者单位

College of Information Studies, University of Maryland, College Park, MD 20742, USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类图书馆学、图书馆事业;
关键词
question answering; evaluation;

机译：问题回答;评估;

相似文献

外文文献
中文文献
专利

1. Towards spoken clinical-question answering: evaluating and adapting automatic speech-recognition systems for spoken clinical questions. [J] . Liu F, Tur G, Hakkani Tur D, Journal of the American Medical Informatics Association : . 2011,第5期

机译：迈向临床口语回答：针对临床口语评估和调整自动语音识别系统。
2. Problem-Oriented Automatic Summarization Method with Semantic Coherence Definition and Usage of Inclusion Measures for the Search of Answers to Questions in the Internet [J] . Simankov Vladimir S., Tolkachev Demid M. International Journal of Uncertainty, Fuzziness, and Knowledge-based Systems . 2019,第5期

机译：语义连贯定义的面向问题的自动汇总方法以及在互联网上搜索问题答案的包含度量的使用
3. Automatically generating effective search queries directly from community question-answering questions for finding related questions [J] . Figueroa Alejandro Expert Systems with Application . 2017,第Jula期

机译：直接从社区问答中自动生成有效的搜索查询，以查找相关问题
4. Towards Automatic Evaluation of Reused Answers in Community Question Answering [C] . Hsin-Wen Liu, Sumio Fujita, Tetsuya Sakai Asia Information Retrieval Societies Conference . 2019

机译：在社区问答中实现对重复使用答案的自动评估
5. Automatic Neural Question Generation Using Community-Based Question Answering Systems [D] . Baghaee, Tina. 2018

机译：使用基于社区的问题应答系统的自动神经问题
6. Towards spoken clinical-question answering: evaluating and adapting automatic speech-recognition systems for spoken clinical questions [O] . Feifan Liu, Gokhan Tur, Dilek Hakkani-Tür, 2011

机译：走向口语临床问题的答案：针对口语临床问题评估和改编自动语音识别系统
7. Methods for automatically evaluating answers to complex questions [O] . Jimmy Lin, Dina Demner-fushman 2006

机译：自动评估复杂问题答案的方法
8. Automatic Question-Answering of English-Like Questions About Arithmetic [R] . Kochen, M. 1968

机译：关于算术的英语问题的自动问答

Methods for automatically evaluating answers to complex questions

摘要

著录项

相似文献

相关主题

期刊订阅