首页> 外文会议>China National Conference on Computational Linguistics >LiveQA: A Question Answering Dataset Over Sports Live
【24h】

LiveQA: A Question Answering Dataset Over Sports Live

机译:LiveQA:一个问题在体育生活中应答DataSet

获取原文

摘要

In this paper, we introduce LiveQA, a new question answering dataset constructed from play-by-play live broadcast. It contains 117k multiple-choice questions written by human commentators for over 1,670 NBA games, which are collected from the Chinese Hupu (https://nba. hupu.com/games.) website. Derived from the characteristics of sports games, LiveQA can potentially test the reasoning ability across timeline-based live broadcasts, which is challenging compared to the existing datasets. In LiveQA, the questions require understanding the timeline, tracking events or doing mathematical computations. Our preliminary experiments show that the dataset introduces a challenging problem for question answering models, and a strong baseline model only achieves the accuracy of 53.1% and cannot beat the dominant option rule. We release the code and data of this paper for future research, (code: https://github.com/PKU-TANGENT/GAReader-LiveQA), (data: https://github.com/PKU-TANGENT/LiveQA).
机译:在本文中,我们介绍了LiveQA,一个新的问题接听了由Play-Play直播的数据集。 它包含由人类评论员编写的117K多项选择题,超过1,670名NBA游戏,该游戏从中国Hupu(https:// nba。hupu.com/games。)网站。 源自运动游戏的特点,LiveQA可能会在与现有数据集相比,跨越基于时间线的现场广播的推理能力测试。 在LiveQA中,问题需要了解时间表,跟踪事件或进行数学计算。 我们的初步实验表明,数据集介绍了问题应答模型的具有挑战性问题,强大的基线模型仅实现了53.1%的准确性,无法击败主导选项规则。 我们释放本文的代码和数据以供未来的研究,(代码:https://github.com/pku-tangent/gariveqa),(数据:https://github.com/pku-tangent/liveqa) 。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号