Estimation of Discourse Segmentation Labels from Crowd Data

机译：从人群数据估计话语细分标签

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

For annotation tasks involving independent judgments, probabilistic models have been used to infer ground truth labels from data where a crowd of many annotators labels the same items. Such models have been shown to produce results superior to taking the majority vote, but have not been applied to sequential data. We present two methods to infer ground truth labels from sequential annotations where we assume judgments are not independent, based on the observation that an annotator's segments all tend to be several utterances long. The data consists of crowd labels for annotation of discourse segment boundaries. The new methods extend Hidden Markov Models to relax the independence assumption. The two methods are distinct, so positive labels proposed by both are taken to be ground truth. In addition, results of the models are checked using metrics that test whether an annotator's accuracy relative to a given model remains consistent across different conversations.

机译：对于涉及独立判断的注释任务，已使用概率模型从数据中推断出地面真相标签，在该数据中，许多注释者会标记相同的项目。已经表明，此类模型所产生的结果要优于获得多数表决的结果，但尚未应用于顺序数据。基于观察者注释段的长度都趋于数个发音的观察，我们提出了两种方法来从顺序注释中推断出地面真相标签，在这些假设中，我们认为判断不是独立的。数据由人群标签组成，用于注释话语段边界。新方法扩展了隐马尔可夫模型以放宽独立性假设。两种方法截然不同，因此两者所提出的肯定标签被视为事实依据。另外，使用度量标准检查模型的结果，该度量标准测试注释者相对于给定模型的准确性在不同对话之间是否保持一致。

著录项

来源
《Conference on empirical methods in natural language processing》|2015年|2190-2200|共11页
会议地点
作者
Ziheng Huang; Jialu Zhong; Rebecca J. Passonneau;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Automatic rib segmentation and labeling in computed tomography scans using a general framework for detection, recognition and segmentation of objects in volumetric data. [J] . Staal J, van-Ginneken B, Viergever MA Medical image analysis . 2007,第1期

机译：使用用于检测，识别和分割体积数据中对象的通用框架，在计算机断层扫描中自动进行肋骨分割和标记。
2. HYBRID ACQUISITION OF HIGH QUALITY TRAINING DATA FOR SEMANTIC SEGMENTATION OF 3D POINT CLOUDS USING CROWD-BASED ACTIVE LEARNING [J] . M. K?lle, V. Walter, S. Schmohl, ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences . 2020,第5期

机译：用基于人群的积极学习的3D点云的语义细分的高质量训练数据的混合习得
3. Integrated segmentation and non-linear registration for organ segmentation and motion field estimation in 4D CT data. [J] . Schmidt Richberg A, Handels H, Ehrhardt J Methods of information in medicine . 2009,第4期

机译：用于4D CT数据中器官分割和运动场估计的集成分割和非线性配准。
4. Estimation of Discourse Segmentation Labels from Crowd Data [C] . Ziheng Huang, Jialu Zhong, Rebecca J. Passonneau Conference on empirical methods in natural language processing . 2015

机译：估算人群数据中的话语分割标签
5. Automatic segmentation of small pulmonary nodules in computed tomography data using a radial basis function neural network with application to volume estimation [D] . Tuinstra, Timothy Ryan 2008

机译：使用径向基函数神经网络的计算机断层扫描数据中的小肺结节自动分割及其在体积估计中的应用
6. Two datasets of defect reports labeled by a crowd of annotators of unknown reliability [O] . Jerónimo Hernández-González, Daniel Rodriguez, Iñaki Inza, 2018

机译：缺陷报告的两个数据集由大量可靠性未知的注释器标记
7. Estimation of Discourse Segmentation Labels from Crowd Data [O] . Ziheng Huang, Jialu Zhong, Rebecca J. Passonneau 2015

机译：估算人群数据中的话语分割标签

Estimation of Discourse Segmentation Labels from Crowd Data

摘要

著录项

相似文献

相关主题

期刊订阅