首页> 外文会议>22nd International Conference on Computational Linguistics >Multi-Criteria-based Strategy to Stop Active Learning for Data Annotation
【24h】

Multi-Criteria-based Strategy to Stop Active Learning for Data Annotation

机译:基于多标准的策略来停止主动学习数据注释

获取原文
获取原文并翻译 | 示例

摘要

In this paper, we address the issue of deciding when to stop active learning for building a labeled training corpus. Firstly, this paper presents a new stopping criterion, classification-change, which considers the potential ability of each unla-beled example on changing decision boundaries. Secondly, a multi-criteria-based combination strategy is proposed to solve the problem of predefining an appropriate threshold for each confidence-based stopping criterion, such as max-confidence, min-error, and overall-uncertainty. Finally, we examine the effectiveness of these stopping criteria on uncertainty sampling and heterogeneous uncertainty sampling for active learning. Experimental results show that these stopping criteria work well on evaluation data sets, and the combination strategies outperform individual criteria.
机译:在本文中,我们解决了决定何时停止主动学习以建立标记训练语料库的问题。首先,本文提出了一种新的停止标准,即分类变更,它考虑了每个无用的例子在改变决策边界上的潜在能力。其次,提出了一种基于多准则的组合策略,以解决针对每个基于置信度的停止准则(例如最大置信度,最小误差和总体不确定性)预先确定适当阈值的问题。最后,我们检查了这些停止标准对于主动学习的不确定性采样和异构不确定性采样的有效性。实验结果表明,这些停止标准在评估数据集上效果很好,并且组合策略的性能优于单个标准。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号