Multi-Criteria-based Strategy to Stop Active Learning for Data Annotation

机译：基于多标准的策略来停止主动学习数据注释

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we address the issue of deciding when to stop active learning for building a labeled training corpus. Firstly, this paper presents a new stopping criterion, classification-change, which considers the potential ability of each unla-beled example on changing decision boundaries. Secondly, a multi-criteria-based combination strategy is proposed to solve the problem of predefining an appropriate threshold for each confidence-based stopping criterion, such as max-confidence, min-error, and overall-uncertainty. Finally, we examine the effectiveness of these stopping criteria on uncertainty sampling and heterogeneous uncertainty sampling for active learning. Experimental results show that these stopping criteria work well on evaluation data sets, and the combination strategies outperform individual criteria.

机译：在本文中，我们解决了决定何时停止主动学习以建立标记训练语料库的问题。首先，本文提出了一种新的停止标准，即分类变更，它考虑了每个无用的例子在改变决策边界上的潜在能力。其次，提出了一种基于多准则的组合策略，以解决针对每个基于置信度的停止准则（例如最大置信度，最小误差和总体不确定性）预先确定适当阈值的问题。最后，我们检查了这些停止标准对于主动学习的不确定性采样和异构不确定性采样的有效性。实验结果表明，这些停止标准在评估数据集上效果很好，并且组合策略的性能优于单个标准。

著录项

来源
《22nd International Conference on Computational Linguistics》|2008年|1129-1136|共8页
会议地点 Manchester(GB);Manchester(GB)
作者
Jingbo Zhu; Huizhen Wang; Eduard Hovy;
展开▼
作者单位

Natural Language Processing Laboratory Northeastern University Shenyang, Liaoning, P.R.China 110004;

Natural Language Processing Laboratory Northeastern University Shenyang, Liaoning, P.R.China 110004;

University of Southern California Information Sciences Institute 4676 Admiralty Way Marina del Rey, CA 90292-6695;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类程序设计、软件工程;
关键词

相似文献

外文文献
中文文献
专利

1. Partition sampling: an active learning selection strategy for large database annotation [J] . F. Souvannavong, B. Merialdo, B. Huet IEE proceedings, Part K. Vision, image and signal processing . 2005,第3期

机译：分区采样：大型数据库注释的主动学习选择策略
2. Partition sampling: an active learning selection strategy for large database annotation [J] . F. Souvannavong, B. Merialdo, B. Huet IEE Proceedings. Part K, Vision, Image, and Signal Processing . 2005,第3期

机译：分区采样：大型数据库注释的主动学习选择策略
3. Effective Active Learning Strategies for the Use of Large-Margin Classifiers in Semantic Annotation: An Optimal Parameter Discovery Perspective [J] . Kaiquan Xu, Stephen Shaoyi Liao, Raymond Y. K. Lau, INFORMS journal on computing . 2014,第3期

机译：在语义注释中使用大幅度分类器的有效主动学习策略：最佳参数发现视角
4. Multi-Criteria-based Strategy to Stop Active Learning for Data Annotation [C] . International Conference on Computational Linguistics . 2008

机译：基于多标准的策略，即停止积极学习数据注释
5. Active Learning, Human Annotation and Morpho-Syntactic Analysis of Ancient Greek [D] . Majidi, Saeed. 2021

机译：古希腊的积极学习，人体注释和态度句法分析
6. An active learning based classification strategy for the minority class problem: application to histopathology annotation [O] . Scott Doyle, James Monaco, Michael Feldman, 2011

机译：一种基于主动学习的少数群体问题分类策略：在组织病理学注释中的应用
7. Multi-Criteria-Based strategy to stop active learning for data annotation [O] . Jingbo Zhu, Huizhen Wang, Eduard Hovy 2013

机译：基于多标准的策略停止主动学习数据标注

Multi-Criteria-based Strategy to Stop Active Learning for Data Annotation

摘要

著录项

相似文献

相关主题

期刊订阅