首页> 外文会议>10th Workshop on statistical machine translation 2015 >Data Enhancement and Selection Strategies for the Word-level Quality Estimation
【24h】

Data Enhancement and Selection Strategies for the Word-level Quality Estimation

机译:词级质量估计的数据增强和选择策略

获取原文
获取原文并翻译 | 示例

摘要

This paper describes the DCU-SHEFF word-level Quality Estimation (QE) system submitted to the QE shared task at WMT15. Starting from a baseline set of features and a CRF algorithm to learn a sequence tagging model, we propose improvements in two ways: (ⅰ) by filtering out the training sentences containing too few errors, and (ⅱ) by adding incomplete sequences to the training data to enrich the model with new information. We also experiment with considering the task as a classification problem, and report results using a subset of the features with Random Forest classifiers.
机译:本文介绍了在WMT15上提交给QE共享任务的DCU-SHEFF单词级质量评估(QE)系统。从一组基线特征和一个CRF算法开始以学习序列标记模型,我们提出了两种改进方法:(ⅰ)通过过滤掉错误太少的训练语句,以及(ⅱ)在训练中添加不完整的序列数据以用新信息丰富模型。我们还尝试将任务视为分类问题,并使用带有随机森林分类器的部分功能报告结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号