首页> 外文会议>22nd International Conference on Computational Linguistics >A Discriminative Alignment Model for Abbreviation Recognition
【24h】

A Discriminative Alignment Model for Abbreviation Recognition

机译:缩写识别的判别比对模型

获取原文
获取原文并翻译 | 示例

摘要

This paper presents a discriminative alignment model for extracting abbreviations and their full forms appearing in actual text. The task of abbreviation recognition is formalized as a sequential alignment problem, which finds the optimal alignment (origins of abbreviation letters) between two strings (abbreviation and full form). We design a large amount of finegrained features that directly express the events where letters produce or do not produce abbreviations. We obtain the optimal combination of features on an aligned abbreviation corpus by using the maximum entropy framework. The experimental results show the usefulness of the alignment model and corpus for improving abbreviation recognition.
机译:本文提出了一种判别性对齐模型,用于提取缩写及其在实际文本中出现的完整形式。缩写识别的任务被形式化为顺序对齐问题,该顺序对齐问题可以找到两个字符串(缩写和完整形式)之间的最佳对齐方式(缩写字母的来源)。我们设计了大量细粒度的功能,可以直接表达字母产生或不产生缩写的事件。通过使用最大熵框架,我们可以获得对齐的缩写语料库上特征的最佳组合。实验结果表明,比对模型和语料库对提高缩写识别的有效性。

著录项

  • 来源
  • 会议地点 Manchester(GB);Manchester(GB)
  • 作者单位

    Graduate School of Information Science and Technology University of Tokyo 7-3-1 Hongo, Bunkyo-ku Tokyo 113-8656, Japan;

    School of Computer Science, University of Manchester National Centre for Text Mining (NaCTeM) Manchester Interdisciplinary Biocentre 131 Princess Street, Manchester M1 7DN, UK;

    Graduate School of Information Science and Technology University of Tokyo 7-3-1 Hongo, Bunkyo-ku Tokyo 113-8656, Japan School of Computer Science, University of Manchester National Centre for Text Mining (NaCTeM) Manchester Interdisciplinary Biocentre 131 Princess Street, Manchester M1 7DN, UK;

  • 会议组织
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 程序设计、软件工程;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号