首页> 外文会议>ISKE 2012;International Conference on Intelligent Systems and Knowledge Engineering >Punctuation Prediction for Chinese Spoken Sentence Based on Model Combination
【24h】

Punctuation Prediction for Chinese Spoken Sentence Based on Model Combination

机译:基于模型组合的中文句子标点符预测

获取原文

摘要

Punctuation prediction is very important for automatic speech recognition (ASR). It greatly improves the readability of transcripts and user experience, and facilitates following natural language processing tasks. In this study, we develop a model combination based approach for the recovery of punctuation for Chinese spoken sentence. Our approach models the relationships between punctuation and sentence by the different ways of sentence representation. And the relationships modeled are combined by multi-layer perception to predict punctuation (period, question mark, and exclamation mark). Different from previous studies, our proposed approach is designed to use global lexical information, not only local information. Results indicate that, compared with the baseline, our proposed method results in an absolute improvement of 10.0 % unweighted accuracy and 4.9 % weighted accuracy, respectively. Our approach finally achieves an unweighted accuracy of 86.9 % and a weighted accuracy of 92.4 %.
机译:标点符号预测对于自动语音识别(ASR)非常重要。它大大提高了成绩单和用户体验的可读性,并促进了自然语言处理任务之后。在这项研究中,我们开发了一种基于模型组合的方法,用于恢复中文句子的标点符号。我们的方法通过不同的句子表示方式模拟标点符号和句子之间的关系。建模的关系由多层感知组合以预测标点符号(周期,问号和感叹号)。与以往的研究不同,我们提出的方法旨在使用全球词汇信息,不仅是本地信息。结果表明,与基线相比,我们所提出的方法可以绝对提高10.0%的未加权精度和4.9%的加权准确性。我们的方法最终实现了86.9%的不安全的准确性,加权准确性为92.4%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号