Punctuation Prediction for Chinese Spoken Sentence Based on Model Combination

机译：基于模型组合的中文句子标点符预测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Punctuation prediction is very important for automatic speech recognition (ASR). It greatly improves the readability of transcripts and user experience, and facilitates following natural language processing tasks. In this study, we develop a model combination based approach for the recovery of punctuation for Chinese spoken sentence. Our approach models the relationships between punctuation and sentence by the different ways of sentence representation. And the relationships modeled are combined by multi-layer perception to predict punctuation (period, question mark, and exclamation mark). Different from previous studies, our proposed approach is designed to use global lexical information, not only local information. Results indicate that, compared with the baseline, our proposed method results in an absolute improvement of 10.0 % unweighted accuracy and 4.9 % weighted accuracy, respectively. Our approach finally achieves an unweighted accuracy of 86.9 % and a weighted accuracy of 92.4 %.

机译：标点符号预测对于自动语音识别（ASR）非常重要。它大大提高了成绩单和用户体验的可读性，并促进了自然语言处理任务之后。在这项研究中，我们开发了一种基于模型组合的方法，用于恢复中文句子的标点符号。我们的方法通过不同的句子表示方式模拟标点符号和句子之间的关系。建模的关系由多层感知组合以预测标点符号（周期，问号和感叹号）。与以往的研究不同，我们提出的方法旨在使用全球词汇信息，不仅是本地信息。结果表明，与基线相比，我们所提出的方法可以绝对提高10.0％的未加权精度和4.9％的加权准确性。我们的方法最终实现了86.9％的不安全的准确性，加权准确性为92.4％。

著录项

来源
《ISKE 2012;International Conference on Intelligent Systems and Knowledge Engineering》|2014年||共10页
会议地点
作者
Xiao Chen; Dengfeng Ke; Bo Xu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-532;
关键词
Punctuation prediction; Model combine; Global lexical information;

机译：标点符号预测;模型结合;全球词汇信息;

相似文献

外文文献
中文文献
专利

1. Automatically identifying the sentence skeleton of Chinese sentences based on the event model [J] . Xu Wei Tsinghua Science and Technology . 2012,第3期

机译：基于事件模型自动识别中文句子的句子骨架
2. Automatically Identifying the Sentence Skeleton of Chinese Sentences Based on the Event Model [J] . Wei Xu, Ke Zhao, Zhenzhen Yi, 清华大学学报（英文版） . 2012,第003期

机译：基于事件模型的汉语句子自动识别
3. Research on Diagnosis Prediction of Traditional Chinese Medicine Diseases Based on Improved Bayesian Combination Model [J] . Zhulv Zhang, Jinghua Li, Wanting Zheng, Evidence-based complementary and alternative medicine: eCAM . 2021,第a期

机译：基于改进贝叶斯组合模型的中药疾病诊断预测研究
4. Punctuation Prediction for Chinese Spoken Sentence Based on Model Combination [C] . Xiao Chen, Dengfeng Ke, Bo Xu ISKE 2012 . 2014

机译：基于模型组合的中国语句标点符号预测
5. Hybrid System Combination for Machine Translation: An Integration of Phrase-level and Sentence-level Combination Approaches. [D] . Ma, Wei-Yun. 2014

机译：机器翻译的混合系统组合：短语级和句子级组合方法的集成。
6. Research on Diagnosis Prediction of Traditional Chinese Medicine Diseases Based on Improved Bayesian Combination Model [O] . Zhulv Zhang, Jinghua Li, Wanting Zheng, 2021

机译：基于改进贝叶斯组合模型的中药疾病诊断预测研究
7. Automatic Sentence Segmentation and Punctuation Prediction for Spoken Language Translation [O] . Matusov Evgeny, Mauser Arne, Ney Hermann 2006

机译：口语翻译的自动句段和标点预测

Punctuation Prediction for Chinese Spoken Sentence Based on Model Combination

摘要

著录项

相似文献

相关主题

期刊订阅