Paraphrastic language models

X. Liu; M.J.F. Gales; P.C. Woodland

首页> 外文期刊>Computer speech and language >Paraphrastic language models

【24h】

Paraphrastic language models

机译：副语言模型

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Natural languages are known for their expressive richness. Many sentences can be used to represent the same underlying meaning. Only modelling the observed surface word sequence can result in poor context coverage and generalization, for example, when using n-gram language models (LMs). This paper proposes a novel form of language model, the paraphrastic LM, that addresses these issues. A phrase level paraphrase model statistically learned from standard text data with no semantic annotation is used to generate multiple paraphrase variants. LM probabilities are then estimated by maximizing their marginal probability. Multi-level language models estimated at both the word level and the phrase level are combined. An efficient weighted finite state transducer (WFST) based paraphrase generation approach is also presented. Significant error rate reductions of 0.5-0.6% absolute were obtained over the baseline n-gram LMs on two state-of-the-art recognition tasks for English conversational telephone speech and Mandarin Chinese broadcast speech using a paraphrastic multi-level LM modelling both word and phrase sequences. When it is further combined with word and phrase level feed-forward neural network LMs, a significant error rate reduction of 0.9% absolute (9% relative) and 0.5% absolute (5% relative) were obtained over the baseline n-gram and neural network LMs respectively.

机译：自然语言以其丰富的表现力而闻名。许多句子可以用来表示相同的基本含义。仅对观察到的表面单词序列进行建模可能会导致较差的上下文覆盖和泛化，例如，在使用n-gram语言模型（LM）时。本文提出了一种新颖的语言模型形式，即意谓LM，来解决这些问题。从没有语义注释的标准文本数据中统计获取的短语级别复述模型用于生成多个复述变体。然后，通过最大化边缘概率来估计LM概率。组合在单词级别和短语级别估计的多级语言模型。还提出了一种有效的基于加权有限状态传感器（WFST）的复述生成方法。在两个会话识别的最先进的英语会话电话语音和普通话广播语音识别技术的两个最先进的识别任务上，与基线n-gram LM相比，绝对误差率降低了0.5-0.6％和短语序列。当它与单词和短语级别的前馈神经网络LM进一步结合时，在基线n-gram和神经元上的错误率分别降低了0.9％绝对（9％相对）和0.5％绝对（5％相对）。网络LM。

著录项

来源
《Computer speech and language》 |2014年第6期|1298-1316|共19页
作者
X. Liu; M.J.F. Gales; P.C. Woodland;
展开▼
作者单位

Cambridge University, Engineering Department, Trumpington Street, Cambridge CB2 1PZ, England;

Cambridge University, Engineering Department, Trumpington Street, Cambridge CB2 1PZ, England;

Cambridge University, Engineering Department, Trumpington Street, Cambridge CB2 1PZ, England;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Language modelling; Paraphrase; Speech recognition;

机译：语言建模;释义语音识别;

相似文献

外文文献
中文文献
专利

1. An empirical study of statistical language models: n-gram language models vs. neural network language models [J] . Freha Mezzoudj, Abdelkader Benyettou International Journal of Innovative Computing and Applications . 2018,第4期

机译：统计语言模型的实证研究：n-gram语言模型与神经网络语言模型
2. Characterizing and evaluating the quality of software process modeling language: Comparison of ten representative model-based languages [J] . Garcia-Garcia J. A., Enriquez J. G., Dominguez-Mayo F. J. Computer standards & interfaces . 2019,第MARa期

机译：表征和评估软件过程建模语言的质量：十种基于模型的代表性语言的比较
3. The challenge of conceptual modeling for product-service systems: status-quo and perspectives for reference models and modeling languages [J] . Joerg Becker, Daniel F. Beverungen, Ralf Knackstedt Information systems and e-business management . 2010,第1期

机译：产品服务系统的概念建模面临的挑战：参考模型和建模语言的现状和观点
4. Paraphrastic language models and combination with neural network language models [C] . Liu X., Gales M.J.F., Woodland P.C. IEEE International Conference on Acoustics, Speech and Signal Processing . 2013

机译：副词性语言模型以及与神经网络语言模型的组合
5. METALINGUISTIC AWARENESS: PARAPHRASTIC ABILITY IN NORMAL AND LEARNING DISABLED SECONDARY STUDENTS [D] . BYLSMA, JACQUELINE GUNNER 1985

机译：含金属的意识：正常和学习障碍的中学生的抛物能力
6. Enhancing African low-resource languages: Swahili data for language modelling [O] . Casper S. Shikali, Refuoe Mokhosi 2020

机译：增强非洲低资源语言：语言建模的斯瓦希里语数据
7. Paraphrastic language models and combination with neural network language models [O] . Liu X, Gales Mark John, Woodland Philip Charles 2013

机译：副语言模型和与神经网络语言模型的组合

Paraphrastic language models

摘要

著录项

相似文献

相关主题

期刊订阅