Maximum Mutual Information Estimation with Unlabeled Datafor Phonetic Classification

机译：使用未标记数据进行语音分类的最大互信息估算

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes a new training framework for mixed la-beled and unlabeled data and evaluates it on the task of binary phonetic classification. Our training objective function com-bines Maximum Mutual Information (MMI) for labeled data and Maximum Likelihood (ML) for unlabeled data. Through the modified training objective, MMI estimates are smoothed with ML estimates obtained from unlabeled data. On the other hand, our training criterion can also help the existing model adapt to new speech characteristics from unlabeled speech. In our experiments of phonetic classification, there is a consistent reduction of error rate from MLE to MMIE with I-smoothing, and then to MMIE with unlabeled-smoothing. Error rates can be further reduced by transductive-MMIE. We also experimented with the gender-mismatched case, in which the best improve-ment shows MMIE with unlabeled data has a 9.3% absolute lower error rate than MLE and a 2.35% absolute lower error rate than MMIE with I-smoothing.

机译：本文为混合洛杉矶和未标记的数据提出了新的培训框架，并在二进制语音分类的任务上评估它。我们的训练目标函数Com-Bines最大互信息（MMI），用于标记数据和用于未标记数据的最大可能性（ML）。通过修改的训练目标，MMI估计用来自未标记数据获得的ML估计进行平滑。另一方面，我们的训练标准还可以帮助现有模型适应来自未标记语音的新语音特征。在我们的语音分类的实验中，使用I光滑的MLE到MMIE的错误率降低，然后用未标记平滑的MMIE。通过Transtonuctive-MMIE可以进一步减少错误率。我们还试验了性别不匹配的情况，其中最佳改进的案件显示了具有未标记数据的MMIE，其绝对值低于MLE，比MLE更低的误差率和2.35％的误差率比MMIE更低，具有I光滑。

著录项

来源
《International Speech Communication Association》|2008年||共4页
会议地点
作者
Jui-Ting Huang; Mark Hasegawa-Johnson;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912.3-532;
关键词
unlabeled speech; Maximum mutual informa-tion; Gaussian mixture models;

机译：未标记的语音;最大相互信息;高斯混合模型;

相似文献

外文文献
中文文献
专利

1. Feature Selection Using Maximum Feature Tree Embedded with Mutual Information and Coefficient of Variation for Bird Sound Classification [J] . Haifeng Xu, Yan Zhang, Jiang Liu, Mathematical Problems in Engineering: Theory, Methods and Applications . 2021,第a期

机译：功能选择使用嵌入具有相互信息的最大特征树和鸟类声音分类的变异系数
2. Symmetry, topology and the maximum number of mutually pairwise-touching infinite cylinders: configuration classification [J] . Peter V. Pikhitsa, Stanislaw Pikhitsa Royal Society Open Science . 2017,第1期

机译：对称性，拓扑结构和相互成对接触的无限圆柱的最大数量：配置分类
3. Maximum mutual information regularized classification [J] . Jim Jing-Yan Wang, Yi Wang, Shiguang Zhao, Engineering Applications of Artificial Intelligence . 2015,第jana期

机译：最大互信息正则化分类
4. Maximum Mutual Information Estimation with Unlabeled Datafor Phonetic Classification [C] . Jui-Ting Huang, Mark Hasegawa-Johnson International Speech Communication Association . 2008

机译：使用未标记数据进行语音分类的最大互信息估算
5. Hidden Markov models, maximum mutual information estimation, and the speech recognition problem [D] . Normandin, Yves. 1991

机译：隐藏的马尔可夫模型，最大互信息估计和语音识别问题
6. Symmetry topology and the maximum number of mutually pairwise-touching infinite cylinders: configuration classification [O] . Peter V. Pikhitsa, Stanislaw Pikhitsa 2017

机译：对称性拓扑结构和相互成对接触的无限圆柱的最大数量：配置分类
7. Relevancy Of Time-Frequency Features For Phonetic Classification Measured By Mutual Information [O] . Howard Yang, Sarel Van Vuuren, Hynek Hermansky 1999

机译：互信息测量语音分类的时频特征相关性
8. Estimation and Classification by Sigmoids Based on Mutual Information [R] . Baram, Y. 1994

机译：基于互信息的sigmoids估计与分类

Maximum Mutual Information Estimation with Unlabeled Datafor Phonetic Classification

摘要

著录项

相似文献

相关主题

期刊订阅