Improved Data-Driven Generation of Pronunciation Dictionaries Using an Adapted Word List

机译：使用自适应单词列表改进了数据驱动的发音词典的生成

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Data-driven approaches to learning pronunciation variants for phonetic dictionaries have to deal with the problem of acquiring a sufficient amount of training data. The reason is not the size of the databases, but the unfavorable distribution of word frequencies in natural speech, which is known as Zipfs law. In this paper we suggest a method which reorganizes a phonetic dictionary according to a given speech database in order to maximize the number of word models for which pronunciation variants can be learned with this corpus. Reorganization takes place automatically by analyzing the orthographic and phonetic transcriptions of the corpus. The method produces an alternative word list consisting of units ranging from partial words to multi-words. The efficiency and the limits of the approach are discussed on the basis of experiments carried out on the German VERBMOBIL corpus.

机译：数据驱动的方法来学习语音词典的发音变体必须解决获取足够数量的训练数据的问题。原因不是数据库的大小，而是自然语音中词频的不利分布，这被称为Zipfs定律。在本文中，我们提出了一种根据给定的语音数据库重新组织语音词典的方法，以最大程度地利用该语料库学习语音变体的单词模型数量。通过分析语料库的正字法和音标会自动进行重组。该方法产生由范围从部分单词到多单词的单位组成的替代单词列表。在对德国VERBMOBIL语料库进行的实验的基础上，讨论了该方法的效率和局限性。

著录项

来源
《European Conference on Speech Communication and Technology v.2; 20010903-20010907; Aalborg; DK》|2001年|P.1433-1436|共4页
会议地点 Aalborg(DK);Aalborg(DK)
作者
Matthias Wolff; Matthias Eichner; Ruediger Hoffmann;
展开▼
作者单位

Dresden University of Technology, Laboratory of Acoustics and Speech Communication D-01062 Dresden, Germany;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类传播理论;
关键词

相似文献

外文文献
中文文献
专利

1. Within-word pronunciation variation modeling for Arabic ASRs:a direct data-driven approach [J] . Dia AbuZeina, Wasfi Al-Khatib, Moustafa Elshafei, International journal of speech technology . 2012,第2期

机译：阿拉伯语ASR的词内发音变化建模：直接数据驱动方法
2. Adaption Model for Building Agile Pronunciation Dictionaries Using Phonemic Distance Measurements [J] . Akella Amarendra Babu, Rama Devi Yellasiri, Natukula Sainath International Journal of Information Technology . 2018,第6期

机译：使用音位距离测量构建敏捷发音词典的适应模型
3. Building Words Dictionary List Using Symbol Enumeration and Hashing Methodology [J] . Safa S. Abdul-Jabbar, Dr. Loay E. George Research journal of applied science, engineering and technology . 2016,第12期

机译：使用符号枚举和散列方法构建单词词典列表
4. Improved Data-Driven Generation of Pronunciation Dictionaries Using an Adapted Word List [C] . Matthias Wolff, Matthias Eichner, Ruediger Hoffmann European conference on speech communication and technology . 2001

机译：使用适应的单词列表改进数据驱动的发音词典
5. A Study on Homophone Words in the Dictionary-Based Password Cracking [D] . Mandapaka, Ajay. 2017

机译：基于字典的密码破解中的同音词研究
6. The American Illustrated Medical Dictionary: A Complete Dictionary of the Terms Used in Medicine Surgery Dentistry Pharmacy Chemistry Nursing Veterinary Science Biology Medical Biography etc. with the Pronunciation Derivation and Definition [O] . 1935

机译：美国插图医学词典：医学外科牙科药学化学护理兽医学生物学医学传记等术语的完整词典包括发音派生和定义
7. Improving the Arabic Pronunciation Dictionary for Phone and Word Recognition with Linguistically-Based Pronunciation Rules [O] . Biadsy Fadi, Hirschberg Julia Bell, Habash Nizar Y. 2009

机译：使用基于语言的发音规则改进用于电话和单词识别的阿拉伯语发音词典
8. The Pronunciation of English Air Traffic Control Words by Controllers from Twelve Icao Nations [R] . Moser, H. M. 1964

机译：十二Icao国家控制人员英语空中交通管制词的发音

Improved Data-Driven Generation of Pronunciation Dictionaries Using an Adapted Word List

摘要

著录项

相似文献

相关主题

期刊订阅