LEARNING SPOKEN WORDS FROM MULTISENSORY INPUT

机译：从多传感器输入中学习口语

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speech recognition and speech translation are traditionally addressed by processing acoustic signals while nonlinguistic information is typically not used. In this paper, we present a new method which explores the spoken word learning from naturally co-occurring multisensory information in a dyadic(two-person) conversation. It has been noticed that the listener always has a strong tendency to look toward objects referred to by the speaker during the conversation. In light of this, we propose to use eye gaze to integrate acoustic and visual signals, and build the audio-visual lexicons of objects. With such data gathered from conversations in different languages, the spoken names of objects in different languages can be translated based on their visual semantics. We have developed a multimodal learning system and report the results of experiments using speech, video in concert with eye movement records as training data.

机译：传统上，语音识别和语音翻译通过处理声音信号来解决，而通常不使用非语言信息。在本文中，我们提出了一种新方法，该方法探索了在二元（两人）对话中从自然共现的多感官信息中学习口语单词的方法。已经注意到，在对话过程中，听者总是有很强的倾向去看说话者所指的对象。有鉴于此，我们建议使用视线来整合声音和视觉信号，并建立对象的视听词典。利用从不同语言的对话中收集的此类数据，可以基于其语言的视觉语义来翻译不同语言的对象的口语名称。我们已经开发了一种多模式学习系统，并使用语音，视频以及眼动记录作为训练数据来报告实验结果。

著录项

来源
《2002 6th International Conference on Signal Processing Proceedings (ICSP'02) Vol.2; Aug 26-30, 2002; Beijing, China》|2002年|p.998-1001|共4页
会议地点 Beijing(CN);Beijing(CN)
作者
Chen Yu; Dana H. Ballard;
展开▼
作者单位

Department of Computer Science University of Rochester Rochester, NY 14627,USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input [J] . David Harwath, Adrià Recasens, Dídac Surís, International Journal of Computer Vision . 2020,第3期

机译：从原始感觉输入共同发现视觉物体和口头单词
2. The orthographic effects (word spelling) on the perception, production, and learning of spoken words in a second language [J] . B. Bassetti International journal of psychophysiology: official journal of the International Organization of Psychophysiology . 2018,第期

机译：在第二语言中的感知，生产和学习语言的看法效果（单词拼写）
3. Spoken word recognition of novel words, either produced or only heard during learning [J] . Zamuner Tania S., Morin-Lessard Elizabeth, Strahm Stephanie, Journal of memory and language . 2016,第Null期

机译：新单词的口语单词识别，无论是在学习过程中产生还是仅在学习期间
4. Learning spoken words from multisensory input [C] . Chen Yu, Dana H. Ballard International Conference on Signal Processing . 2002

机译：从多用户输入学习口语单词
5. Spoken word in the media: A 30 year historical analysis of spoken word. [D] . Currie, Tracie E. 2003

机译：媒体中的口语：对口语的30年历史分析。
6. On the Locus of L2 Lexical Fuzziness: Insights From L1 Spoken Word Recognition and Novel Word Learning [O] . Efthymia C. Kapnoula 2021

机译：在L2词汇模糊的轨迹上：L1口语识别和新型词学习见解
7. Learning Spoken Words From Multisensory Input [O] . Chen Yu, Dana H. Ballard 2002

机译：从多感官输入中学习口语

LEARNING SPOKEN WORDS FROM MULTISENSORY INPUT

摘要

著录项

相似文献

相关主题

期刊订阅