Online detection of vocal Listener Responses with maximum latency constraints

机译：在线检测具有最大延迟限制的声音听众响应

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

When human listeners utter Listener Responses (e.g. back-channels or acknowledgments) such as ‘yeah’ and ‘mmhmm’, interlocutors commonly continue to speak or resume their speech even before the listener has finished his/her response. This type of speech interactivity results in frequent speech overlap which is common in human-human conversation. To allow for this type of speech interactivity to occur between humans and spoken dialog systems, which will result in more human-like continuous and smoother human-machine interaction, we propose an on-line classifier which can classify incoming speech as Listener Responses. We show that it is possible to detect vocal Listener Responses using maximum latency thresholds of 100–500 ms, thereby obtaining equal error rates ranging from 34% to 28% by using an energy based voice activity detector.

机译：当人类听众说出“是”和“嗯”之类的听众响应（例如反向通道或确认）时，对话者通常甚至会在听众完成其响应之前继续讲话或恢复其讲话。这种类型的语音交互导致频繁的语音重叠，这在人与人之间的对话中很常见。为了使这种类型的语音交互在人与口语对话系统之间发生，从而导致类似人的连续且更流畅的人机交互，我们提出了一种在线分类器，该分类器可以将传入的语音分类为“听众响应”。我们表明，有可能使用最大等待时间阈值100–500 ms来检测声音听众响应，从而通过使用基于能量的语音活动检测器来获得34％到28％的相等错误率。

著录项

来源
《2011 IEEE International Conference on Acoustics, Speech and Signal Processing》|2011年|p.5836-5839|共4页
会议地点
作者
Neiberg Daniel; Truong Khiet P.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信理论;
关键词
Speech processing; speech analysis;

机译：语音处理;语音分析;

相似文献

外文文献
中文文献
专利

1. Making 'Maximum Fun' for fans: Examining podcast listener participation online [J] . KYLE WRATHER Radio journal . 2016,第1期

机译：为粉丝创造“最大乐趣”：在线检查播客收听者的参与
2. Sequential Two-Dimensional Partial Response Maximum Likelihood Detection Scheme with Constant-Weight Constraint Code for Holographic Data Storage Systems [J] . Gyuyeol Kong, Sooyong Choi Japanese journal of applied physics . 2012,第8ISSUE3期

机译：具有恒定重量约束码的全息数据存储系统的顺序二维部分响应最大似然检测方案
3. Middle latency auditory-evoked fields reflect psychoacoustic gap detection thresholds in human listeners. [J] . Rupp A, Gutschalk A, Uppenkamp S, Journal of Neurophysiology . 2004,第4期

机译：中潜伏期听觉诱发的字段反映了人类听众的心理声学间隙检测阈值。
4. ONLINE DETECTION OF VOCAL LISTENER RESPONSES WITH MAXIMUM LATENCY CONSTRAINTS [C] . Daniel Neiberg, Khiet P. Truong IEEE International Conference on Acoustics, Speech and Signal Processing . 2011

机译：具有最大延迟约束的声音侦听器响应的在线检测
5. Entity Relation Detection with Factorial Hidden Markov Models and Maximum Entropy Discriminant Latent Dirichlet Allocations . [D] . Li, Dingcheng. 2011

机译：因子隐马尔可夫模型与最大熵判别潜在Dirichlet分配的实体关系检测。
6. Auditory brainstem response latency in forward masking a marker of sensory deficits in listeners with normal hearing thresholds [O] . Golbarg Mehraei, Andreu Paredes Gallardo, Barbara G. Shinn-Cunningham, -1

机译：前倾掩蔽中的听觉脑干反应潜伏期是正常听觉阈值的听众感觉缺陷的标志
7. Online Detection Of Vocal Listener Responses With Maximum Latency Constraints [O] . Neiberg, Daniel, Truong, Khiet Phuong 2011

机译：在线检测具有最大延迟约束的人声监听器响应
8. A Method of Calculating the Vibratory Response of a Rigid Body to Arbitrary Excitation. Part I. Maximum of 4 Supports Exerting Longitudinal Constraint Only, Arbitrary Location and Direction [R] . Parkins, D. W. 1980

机译：一种计算刚体对任意激励的振动响应的一种方法。第一部分。最多4个支持仅发挥纵向约束，任意位置和方向

Online detection of vocal Listener Responses with maximum latency constraints

摘要

著录项

相似文献

相关主题

期刊订阅