首页> 外文期刊>Journal of visual communication & image representation >Geometrical-based lip-reading using template probabilistic multi-dimension dynamic time warping
【24h】

Geometrical-based lip-reading using template probabilistic multi-dimension dynamic time warping

机译:使用模板概率多维动态时间规整的基于几何的唇读

获取原文
获取原文并翻译 | 示例
           

摘要

By identifying lip movements and characterizing their associations with speech sounds, the performance of speech recognition systems can be improved, particularly when operating in noisy environments. In this paper, we present a geometrical-based automatic lip reading system that extracts the lip region from images using conventional techniques, but the contour itself is extracted using a novel application of a combination of border following and convex hull approaches. Classification is carried out using an enhanced dynamic time warping technique that has the ability to operate in multiple dimensions and a template probability technique that is able to compensate for differences in the way words are uttered in the training set. The performance of the new system has been assessed in recognition of the English digits 0 to 9 as available in the CUAVE database. The experimental results obtained from the new approach compared favorably with those of existing lip reading approaches, achieving a word recognition accuracy of up to 71% with the visual information being obtained from estimates of lip height, width and their ratio. (C) 2015 Elsevier Inc. All rights reserved.
机译:通过识别嘴唇运动并表征其与语音的关联,可以提高语音识别系统的性能,尤其是在嘈杂的环境中操作时。在本文中,我们提出了一种基于几何的自动嘴唇读取系统,该系统使用常规技术从图像中提取嘴唇区域,但轮廓本身是使用边界跟随和凸包方法的组合的新颖应用程序提取的。使用增强的动态时间规整技术(可以在多个维度上操作)和模板概率技术(可以补偿训练集中说出单词方式的差异)来进行分类。新系统的性能已通过对CUAVE数据库中可用的英文数字0到9的识别进行了评估。从新方法获得的实验结果与现有的唇读方法相比具有优势,通过从唇高,宽度及其比例的估计获得的视觉信息,实现了高达71%的单词识别精度。 (C)2015 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号