首页> 中文期刊> 《计算机应用》 >基于SIFT的说话人唇动识别

基于SIFT的说话人唇动识别

         

摘要

Aiming at the problem that the lip feature dimension is too high and sensitive to the scale space,a technique based on the Scale-Invariant Feature Transform (SIFT) algorithm was proposed to carry out the speaker authentication.Firstly,a simple video frame image neat algorithm was proposed to adjust the length of the lip video to the same length,and the representative lip motion pictures were extracted.Then,a new algorithm based on key points of SIFT was proposed to extract the texture and motion features.After the integration of Principal Component Analysis (PCA) algorithm,the typical lip motion features were obtained for authentication.Finally,a simple classification algorithm was presented according to the obtained features.The experimental results show that compared to the common Local Binary Pattern (LBP) feature and the Histogram of Oriental Gradient (HOG) feature,the False Acceptance Rate (FAR) and False Rejection Rate (FRR) of the proposed feature extraction algorithm are better,which proves that the whole speaker lip motion recognition algorithm is effective and can get the ideal results.%针对唇部特征提取维度过高以及对尺度空间敏感的问题,提出了一种基于尺度不变特征变换(SIFT)算法作特征提取来进行说话人身份认证的技术.首先,提出了一种简单的视频帧图片规整算法,将不同长度的唇动视频规整到同一的长度,提取出具有代表性的唇动图片;然后,提出一种在SIFT关键点的基础上,进行纹理和运动特征的提取算法,并经过主成分分析(PCA)算法的整合,最终得到具有代表性的唇动特征进行认证;最后,根据所得到的特征,提出了一种简单的分类算法.实验结果显示,和常见的局部二元模式(LBP)特征和方向梯度直方图(HOG)特征相比较,该特征提取算法的错误接受率(FAR)和错误拒绝率(FRR)表现更佳.说明整个说话人唇动特征识别算法是有效的,能够得到较为理想的结果.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号