首页> 外国专利> Audiovisual communication - method and - a device with integrated, perception dependent speech - and video coding

Audiovisual communication - method and - a device with integrated, perception dependent speech - and video coding

机译:视听通信-方法和-具有集成的,与感知相关的语音的设备-和视频编码

摘要

Disclosed is a low bit rate audio and video communication system which employs an integrated encoding system that dynamically allocates available bits among the audio and video signals to be encoded based on the content of the audio and video information and the manner in which the audio and video information will be perceived by a viewer. A dynamic bit allocation and encoding process will evaluate the current content of the audio and video information and allocate the available bits among the audio and video signals to be encoded. In addition, an appropriate audio encoding technique is dynamically selected based on the current content of the audio signal. A face location detection subroutine will detect and model the location of faces in each video frame, in order that the facial regions may be more accurately encoded than other portions of the video frame. A lip motion detection subroutine will detect the location and movement of the lips of a person present in a video scene, in order to determine when a person is speaking and to encode the lip regions more accurately. The audio and video signals generated by a second party to a communication are monitored to determine if the second party is paying attention to the audio and video information transmitted by the first party to the communication.
机译:公开了一种低比特率的音频和视频通信系统,该系统采用集成的编码系统,该系统根据音频和视频信息的内容以及音频和视频的方式在要编码的音频和视频信号之间动态分配可用位。信息将被观看者感知。动态位分配和编码过程将评估音频和视频信息的当前内容,并在要编码的音频和视频信号之间分配可用位。另外,基于音频信号的当前内容来动态地选择适当的音频编码技术。面部位置检测子例程将对每个视频帧中的面部位置进行检测和建模,以便可以比视频帧的其他部分更准确地编码面部区域。嘴唇运动检测子例程将检测视频场景中出现的人的嘴唇的位置和运动,以便确定某人正在讲话并更准确地对嘴唇区域进行编码。监视由第二方通信产生的音频和视频信号,以确定第二方是否正在关注由第一方向通信发送的音频和视频信息。

著录项

  • 公开/公告号DE69523503T2

    专利类型

  • 公开/公告日2002-07-11

    原文格式PDF

  • 申请/专利权人 AT & T CORP. NEW YORK;

    申请/专利号DE1995623503T

  • 发明设计人 ZHOU YONG;

    申请日1995-03-29

  • 分类号H04N7/26;H04N7/52;G06K9/00;G10L17/00;H04N7/15;

  • 国家 DE

  • 入库时间 2022-08-22 00:25:32

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号