首页> 外国专利> SOUND SEGMENT CLASSIFICATION DEVICE, SOUND SEGMENT CLASSIFICATION METHOD, AND SOUND SEGMENT CLASSIFICATION PROGRAM

SOUND SEGMENT CLASSIFICATION DEVICE, SOUND SEGMENT CLASSIFICATION METHOD, AND SOUND SEGMENT CLASSIFICATION PROGRAM

机译:声音段分类装置,声音段分类方法和声音段分类程序

摘要

A sound segment classification device that appropriately classifies sound segments of an observation signal by sound source, when the volume from a sound source fluctuates, when the number of sound sources is unknown, and even when a mixture of microphones of different types is used. The sound segment classification device (100) comprises: a vector calculation means (101) that calculates, from a time series of the power spectrum for sound signals collected by a plurality of microphones, a multidimensional vector series which is a vector series of the power spectrum having the same number of dimensions as there are microphones; a difference calculation means (104) that calculates, for each point in time in the multidimensional vector series that is divided into lengths of any time period, the difference vector between a point in time and the immediately preceding point in time; a sound source direction estimation means (105) that estimates as the sound source direction the main component of the difference vector found in a state where both non-orthogonality and exceeding spatial dimensions are permitted; and a sound segment determination means (106) that determines whether a sound source direction is a sound segment or a silence segment, for each sound source direction found using the sound source direction estimation means, using a prescribed sound characteristics index indicating the sound segment characteristics of sound signals input for each point in time.
机译:声音片段分类装置,在来自声音源的音量变动,未知的声音源数量,甚至混合使用不同种类的麦克风的情况下,也可以通过声音源对观察信号的声音片段进行适当的分类。声音片段分类装置(100)包括:矢量计算装置(101),该矢量计算装置(101)根据多个麦克风收集的声音信号的功率谱的时间序列,计算作为功率的矢量序列的多维矢量序列。具有与麦克风相同尺寸的频谱;差计算装置(104),对于被划分为任意时间段的长度的多维矢量序列中的每个时间点,计算该时间点与紧接在前的时间点之间的差向量;声源方向估计装置(105),估计在允许非正交且超过空间尺寸的状态下找到的差矢量的主要成分作为声源方向;音段确定装置(106),使用指示所述音段特性的规定的声音特性指标,针对使用所述音源方向估计装置找到的每个音源方向,确定声源方向是声音段还是静音段。每个时间点输入的声音信号的数量。

著录项

  • 公开/公告号WO2012105385A1

    专利类型

  • 公开/公告日2012-08-09

    原文格式PDF

  • 申请/专利权人 NEC CORPORATION;ONISHI YOSHIFUMI;

    申请/专利号WO2012JP51553

  • 发明设计人 ONISHI YOSHIFUMI;

    申请日2012-01-25

  • 分类号G10L21/02;G10L15/04;G10L15/28;

  • 国家 WO

  • 入库时间 2022-08-21 17:14:25

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号