首页> 外文期刊>Multimedia, IEEE Transactions on >Detection and Classification of Acoustic Scenes and Events
【24h】

Detection and Classification of Acoustic Scenes and Events

机译:声音场景和事件的检测和分类

获取原文
获取原文并翻译 | 示例
           

摘要

For intelligent systems to make best use of the audio modality, it is important that they can recognize not just speech and music, which have been researched as specific tasks, but also general sounds in everyday environments. To stimulate research in this field we conducted a public research challenge: the IEEE Audio and Acoustic Signal Processing Technical Committee challenge on Detection and Classification of Acoustic Scenes and Events (DCASE). In this paper, we report on the state of the art in automatically classifying audio scenes, and automatically detecting and classifying audio events. We survey prior work as well as the state of the art represented by the submissions to the challenge from various research groups. We also provide detail on the organization of the challenge, so that our experience as challenge hosts may be useful to those organizing challenges in similar domains. We created new audio datasets and baseline systems for the challenge; these, as well as some submitted systems, are publicly available under open licenses, to serve as benchmarks for further research in general-purpose machine listening.
机译:为了使智能系统充分利用音频模式,重要的是它们不仅可以识别语音和音乐(已被研究为特定任务),而且还可以识别日常环境中的一般声音。为了刺激这一领域的研究,我们进行了一项公共研究挑战:IEEE音频和声信号处理技术委员会对声学场景和事件的检测和分类(DCASE)的挑战。在本文中,我们报告了自动分类音频场景,自动检测和分类音频事件的最新技术。我们调查了以前的工作以及各个研究小组对挑战提出的意见所代表的最新技术水平。我们还提供了有关挑战组织的详细信息,因此我们作为挑战主持人的经验可能对组织相似领域挑战的人有用。我们为挑战创建了新的音频数据集和基准系统;这些以及一些提交的系统都可以公开许可的形式公开获得,以作为进一步研究通用机器侦听的基准。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号