首页> 外国专利> Singing voice separation with deep U-Net convolutional networks

Singing voice separation with deep U-Net convolutional networks

机译:用深U-Net卷积网络唱歌语音分离

摘要

A system, method and computer product for estimating a component of a provided audio signal. The method comprises converting the provided audio signal to an image, processing the image with a neural network trained to estimate one of vocal content and instrumental content, and storing a spectral mask output from the neural network as a result of the image being processed by the neural network. The neural network is a U-Net. The method also comprises providing the spectral mask to a client media playback device, which applies the spectral mask to a spectrogram of the provided audio signal, to provide a masked spectrogram. The media playback device also transforms the masked spectrogram to an audio signal, and plays back that audio signal via an output user interface.
机译:用于估计提供的音频信号的组件的系统,方法和计算机产品。该方法包括将所提供的音频信号转换为图像,用培训的神经网络处理图像以估计声乐内容和乐器内容之一,并且由于所处理的图像而从神经网络存储来自神经网络的频谱掩模输出。神经网络。神经网络是U-Net。该方法还包括向客户媒体回放设备提供频谱掩模,该客户媒体回放设备将光谱掩模施加到所提供的音频信号的频谱图,以提供屏蔽频谱图。媒体回放设备还将屏蔽频谱图转换为音频信号,并通过输出用户界面返回该音频信号。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号