首页>外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing

IEEE International Conference on Acoustics, Speech and Signal Processing

召开年：2020
召开地：Barcelona(ES)
出版时间：-

会议文集：-

会议论文

热门论文

全部论文

全选（0）

1.Deep Joint Source-Channel Coding for Wireless Image Retrieval

机译：用于无线图像检索的深度联合源通道编码
- 作者：Mikolaj Jankowski;Deniz Gündüz;Krystian Mikolajczyk
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Joint source-channel coding;
- retrieval;
- person re-identification;
- IoT;
- deep learning;
2.Projected Weight Regularization to Improve Neural Network Generalization

机译：预测权重正则化以改善神经网络的泛化
- 作者：Guoqiang Zhang;Kenta Niwa;W. Bastiaan Kleijn
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- DNN;
- projected weight regularization;
- CWN;
3.Multimodal Active Speaker Detection and Virtual Cinematography for Video Conferencing

机译：用于视频会议的多模式主动扬声器检测和虚拟摄影
- 作者：Ross Cutler;Ramin Mehran;Sam Johnson;Cha Zhang;Adam Kirk;Oliver Whyte;Adarsh Kowdle
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
4.Riemannian Framework for Robust Covariance Matrix Estimation in Spiked Models

机译：尖峰模型中鲁棒协方差矩阵估计的黎曼框架
- 作者：Florent Bouchard;Arnaud Breloy;Guillaume Ginolhac;Frédéric Pascal
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Covariance Matrices;
- Spiked Models;
- Robust Estimation;
- Riemannian Optimization;
5.Meta Learning for End-To-End Low-Resource Speech Recognition

机译：元学习用于端到端的低资源语音识别
- 作者：Jui-Yang Hsu;Yuan-Jui Chen;Hung-yi Lee
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- meta-learning;
- low-resource;
- speech recognition;
- language adaptation;
- IARPA-BABEL;
6.Towards Fast and Accurate Streaming End-To-End ASR

机译：迈向快速准确的流式端到端ASR
- 作者：Bo Li;Shuo-yiin Chang;Tara N. Sainath;Ruoming Pang;Yanzhang He;Trevor Strohman;Yonghui Wu
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Training;
- Recurrent neural networks;
- Decoding;
- Error analysis;
- Transducers;
- Acoustics;
- Speech recognition;
7.Aipnet: Generative Adversarial Pre-Training of Accent-Invariant Networks for End-To-End Speech Recognition

机译：Aipnet：用于端到端语音识别的重音不变网络的生成对抗性预训练
- 作者：Yi-Chen Chen;Zhaojun Yang;Ching-Feng Yeh;Mahaveer Jain;Michael L. Seltzer
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Generative adversarial network;
- end-to-end speech recognition;
- accent-invariance;
8.Improving the Performance of Transformer Based Low Resource Speech Recognition for Indian Languages

机译：提高基于变压器的印度语言低资源语音识别性能
- 作者：Vishwas M. Shetty;Metilda Sagaya Mary N J;S. Umesh
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Transformer;
- Automatic Speech Recognition;
- Multilingual;
- Low Resource;
9.An Improved Solution to the Frequency-Invariant Beamforming with Concentric Circular Microphone Arrays

机译：同心圆麦克风阵列的频率不变波束形成的改进解决方案
- 作者：Xudong Zhao;Gongping Huang;Jingdong Chen;Jacob Benesty
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Microphone arrays;
- concentric circular arrays;
- frequency-invariant beamforming;
- spatial aliasing;
10.Subject Transfer Framework Based on Source Selection and Semi-Supervised Style Transfer Mapping for Semg Pattern Recognition

机译：基于来源选择和半监督风格转移映射的主题转移框架用于模式识别
- 作者：Suguru Kanoga;Takayuki Hoshino;Hideki Asoh
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Transfer learning;
- style transfer mapping;
- surface electromyogram (sEMG);
- covariate shift adaptation.;
11.Sequential Deep Unrolling With Flow Priors For Robust Video Deraining

机译：具有流量先验的顺序深度展开功能可实现强大的视频排空
- 作者：Xinwei Xue;Ying Ding;Pan Mu;Long Ma;Risheng Liu;Xin Fan
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Video Deraining;
- Deep Unrolling;
- Optical Flow;
- Video Restoration;
12.Srzoo: An Integrated Repository For Super-Resolution Using Deep Learning

机译：Srzoo：使用深度学习实现超分辨率的集成存储库
- 作者：Jun-Ho Choi;Jun-Hyuk Kim;Jong-Seok Lee
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Super-resolution;
- deep learning;
- image enhancement;
13.Weakly Supervised Crowd-Wise Attention For Robust Crowd Counting

机译：缺乏监督的明智人群关注稳健的人群计数
- 作者：Xiyu Kong;Muming Zhao;Hao Zhou;Chongyang Zhang
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- crowd counting;
- crowd segmentation;
- spatial attention;
14.Sight to Sound: An End-to-End Approach for Visual Piano Transcription

机译：声音的视觉：视觉钢琴转录的端到端方法
- 作者：A. Sophia Koepke;Olivia Wiles;Yael Moses;Andrew Zisserman
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- visual music transcription;
- automatic music transcription;
- music information retrieval;
- deep learning;
15.Meta-Learning to Communicate: Fast End-to-End Training for Fading Channels

机译：交流的元学习：衰落频道的快速端到端培训
- 作者：Sangwoo Park;Osvaldo Simeone;Joonhyuk Kang
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Machine learning;
- autoencoder;
- fading channels;
16.Learning Domain Invariant Representations for Child-Adult Classification from Speech

机译：从语音中学习儿童成人分类的领域不变表示
- 作者：Rimita Lahiri;Manoj Kumar;Somer Bishop;Shrikanth Narayanan
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Child speech;
- domain adversarial learning;
- gradient reversal;
- autism spectrum disorder;
17.Ts-Fen: Probing Feature Selection Strategy for Face Anti-Spoofing

机译：Ts-Fen：探索面部反欺骗的特征选择策略
- 作者：Dongmei Peng;Jing Xiao;Rong Zhu;Ge Gao
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- face anti-spoofing;
- feature enhancement;
- feature selection;
- category discrepancy;
18.CIF: Continuous Integrate-And-Fire for End-To-End Speech Recognition

机译：CIF：连续集成和发射，用于端到端语音识别
- 作者：Linhao Dong;Bo Xu
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Decoding;
- Hidden Markov models;
- Acoustics;
- Training;
- Computational modeling;
- Predictive models;
- Prediction algorithms;
19.Re-Translation Strategies for Long Form, Simultaneous, Spoken Language Translation

机译：长格式，同时，口语翻译的重新翻译策略
- 作者：Naveen Arivazhagan;Colin Cherry;I Te;Wolfgang Macherey;Pallavi Baljekar;George Foster
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Speech Recognition Neural Machine Translation;
20.SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech Translation

机译：SkinAugment：自动编码的说话人转换，用于自动语音翻译
- 作者：Arya D. McCarthy;Liezl Puzon;Juan Pino
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- automatic speech translation;
- end-to-end speech translation;
- data augmentation;
- speaker normalization;
21.End-to-End Speech Translation with Self-Contained Vocabulary Manipulation

机译：端到端语音翻译和独立词汇处理
- 作者：Mei Tu;Fan Zhang;Wei Liu
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- end-to-end;
- speech-to-text;
- speed;
22.Time Domain Velocity Vector for Retracing the Multipath Propagation

机译：用于追踪多径传播的时域速度矢量
- 作者：Jérôme Daniel;Srđan Kitić
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Ambisonics;
- intensity;
- localization;
- DoA;
- distance;
23.Audio-Visual Recognition of Overlapped Speech for the LRS2 Dataset

机译：LRS2数据集的重叠语音的视听识别
- 作者：Jianwei Yu;Shi-Xiong Zhang;Jian Wu;Shahram Ghorbani;Bo Wu;Shiyin Kang;Shansong Liu;Xunying Liu;Helen Meng;Dong Yu
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- audio-visual speech recognition;
- overlapped speech;
- speech separation;
- multi-modal;
24.SED-MDD: Towards Sentence Dependent End-To-End Mispronunciation Detection and Diagnosis

机译：SED-MDD：依赖于句子的端到端错误诊断和诊断
- 作者：Yiqing Feng;Guanyu Fu;Qingcai Chen;Kai Chen
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
25.Zero-Crossing Precoding with Maximum Distance to the Decision Threshold for Channels with 1-Bit Quantization and Oversampling

机译：具有1位量化和过采样的通道的零交叉预编码与决策阈值的最大距离
- 作者：Diana M. V. Melo;Lukas T. N. Landau;Rodrigo C. de Lamare
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- zero-crossing precoding;
- 1-bit quantization;
- MIMO systems;
- faster-than-Nyquist signaling;
- oversampling;
26.Separable Optimization for Joint Blind Deconvolution and Demixing

机译：联合盲解卷积和解混的可分离优化
- 作者：Dana Weitzner;Raja Giryes
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- blind deconvolution;
- demixing;
- low-rank;
27.Deep Clustering for Domain Adaptation

机译：深度集群以适应领域
- 作者：Boyan Gao;Yongxin Yang;Henry Gouk;Timothy M. Hospedales
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Domain Adaptation;
- Deep Clustering;
- Unsupervised Learning;
- Semi-Supervised Learning;
28.Misspecified Cramer-Rao Bound For Delay Estimation with a Mismatched Waveform: A Case Study

机译：错误指定的Cramer-Rao界限用于不匹配波形的延迟估计：一个案例研究
- 作者：Florian Roemer
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Cramér-Rao Bound;
- Delay Estimation;
- Ultrasound;
- Nondestructive Testing;
29.A New Variational Method for Deep Supervised Semantic Image Hashing

机译：深度监督语义图像散列的新变分方法
- 作者：Furen Zhuang;Pierre Moulin
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- supervised;
- hashing;
- image;
- retrieval;
30.DOA Estimation in Systems with Nonlinearities for MMWAVE Communications

机译：MMWAVE通信的非线性系统中的DOA估计
- 作者：Aditya Sant;Bhaskar D. Rao
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- mmWave Communication;
- Direction of Arrival;
- Channel Estimation;
- One-bit Quantization;
- Nonlinear Systems;
31.Multi-Task Self-Supervised Learning for Robust Speech Recognition

机译：多任务自我监督学习，实现强大的语音识别
- 作者：Mirco Ravanelli;Jianyuan Zhong;Santiago Pascual;Pawel Swietojanski;Joao Monteiro;Jan Trmal;Yoshua Bengio
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- self-supervised learning;
- speech recognition;
32.Improving Language Identification for Multilingual Speakers

机译：改进多语言发言人的语言识别
- 作者：Andrew Titus;Jan Silovsky;Nanxin Chen;Roger Hsiao;Mary Young;Arnab Ghoshal
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Language identification;
- multilingual;
33.Generative Pre-Training for Speech with Autoregressive Predictive Coding

机译：自回归预测编码的语音生成预训练
- 作者：Yu-An Chung;James Glass
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- representation learning;
- self-supervised learning;
- pre-training;
- transfer learning;
- autoregressive modeling;
34.A-CRNN: A Domain Adaptation Model for Sound Event Detection

机译：A-CRNN：用于声音事件检测的域自适应模型
- 作者：Wei Wei;Hongning Zhu;Emmanouil Benetos;Ye Wang
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- sound event detection;
- domain adaptation;
- computational sound scene analysis;
- CRNNs;
35.Xpsnr: A Low-Complexity Extension of The Perceptually Weighted Peak Signal-To-Noise Ratio For High-Resolution Video Quality Assessment

机译：Xpsnr：感知加权峰值信噪比的低复杂度扩展，用于高分辨率视频质量评估
- 作者：Christian R. Helmrich;Mischa Siekmann;Sören Becker;Sebastian Bosse;Detlev Marpe;Thomas Wiegand
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- PSNR;
- SSIM;
- UHD;
- video coding;
- VQA;
36.Adaptive Distributed Stochastic Gradient Descent for Minimizing Delay in the Presence of Stragglers

机译：自适应分布随机梯度下降算法，可最大程度地减少拖曳者在场时的延迟
- 作者：Serge Kas Hanna;Rawad Bitar;Parimal Parag;Venkat Dasari;Salim El Rouayheb
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Distributed SGD;
- adaptive policy;
- stragglers.;
37.A Recurrent Variational Autoencoder for Speech Enhancement

机译：用于语音增强的递归变分自动编码器
- 作者：Simon Leglaive;Xavier Alameda-Pineda;Laurent Girin;Radu Horaud
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
38.Joint Semi-Supervised Feature Auto-Weighting and Classification Model for EEG-Based Cross-Subject Sleep Quality Evaluation

机译：基于EEG的跨学科睡眠质量评估的联合半监督特征自动加权和分类模型
- 作者：Yong Peng;Qingxi Li;Wanzeng Kong;Jianhai Zhang;Bao-Liang Lu;Andrzej Cichocki
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Sleep quality evaluation;
- EEG;
- Feature auto-weighting;
- Semi-supervised learning;
- Classification;
39.Domain Adaptation for Generalization of Face Presentation Attack Detection in Mobile Settengs with Minimal Information

机译：具有最小信息的移动环境中人脸呈现攻击检测的通用化域自适应
- 作者：Amir Mohammadi;Sushil Bhattacharjee;Sébastien Marcel
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- presentation attack detection;
- domain adaptation;
- domain generalization;
- pruning;
- feature selection;
40.Global and Local Discriminative Patches Exploiting for Action Recognition

机译：全局和局部区分补丁，用于行动识别
- 作者：Jintao Wu;Wu Luo;Weiwei Liu;Chongyang Zhang
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Discriminative Patches;
- Class Activation Map;
- 2D and 3D ConvNets;
- Feature Fusion;
41.Multi-Scale Deep Feature Fusion for Vehicle Re-Identification

机译：用于汽车重新识别的多尺度深度特征融合
- 作者：Yiting Cheng;Chuanfa Zhang;Kangzheng Gu;Lizhe Qi;Zhongxue Gan;Wenqiang Zhang
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Vehicle Re-Identification;
- Multi-Scale;
- Multi-Level;
- Deep Neural Network;
42.Compare Learning: Bi-Attention Network for Few-Shot Learning

机译：比较学习：少注意力学习的双注意力网络
- 作者：Li Ke;Meng Pan;Weigao Wen;Dong Li
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Few-shot;
- Bi-attention;
- Compare learning;
- Metric learning;
43.Complex Transformer: A Framework for Modeling Complex-Valued Sequence

机译：复杂变压器：建模复杂值序列的框架
- 作者：Muqiao Yang;Martin Q. Ma;Dongyu Li;Yao-Hung Hubert Tsai;Ruslan Salakhutdinov
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Deep learning;
- transformer network;
- sequence modeling;
- complex-valued deep neural network;
44.SPIDERnet: Attention Network For One-Shot Anomaly Detection In Sounds

机译：SPIDERnet：声音一发异常检测的注意力网络
- 作者：Yuma Koizumi;Masahiro Yasuda;Shin Murata;Shoichiro Saito;Hisashi Uematsu;Noboru Harada
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Anomaly detection in sounds;
- acoustic condition monitoring;
- one-shot learning;
- and multi-head attention;
45.Audio-Based Auto-Tagging With Contextual Tags for Music

机译：基于音乐的带有上下文标签的基于音频的自动标签
- 作者：Karim M. Ibrahim;Jimena Royo-Letelier;Elena V. Epure;Geoffroy Peeters;Gaël Richard
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- music auto-tagging;
- user context;
- dataset collection;
- multi-label classification;
- missing labels;
46.Multi-Modal Self-Supervised Pre-Training for Joint Optic Disc and Cup Segmentation in Eye Fundus Images

机译：眼底图像中联合视盘和杯分割的多模态自我监督预训练
- 作者：Álvaro S. Hervella;Lucía Ramos;José Rouco;Jorge Novo;Marcos Ortega
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Deep learning;
- self-supervised learning;
- segmentation;
- eye fundus;
- glaucoma;
47.Space Filling Curves for MRI Sampling

机译：MRI采样的空间填充曲线
- 作者：Shubham Sharma;K.V.S. Hari;Geert Leus
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- MRI;
- k-space trajectories;
- space filling curves;
48.Polarizing Front Ends for Robust Cnns

机译：极化前端，可实现可靠的CNN
- 作者：Can Bakiskan;Soorya Gopalakrishnan;Metehan Cekic;Upamanyu Madhow;Ramtin Pedarsani
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- adversarial machine learning;
- quantization;
- front-end defense;
49.Crowdsourcing-Based Ranking Aggregation for Person Re-Identification

机译：基于众包的人员重新识别排名汇总
- 作者：Yinxue Yu;Chao Liang;Weijian Ruan;Longxiang Jiang
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Person Re-Identification;
- Crowdsourcing;
- Aggregation;
50.Arnet: Attention-Based Refinement Network for Few-Shot Semantic Segmentation

机译：Arnet：基于注意力的细化语义分割网络
- 作者：Rusheng Li;Hanhui Liu;Yuesheng Zhu;Zhiqiang Bai
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Semantic Segmentation;
- Few-shot Learning;
- Attention Mechanism;
- Deep learning;
51.A Low-Latency Successive Cancellation Hybrid Decoder for Convolutional Polar Codes

机译：用于卷积极化码的低延迟连续消除混合解码器
- 作者：Yu Wang;Shikai Qiu;Lirui Chen;Qinglin Wang;Yang Zhang;Cang Liu;Zuocheng Xing
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- List decoding;
- bit-flipping;
- convolutional polar codes;
- path metric;
- low latency;
52.Similarity Learning For Cover Song Identification Using Cross-Similarity Matrices of Multi-Level Deep Sequences

机译：多层深度序列的交叉相似度矩阵用于翻唱歌曲识别的相似度学习
- 作者：Chaoya Jiang;Deshun Yang;Xiaoou Chen
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- similarity learning;
- cover song identification;
- cross-similarity matrices;
- Siamese network;
- multi-level deep sequences;
53.A Deep Gradient Boosting Network for Optic Disc and Cup Segmentation

机译：用于光盘和杯分割的深度梯度提升网络
- 作者：Qing Liu;Beiji Zou;Yang Zhao;Yixiong Liang
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Fundus image;
- OD and OC segmentation;
- gradient boosting;
- deep supervision;
54.Training LSTM for Unsupervised Anomaly Detection Without A Priori Knowledge

机译：在没有先验知识的情况下训练LSTM进行无监督异常检测
- 作者：Yann Cherdo;Paul de Kerret;Renaud Pawlak
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
55.A Model of Double Descent for High-Dimensional Logistic Regression

机译：高维Logistic回归的双下降模型
- 作者：Zeyu Deng;Abla Kammoun;Christos Thrampoulidis
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Generalization error;
- Binary Classification;
- Overparameterization;
- Max-margin;
- Asymptotics;
56.Two-dimensional DOA Estimation for Coprime Planar Array: A Coarray Tensor-based Solution

机译：互质平面阵列的二维DOA估计：基于互阵列张量的解决方案
- 作者：Hang Zheng;Chengwei Zhou;Yujie Gu;Zhiguo Shi
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Coarray tensor;
- coprime planar array;
- DOA estimation;
- structured tensorization;
- underdetermined.;
57.Epigraphical Reformulation for Non-Proximable Mixed Norms

机译：不可逼近混合范式的表位重构
- 作者：Seisuke Kyochi;Shunsuke Ono;Ivan Selesnick
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Erbium;
- Minimization;
- TV;
- Indexes;
- Convex functions;
- Optimization;
- Decorrelation;
58.A Time-Based Sampling Framework for Finite-Rate-of-Innovation Signals

机译：有限速率创新信号的基于时间的采样框架
- 作者：Sunil Rudresh;Abijith Jagannath Kamath;Chandra Sekhar Seelamantula
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Time-encoding machine (TEM);
- finite-rate-of-innovation signals;
- time-based sampling;
- crossing TEM;
- integrate-and-fire TEM;
59.Transformer-Based Online CTC/Attention End-To-End Speech Recognition Architecture

机译：基于变压器的在线CTC /注意力端到端语音识别架构
- 作者：Haoran Miao;Gaofeng Cheng;Changfeng Gao;Pengyuan Zhang;Yonghong Yan
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Transformer;
- end-to-end speech recognition;
- online speech recognition;
- CTC/attention speech recognition;
60.Single Frequency Filter Bank Based Long-Term Average Spectra for Hypernasality Detection and Assessment in Cleft Lip and Palate Speech

机译：基于单频滤波器组的长期平均频谱，用于唇裂和Speech裂语音的鼻音检测和评估
- 作者：Mohammad Hashim Javid;Krishna Gurugubelli;Anil Kumar Vuppala
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Cleft Lip and Palate;
- Hypernasality;
- Long-Term Average Spectra;
- Single Frequency Filtering;
61.End-to-End Automatic Speech Recognition Integrated with CTC-Based Voice Activity Detection

机译：端到端自动语音识别与基于CTC的语音活动检测相集成
- 作者：Takenori Yoshimura;Tomoki Hayashi;Kazuya Takeda;Shinji Watanabe
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Speech recognition;
- end-to-end;
- voice activity detection;
- streaming;
- CTC greedy search;
62.Exocentric to Egocentric Image Generation Via Parallel Generative Adversarial Network

机译：通过平行生成对抗网络生成以外为中心的图像
- 作者：Gaowen Liu;Hao Tang;Hugo Latapie;Yan Yan
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Egocentric;
- Exocentric;
- Cross-View Image Generation;
- Parallel GANs;
63.Classification of High-Dimensional Motor Imagery Tasks Based on An End-To-End Role Assigned Convolutional Neural Network

机译：基于端到端角色分配卷积神经网络的高维运动图像任务分类
- 作者：Byeong-Hoo Lee;Ji-Hoon Jeong;Kyung-Hwan Shim;Seong-Whan Lee
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Brain-computer interface (BCI);
- Electroencephalogram (EEG);
- Motor imagery;
- Convolutional Neural Network (CNN);
64.Fast and Accurate Embedded DCNN for Rgb-D Based Sign Language Recognition

机译：快速，准确的嵌入式DCNN，用于基于Rgb-D的手语识别
- 作者：Ching-Chen Wang;Ching-Te Chiu;Chao-Tsung Huang;Yu-Chun Ding;Li-Wei Wang
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- RGB-D multi-modality;
- Gesture recognition;
- RGB-D dataset;
- Hardware oriented DCNN;
- CNN accer-leration;
65.Regression Before Classification for Temporal Action Detection

机译：分类进行时间动作检测之前的回归
- 作者：Cece Jin;Tao Zhang;Weijie Kong;Thomas Li;Ge Li
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Video Analysis;
- Temporal Action Detection;
- Action Classification;
- Location Regression;
66.Pixel-Level Self-Paced Learning For Super-Resolution

机译：像素级自定步学习，可实现超分辨率
- 作者：Wei Lin;Junyu Gao;Qi Wang;Xuelong Li
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- super-resolution;
- training strategy;
- self-paced learning;
67.Learning Multi-Scale Attentive Features for Series Photo Selection

机译：学习用于系列照片选择的多尺度注意功能
- 作者：Jin Huang;Chaoran Cui;Chunyun Zhang;Zhen Shen;Jun Yu;Yilong Yin
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Aesthetics assessment;
- series photo selection;
- multi-scale;
- self-attention mechanism;
68.Stargan for Emotional Speech Conversion: Validated by Data Augmentation of End-To-End Emotion Recognition

机译：Stargan用于情感语音转换：通过端到端情感识别的数据增强进行验证
- 作者：Georgios Rizos;Alice Baird;Max Elliott;Björn Schuller
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- adversarial networks;
- data augmentation;
- end-to-end affective computing;
- emotional speech synthesis;
69.Improving Convergent Cross Mapping for Causal Discovery with Gaussian Processes

机译：使用高斯过程改进因果发现的收敛交叉映射
- 作者：Guanchao Feng;Kezi Yu;Yunlong Wang;Yilian Yuan;Petar M. Djurić
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Convergent cross mapping;
- causal discovery;
- Gaussian processes;
- state space reconstruction;
- attractor;
70.Deep Metric Learning Based On Center-Ranked Loss for Gait Recognition

机译：基于中心秩损失的深度度量学习用于步态识别
- 作者：Jingran Su;Yang Zhao;Xuelong Li
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Gait recognition;
- deep metric learning;
- loss function;
71.Joint Phoneme-Grapheme Model for End-To-End Speech Recognition

机译：端到端语音识别的联合音素字素模型
- 作者：Yotaro Kubo;Michiel Bacchiani
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Automatic speech recognition;
- Listen-Attend-Spell;
- multi-task learning;
- iterative refinement;
72.End-To-End Spoken Language Understanding Without Matched Language Speech Model Pretraining Data

机译：没有匹配语言语音模型预训练数据的端到端口语理解
- 作者：Ryan Price
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- spoken language understanding;
- end-to-end;
- data augmentation;
- multilingual;
- pretraining;
73.Multi-Head Attention for Speech Emotion Recognition with Auxiliary Learning of Gender Recognition

机译：语音识别的多头注意力与性别识别辅助学习
- 作者：Anish Nediyanchath;Periyasamy Paramasivam;Promod Yenigalla
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Speech emotion recognition;
- Multi-Head Attention;
- multi-task learning;
- position embedding;
74.Study of Closed Phase Resonance Bandwidths for Oral and Nasal Tracts Using Zero Time Windowing

机译：零时窗研究口腔和鼻道闭合相共振带宽
- 作者：Haala Deeba Abbas;RaviShankar Prasad;Bhanu Teja Nellore;Suryakanth V Gangashetty
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
75.Stable Training of Dnn for Speech Enhancement Based on Perceptually-Motivated Black-Box Cost Function

机译：基于感知动机的黑盒成本函数的Dnn的语音增强稳定训练
- 作者：Masaki Kawanaka;Yuma Koizumi;Ryoichi Miyazaki;Kohei Yatabe
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Speech enhancement;
- sound quality assessment;
- perceptual evaluation of speech quality;
- function approximation;
76.A Hierarchical Model for Dialog Act Recognition Considering Acoustic and Lexical Context Information

机译：考虑声音和词汇上下文信息的对话行为识别的层次模型
- 作者：Yuke Si;Longbiao Wang;Jianwu Dang;Mengfei Wu;Aijun Li
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Mandarin dialog act recognition;
- acoustic and lexical context information;
- Hierarchical model.;
77.Multilingual Grapheme-To-Phoneme Conversion with Byte Representation

机译：具有字节表示的多语言音素到音素转换
- 作者：Mingzhi Yu;Hieu Duy Nguyen;Alex Sokolov;Jack Lepird;Kanthashree Mysore Sathyendra;Samridhi Choudhary;Athanasios Mouchtaris;Siegfried Kunzmann
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Grapheme-to-phoneme (G2P);
- multilingual;
- end-to-end models;
- byte representation;
- pronunciation generation;
78.Constrained Spectral Clustering for Dynamic Community Detection

机译：约束谱聚类用于动态社区检测
- 作者：Abdullah Karaaslanli;Selin Aviyente
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Community Detection;
- Dynamic Networks;
- Stochastic Block Model;
- Spectral Clustering;
79.A Hardware Architecture For Reconfigurable Intelligent Surfaces with Minimal Active Elements for Explicit Channel Estimation

机译：具有最少活动元素的可重构智能表面的硬件体系结构，用于显式信道估计
- 作者：George C. Alexandropoulos;Evangelos Vlachos
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Channel estimation;
- hardware architecture;
- matrix completion;
- metasurface;
- intelligent surface;
80.Opportunistic use of GNSS Signals to Characterize the Environment by Means of Machine Learning Based Processing

机译：通过基于机器学习的处理方法对GNSS信号进行机会性使用来表征环境
- 作者：Fabio Dovis;Rayan Imam;Wenjian Qin;Caner Savas;Hans Visser
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Multipath;
- interference;
- scintillation;
- K-means clustering;
- support vector machines;
81.An Optimal Symmetric Threshold Strategy for Remote Estimation Over The Collision Channel

机译：碰撞通道上远程估计的最佳对称阈值策略
- 作者：Xu Zhang;Marcos M. Vasconcelos;Wei Cui;Urbashi Mitra
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Remote estimation;
- threshold strategies;
- collision channel;
- network control systems;
- optimization;
82.Multi-Agent Deep Reinforcement Learning For Distributed Handover Management In Dense MmWave Networks

机译：密集MmWave网络中用于分布式切换管理的多智能体深度强化学习
- 作者：Mohamed Sana;Antonio De Domenico;Emilio Calvanese Strinati;Antonio Clemente
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Handover Management;
- mmWave;
- Multi-Agent Deep Reinforcement Learning;
83.Leveraging Unpaired Text Data for Training End-To-End Speech-to-Intent Systems

机译：利用未配对的文本数据来训练端到端语音到意图系统
- 作者：Yinghui Huang;Hong-Kwang Kuo;Samuel Thomas;Zvi Kons;Kartik Audhkhasi;Brian Kingsbury;Ron Hoory;Michael Picheny
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
84.Towards an Efficient and General Framework of Robust Training for Graph Neural Networks

机译：建立图神经网络的有效而强大的鲁棒训练框架
- 作者：Kaidi Xu;Sijia Liu;Pin-Yu Chen;Mengshu Sun;Caiwen Ding;Bhavya Kailkhura;Xue Lin
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Graph neural networks;
- adversarial training;
- robustness;
- greedy algorithm;
- large-scale learning;
85.Confirmnet: Convolutional Firmnet and Application to Image Denoising and Inpainting

机译：Confirmnet：卷积固件网及其在图像去噪和修复中的应用
- 作者：Praveen Kumar Pokala;Prakash Kumar Uttam;Chandra Sekhar Seelamantula
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Convolutional sparse coding;
- FirmNet;
- convolutional IFTA;
- LISTA;
- deep unfolding;
86.Joint Learning of Cartesian under Sampling Andre Construction for Accelerated MRI

机译：采样安德烈构造下的直角坐标系加速MRI的联合学习
- 作者：Tomer Weiss;Sanketh Vedula;Ortal Senouf;Oleg Michailovich;Michael Zibulevsky;Alex Bronstein
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Magnetic Resonance Imaging (MRI);
- fast image acquisition;
- image reconstruction;
- deep learning;
87.Generating and Protecting Against Adversarial Attacks for Deep Speech-Based Emotion Recognition Models

机译：基于深度语音的情感识别模型的生成和防御对抗攻击
- 作者：Zhao Ren;Alice Baird;Jing Han;Zixing Zhang;Björn Schuller
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Speech Emotion Recognition;
- Adversarial Attacks;
- Adversarial Training;
- Convolutional Neural Network;
88.Deep Encoded Linguistic and Acoustic Cues for Attention Based End to End Speech Emotion Recognition

机译：基于注意力的端到端语音情感识别的深度编码语言和声学提示
- 作者：Swapnil Bhosale;Rupayan Chakraborty;Sunil Kumar Kopparapu
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Speech emotion recognition;
- End-to-End system;
- multi-head self attention;
- linguistic features;
- acoustic features;
89.A Hierarchical Tracker for Multi-Domain Dialogue State Tracking

机译：用于多域对话状态跟踪的分层跟踪器
- 作者：Jieyu Li;Su Zhu;Kai Yu
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Dialogue State Tracking;
- Data Sparsity;
- Hierarchical;
90.Comparison of Glottal Closure Instants Detection Algorithms for Emotional Speech

机译：语音语音的声门闭合瞬时检测算法比较
- 作者：Sudarsana Reddy Kadiri;Paavo Alku;B. Yegnanarayana
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Speech analysis;
- Excitation source;
- Epochs;
- Glottal Closure Instants;
- Emotions;
91.Unsupervised Speaker Adaptation Using Attention-Based Speaker Memory for End-to-End ASR

机译：使用基于注意力的扬声器内存进行端到端ASR的无监督扬声器自适应
- 作者：Leda Sarı;Niko Moritz;Takaaki Hori;Jonathan Le Roux
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Training;
- Decoding;
- Adaptation models;
- Hidden Markov models;
- Neural networks;
- Turing machines;
- Acoustics;
92.A BI-Model Approach for Handling Unknown Slot Values in Dialogue State Tracking

机译：在对话状态跟踪中处理未知插槽值的BI模型方法
- 作者：Yu Wang;Yilin Shen;Hongxia Jin
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
93.Improving Sample-Efficiency in Reinforcement Learning for Dialogue Systems by Using Trainable-Action-Mask

机译：通过使用可训练的动作蒙版提高对话系统的强化学习中的样本效率
- 作者：Yen-Chen Wu;Bo-Hsiang Tseng;Carl Edward Rasmussen
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- model-based reinforcement learning;
- sample-efficiency;
- spoken dialogue systems;
94.Multi-Conditioning and Data Augmentation Using Generative Noise Model for Speech Emotion Recognition in Noisy Conditions

机译：噪声条件下基于生成噪声模型的多条件和数据增强用于语音情感识别
- 作者：Upasana Tiwari;Meet Soni;Rupayan Chakraborty;Ashish Panda;Sunil Kumar Kopparapu
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Speech emotion recognition;
- noise robustness;
- generative noise model;
- multi conditioning;
- deep neural network;
95.Channel Attention Based Generative Network for Robust Visual Tracking

机译：基于通道注意的生成网络进行稳健的视觉跟踪
- 作者：Ying Hu;Hanyu Xuan;Jian Yang;Yan Yan
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Single-target tracking;
- Siamese convolutional neural network;
- Channel attention mechanism;
- Generative network;
96.Cross-VAE: Towards Disentangling Expression from Identity For Human Faces

机译：跨VAE：从人脸识别中脱颖而出
- 作者：Haozhe Wu;Jia Jia;Lingxi Xie;Guojun Qi;Yuanchun Shi;Qi Tian
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Facial expression recognition;
- Disentangle;
- Variational Autoencoder;
97.End-To-End Multi-Talker Overlapping Speech Recognition

机译：端到端多通话者重叠语音识别
- 作者：Anshuman Tripathi;Han Lu;Hasim Sak
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- multi-talker;
- multi-speaker;
- overlapping speech;
- end-to-end;
98.End-To-End Multi-Speaker Speech Recognition With Transformer

机译：变压器端对端多说话者语音识别
- 作者：Xuankai Chang;Wangyou Zhang;Yanmin Qian;Jonathan Le Roux;Shinji Watanabe
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- Transformer;
- end-to-end;
- overlapped speech recognition;
- neural beamforming;
- speech separation;
99.Large-Scale Unsupervised Pre-Training for End-to-End Spoken Language Understanding

机译：大规模的无监督预训练，用于端到端口语理解
- 作者：Pengwei Wang;Liangchen Wei;Yong Cao;Jinghui Xie;Zaiqing Nie
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- End-to-end Spoken Language Understanding;
- Large-Scale Unsupervised Pre-training;
100.Learning Asr-Robust Contextualized Embeddings for Spoken Language Understanding

机译：学习Asr-Robust上下文化嵌入以理解口语
- 作者：Chao-Wei Huang;Yun-Nung Chen
- 会议名称：《IEEE International Conference on Acoustics, Speech and Signal Processing》 | 2020年
- spoken language understanding;
- contextualized embedding;
- ASR robustness;