Making Punctuation Restoration Robust and Fast with Multi-Task Learning and Knowledge Distillation

机译：用多任务学习和知识蒸馏制作标点恢复速度和快速

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In punctuation restoration, we try to recover the missing punctuation from automatic speech recognition output to improve understandability. Currently, large pre-trained transformers such as BERT set the benchmark on this task but there are two main drawbacks to these models. First, the pre-training data does not match the output data from speech recognition that contains errors. Second, the large number of model parameters increases inference time. To address the former, we use a multi-task learning framework with ELECTRA, a recently proposed improvement on BERT, that has a generator-discriminator structure. The generator allows us to inject errors into the training data and, as our experiments show, this improves robustness against speech recognition errors during inference. To address the latter, we investigate knowledge distillation and parameter pruning of ELECTRA. In our experiments on the IWSLT 2012 benchmark data, a model with less than 11% the size of BERT achieved better performance while having an 82% faster inference time.

机译：在标点符号恢复中，我们尝试从自动语音识别输出中恢复缺少的标点符号，以提高可易于的可理解性。目前，大型预训练的变压器如BERT在这项任务上设置了基准，但这些模型有两个主要缺点。首先，预训练数据与包含错误的语音识别的输出数据与输出数据不匹配。其次，大量模型参数增加了推理时间。为了解决前者，我们使用具有电力的多任务学习框架，最近提出的伯特改进，具有发电机鉴别器结构。发电机允许我们将错误注入培训数据，并且作为我们的实验表明，这改善了在推理期间对语音识别误差的鲁棒性。为了解决后者，我们研究了电力的知识蒸馏和参数灌注。在我们对IWSLT 2012基准数据的实验中，BERT大小尺寸小于11％的模型可以实现更好的性能，同时具有82％的推理时间。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2021年|7773-7777|共5页
会议地点
作者
Michael Hentschel; Emiru Tsunoo; Takao Okuda;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Impedance matching; Bit error rate; Training data; Benchmark testing; Signal processing; Robustness;

机译：训练;阻抗匹配;钻头错误率;训练数据;基准测试;信号处理;鲁棒性;

相似文献

外文文献
中文文献
专利

1. Multi-task Knowledge Distillation with Rhythm Features for Speaker Verification [J] . Ruyun Li, Peng Ouyang, Dandan Song, Computer Science & Information Technology . 2020,第5期

机译：具有节奏特征的多任务知识蒸馏，用于扬声器验证
2. Orthogonal Gradient Penalty for Fast Training of Wasserstein GAN Based Multi-Task Autoencoder toward Robust Speech Recognition [J] . Chao-Yuan KAO, Sangwook PARK, Alzahra BADI, IEICE transactions on information and systems . 2020,第5期

机译：基于Wassersein GaN的多任务自动化器快速演讲识别的正交梯度惩罚
3. Robust license plate signatures matching based on multi-task learning approach [J] . Hasnat Abul, Nakib Amir Neurocomputing . 2021,第Juna14期

机译：基于多任务学习方法的强大牌照签名匹配
4. Knowledge Distillation for Multi-task Learning [C] . Wei-Hong Li, Hakan Bilen European conference on computer vision . 2020

机译：多任务学习的知识蒸馏
5. Robust Multi-Task Learning Algorithms for Predictive Modeling of Spatial and Temporal Data [D] . Liu, Xi 2019

机译：鲁棒的多任务学习算法，用于时空数据的预测建模
6. Machine Learning-Based Fast Banknote Serial Number Recognition Using Knowledge Distillation and Bayesian Optimization [O] . Eunjeong Choi, Somi Chae, Jeongtae Kim 2019

机译：基于知识蒸馏和贝叶斯优化的基于机器学习的快速钞票序列号识别
7. Synergic Adversarial Label Learning for Grading Retinal Diseases via Knowledge Distillation and Multi-task Learning [O] . Lie Ju, Xin Wang, Xin Zhao, 2021

机译：通过知识蒸馏和多任务学习评分视网膜疾病的协同对抗性标签

Making Punctuation Restoration Robust and Fast with Multi-Task Learning and Knowledge Distillation

摘要

著录项

相似文献

相关主题

期刊订阅