首页> 外文OA文献 >Arabic Language WEKA-Based Dialect Classifier for Arabic Automatic Speech Recognition Transcripts

【2h】

Arabic Language WEKA-Based Dialect Classifier for Arabic Automatic Speech Recognition Transcripts

机译：阿拉伯语基于WEKa的阿拉伯语自动语音识别成语的方言分类器

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper describes an Arabic dialect identification system which we developed for the Discriminating Similar Languages (DSL) 2016 shared task. We classified Arabic dialects by using Waikato Environment for Knowledge Analysis (WEKA) data analytic tool which contains many alternative filters and classifiers for machine learning. We experimented with several classifiers and the best accuracy was achieved using the Sequential Minimal Optimization (SMO) algorithm for training and testing process set to three different feature-sets for each testing process. Our approach achieved an accuracy equal to 42.85% which is considerably worse in comparison to the evaluation scores on the training set of 80-90% and with training set 60:40 percentage split which achieved accuracy around 50%. We observed that Buckwalter transcripts from the Saarland Automatic Speech Recognition (ASR) system are given without short vowels, though the Buckwalter system has notation for these. We elaborate such observations, describe our methods and analyse the training dataset.

机译：本文介绍了我们为区分相似语言（DSL）2016共享任务而开发的阿拉伯语方言识别系统。我们使用怀卡托知识分析环境（WEKA）数据分析工具对阿拉伯语进行了分类，该工具包含许多用于机器学习的替代过滤器和分类器。我们对多个分类器进行了实验，使用序列最小优化（SMO）算法将训练集和测试过程集设置为每个测试过程三个不同的功能集，从而获得了最高的准确性。我们的方法达到的准确度等于42.85％，与训练集上80-90％的评估得分相比，以及训练集以60:40的百分比拆分（达到50％左右的准确度）时，评估结果差很多。我们观察到，萨尔兰自动语音识别（ASR）系统提供的Buckwalter成绩单没有短元音，尽管Buckwalter系统对此有注释。我们将详细阐述这些观察结果，描述我们的方法并分析训练数据集。

著录项

作者
Alshutayri A; Atwell ES; Alosaimy A; Dickins J; Ingleby M; Watson J;
展开▼
作者单位

展开▼
年度 2016
总页数
原文格式 PDF
正文语种 en
中图分类

相似文献

外文文献
中文文献
专利

1. Development of the Arabic Loria Automatic Speech Recognition system (ALASR) and its evaluation for Algerian dialect [J] . Mohamed Amine Menacer, Odile Mella, Dominique Fohr, Procedia Computer Science . 2017,第1期

机译：阿拉伯语Loria自动语音识别系统（ALASR）的开发及其对阿尔及利亚方言的评估
2. Development of the Arabic Loria Automatic Speech Recognition system (ALASR) and its evaluation for Algerian dialect [J] . Mohamed Amine Menacer, Odile Mella, Dominique Fohr, Procedia Computer Science . 2017,第1期

机译：阿拉伯语Loria自动语音识别系统（ALASR）的开发及其对阿尔及利亚方言的评估
3. Arabic Sign Language Recognition and Generating Arabic Speech Using Convolutional Neural Network [J] . M. M. Kamruzzaman Wireless communications & mobile computing . 2020,第1期

机译：使用卷积神经网络，阿拉伯语手语识别和产生阿拉伯语演讲
4. Arabic Language WEKA-Based Dialect Classifier for Arabic Automatic Speech Recognition Transcripts [C] . Areej Alshutayri, Eric Atwell, AbdulRahman AlOsaimy, Workshop on NLP for similar languages, varieties and dialects . 2016

机译：基于阿拉伯语WEKA的阿拉伯语自动语音识别抄本方言分类器
5. Arabic language modeling with stem-derived morphemes for automatic speech recognition. [D] . Heintz, Ilana. 2010

机译：具有词干衍生语素的阿拉伯语言建模，可实现自动语音识别。
6. Formant analysis in dysphonic patients and automatic Arabic digit speech recognition [O] . Ghulam Muhammad, Tamer A Mesallam, Khalid H Malki, 2011

机译：语音障碍患者的共振峰分析和阿拉伯数字自动语音识别
7. Development of the Arabic Loria Automatic Speech Recognition system (ALASR) and its evaluation for Algerian dialect [O] . Menacer, Mohamed,, Mella, Odile, Fohr, Dominique, 2017

机译：阿拉伯语Loria自动语音识别系统（ALASR）的开发及其对阿尔及利亚方言的评估

Arabic Language WEKA-Based Dialect Classifier for Arabic Automatic Speech Recognition Transcripts

摘要

著录项

相似文献

相关主题

期刊订阅