...
【24h】

Real-time Assistive Reader Pen for Arabic Language

机译:用于阿拉伯语的实时辅助阅读器笔

获取原文
获取原文并翻译 | 示例
           

摘要

Disability is an impairment affecting an individual's livelihood and independence. Assistive technology enables the disabled cohort of the community to break the barriers to learning, access information, contribute to the community, and live independently. This article proposes an assistive device to enable people with visual disabilities and learning disabilities to access printed Arabic material in real-time, and to help them participate in the education system and the professional workforce.This proposed assistive device employs Optical Character Recognition (OCR) and Text To Speech (TTS) conversion, using concatenation synthesis. OCR is achieved using image processing, character extraction, and classification, while Arabic speech synthesis is achieved through concatenation synthesis, followed by Multi Band Re-synthesis Overlap-Add (MBROLA). Waveform generation in the second phase produces vocal output for the disabled user to hear. OCR character and word accuracy tests were conducted for nine Arabic fonts. The results show that six fonts were recognized with over 60% character accuracy and two fonts were recognized with over 88% accuracy. A Mean Opinion Score (MOS) test for speech quality was conducted. The results showed an overall MOS score of 3.53/5 and indicated that users were able to understand the speech. A real-time usability testing was conducted with 10 subjects. The results showed an overall average of agreements scores of 3.9/5 and indicated that the proposed Arabic reader pen meets the real-time constraints and is pleasant and satisfying to use and can contribute to make printed Arabic material accessible to visually impaired persons and people with learning disabilities.
机译:残疾是影响个人生计和独立性的损害。辅助技术使社区的残疾人队能够打破学习的障碍,获取信息,为社区做出贡献,独立生活。本文提出了一项辅助设备,使人们能够实时获得视觉残疾和学习残疾,并帮助他们参与教育系统和专业的劳动力。本建议的辅助设备采用光学字符识别(OCR)和语音(TTS)转换的文本,使用串联合成。使用图像处理,字符提取和分类来实现OCR,而通过串联合成实现阿拉伯语语音合成,其次是多频段重新合成重叠(MBROLA)。第二阶段中的波形生成产生了用于禁用用户的声音输出。为九个阿拉伯语字体进行了OCR字符和字精度测试。结果表明,六个字体被识别出超过60%的性质精度,并且两个字体被识别超过88%的精度。进行语音质量的平均意见评分(MOS)测试。结果表明,总体MOS分数为3.53 / 5,并表示用户能够理解演讲。使用10个科目进行实时可用性测试。结果显示了3.9 / 5的协议分数的总体平均值,并表示提议的阿拉伯读者笔符合实时限制,并且使用令人愉悦,满意,可以有助于使印刷的阿拉伯材料可用于视力受损人员和人物学习障碍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号