Perception Framework through Real-Time Semantic Segmentation and Scene Recognition on a Wearable System for the Visually Impaired

机译：通过实时语义分割和现场识别在视觉损害的可穿戴系统上的实时语义分割和场景识别的感知框架

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

As the scene information, including objectness and scene type, are important for people with visual impairment, in this work we present a multi-task efficient perception system for the scene parsing and recognition tasks. Building on the compact ResNet backbone, our designed network architecture has two paths with shared parameters. In the structure, the semantic segmentation path integrates fast attention, with the aim of harvesting long-range contextual information in an efficient manner. Simultaneously, the scene recognition path attains the scene type inference by passing the semantic features into semantic-driven attention networks and combining the semantic extracted representations with the RGB extracted representations through a gated attention module. In the experiments, we have verified the systems' accuracy and efficiency on both public datasets and real-world scenes. This system runs on a wearable belt with an Intel RealSense LiDAR camera and an Nvidia Jetson AGX Xavier processor, which can accompany visually impaired people and provide assistive scene information in their navigation tasks.

机译：作为场景信息，包括对象和场景类型，对于具有视觉损伤的人来说很重要，在这项工作中，我们为场景解析和识别任务提供了一个多任务高效的感知系统。在Compact Reset骨干上构建，我们设计的网络架构有两个具有共享参数的路径。在结构中，语义分割路径集成了快速关注，目的是以有效的方式收集远程上下文信息。同时，场景识别路径通过将语义特征传递到语义驱动的注意网络并将语义提取的表示与RGB提取的表示通过所喷射的注意模块组合来实现场景类型推断。在实验中，我们已经验证了对公共数据集和现实世界场景的系统准确性和效率。该系统在可穿戴带上的可穿戴皮带上运行，带有英特尔Realsense Lidar Camera和NVIDIA Jetson Agx Xavier处理器，可以伴随着视力受损人员，并在其导航任务中提供辅助场景信息。

著录项

来源
《IEEE International Conference on Real-time Computing and Robotics》|2021年|863-868|共6页
会议地点
作者
Yingzhi Zhang; Haoye Chen; Kailun Yang; Jiaming Zhang; Rainer Stiefelhagen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Visualization; Navigation; Semantics; Computer architecture; Network architecture; Logic gates; Feature extraction;

机译：可视化;导航;语义;计算机架构;网络架构;逻辑门;特征提取;

相似文献

外文文献
中文文献
专利

1. Scene perception system for visually impaired based on object detection and classification using multimodal deep convolutional neural network [J] . Kaur Baljit, Bhattacharya Jhilik Journal of electronic imaging . 2019,第1期

机译：基于多模态深度卷积神经网络的基于目标检测和分类的视障者场景感知系统
2. MedGlasses: A Wearable Smart-Glasses-Based Drug Pill Recognition System Using Deep Learning for Visually Impaired Chronic Patients [J] . Chang Wan-Jung, Chen Liang-Bi, Hsu Chia-Hao, Quality Control, Transactions . 2020,第期

机译：Medglasses：一种使用深层学习的可穿戴智能眼镜的药丸识别系统，用于视力受损的慢性患者
3. A Kinect-Based Wearable Face Recognition System to Aid Visually Impaired Users [J] . Laurindo Britto Neto, Felipe Grijalva, Vanessa Regina Margareth Lima Maike, Human-Machine Systems, IEEE Transactions on . 2017,第1期

机译：基于Kinect的可穿戴人脸识别系统可帮助视障用户
4. Intersection Perception Through Real-Time Semantic Segmentation to Assist Navigation of Visually Impaired Pedestrians [C] . Kailun Yang, Ruiqi Cheng, Luis M. Bergasa, IEEE International Conference on Robotics and Biomimetics . 2018

机译：实时语义分割相交感知，以辅助视觉障碍行人导航
5. A wearable indoor navigation system for blind and visually impaired individuals. [D] . Bai, Yicheng. 2014

机译：面向盲人和视障人士的可穿戴室内导航系统。
6. A Wearable Navigation Device for Visually Impaired People Based on the Real-Time Semantic Visual SLAM System [O] . Zhuo Chen, Xiaoming Liu, Masaru Kojima, 2021

机译：基于实时语义视觉SLAM系统的视力受损人员的可穿戴导航设备
7. A scene perception system for visually impaired based on object detection and classification using CNN [O] . Lalita Moharkar, Sudhanshu Varun, Apurva Patil, 2020

机译：基于使用CNN的对象检测和分类，视觉损害的场景感知系统
8. Solutions for Problems of Visually Impaired Users of Rail Rapid Transit. Volume I: Improving Communications with the Visually Impaired in Rail Rapid Transit Systems [R] . Bentzen, B. L. , Jackson, R. M. , Peck, A. F. 1981

机译：铁路快速交通视障用户问题的解决方案。第一卷：改善铁路快速交通系统视障人士的沟通

Perception Framework through Real-Time Semantic Segmentation and Scene Recognition on a Wearable System for the Visually Impaired

摘要

著录项

相似文献

相关主题

期刊订阅