首页> 外文会议>International Technology, Education and Development Conference >(864) A TUTORIAL ON OBJECT RECOGNITION BY MACHINE LEARNING TECHNIQUES USING PYTHON
【24h】

(864) A TUTORIAL ON OBJECT RECOGNITION BY MACHINE LEARNING TECHNIQUES USING PYTHON

机译:(864)使用Python的机器学习技术对物体识别的教程

获取原文

摘要

This paper presents our experiences towards a tutorial on a hot Computer Vision problem, object recognition employing Machine Learning techniques, which we consider that can be a resource didactically interesting for both practitioners’ and lecturers’ communities. The recognition of objects is an innate ability of human beings. When we look at a picture, we are able to effortlessly detect elements like animals, signals, objects of interest, etc. In the Computer Vision field, this process is carried out by Machine Learning tools, aiming to retrieve information about the content of an image. The possible applications of object recognition are numerous, to name a few: image panoramas, robot localization, face detection/recognition, autonomous driving, or pedestrian detection. Given the relevance of this problem, numerous approaches have been explored to address it. Among the most popular techniques in both, real applications and academia, we can find those recognizing objects by means of hand-crafted features. Typical features are descriptors of their keypoints extracted according to a certain criterion (e.g. Scale-Invariant Feature Transform, SIFT), or those geometrically (size, orientation, shape, etc.) or visually (colour, texture, etc.) describing the regions to be recognized. The number of software tools developed to carry out this task is large, including wellknown software libraries as OpenCV or scikit-learn. However, object recognition is far from being a one-step task, and most of them lack a comprehensive step-by-step description of the pipeline needed for accomplishing it. This pipeline usually includes the management and analysis of the data used to train the recognition model, a pre-processing of such data to convert them into an usable form, the fitting of the model and, finally, its validation. The tutorial presented in this work encompasses those steps, giving detailed information and hints about how to complete each one. It is accompanied by a public implementation of the object recognition pipeline (https://github.com/jotaraul/object_recognition_in_python) using trendy python tools (Pandas, seaborn and scikit-learn), so it can be profitable by Computer Vision practitioners and lecturers aiming to gain insight/illustrate good practices for the design of a successful object recognition system. It is also open to any contribution from those communities. Finally, we also provide some directions and experiences regarding its utilization in academia.
机译:本文介绍了对热电计算机视觉问题的教程,对象识别采用机器学习技术的教程,我们认为这可以是为从业者和讲师社区进行积累的资源。对物体的认可是人类的先天能力。当我们看一张照片时,我们能够毫不费力地检测像动物,信号,感兴趣的对象等。在计算机视觉领域中,该过程是通过机器学习工具执行的,旨在检索关于内容的信息图片。对象识别的可能应用是众多,名称少数:图像全景,机器人定位,面部检测/识别,自主驾驶或行人检测。鉴于这个问题的相关性,已经探索了许多方法来解决它。在真正的应用和学术界中最受欢迎的技术中,我们可以通过手工制作的功能找到识别物体的人。典型的特征是根据某个标准(例如鳞片不变特征变换,SIFT)或几何上(尺寸,方向,形状等)或视觉(颜色,纹理等)来提取其关键点的描述符被认可。开发该任务的软件工具数量很大,包括众所周知的软件库作为OpenCV或Scikit-rement。然而,对象识别远非是一步任务,而且大多数人缺乏完成完成管道所需的流量的综合逐步描述。该流水线通常包括用于训练识别模型的数据的管理和分析,预处理这些数据以将它们转换为可用形式,模型的拟合,最后,其验证。本工作中呈现的教程包含这些步骤,提供详细信息和暗示如何完成每个的信息和提示。它伴随着使用时尚Python工具(Pandas,Seaborn和Scikit-Learn)的对象识别管道(https://github.com/jotaraul/object_recognition_in_python)的公开实施,因此它可以通过计算机视觉从业者和讲师盈利旨在获得洞察/说明设计成功对象识别系统的良好实践。它也对这些社区的任何贡献都开放。最后,我们还提供了一些关于其在学术界的利用的方向和经验。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号