首页> 外国专利> METHOD AND APPARATUS FOR VISUAL QUESTION ANSWERING, COMPUTER DEVICE AND MEDIUM

METHOD AND APPARATUS FOR VISUAL QUESTION ANSWERING, COMPUTER DEVICE AND MEDIUM

机译:用于视觉问题的方法和装置,用于视觉问题,计算机设备和介质

摘要

The present disclosure provides a method for visual question answering, which relates to fields of computer vision and natural language processing. The method includes: acquiring an input image and an input question; detecting visual information and position information of each of at least one text region in the input image; determining semantic information and attribute information of each of the at least one text region based on the visual information and the position information; determining a global feature of the input image based on the visual information, the position information, the semantic information, and the attribute information; determining a question feature based on the input question; and generating a predicted answer for the input image and the input question based on the global feature and the question feature. The present disclosure further provides a device for visual question answering, a computer device and a non-transitory medium.
机译:本公开提供了一种用于视觉问题的方法,其涉及计算机视觉和自然语言处理的字段。该方法包括:获取输入图像和输入问题;检测输入图像中每个文本区域中的每一个的视觉信息和位置信息;基于视觉信息和位置信息确定至少一个文本区域中的每一个的语义信息和属性信息;基于视觉信息,位置信息,语义信息和属性信息确定输入图像的全局特征;根据输入问题确定问题特征;基于全局特征和问题特征生成输入图像的预测答案和输入问题。本公开还提供了一种用于视觉问题应答,计算机设备和非暂时性介质的设备。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号