Video Highlight Detection via Region-Based Deep Ranking Model

Jiao Yifan; Zhang Tianzhu; Huang Shucheng; Liu Bin; Xu Changsheng

首页> 外文期刊>International Journal of Pattern Recognition and Artificial Intelligence >Video Highlight Detection via Region-Based Deep Ranking Model

【24h】

Video Highlight Detection via Region-Based Deep Ranking Model

机译：通过基于区域的深度排名模型进行视频精彩片段检测

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The video highlight detection task is to localize key elements (moments of user's major or special interest) in a video. Most of the existing highlight detection approaches extract features from the video segment as a whole without considering the difference of local features spatially. In spatial extent, not all regions are worth watching because some of them only contain the background of the environment without human or other moving objects, especially when there is lots of clutter in the background. To deal with this issue, we propose a novel region-based model which can automatically localize the key elements in a video without any extra supervised annotations. Specifically, the proposed model produces position-sensitive score maps for local regions in the spatial dimension of the video segment, and then aggregates all position-wise scores with position-pooling operation. The regions with higher response values will be extracted as key elements. Thus more effective features of the video segment are obtained to predict the highlight score. The proposed position-sensitive scheme can be easily integrated into an endto-end fully convolutional network which aims to update parameters via stochastic gradient descent method in the backward propagation to improve the robustness of the model. Extensive experimental results on the YouTube and SumMe datasets demonstrate that the proposed approach achieves significant improvement over state-of-the-art methods.

机译：视频精彩片段检测任务是定位视频中的关键元素（用户的主要兴趣或特殊兴趣的时刻）。大多数现有的高光检测方法从整个视频片段中提取特征，而不考虑空间上局部特征的差异。在空间范围内，并不是所有区域都值得一看，因为其中一些区域仅包含环境背景而没有人或其他移动物体，尤其是在背景中杂乱无章的情况下。为了解决这个问题，我们提出了一种新颖的基于区域的模型，该模型可以自动定位视频中的关键元素，而无需任何额外的监督注释。具体而言，提出的模型为视频片段的空间维度中的局部区域生成位置敏感得分图，然后使用位置合并操作汇总所有位置得分。具有较高响应值的区域将被提取为关键元素。因此，获得了视频片段的更有效特征来预测精彩片段。所提出的位置敏感方案可以容易地集成到端到端全卷积网络中，该网络旨在通过随机梯度下降法在向后传播中更新参数，以提高模型的鲁棒性。 YouTube和SumMe数据集上的大量实验结果表明，所提出的方法相对于最新方法取得了显着改进。

著录项

来源
《International Journal of Pattern Recognition and Artificial Intelligence》 |2019年第7期|1940001.1-1940001.14|共14页
作者
Jiao Yifan; Zhang Tianzhu; Huang Shucheng; Liu Bin; Xu Changsheng;
展开▼
作者单位

Jiangsu Univ Sci & Technol, Zhenjiang 212003, Jiangsu, Peoples R China|Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China;

Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China;

Jiangsu Univ Sci & Technol, Zhenjiang 212003, Jiangsu, Peoples R China;

Moshanghua Tech Co Ltd, Beijing 100030, Peoples R China;

Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Video highlight detection; position-sensitive; fully convolutional network;

机译：视频突出显示;位置敏感;完全卷积的网络;

相似文献

外文文献
中文文献
专利

1. Video Highlight Detection via Region-Based Deep Ranking Model [J] . Jiao Yifan, Zhang Tianzhu, Huang Shucheng, International Journal of Pattern Recognition and Artificial Intelligence . 2019,第7期

机译：通过基于区域的深度排名模型进行视频突出显示
2. Video Summarization Using Highlight Detection and Pairwise Deep Ranking Model [J] . M. Sridevi, Mayuri Kharde Procedia Computer Science . 2020,第5期

机译：使用突出检测和成对深度排名模型的视频摘要
3. Exploiting Web Images for Video Highlight Detection With Triplet Deep Ranking [J] . Hoseong Kim, Tao Mei, Hyeran Byun, IEEE transactions on multimedia . 2018,第9期

机译：利用Web图像进行三重态深度排名的视频突出显示检测
4. Video Highlight Detection via Deep Ranking Modeling [C] . Yifan Jiao, Xiaoshan Yang, Tianzhu Zhang, Pacific-rim symposium on image and video technology . 2018

机译：通过深度排名建模检测视频高光
5. Machine learning techniques in nuclear material detection, drug ranking and video tracking. [D] . Yang, Yan. 2013

机译：核材料检测，药物排名和视频跟踪中的机器学习技术。
6. Real-Time Human Detection for Aerial Captured Video Sequences via Deep Models [O] . Nouar AlDahoul, Aznul Qalid Md Sabri, Ali Mohammed Mansoor 2018

机译：通过深度模型对空中捕获的视频序列进行实时人体检测
7. MINI-Net: Multiple Instance Ranking Network for Video Highlight Detection [O] . Fa-Ting Hong, Xuanteng Huang, Wei-Hong Li, 2020

机译：MINI-NET：多实例排名网络用于视频亮点检测

Video Highlight Detection via Region-Based Deep Ranking Model

摘要

著录项

相似文献

相关主题

期刊订阅