Unsupervised Video Summarization With Cycle-Consistent Adversarial LSTM Networks

Yuan Li; Tay Francis Eng Hock; Li Ping; Feng Jiashi

首页> 外文期刊>IEEE transactions on multimedia >Unsupervised Video Summarization With Cycle-Consistent Adversarial LSTM Networks

【24h】

Unsupervised Video Summarization With Cycle-Consistent Adversarial LSTM Networks

机译：循环一致的对冲LSTM网络无监督的视频摘要

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Video summarization is an important technique to browse, manage and retrieve a large amount of videos efficiently. The main objective of video summarization is to minimize the information loss when selecting a subset of video frames from the original video, hence the summary video can faithfully represent the overall story of the original video. Recently developed unsupervised video summarization approaches are free of requiring tedious annotation on important frames to train a video summarization model and thus are practically attractive. However, their performance is still limited due to the difficulty of minimizing information loss between the summary and original videos. In this paper, we address unsupervised video summarization by developing a novel Cycle-consistent Adversarial LSTM architecture to effectively reduce the information loss in the summary video. The proposed model, named Cycle-SUM, consists of a frame selector and a cycle-consistent learning based evaluator. The selector is a bi-directional LSTM network to capture the long-range relationship between video frames. To overcome the difficulty of specifying a suitable information preserving metric between original video and summary video, the evaluator is introduced to "supervise" selector to improve the video summarization quality. Specifically, the evaluator is composed of two generative adversarial networks (GANs), in which the forward GAN component is learned to reconstruct the original video from summary video, while the backward GAN learns to invert the process. We establish the relation between mutual information maximization and such cycle learning procedure and further introduce cycle-consistent loss to regularize the summarization. Extensive experiments on three video summarization benchmark datasets demonstrate a state-of-the-art performance, and show the superiority of the Cycle-SUM model compared with other unsupervised approaches.

机译：视频摘要是有效浏览，管理和检索大量视频的重要技术。视频摘要的主要目的是在从原始视频选择视频帧的子集时最小化信息丢失，因此摘要视频可以忠实地代表原始视频的整体故事。最近开发了无监督的视频摘要方法，无需对培训视频概要模型的重要帧来说，繁琐的注释，因此实际上是有吸引力的。但是，由于难度最小化了摘要和原始视频之间的信息丢失，它们的性能仍然有限。在本文中，我们通过开发一种新的循环一致的对抗性LSTM架构来解决无监督的视频摘要，以有效地降低摘要视频中的信息丢失。所提出的模型，命名为周期 - 总和，包括帧选择器和基于周期一致的基于学习的评估器。选择器是一个双向LSTM网络，用于捕获视频帧之间的远程关系。为了克服在原始视频和摘要视频之间指定保存度量的难度，评估器被引入“监督”选择器以提高视频概括质量。具体地，评估者由两种生成的对抗性网络（GAN）组成，其中学习前向GaN组件来重建摘要视频的原始视频，而后退GaN学会反转该过程。我们建立了相互信息最大化与此类循环学习程序之间的关系，并进一步引入了周期一致的损失，以规范摘要。三个视频摘要基准数据集的广泛实验证明了最先进的性能，并与其他无监督的方法相比，循环和模型的优越性。

著录项

来源
《IEEE transactions on multimedia》 |2020年第10期|2711-2722|共12页
作者
Yuan Li; Tay Francis Eng Hock; Li Ping; Feng Jiashi;
展开▼
作者单位

Natl Univ Singapore Singapore 119077 Singapore;

Natl Univ Singapore Singapore 119077 Singapore;

Hangzhou Dianzi Univ Sch Comp Sci & Technol Hangzhou 310018 Peoples R China;

Natl Univ Singapore Singapore 119077 Singapore;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Video summarization; cycle-consistent learning; LSTM; variation autoencoder;

机译：视频摘要;周期一致的学习;LSTM;变异AutoEncoder;

相似文献

外文文献
中文文献
专利

1. Recurrent generative adversarial networks for unsupervised WCE video summarization [J] . Lan Libin, Ye Chunxiao Knowledge-Based Systems . 2021,第Juna21期

机译：用于无监督的WCE视频摘要的经常性发生的对抗网络
2. Unsupervised feature learning for environmental sound classification using Weighted Cycle-Consistent Generative Adversarial Network [J] . Esmaeilpour Mohammad, Cardinal Patrick, Koerich Alessandro Lameiras Applied Soft Computing . 2020,第期

机译：使用加权循环一致的生成对抗网络进行环境声音分类的无监督特征
3. Unsupervised Variational Video Hashing With 1D-CNN-LSTM Networks [J] . Li Shuyan, Chen Zhixiang, Li Xiu, IEEE transactions on multimedia . 2020,第6期

机译：与1D-CNN-LSTM网络无监督变分视频散列
4. Cycle-SUM: Cycle-Consistent Adversarial LSTM Networks for Unsupervised Video Summarization [C] . Li Yuan, Francis EH Tay, Ping Li, AAAI Conference on Artificial Intelligence . 2019

机译：周期 - 总和：无监督视频摘要的周期 - 一致的对抗性LSTM网络
5. Deepfakes Generation Using LSTM Based Generative Adversarial Networks [D] . Santha, Akhil. 2020

机译：使用基于LSTM的生成对抗网络的Deepfakes生成
6. LCPR-Net: low-count PET image reconstruction using the domain transform and cycle-consistent generative adversarial networks [O] . Hengzhi Xue, Qiyang Zhang, Sijuan Zou, 2021

机译：LCPR-net：使用域变换和循环一致的生成对抗网络的低计数PET图像重建
7. Cycle-SUM: Cycle-Consistent Adversarial LSTM Networks for Unsupervised Video Summarization [O] . Li Yuan, Francis EH Tay, Ping Li, 2019

机译：周期 - 总和：无监督视频摘要的循环一致的对抗性LSTM网络

Unsupervised Video Summarization With Cycle-Consistent Adversarial LSTM Networks

摘要

著录项

相似文献

相关主题

期刊订阅