Recaptured screen image identification based on vision transformer

Li Guihao; Yao Heng; Le YanfenQin Chuan

首页> 外文期刊>Journal of visual communication & image representation >Recaptured screen image identification based on vision transformer

【24h】

Recaptured screen image identification based on vision transformer

机译：Recaptured screen image identification based on vision transformer

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

Due to the copyright issues often involved in the recapture of LCD screen content, recaptured screen image identification has received lots of concerns in image source forensics. This paper analyzes the characteristics of convolutional neural network (CNN) and vision transformer (ViT) in extracting features and proposes a cascaded network structure that combines local-feature and global-feature extraction modules to detect the recaptured screen image from original images with or without demoireing operation. We first extract the local features of the input images with five convolutional layers and feed the local features into the ViT to enhance the local perception capability of the ViT module, and further extract the global features of the input images. Through thorough experiments, our method achieves a detection accuracy rate of 0.9691 in our generated dataset and 0.9940 in the existing mixture dataset, both showing the best performance among the compared methods.

著录项

来源
《Journal of visual communication & image representation》 |2023年第2期|103692.1-103692.10|共10页
作者
Li Guihao; Yao Heng; Le YanfenQin Chuan;
展开▼
作者单位

Univ Shanghai Sci & Technol;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类
关键词
Recaptured screen images; Image forensics; Demoir?ing operation; Vision transformer; Recapture identification;

Recaptured screen image identification based on vision transformer

摘要

著录项

相关主题

期刊订阅