ResViT: A Framework for Deepfake Videos Detection

Wasim  Ahmad; Imad Ali; Adil Shahzad; Ammarah  Hashmi; Faisal  Ghaffar

doi:10.32985/ijeces.13.9.9

Authors

Wasim Ahmad Institute of Information Sciences, Academia Sinica, Taiwan Department of Computer Science, National ChengChi University, Taiwan
Imad Ali Department of Computer Science, University of Swat, KP, Pakistan
Sahibzada Adil Shahzad Institute of Information Sciences, Academia Sinica, Taiwan Department of Computer Science, National Chengchi University, Taiwan
Ammarah Hashmi Institute of Information Science, Academia Sinica, Taiwan Institute of Information Systems and Applications, National Tsing Hua University, Taiwan
Faisal Ghaffar System Design Engineering Department, University of Waterloo, Canada

DOI:

https://doi.org/10.32985/ijeces.13.9.9

Keywords:

deepfake, detection, vision transformer, GAN

Abstract

Deepfake makes it quite easy to synthesize videos or images using deep learning techniques, which leads to substantial danger and worry for most of the world's renowned people. Spreading false news or synthesizing one's video or image can harm people and their lack of trust on social and electronic media. To efficiently identify deepfake images, we propose ResViT, which uses the ResNet model for feature extraction, while the vision transformer is used for classification. The ResViT architecture uses the feature extractor to extract features from the images of the videos, which are used to classify the input as fake or real. Moreover, the ResViT architectures focus equally on data pre-processing, as it improves performance. We conducted extensive experiments on the five mostly used datasets our results with the baseline model on the following datasets of Celeb-DF, Celeb-DFv2, FaceForensics++, FF-Deepfake Detection, and DFDC2. Our analysis revealed that ResViT performed better than the baseline and achieved the prediction accuracy of 80.48%, 87.23%, 75.62%, 78.45%, and 84.55% on Celeb-DF, Celeb-DFv2, FaceForensics++, FF-Deepfake Detection, and DFDC2 datasets, respectively.

ResViT: A Framework for Deepfake Videos Detection

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Make a Submission

JCR Impact factor for 2024

0.9