Leveraging Frequency Analysis for Deepfake Image Classification

The project entails two main objectives: to understand the entire architecture of the underlying neural network, and to implement web application for distinguish Deepfake images.

INTRODUCTION

Deep fake technology has gained significant attention in recent years due to its potential to generate highly realistic counterfeit images and videos, raising concerns about the integrity of visual media. Addressing the challenge of detecting deep fake images is crucial to ensure trustworthiness in various domains, including journalism, forensics, and social media platforms. This project aims to explore the effectiveness of frequency analysis techniques and deep learning for deep fake image recognition, offering a comprehensive study to enhance the reliability of detection methods.

The project utilizes a frequency analysis technique DCT, to extract spectral features that capture unique characteristics of deep fake images. These features are then used to train deep learning models, enhancing their ability to accurately classify deep fake images. We also aim to build a frontend web app which identifies a deep fake image and if possible where one can upload an image and it will classify the image as deep fake or real. We use Gradio for the frontend display of the web-app. By empowering users to detect deep fake images and promoting transparency, the project contributes to combating the harmful effects of deepfakes and fostering a more trustworthy digital media environment.

TECHNOLOGIES USED

ARCHITECTURE

The AdaIN layer is normalizing the statistics of inputs and outputs to the convolution layer, and feeding the statistics of the style input. This way we can keep the information of x while transforming the distribution of the output of AdaIN to be similar to the style input.

The synthesis network uses a progressive GAN structure as its backbone, where the network grows from low resolution to high resolution as training continues. Noise is given as an input to each synthesis block in order to capture variational details and thus rendering the image to be seen more realistic. The input latent vector characterizes important features such as gender, ethnicity and hairstyle. This leads to more controllability of the input style vector. Style mixing in StyleGAN involves interpolating the latent code vectors (Z vectors) of two input images at specific layers in the network. By blending the style vectors (W vectors) of these images, it allows for the synthesis of new images that exhibit a combination of visual attributes from both source images, effectively influencing the appearance of generated outputs at different hierarchical levels within the network.

We transform images into the frequency domain using the discrete cosine transform (DCT). The DCT expresses, much like the discrete Fourier transform (DFT), a finite sequence of data points as a sum of cosine functions oscillating at different frequencies. In practice, we compute the 2D-DCT as a product of two 1D-DCTs, i.e. for images we first compute a DCT along the columns and then a DCT along the rows. When we plot the DCT spectrum, we depict the DCT coefficients as a heatmap. To classify the images based on their frequency domain counterparts, we use a simple linear classifier. To demonstrate this, we perform a ridge regression on real and generated images, after applying a DCT. We also perform a ridge regression on the original images without any transformations for comparative analysis. We use a ridge classifier for the classification which is a modification of linear regression where regularization, specifically L2 regularization, is applied to the coefficients.

Dataset

FFHQ-dataset has been used for real images. Fake images have been generated via the use of StyleGAN

Performance and Results

DCT Performance

Standard Performance

Accuracy of DCT of Image: 1.0

Accuracy of Standard Image: 0.547

A large improvement in detection of deepfake images is noticed when taking the DCT of images compared to normal images as input to a ridge classifier. DCT is able to pickup on artifacts present in deepfake images generated by the STYLEGAN.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
DeepFake_DCT.ipynb		DeepFake_DCT.ipynb
README.md		README.md
StyleGanGen.ipynb		StyleGanGen.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DeepFake_DCT.ipynb

DeepFake_DCT.ipynb

README.md

README.md

StyleGanGen.ipynb

StyleGanGen.ipynb

Repository files navigation

Leveraging Frequency Analysis for Deepfake Image Classification

INTRODUCTION

TECHNOLOGIES USED

ARCHITECTURE

Dataset

Performance and Results

DCT Performance

Standard Performance

References

Project Mentors:

Project Mentees:

About

Releases

Packages

Contributors 5

Languages

IEEE-NITK/Leveraging-Frequency-Analysis-for-Deepfake-Image-Classification

Folders and files

Latest commit

History

Repository files navigation

Leveraging Frequency Analysis for Deepfake Image Classification

INTRODUCTION

TECHNOLOGIES USED

ARCHITECTURE

Dataset

Performance and Results

DCT Performance

Standard Performance

References

Project Mentors:

Project Mentees:

About

Resources

Stars

Watchers

Forks

Languages