Exploring Diffusion-Generated Image Detection Methods

Research done by: Mehdi Abdallahi
Supervisor: Yuhang Lu
Professor: Dr. Touradj Ebrahimi

The content of this repository contains the material used for the research done during the second semester of my Master's at EPFL on the detection of diffusion-generated images, the code is adapted from the paper CNN-generated images are surprisingly easy to spot... for now published by Wang et al. in 2020, and the code they used can be found in this repository. Click here for the report and presentation slides of this project. The trainings I did were done on the izar cluster at EPFL.

For this research, the main focus was to gather useful insights for the generalization of entire face synthesis detection. To address this task, a state-of-the-art synthesized image detection method was first established as a baseline. In order to train and assess the performances of a wide range of models, a convenient pipeline for training, validation and evaluation was implemented. This allowed to move onto the next step: the exploration of large pre-trained vision models. More than 15 CNN architectures were explored, thanks to which interesting insights on the generalization of entire face synthesis classification were gathered.

Getting started

git clone https://github.com/mehdi533/Deepfake-Detection
cd Deepfake-Detection/

The required libraries are in the requirements.txt file and can be installed using the command

pip install -r requirements.txt

Datasets

The datasets in the same format used for this study are available on the MMSPG page.

The images of a specific generator (e.g. ProGAN) are all located in the same folder, the correct images for training, validation, and testing, are selected by loading the lists of images in the metadata folder. The emplacement of the Metadata folder has to be passed by argument, by default it is set as Deepfake-Detection/dataset/metadata.

The format in which the metadata is stored has to be similar to this:

├── Metadata
│   ├── train
│   │   ├── 0_real
│   │   │   ├── X_real_train_list.txt
│   │   │   └── Y_real_train_list.txt
│   │   ├── 1_fake
│   │   │   ├── A_fake_train_list.txt
│   │   │   ├── B_fake_train_list.txt
│   │   │   ├-- ...
│   │   │   
│   ├── val
│   │   ├── 0_real
│   │   │   ├── X_real_val_list.txt
│   │   │   └── Y_real_val_list.txt
│   │   ├── 1_fake
│   │   │   ├── A_fake_val_list.txt
│   │   │   ├── B_fake_val_list.txt
│   │   │   ├-- ...
│   │   │   
│   ├── test
│   │   ├── 0_real
│   │   │   ├── X_real_test_list.txt
│   │   │   └── Y_real_test_list.txt
│   │   ├── 1_fake
│   │   │   ├── A_fake_test_list.txt
│   │   │   ├── B_fake_test_list.txt
│   │   │   ├-- ...
│   │   │

X,Y,A, and B represent names of folders in which images are stored, they need to correspond for the fetching of the image path to be done properly.

For the images, they have to be stored like this:

├── data
│   ├── ProGAN
│   │   ├── 00000.png
│   │   ├── ...
│   ├── DDIM
│   │   ├── 00000.png
│   │   ├── ...
│   ├── CelebA-HQ
│   │   ├── 00000.png
│   │   ├── ...

It is easy to add new datasets for training/testing. When testing on datasets that do not have the same name as the ones used for this study, please adapt the "list_data" in the util.py file. Your models will then be tested on the datasets present in this list.

Train a model

To train a model, there are a lot of options available.

The possible flag options we propose are:

--checkpoints_dir: models trained will be saved in this directory
--name: the name of the experiment, the results will be saved in: checkpoints_dir, filename will be
--arch: architecture of the model (check list below for available ones)
--intermediate: adds a fully connected layer in the classifier (use when training with frozen backbone)
--intermediate_dim: the dimension of the added layer
--freeze: option to freeze the backbone of the model
--pre_trained: use the model with pre trained weights
--models: models/generators on which the model will be trained (e.g. CelebA-HQ,ProGAN,DDIM)
--multiply_real: to upsample the real class, the amount of real images will be multiplied by this amount
--batch_size: the batch size
--dataroot: the path to the folder containing the different images from the different generators (e.g. the path to "data" presented when showing how the images should be stored)
--metadata: the path to the folder containing the metadata as shown above.
--num_threads: the number of threads to use
--cropping: crop images in random patches
--compr_prob: the percentage of images to be pre processed with compression
--blur_prob: the percentage of images to be pre processed with blurring

You simply have to run a command with your chosen options, use the different flags to have the different modes of training (fine-tuning by default, freeze and add a fully connected layer for frozen backbone, set pre-trained to false to train newly initialized layers):

# ResNet50 (Fine-tuning)
python train.py --arch res50 --name 1106 --pre_trained --multiply_real 2 --batch_size 256 --blur_prob 0.3 --models real,DDIM,ProGAN --checkpoints_dir ./checkpoints/ --dataroot ./dataset/ --metadata ./dataset/metadata/

# Swin Tiny (Frozen backbone)
python train.py --arch swin_tiny --name 1206 --freeze --intermediate --pre_trained --batch_size 256 --blur_prob 0.1 --models CelebAHQ,FFpp0,FFpp1,ProGAN --checkpoints_dir ./checkpoints/ --dataroot ./dataset/ --metadata ./dataset/metadata/

# Big Transfer (Training newly initialized layers)
python train.py --arch bit --name 1206 --no-pre_trained --batch_size 256 --blur_prob 0.1 --models CelebAHQ,FFpp0,FFpp1,ProGAN --checkpoints_dir ./checkpoints/ --dataroot ./dataset/ --metadata ./dataset/metadata/

The details of the implementation (optimizer, learning rate scheduler...) can be found in the report.

Testing

To test a model, the way of doing is similar, the options listed below are available:

--path: path to the model (.pth) or to multiple models if using the model ensemble
--name: the name of the experiment, the results will be saved in
--intermediate: adds a fully connected layer in the classifier (use when training with frozen backbone)
--intermediate_dim: the dimension of the added layer
--batch_size: the batch size
--dataroot: the path to the folder containing the different images from the different generators (e.g. the path to "data" presented when showing how the images should be stored)
--metadata: the path to the folder containing the metadata as shown above.
--num_threads: the number of threads to use
--meta_model: option to train a model on the validation set to optimize the weights for the vote of the models (LR or kNN).
--models: models/generators on which the meta model will be trained (e.g. CelebA-HQ,ProGAN,DDIM)

To change the directory to save the results, check the util.py file. The intermediate and intermediate_dim are mendatory to use when testing a model that was trained using those flags. The default meta model is none, to choose one put LR for linear regression or kNN for k Nearest Neighbors in the --meta_model flag, if you want to change the numbers of neighbors, the distance metric or other, you will have to do so manually in the model_ensemble.py file.

To evaluate a model, you simply have to run a command with your chosen options:

# Simple evaluation of a model
python eval.py --name testFFpp3 --batch_size 256 --path swin_tiny_0506_FFpp3/model_epoch_best.pth

# With Model Ensemble
python eval.py --name LRtest --batch_size 256 --meta_model LR --path trained/swin_tiny_Forensics --models FFpp0,FFpp1,FFpp2,FFpp3 --num_threads 8

Models available

The different architectures that you can train on using this code are the following, the value to pass with the --arch flag is given first:

res50 ResNet50
vgg16 VGG16
efficient_b0 EfficientNet b0
efficient_b4 EfficientNet b4
bit Big Transfer
vit_base Vision Transformer (base size)
vit_large Vision Transformer (large size)
deit_smallDistilled Data-efficient Image Transformer (small-sized model)
deit_base Distilled Data-efficient Image Transformer (base-sized model)
coatnet CoAtNet
resnext ResNeXt
beit BEiT (base-sized model)
convnext ConvNeXt
regnet RegNetY-400MF
swin_tiny Swin Transformer (tiny version)
swin_base Swin Transformer (base version)
swin_large Swin Transformer (large version)

Custom models and datasets

If you are planning on using a new architecture, add it in the custom_models.py file and update the list in util.py. For new datasets, also add the name in the list if you want your models to be evaluated on it, ensure that they follow the same standards as mentionned in this readme, otherwise you will have to adapt the code to ensure a smooth running of the training/evaluation process.

Acknowledgements

For the dataset and the directives during the project: Yuhang Lu.
For the code: the github of Peter Wang licensed under CC BY-NC-SA 4.0.

Name		Name	Last commit message	Last commit date
Latest commit History 83 Commits
dataloader		dataloader
dataset/Metadata		dataset/Metadata
models_trained		models_trained
networks		networks
results		results
.gitignore		.gitignore
Exploring_Diffusion_Generated_Image_Detection_Methods.pdf		Exploring_Diffusion_Generated_Image_Detection_Methods.pdf
FinalPresentation.pdf		FinalPresentation.pdf
LICENSE.txt		LICENSE.txt
README.md		README.md
data_moving.py		data_moving.py
earlystop.py		earlystop.py
eval.py		eval.py
model_ensemble.py		model_ensemble.py
options.py		options.py
requirements.txt		requirements.txt
results_extractor.py		results_extractor.py
train.py		train.py
util.py		util.py
validate.py		validate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Exploring Diffusion-Generated Image Detection Methods

Getting started

Datasets

Train a model

Testing

Models available

Custom models and datasets

Acknowledgements

About

Releases

Packages

Languages

License

mehdi533/Fake_Images_Detection

Folders and files

Latest commit

History

Repository files navigation

Exploring Diffusion-Generated Image Detection Methods

Getting started

Datasets

Train a model

Testing

Models available

Custom models and datasets

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages