Enhancing Tampered Text Detection through Frequency Feature Fusion and Decomposition

This repository is official implementation for Enhancing Tampered Text Detection Through Frequency Feature Fusion and Decomposition, ECCV 2024.

We introduce a Feature Fusion and Decomposition Network (FFDN) that combines a Visual Enhancement Module (VEM) with a Wavelet-like Frequency Enhancement (WFE). The VEM makes tampering traces visible while preserving the integrity of original RGB features using zero-initialized convolutions. Meanwhile, the WFE decomposes the features to explicitly retain high-frequency details that are often overlooked during downsampling, focusing on small but critical tampering clues.

Prepare Dataset

download the dataset from DocTamper
Link the dataset to ./data/DocTamperV1/unzip_files.

Your folder structure should look like this:

data
└── DocTamperV1
    ├── unzip_files
    │   ├── DocTamperV1-TrainingSet
    │   ├── DocTamperV1-TestingSet
    │   ├── DocTamperV1-FCD
    │   └── DocTamperV1-SCD
    ├── pks
    │   ├── DocTamperV1-TestingSet_75.pk
    │   ├── DocTamperV1-FCD_75.pk
    │   └── DocTamperV1-SCD_75.pk
    └── processed
        ├── train.txt
        ├── val.txt
        ├── fcd.txt
        └── scd.txt

Getting Started

Installations

To install FFDN, follow these steps:

# install jpegio
cd FFDN/libs/jpegio
pip install -r requirements.txt
python setup.py install
# install mmsegmentation
cd ../../
pip install -r requirements.txt

Inference

Run the demo in inference.ipynb

Train and Evaluate

export GPU_NUMS=4
export PRETRAINED_MODEL=work_dirs/FFDN/FFDN.pth

bash tools/dist_train_val.sh work_config/FFDN/FFDN.py ${GPU_NUMS}
bash tools/dist_test_docTamper_lmdb.sh work_config/FFDN/FFDN.py ${PRETRAINED_MODEL} ${GPU_NUMS}

Note

For fair comparison, we replace 3 times compression with 1 time compression, because the Doctamper project actually compresses 1 time due to a code implementation error. For more details, please refer to jpeg_compress_vis.ipynb.

Acknowledgement

This project builds upon several open-source projects and datasets:

MMSeg: We leverage the MMSegmentation framework for our model implementation and training pipeline. Their modular design and extensive tools greatly facilitated our research.

DocTamper: We utilize the DocTamper dataset and build upon their baseline methods. Their work in document tampering detection has been instrumental in advancing this field.

JPEGIO: We use the JPEG IO library for efficient JPEG image processing, which is crucial for our frequency domain analysis.

We express our sincere gratitude to the developers and researchers behind these projects. Their contributions to the open-source community have been invaluable to our research.

Citation

If you find this work useful in your research, please consider citing:

@inproceedings{chen2024enhancing,
  title={Enhancing Tampered Text Detection Through Frequency Feature Fusion and Decomposition},
  author={Chen, Zhongxi and Chen, Shen and Yao, Taiping and Sun, Ke and Ding, Shouhong and Lin, Xianming and Cao, Liujuan and Ji, Rongrong},
  booktitle={European Conference on Computer Vision},
  pages={200--217},
  year={2024},
  organization={Springer}
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
assets		assets
data/DocTamperV1		data/DocTamperV1
libs/jpegio		libs/jpegio
mmseg		mmseg
requirements		requirements
tools		tools
work_config		work_config
CITATION.cff		CITATION.cff
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
dataset-index.yml		dataset-index.yml
readme.md		readme.md
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Enhancing Tampered Text Detection through Frequency Feature Fusion and Decomposition

Prepare Dataset

Getting Started

Installations

Inference

Train and Evaluate

Note

Acknowledgement

Citation

About

Releases 1

Packages

Languages

License

Rapisurazurite/FFDN

Folders and files

Latest commit

History

Repository files navigation

Enhancing Tampered Text Detection through Frequency Feature Fusion and Decomposition

Prepare Dataset

Getting Started

Installations

Inference

Train and Evaluate

Note

Acknowledgement

Citation

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages