Datasets

AI or Not Model Code

Given an image, this model will predict a probability that an image is AI-generated Image.

It's a Tensorflow Model built using TF Dataset for data processing.
Training code can run on Mac or AWS.
F1 Score is 0.90 for Current Evaluation Dataset.

Datasets

Below are the datasets used for training and evaluating the model. Datasets are chosen to solve the problem of identifying Low effort AI generated Crypto Art. Approxmately 1.35 Million images are used for trianing, 350K for Validation and 500K Images are used for testing. For more details refer Training Report

Dataset Name	Description	License
Artbench	Real Artistic Images	MIT
CIFAKE	Real from CIFAR-10 and Fake are generated using Stable Diffusion	MIT
DIRE	Broad dataset of other Dataset	Inherited from other datasets
Dalle3	Generated using Dalle3 Model	N/A
FakeImageDataset	Benchmarking Dataset	Apache-2.0
LAION-5B	Real Artistic Images with Aesthetics score greater than 6.5 and modified before Jan 2020	Creative Common CC-BY 4.0
Midjourney	Generated using the Midjourney Model	N/A

Evaluation and Performance

For evaluation, we employed multiple datasets to test the generalization capabilities of the model. Key evaluation metrics include precision, recall, and F1 score:

Evaluation Dataset	Precision	Recall	F1 Score
Combined Test Set	0.95	0.86	0.90

Sample code

Below is the sample code to test the model on a single image. Refer notebooks folder for more examples.

from PIL import Image
import numpy as np
import tensorflow as tf
from huggingface_hub import from_pretrained_keras

# Download Model
REPO_ID = "konerusudhir/ai-or-not-model"
model = from_pretrained_keras(REPO_ID, token=True, force_download=True)

# Read Image
file_path = "examples/1-1900923-343813.jpg"
image = Image.open(file_path)

# Convert to RGB
if image.mode != "RGB":
    image = image.convert("RGB")

# normalize to 0-1
image = np.array(image) / 255.0

# resize
resized_image = tf.image.resize_with_pad(image, 224, 224)

print(f"File: {file_path} Image shape: {resized_image.shape}")

# Predict
predictions = model(np.array([resized_image]))

# Compute AI Probability
real_probability = predictions[0][0]
ai_probability = 1 - real_probability
print(f"AI Image Prediction: {ai_probability:.2f}")

Huggingface Model

Limitations of the Model

The goal of the model is to detect Low effort AI generated images.
If multiple post processing steps are applied to images after AI generation, model might not be good on detecting these type of images.
Model might not be able to detect Hyper Realistic AI generated images from latest models

Training

Please set below env variables before running the training pipeline(pipeline.py). Training details and Dataset tar files will be released in coming weeks.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
notebooks		notebooks
training_pipeline		training_pipeline
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
create_env.zsh		create_env.zsh
dev_environment.yml		dev_environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI or Not Model Code

Datasets

Evaluation and Performance

Sample code

Limitations of the Model

Training

About

Releases

Packages

Languages

License

MakersPlace/ai_or_not

Folders and files

Latest commit

History

Repository files navigation

AI or Not Model Code

Datasets

Evaluation and Performance

Sample code

Limitations of the Model

Training

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages