Abricate Genomes

Identify some fun stuff in a slew of genomes

This is a simple wrapper for Torsten's ABRicate to generate a .csv (please forgive me Torsten) that concatenates all virulence factors found among the eight databases that can be searched with ABRicate.

Quick Start

git clone https://github.com/greenkidneybean/abricate_genomes.git
cd abricate_genomes
conda create --name abricate_env --file env/abricate_linux.txt
python abricate_genomes.py samples.csv

Setup

Clone this repo

git clone https://github.com/greenkidneybean/abricate_genomes.git

Install Miniconda, here's a guide for setup on Biowulf
Create the abricate_env conda environment:

conda create --name abricate_env --file abricate_linux.txt

Input

Takes a .csv file with two columns: "samples" and "path". Check-out the samples.csv file as a guide for formating the sample input.

sample,path
sample_1,test/sample_1.fa
sample_2,test/sample_2.fa
sample_3,test/sample_3.fa

Output

The abricate_genomes.py file will generate a new directory titled "abricate" containing a pile of files. The primary output file of interest is the abricate/abricate.csv, which flattens the results of each sample summary.csv into a single line. A zero (0) indicates that the virulence factor was not found in any of the eight databases. A one (1) indicates that the virulence factor was found in at least one of the eight databases.

abricate
├── abricate.csv                # summary file of virulence factors
├── fig                         # sample heatmaps with database hits
│   ├── sample_1.png
│   ├── sample_2.png
│   └── sample_3.png
└── samples
    ├── sample_1                # database.out for each database
    │   ├── argannot.out
    │   ├── card.out
    │   ├── ecoh.out
    │   ├── ecoli_vf.out
    │   ├── ncbi.out
    │   ├── plasmidfinder.out
    │   ├── resfinder.out
    │   ├── summary.csv         # summary table of percent hit in each database
    │   ├── summary.tab         # summary table of percent hit in each database
    │   └── vfdb.out
    └── ...

Run

# activate the "abricate_env" conda environment
conda activate abricate_env

# check the very mild help flag for script options
python abricate_genomes.py -h

# run abricate_genomes.py with the provided test data
python abricate_genomes.py samples.csv

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
env		env
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
abricate_genomes.py		abricate_genomes.py
samples.csv		samples.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Abricate Genomes

Quick Start

Setup

Input

Output

Run

About

Releases

Packages

Languages

License

greenkidneybean/abricate_genomes

Folders and files

Latest commit

History

Repository files navigation

Abricate Genomes

Quick Start

Setup

Input

Output

Run

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages