-
Notifications
You must be signed in to change notification settings - Fork 28
Available VADR model files
Eric Nawrocki edited this page May 7, 2020
·
33 revisions
You can download pre-built models to use to validate and annotate
viruses or cox1 genes as listed below. Importantly, to use a set
of models other than the default set that is installed with VADR, you
will need to use either the --mdir
and --mkey
options to
v-annotate.pl
as shown in the SARS-CoV-2 example here, or with some
combination of the -m
, -a
, -i
, -n
, and -x
options. Those
options are described more
here
- FTP page with current model file gzipped tarball
- see the
00README.txt
file included in the tarball of models for more information and recommendedv-annotate.pl
commands. -
WARNING: using
v-annotate.pl
with these models requires a lot of RAM (64Gb recommended) and you will need to add the option--mxsize 64000
to yourv-annotate.pl
commands. - these models (~30Kb) are larger than the maximum length (25Kb)
allowed by
v-build.pl
and were built with a still in-development version of VADR that is not yet publicly available.
RefSeqs](https://ftp.ncbi.nlm.nih.gov/pub/nawrocki/vadr-models/CURRENT)
along with other *Flaviviridae* and *Caliciviridae* RefSeqs.
* ***THIS SET OF MODELS IS INSTALLED WITH VADR*** and is referred
to in the documentation as the "default" set of models
* [this
link](https://ftp.ncbi.nlm.nih.gov/pub/nawrocki/vadr-models/CURRENT/vadr-models-1.0-1.tar.gz)
will download the specific set of models
(vadr-models-1.0-1.tar.gz) used in the [manuscript describing VADR
v1.0](https://www.biorxiv.org/content/10.1101/852657v1)
* construction of this set of models is detailed
[here](documentation/build.md#1.0library)
* see the `00README.txt` file included in the tarball of models for more
information and recommended `v-annotate.pl` commands.
* these models are currently in testing for analyzing metazoan cox1
sequence submissions to GenBank
* these models were built from alignments instead of single
RefSeqs. The alignments are included in Stockholm format in the tarballs as `.stk` files.