-
Notifications
You must be signed in to change notification settings - Fork 39
The mitoz tools gbseqextractor command
Guanliang MENG edited this page Jun 22, 2023
·
1 revision
$ mitoz-tools gbseqextractor -h
usage: mitoz-tools gbseqextractor [-h] -f <STR> -prefix <STR> [-seqPrefix <STR>]
[-types {CDS,rRNA,tRNA,wholeseq,gene} [{CDS,rRNA,tRNA,wholeseq,gene} ...]]
[-cds_translation] [-gi] [-p] [-t] [-rv] [-s] [-l] [-F]
Extract any CDS or rNRA or tRNA DNA sequences of genes from Genbank file.
Seqid will be the value of '/gene=' or '/product=', if they both were not
present, the gene will not be output!
version 20201128:
Now we can handle compounlocation (feature location with "join")!
We can also output the translation for each CDS (retrived from '/translation=')
Please cite:
Guanliang Meng, Yiyuan Li, Chentao Yang, Shanlin Liu,
MitoZ: a toolkit for animal mitochondrial genome assembly, annotation
and visualization, Nucleic Acids Research, https://doi.org/10.1093/nar/gkz173
optional arguments:
-h, --help show this help message and exit
-f <STR> Genbank file
-prefix <STR> prefix of output file. required.
-seqPrefix <STR> prefix of each seq id. default: None
-types {CDS,rRNA,tRNA,wholeseq,gene} [{CDS,rRNA,tRNA,wholeseq,gene} ...]
what kind of genes you want to extract? wholeseq for whole fasta seq. WARNING: Each
sequence in the result files corresponds to ONE feature in the GenBank file, I will NOT
combine multiple CDS of the same gene into ONE! [CDS]
-cds_translation Also output translated CDS (required -types CDS). The translations are retrived directly
from the '/translation=' key word. [False]
-gi use gi number as sequence ID instead of accession number when " gi number is present.
(default: accession number)
-p output the position information on the ID line. Warning: the position on ID line is 0
left-most! [False]
-t output the taxonomy lineage on ID line [False]
-rv reverse and complement the output sequences if they are on the 3'-5' direction [True]
-s output the species name on the ID line [False]
-l output the seq length on the ID line [False]
-F only output full length genes,i.e., exclude the genes with '>' or '<' in their location
[False]
About:
Commands:
- The -all- subcommand
- The -filter- subcommand
- The -assemble- subcommand
- The -findmitoscaf- subcommand
- The -annotate- subcommand
- The -visualize- subcommand
Usages:
- Installation
- Tutorial
- Extending MitoZ-s database
- Batch processing of many samples
- Known issues
- FAQ
- Some important intermediate files
- Upload to GenBank
MitoZ-tools:
- Overview: The -mitoz tools- command
- The -mitoz-tools--group_seq_by_gene- command
- The -mitoz tools bold_identification- command
- The -mitoz tools circle_check- command
- The -mitoz tools gbfiletool- command
- The -mitoz tools gbseqextractor- command
- The -mitoz tools msaconverter- command
- The -mitoz tools taxonomy_ranks- command