Skip to content

Commit 434be98

Browse files
Updated reference genome information (#3267)
1 parent 4712985 commit 434be98

File tree

1 file changed

+14
-15
lines changed

1 file changed

+14
-15
lines changed

qiita_pet/support_files/doc/source/processingdata/processing-recommendations.rst

Lines changed: 14 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ Currently, Qiita supports the processing of raw data from:
66
#. Target gene barcoded sequencing
77
#. Shotgun sequencing
88
#. Metatranscriptome sequencing
9-
#. Genome Isolate sequencing
9+
#. Genome isolate sequencing
1010

1111
Note that the selected processing recommendations are mainly guided towards performing meta-analyses,
1212
this is combine different studies, even from different wet lab techniques or
@@ -63,20 +63,19 @@ The current workflow is as follows:
6363

6464
Note that we recommend only uploading sequences that have already been through QC and human sequence removal. However, we
6565
recommend that all sequence files go through adapter and host filtering within the system to ensure they are ready for
66-
subsequent meta-analyses. Currently, the `fastp` command is set to autodetect adaptors so this command is available for all different
67-
wetlab processing and we provide the following host references for your convenience:
68-
69-
- auto-detect adapters and artifacts + phix filtering: This is a `deblur artifacts <https://github.com/biocore/deblur/blob/master/deblur/support_files/artifacts.fa>`_ reference, mainly for debugging and testing
70-
- auto-detect adapters and `cheetah <https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/003/709/585/GCF_003709585.1_Aci_jub_2/GCF_003709585.1_Aci_jub_2_genomic.fna.gz>`_ + phix filtering
71-
- auto-detect adapters and `cow <https://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/000/003/205/GCA_000003205.6_Btau_5.0.1/GCA_000003205.6_Btau_5.0.1_genomic.fna.gz>`_ + phix filtering
72-
- auto-detect adapters and `hamster <https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/017/639/785/GCF_017639785.1_BCM_Maur_2.0/GCF_017639785.1_BCM_Maur_2.0_genomic.fna.gz>`_ + phix filtering
73-
- auto-detect adapters and `horse <https://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/000/002/305/GCA_000002305.1_EquCab2.0/GCA_000002305.1_EquCab2.0_genomic.fna.gz>`_ + phix filtering
74-
- auto-detect adapters and merge_genomes + phix filtering : is the combined genomes of a cheetah, cow, hamster, horse, human, mouse, pig, rabbit, and rat
75-
- auto-detect adapters and `mouse <https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/001/635/GCF_000001635.27_GRCm39/GCF_000001635.27_GRCm39_genomic.fna.gz>`_ + phix filtering
76-
- auto-detect adapters and `pig <https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/003/025/GCF_000003025.6_Sscrofa11.1/GCF_000003025.6_Sscrofa11.1_genomic.fna.gz>`_ + phix filtering
77-
- auto-detect adapters and `rabbit <https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/003/625/GCF_000003625.3_OryCun2.0/GCF_000003625.3_OryCun2.0_genomic.fna.gz>`_ + phix filtering
78-
- auto-detect adapters and `rat <https://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/000/001/895/GCA_000001895.4_Rnor_6.0/GCA_000001895.4_Rnor_6.0_genomic.fna.gz>`_ + phix filtering
79-
- auto-detect adapters only filtering
66+
subsequent meta-analyses. We currently provide the several options for your convenience. For each the `fastp` command is set to autodetect and remove universal adapter sequences (i.e., 'GATCGGAAGAGCACACGTCTGAACTCCAGTCAC' for R1 reads and 'GATCGGAAGAGCGTCGTGTAGGGAAAGGAGTGT' for R2 reads). We also provide the following host reference genomes for filtering against; each also filters against three phi x sequences (i.e., `HM753704.1 <https://www.ncbi.nlm.nih.gov/nuccore/HM753704.1/>`_, `JF719728.1 <https://www.ncbi.nlm.nih.gov/nuccore/JF719728.1>`_, `J02482.1 <https://www.ncbi.nlm.nih.gov/nuccore/J02482.1>`_):
67+
68+
- auto-detect adapters and artifacts + phix filtering: This is a `deblur artifacts <https://github.com/biocore/deblur/blob/master/deblur/support_files/artifacts.fa>`_ reference, mainly for debugging and testing. Includes another adapter sequence (i.e., 'ATCTCGTATGCCGTCTTCTGC').
69+
- auto-detect adapters and **cheetah** + phix filtering. Includes cheetah (*Acinonyx jubatus*) reference `GCF_003709585.1 (Aci_jub_2) <https://www.ncbi.nlm.nih.gov/data-hub/genome/GCF_003709585.1/>`_. `Download link <https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/003/709/585/GCF_003709585.1_Aci_jub_2/GCF_003709585.1_Aci_jub_2_genomic.fna.gz>`_
70+
- auto-detect adapters and **cow** + phix filtering. Includes cow (*Bos taurus*) reference `GCF_000003205.7 (Btau_5.0.1) <https://www.ncbi.nlm.nih.gov/data-hub/genome/GCF_000003205.7/>`_. `Download link <https://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/000/003/205/GCA_000003205.6_Btau_5.0.1/GCA_000003205.6_Btau_5.0.1_genomic.fna.gz>`_
71+
- auto-detect adapters and **hamster** + phix filtering. Includes golden hamster (*Mesocricetus auratus*) reference `GCF_017639785.1 (BCM_Maur_2.0) <https://www.ncbi.nlm.nih.gov/data-hub/genome/GCF_017639785.1/>`_. `Download link <https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/017/639/785/GCF_017639785.1_BCM_Maur_2.0/GCF_017639785.1_BCM_Maur_2.0_genomic.fna.gz>`_
72+
- auto-detect adapters and **horse** + phix filtering. Includes horse (*Equus caballus*) reference `GCF_000002305.2 (EquCab2.0) <https://www.ncbi.nlm.nih.gov/data-hub/genome/GCF_000002305.2/>`_. `Download link <https://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/000/002/305/GCA_000002305.1_EquCab2.0/GCA_000002305.1_EquCab2.0_genomic.fna.gz>`_
73+
- auto-detect adapters and **merge_genomes** + phix filtering. Includes the genomes of cheetah, cow, hamster, horse, mouse, pig, rabbit, and rat described here.
74+
- auto-detect adapters and **mouse** + phix filtering. Includes house mouse (*Mus musculus*) reference `GCF_000001635.27 (GRCm39) <https://www.ncbi.nlm.nih.gov/data-hub/genome/GCF_000001635.27/>`_. `Download link <https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/001/635/GCF_000001635.27_GRCm39/GCF_000001635.27_GRCm39_genomic.fna.gz>`_
75+
- auto-detect adapters and **pig** + phix filtering. Includes pig (*Sus scrofa*) reference `GCF_000003025.6 (Sscrofa11.1) <https://www.ncbi.nlm.nih.gov/data-hub/genome/GCF_000003025.6/>`_. `Download link <https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/003/025/GCF_000003025.6_Sscrofa11.1/GCF_000003025.6_Sscrofa11.1_genomic.fna.gz>`_
76+
- auto-detect adapters and **rabbit** + phix filtering. Includes rabbit (*Oryctolagus cuniculus*) reference `GCF_000003625.3 (OryCun2.0) <https://www.ncbi.nlm.nih.gov/data-hub/genome/GCF_000003625.3/>`_. `Download link <https://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/003/625/GCF_000003625.3_OryCun2.0/GCF_000003625.3_OryCun2.0_genomic.fna.gz>`_
77+
- auto-detect adapters and **rat** + phix filtering. Includes Norway rat (*Rattus norvegicus*) reference `GCF_000001895.5 (Rnor_6.0) <https://www.ncbi.nlm.nih.gov/data-hub/genome/GCF_000001895.5/>`_. `Download link <https://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/000/001/895/GCA_000001895.4_Rnor_6.0/GCA_000001895.4_Rnor_6.0_genomic.fna.gz>`_
78+
- auto-detect adapters only filtering. Only includes the two adapter sequences noted above.
8079

8180
Note that the command produces up to 6 output artifacts based on the aligner and database selected:
8281

0 commit comments

Comments
 (0)