-
Notifications
You must be signed in to change notification settings - Fork 161
Added TAPS to methylseq #565
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closed
eduard-watchmaker
wants to merge
10
commits into
nf-core:dev
from
watchmaker-genomics:methylseq_taps
Closed
Changes from all commits
Commits
Show all changes
10 commits
Select commit
Hold shift + click to select a range
db60bfe
Merge pull request #558 from nf-core/dev
sateeshperi 83dd7eb
Added TAPS to methylseq
eduard-watchmaker 8ace1d6
Resolving conflicts from dev
thomas-watchmaker 428735e
new ro-crate and snapshots
870b936
rm aws conf
eduard-watchmaker 3e1eacf
rm aws conf for nf-test
eduard-watchmaker 2b5e48b
changes
eduard-watchmaker 3f893b9
Fixed join and combine to run all samples
eduard-watchmaker 4ff99f6
Fixed new image for rastair
eduard-watchmaker 1fe460f
Fixed tests
eduard-watchmaker File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,5 @@ | ||
| process { | ||
| withName: GATK4_REMOVEDUPLICATES { | ||
| ext.args = "--REMOVE_DUPLICATES true --TAG_DUPLICATE_SET_MEMBERS true" | ||
| } | ||
| } |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,8 @@ | ||
| process { | ||
| withName: PICARD_ADDORREPLACEREADGROUPS { | ||
| ext.args = "--RGID 1 --RGLB lib1 --RGPL illumina --RGPU unit1 --RGSM sample1" | ||
| } | ||
| } | ||
|
|
||
|
|
||
|
|
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,18 @@ | ||
| process { | ||
| withName: PICARD_REMOVEDUPLICATES { | ||
| ext.args = "--ASSUME_SORTED true --REMOVE_DUPLICATES true --VALIDATION_STRINGENCY LENIENT --PROGRAM_RECORD_ID 'null' --TMP_DIR tmp" | ||
| ext.prefix = { "${meta.id}.dedup.sorted" } | ||
| publishDir = [ | ||
| [ | ||
| path: { "${params.outdir}/${params.aligner}/deduplicated/picard_metrics" }, | ||
| pattern: "*.metrics.txt", | ||
| mode: params.publish_dir_mode | ||
| ], | ||
| [ | ||
| path: { "${params.outdir}/${params.aligner}/deduplicated" }, | ||
| pattern: "*.bam", | ||
| mode: params.publish_dir_mode | ||
| ] | ||
| ] | ||
| } | ||
| } |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,20 @@ | ||
| process { | ||
| withName: RASTAIR_CALL { | ||
| ext.args = [ | ||
| // Pending the resolution of the mbias_parse process | ||
| params.trim_OT ?: '0,0,10,0', // [r1_start, r1_end, r2_start, r2_end] | ||
| params.trim_OB ?: '0,0,10,0' // [r1_start, r1_end, r2_start, r2_end] | ||
| ].join(" ").trim() | ||
| publishDir = [ | ||
| [ | ||
| path: { "${params.outdir}/rastair/call" }, | ||
| mode: params.publish_dir_mode, | ||
| pattern: "*.txt" | ||
| ], [ | ||
| path: { "${params.outdir}/rastair/call" }, | ||
| mode: params.publish_dir_mode, | ||
| pattern: "*.gz" | ||
| ] | ||
| ] | ||
| } | ||
| } |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,11 @@ | ||
| process { | ||
| withName: RASTAIR_MBIAS { | ||
| publishDir = [ | ||
| [ | ||
| path: { "${params.outdir}/rastair/mbias" }, | ||
| mode: params.publish_dir_mode, | ||
| pattern: "*.txt" | ||
| ] | ||
| ] | ||
| } | ||
| } |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,11 @@ | ||
| process { | ||
| withName: RASTAIR_MBIAS_PARSER { | ||
| publishDir = [ | ||
| [ | ||
| path: { "${params.outdir}/rastair/mbias_parser" }, | ||
| mode: params.publish_dir_mode, | ||
| pattern: "*rastair_mbias_processed*" | ||
| ] | ||
| ] | ||
| } | ||
| } |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,11 @@ | ||
| process { | ||
| withName: CONVERT_TO_METHYLKIT { | ||
| publishDir = [ | ||
| [ | ||
| path: { "${params.outdir}/rastair/methylkit" }, | ||
| mode: params.publish_dir_mode, | ||
| pattern: "*.txt.gz" | ||
| ] | ||
| ] | ||
| } | ||
| } |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,4 @@ | ||
| includeConfig "../modules/gunzip.config" | ||
| includeConfig "../modules/bismark_genomepreparation.config" | ||
| includeConfig "../modules/samtools_faidx.config" | ||
| includeConfig "../modules/bwameth_index.config" |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,2 @@ | ||
| includeConfig "../modules/picard_addorreplacereadgroups.config" | ||
| includeConfig "../modules/picard_removeduplicates.config" |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,2 @@ | ||
| includeConfig "../modules/methyldackel_extract.config" | ||
| includeConfig "../modules/methyldackel_mbias.config" |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,4 @@ | ||
| includeConfig "../modules/rastair_call.config" | ||
| includeConfig "../modules/rastair_mbias.config" | ||
| includeConfig "../modules/rastair_mbias_parser.config" | ||
| includeConfig "../modules/rastair_methylkit.config" |
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -6,7 +6,7 @@ This document describes the output produced by the methylseq pipeline. | |
|
|
||
| Most of the plots are taken from the MultiQC report, which summarizes results at the end of the pipeline. | ||
|
|
||
| > NOTE: nf-core/methylseq contains two workflows - one for Bismark, one for bwa-meth. The results files produced will vary depending on which variant is run. | ||
| > NOTE: nf-core/methylseq contains three alignment workflows - one for Bismark, one for bwa-meth and one for bwa-mem. On top of that, there is an extra workflow to process conversion rates from TAPS data (protocol for positive methylation reading) through Rastair. The results files produced will vary depending on which variant is run. | ||
|
|
||
| The output directories listed below will be created in the results directory after the pipeline has finished. All paths are relative to the top-level results directory. | ||
|
|
||
|
|
@@ -85,6 +85,47 @@ bwameth/ | |
| └── logs | ||
| ``` | ||
|
|
||
| #### bwa-mem | ||
|
|
||
| ``` | ||
| bwamem/ | ||
| ├── bwamem | ||
| │ ├── alignments | ||
| │ └── deduplicated | ||
| ├── fastqc | ||
| │ ├── Ecoli_10K_methylated_1_fastqc.html | ||
| │ ├── Ecoli_10K_methylated_2_fastqc.html | ||
| │ └── zips | ||
| ├── multiqc | ||
| │ └── bwamem | ||
| ├── pipeline_info | ||
| │ ├── execution_report_2024-12-13_05-36-34.html | ||
| │ ├── execution_timeline_2024-12-13_05-36-34.html | ||
| │ ├── execution_trace_2024-12-13_05-36-34.txt | ||
| │ ├── nf_core_methylseq_software_mqc_versions.yml | ||
| │ ├── params_2024-12-13_05-36-43.json | ||
| │ └── pipeline_dag_2024-12-13_05-36-34.html | ||
| └── trimgalore | ||
| ├── fastqc | ||
| └── logs | ||
| ``` | ||
|
|
||
| #### rastair | ||
|
|
||
| ``` | ||
| rastair | ||
| ├── call | ||
| | └── Ecoli_10K_methylated.markdup.sorted_CpG.rastair_call.tsv | ||
| ├── mbias | ||
| | └── Ecoli_10K_methylated.markdup.sorted_CpG.rastair_mbias.tsv | ||
| ├── mbias_parser | ||
| │ ├── Ecoli_10K_methylated.markdup.sorted_CpG.rastair_mbias_processed.txt | ||
| │ ├── Ecoli_10K_methylated.markdup.sorted_CpG.rastair_mbias_processed.csv | ||
| │ └── Ecoli_10K_methylated.markdup.sorted_CpG.rastair_mbias_processed.pdf | ||
| ├── methylkit | ||
| | └── Ecoli_10K_methylated.markdup.sorted_CpG.rastair_methylkit.txt.gz | ||
| ``` | ||
|
|
||
| ### Detailed Output Descriptions | ||
|
|
||
| ### Reference Genome Preparation | ||
|
|
@@ -174,6 +215,9 @@ _Note that bismark can use either use Bowtie2 (default) or HISAT2 as alignment t | |
| - `logs/samtools_stats/sample_stats.txt` | ||
| - Summary file giving lots of metrics about the aligned BAM file. | ||
|
|
||
| **bwa-mem output directory: `results/bwamem/alignments/`** | ||
| # TODO | ||
|
|
||
| ### Deduplication | ||
|
|
||
| This step removes alignments with identical mapping position to avoid technical duplication in the results. Note that it is skipped if `--save_align_intermeds`, `--skip_deduplication` or `--rrbs` is specified when running the pipeline. | ||
|
|
@@ -196,6 +240,9 @@ This step removes alignments with identical mapping position to avoid technical | |
| - `logs/sample.sorted.markDups_metrics.txt` | ||
| - Log file giving summary statistics about deduplication. | ||
|
|
||
| **bwa-mem output directory: `results/bwamem/deduplicated/`** | ||
| # TODO | ||
|
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. TODO |
||
|
|
||
| ### Methylation Extraction | ||
|
|
||
| The methylation extractor step takes a BAM file with aligned reads and generates files containing cytosine methylation calls. It produces a few different output formats, described below. | ||
|
|
@@ -231,6 +278,9 @@ Filename abbreviations stand for the following reference alignment strands: | |
| - `sample.bedGraph` | ||
| - Methylation statuses in [bedGraph](http://genome.ucsc.edu/goldenPath/help/bedgraph.html) format. | ||
|
|
||
| **bwa-mem / TAPS workflow output directory: `results/rastair/`** | ||
| # TODO | ||
|
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. TODO |
||
|
|
||
| ### Targeted Sequencing | ||
|
|
||
| If `--run_targeted_sequencing` is set to `true`, the pipeline performs additional analysis for targeted sequencing experiments. | ||
|
|
@@ -241,6 +291,7 @@ BedGraph files are filtered using the BED file passed to `--target_regions_file` | |
|
|
||
| **Bismark output directory: `results/bismark/methylation_calls/bedGraph/`** | ||
| **bwa-meth output directory: `results/methyldackel/`** | ||
| # TODO: implement filtering by `--target_regions_file` in bwa-mem | ||
|
|
||
| - `*.targeted.bedGraph` | ||
| - Methylation statuses in [bedGraph](http://genome.ucsc.edu/goldenPath/help/bedgraph.html) format, limited to the positions in the target regions BED file. | ||
|
|
@@ -254,6 +305,10 @@ BedGraph files are filtered using the BED file passed to `--target_regions_file` | |
| - `*.CollectHsMetrics.coverage_metrics` | ||
| - Text-based statistics showed also in the MultiQC report. | ||
|
|
||
|
|
||
| ### Rastair | ||
| # TODO | ||
|
|
||
| ### Bismark Reports | ||
|
|
||
| Bismark generates a HTML reports describing results for each sample, as well as a summary report for the whole run. | ||
|
|
||
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
TODO