Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merging template updates 2.1 #56

Merged
merged 74 commits into from
Aug 25, 2021
Merged
Changes from 1 commit
Commits
Show all changes
74 commits
Select commit Hold shift + click to select a range
8e43acd
Template update for nf-core/tools version 1.12
nf-core-bot Nov 19, 2020
2e7540d
Template update for nf-core/tools version 1.12.1
nf-core-bot Dec 3, 2020
225a827
Template update for nf-core/tools version 1.13
nf-core-bot Mar 18, 2021
be7bbe1
Template update for nf-core/tools version 1.13.1
nf-core-bot Mar 19, 2021
ceebac0
Template update for nf-core/tools version 1.13.2
nf-core-bot Mar 23, 2021
0f9ba31
Template update for nf-core/tools version 1.13.3
nf-core-bot Mar 25, 2021
b4b1d3a
Template update for nf-core/tools version 1.14
nf-core-bot May 11, 2021
fc85b5d
Template update for nf-core/tools version 2.0
KevinMenden Jul 13, 2021
db29c73
Template update for nf-core/tools version 2.0.1
nf-core-bot Jul 13, 2021
e19ae65
Template update for nf-core/tools version 2.1
nf-core-bot Jul 27, 2021
798cea3
barebone template update 2.1
d4straub Aug 2, 2021
e578b4f
fix CHANGELOG.md
d4straub Aug 2, 2021
239a48f
fix linting
d4straub Aug 2, 2021
45cc713
update description and README
d4straub Aug 2, 2021
b6d1a8f
update schema and version
d4straub Aug 3, 2021
5afb220
change samplesheet to csv and fix parsing
d4straub Aug 3, 2021
2cc2e70
fix linting
d4straub Aug 3, 2021
335e175
add skewer
d4straub Aug 3, 2021
b513931
fix linting
d4straub Aug 3, 2021
c64afa0
add more modules
d4straub Aug 4, 2021
0941b47
from samplesheet csv to unicycler channels all fine
d4straub Aug 5, 2021
4034bcf
add skewer version
d4straub Aug 6, 2021
b7cee69
add prokka, kraken2 and quast
d4straub Aug 6, 2021
6c89976
fix linting
d4straub Aug 6, 2021
90392bd
samplesheet from tsv to csv again
d4straub Aug 6, 2021
8f72103
use nf-core prokka module
d4straub Aug 6, 2021
f72bc41
add dfast
d4straub Aug 6, 2021
090f175
modify test profiles
d4straub Aug 9, 2021
bfa5b42
create assembly channel
d4straub Aug 9, 2021
008f01e
fix test_long_miniasm
d4straub Aug 9, 2021
0d26ae8
add canu but test fails because reads do not overlap
d4straub Aug 9, 2021
e92471f
add miniasm assembly
d4straub Aug 9, 2021
b6fc4d7
canu test_long: bacass_long_miniasm.csv instead bacass_long.csv
d4straub Aug 9, 2021
94a0f7d
add consensus (racon) for miniasm
d4straub Aug 9, 2021
6644546
fix racon version output
d4straub Aug 9, 2021
931c9c4
fix test_long
d4straub Aug 13, 2021
a6e62a6
add medaka
d4straub Aug 13, 2021
d00f08b
add nanopolish
d4straub Aug 16, 2021
a0df16f
adjust resources
d4straub Aug 16, 2021
00bc5fd
fix test_long_miniasm
d4straub Aug 16, 2021
7d3fc0a
activate medaka for test_long_miniasm
d4straub Aug 16, 2021
5b91536
fix miniasm assembly
d4straub Aug 17, 2021
08fcfe8
fix kraken2 and activate for test_long_miniasm
d4straub Aug 17, 2021
206c6e6
adjust output folders
d4straub Aug 17, 2021
82ba5a9
update CHANGELOG
d4straub Aug 18, 2021
71e42db
fix kraken2 input
d4straub Aug 18, 2021
df6a5bd
fix kraken2 output
d4straub Aug 18, 2021
aefcd1e
fix kraken2 output 2
d4straub Aug 18, 2021
e3a071f
fix linting
d4straub Aug 18, 2021
54ead7f
refine output folders
d4straub Aug 18, 2021
05b54d0
add change in --krakendb
d4straub Aug 19, 2021
1f8718c
add zenodo doi
d4straub Aug 19, 2021
a58fd8e
change dfast output to folder DFAST
d4straub Aug 19, 2021
05ac341
update CITATTIONS
d4straub Aug 19, 2021
5091db8
update README
d4straub Aug 19, 2021
d6b9840
update output docs
d4straub Aug 19, 2021
34fc917
remove TODO
d4straub Aug 19, 2021
ad96a76
fix dfast
d4straub Aug 19, 2021
9c0b316
fix picoqc
d4straub Aug 19, 2021
911186a
fix linting and pycoqc
d4straub Aug 19, 2021
097938e
update output with medaka and nanopolish
d4straub Aug 19, 2021
5ecd08e
remove samtools files from results folder
d4straub Aug 19, 2021
e7b9ec2
update parameter descriptions
d4straub Aug 19, 2021
545763a
remove parallel from nanopolish
d4straub Aug 20, 2021
6c6f01a
fix pycoqc
d4straub Aug 20, 2021
787ae45
amend output and update usage
d4straub Aug 20, 2021
1df85e5
remove tabs
d4straub Aug 20, 2021
8ba97d6
fix nanopolish input
d4straub Aug 20, 2021
ae2a20f
fix nanopolish input
d4straub Aug 20, 2021
1455d9d
remove unused samplesheet check and solve some TODOs
d4straub Aug 20, 2021
61df69d
remove any connection to igenomes
d4straub Aug 23, 2021
a1baa68
add full size test dataset
d4straub Aug 23, 2021
97a1ea8
removed unused parameter from test_long
d4straub Aug 23, 2021
002fcd0
ignore igenomes_base to avoid frequent warning with institutional pro…
d4straub Aug 23, 2021
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
update description and README
d4straub committed Aug 2, 2021
commit 45cc7132342e44be1a75debd7e94ccd544280631
22 changes: 14 additions & 8 deletions README.md
Original file line number Diff line number Diff line change
@@ -3,7 +3,7 @@
[![GitHub Actions CI Status](https://github.com/nf-core/bacass/workflows/nf-core%20CI/badge.svg)](https://github.com/nf-core/bacass/actions?query=workflow%3A%22nf-core+CI%22)
[![GitHub Actions Linting Status](https://github.com/nf-core/bacass/workflows/nf-core%20linting/badge.svg)](https://github.com/nf-core/bacass/actions?query=workflow%3A%22nf-core+linting%22)
[![AWS CI](https://img.shields.io/badge/CI%20tests-full%20size-FF9900?labelColor=000000&logo=Amazon%20AWS)](https://nf-co.re/bacass/results)
[![Cite with Zenodo](http://img.shields.io/badge/DOI-10.5281/zenodo.XXXXXXX-1073c8?labelColor=000000)](https://doi.org/10.5281/zenodo.XXXXXXX)
[![Cite with Zenodo](http://img.shields.io/badge/DOI-10.5281/zenodo.2669428-1073c8?labelColor=000000)](https://doi.org/10.5281/zenodo.2669428)

[![Nextflow](https://img.shields.io/badge/nextflow%20DSL2-%E2%89%A521.04.0-23aa62.svg?labelColor=000000)](https://www.nextflow.io/)
[![run with conda](http://img.shields.io/badge/run%20with-conda-3EB049?labelColor=000000&logo=anaconda)](https://docs.conda.io/en/latest/)
@@ -16,8 +16,7 @@

## Introduction

<!-- TODO nf-core: Write a 1-2 sentence summary of what data the pipeline is for and what it does -->
**nf-core/bacass** is a bioinformatics best-practice analysis pipeline for Simple bacterial assembly and annotation pipeline..
**nf-core/bacass** is a bioinformatics best-practice analysis pipeline for simple bacterial assembly and annotation. The pipeline is able to assemble short reads, long reads, or a mixture of short and long reads (hybrid assembly).

The pipeline is built using [Nextflow](https://www.nextflow.io), a workflow tool to run tasks across multiple compute infrastructures in a very portable manner. It uses Docker/Singularity containers making installation trivial and results highly reproducible. The [Nextflow DSL2](https://www.nextflow.io/docs/latest/dsl2.html) implementation of this pipeline uses one container per process which makes it much easier to maintain and update software dependencies. Where possible, these processes have been submitted to and installed from [nf-core/modules](https://github.com/nf-core/modules) in order to make them available to all nf-core pipelines, and to everyone within the Nextflow community!

@@ -26,10 +25,18 @@ On release, automated continuous integration tests run the pipeline on a full-si

## Pipeline summary

<!-- TODO nf-core: Fill in short bullet-pointed list of the default steps in the pipeline -->
### Short Read Assembly

1. Read QC ([`FastQC`](https://www.bioinformatics.babraham.ac.uk/projects/fastqc/))
2. Present QC for raw reads ([`MultiQC`](http://multiqc.info/))
This pipeline is primarily for bacterial assembly of next-generation sequencing reads. It can be used to quality trim your reads using [Skewer](https://github.com/relipmoc/skewer) and performs basic sequencing QC using [FastQC](https://www.bioinformatics.babraham.ac.uk/projects/fastqc/). Afterwards, the pipeline performs read assembly using [Unicycler](https://github.com/rrwick/Unicycler). Contamination of the assembly is checked using [Kraken2](https://ccb.jhu.edu/software/kraken2/) to verify sample purity.

### Long Read Assembly

For users that only have Nanopore data, the pipeline quality trims these using [PoreChop](https://github.com/rrwick/Porechop) and assesses basic sequencing QC utilizing [NanoPlot](https://github.com/wdecoster/NanoPlot) and [PycoQC](https://github.com/a-slide/pycoQC).
The pipeline can then perform long read assembly utilizing [Unicycler](https://github.com/rrwick/Unicycler), [Miniasm](https://github.com/lh3/miniasm) in combination with [Racon](https://github.com/isovic/racon) or [Canu](https://github.com/marbl/canu). Long reads can be polished using specified Fast5 files with [NanoPolish](https://github.com/jts/nanopolish).

### Hybrid Assembly

For users specifying both short read and long read (NanoPore) data, the pipeline can perform a hybrid assembly approach utilizing [Unicycler](https://github.com/rrwick/Unicycler), taking the full set of information from short reads and long reads into account.

## Quick Start

@@ -75,8 +82,7 @@ For further information or help, don't hesitate to get in touch on the [Slack `#

## Citations

<!-- TODO nf-core: Add citation for pipeline after first release. Uncomment lines below and update Zenodo doi and badge at the top of this file. -->
<!-- If you use nf-core/bacass for your analysis, please cite it using the following doi: [10.5281/zenodo.XXXXXX](https://doi.org/10.5281/zenodo.XXXXXX) -->
If you use nf-core/bacass for your analysis, please cite it using the following doi: [10.5281/zenodo.2669428](https://doi.org/10.5281/zenodo.2669428)

<!-- TODO nf-core: Add bibliography of tools and data used in your pipeline -->
An extensive list of references for the tools used by the pipeline can be found in the [`CITATIONS.md`](CITATIONS.md) file.
2 changes: 1 addition & 1 deletion assets/email_template.html
Original file line number Diff line number Diff line change
@@ -4,7 +4,7 @@
<meta http-equiv="X-UA-Compatible" content="IE=edge">
<meta name="viewport" content="width=device-width, initial-scale=1">

<meta name="description" content="nf-core/bacass: Simple bacterial assembly and annotation pipeline.">
<meta name="description" content="nf-core/bacass: Simple bacterial assembly and annotation">
<title>nf-core/bacass Pipeline Report</title>
</head>
<body>
50 changes: 50 additions & 0 deletions modules/local/abc.nf
Original file line number Diff line number Diff line change
@@ -0,0 +1,50 @@
// Import generic module functions
include { initOptions; saveFiles; getSoftwareName } from './functions'

params.options = [:]
options = initOptions(params.options)

process DADA2_FILTNTRIM {
tag "$meta.id"
label 'process_medium'
publishDir "${params.outdir}",
mode: params.publish_dir_mode,
saveAs: { filename -> saveFiles(filename:filename, options:params.options, publish_dir:getSoftwareName(task.process), publish_id:'') }

conda (params.enable_conda ? "bioconductor-dada2=1.18.0" : null)
if (workflow.containerEngine == 'singularity' && !params.singularity_pull_docker_container) {
container "https://depot.galaxyproject.org/singularity/bioconductor-dada2:1.18.0--r40h5f743cb_0"
} else {
container "quay.io/biocontainers/bioconductor-dada2:1.18.0--r40h5f743cb_0"
}

input:
tuple val(meta), path(reads), val(trunclenf), val(trunclenr)

output:
tuple val(meta), path("*.filter_stats.tsv"), emit: log
tuple val(meta), path("*.filt.fastq.gz") , emit: reads
path "*.version.txt" , emit: version
path "*.args.txt" , emit: args

script:
def software = getSoftwareName(task.process)
def in_and_out = meta.single_end ? "\"${reads}\", \"${meta.id}.filt.fastq.gz\"" : "\"${reads[0]}\", \"${meta.id}_1.filt.fastq.gz\", \"${reads[1]}\", \"${meta.id}_2.filt.fastq.gz\""
def trunclenf = trunclenf[1].toInteger()
def trunclenr = trunclenr[1].toInteger()
def trunc_args = meta.single_end ? "truncLen = $trunclenf" : "truncLen = c($trunclenf, $trunclenr)"
"""
#!/usr/bin/env Rscript
suppressPackageStartupMessages(library(dada2))
out <- filterAndTrim($in_and_out,
$trunc_args,
$options.args,
compress = TRUE,
multithread = $task.cpus,
verbose = TRUE)
out <- cbind(out, ID = row.names(out))
write.table( out, file = "${meta.id}.filter_stats.tsv", sep = "\t", row.names = FALSE, quote = FALSE)
write.table(paste('filterAndTrim\t$trunc_args','$options.args',sep=","), file = "filterAndTrim.args.txt", row.names = FALSE, col.names = FALSE, quote = FALSE)
write.table(packageVersion("dada2"), file = "${software}.version.txt", row.names = FALSE, col.names = FALSE, quote = FALSE)
"""
}
2 changes: 1 addition & 1 deletion nextflow.config
Original file line number Diff line number Diff line change
@@ -158,7 +158,7 @@ manifest {
name = 'nf-core/bacass'
author = 'Andreas Wilm, Alexander Peltzer'
homePage = 'https://github.com/nf-core/bacass'
description = 'Simple bacterial assembly and annotation pipeline.'
description = 'Simple bacterial assembly and annotation'
mainScript = 'main.nf'
nextflowVersion = '!>=21.04.0'
version = '1.1.1'
2 changes: 1 addition & 1 deletion nextflow_schema.json
Original file line number Diff line number Diff line change
@@ -2,7 +2,7 @@
"$schema": "http://json-schema.org/draft-07/schema",
"$id": "https://raw.githubusercontent.com/nf-core/bacass/master/nextflow_schema.json",
"title": "nf-core/bacass pipeline parameters",
"description": "Simple bacterial assembly and annotation pipeline.",
"description": "Simple bacterial assembly and annotation",
"type": "object",
"definitions": {
"input_output_options": {