Skip to content

Releases: nf-cmgg/germline

v1.1.0 - Glorious Ghent

14 Mar 13:32
20d4fda
Compare
Choose a tag to compare

! WES support has been temporarily disabled, but is planned to be re-added in the next major release !

New Features

  1. Added support for samples that aren't part of a family. Just leave the ped and family_id input fields in the samplesheet empty for a sample to be treated like this. This sample will go through exactly the same workflow but will be emitted as a single-sample VCF.
  2. Added dump functionality to lots of channels.
  3. Added the dbsnp option to GATK HaplotypeCaller. use --dbsnp and --dbsnp_tbi to supply these VCFs.
  4. Added the vcf_extract_somalier subworkflow to the pipeline. This also creates PED files inferred from the input multi-sample VCF.
  5. Added a validation subworkflow. All files that have a VCF in the truth_vcf column of the input samplesheet will be validated against this VCF. This can be turned off by supplying the --validate false flag to the pipeline run.

Improvements

  1. Improved the scatter/gather logic. This is now done with goleft indexsplit to define chunks of even coverage. The genotyping scattering now happens with bedtools makewindows. This creates chunks of even regions from the merged BED files for the family. By passing a padding of about 20 bps to the genotype tools, we make sure all variants on the edges of these regions are also genotyped. Duplicates are removed later when running bcftools concat
  2. Refactored a lot of the code to maintain the same style over the whole pipeline.
  3. Updated the minimum Nextlow version to 22.10.5 to make sure S3 staging works perfectly.
  4. The post_processing subworklow has been renamed to the better suiting joint_genotyping subworkflow. reblockgvcf has been moved to germline_variant_calling and the filter and reheadering has been moved to the main workflow.
  5. Merging VCFs of the same family now happens with GATK GenomicsDBImport instead of GATK MergeGVCFs or bcftools merge. This gives more reliable results.
  6. Improved the handling of vcfanno
  7. The PED headers can now be added to all the output VCFs that are part of a family instead of only those that were given a PED file as input. The PED file used is created using somalier relate. This feature can be turned on using the --add_ped true argument. This doesn't happen by default.

Bug fixes

  1. Fixed some issues when both the ped and family_id were given for a sample.
  2. Fixed the PED input for rtgtools_pedfilter (-9 isn't recognized as unknown by the tool. Now these will be automatically converted to 0 before this tool)
  3. Fixed issues with the DBsnp index not being created correctly
  4. Fixed wrongly formed joins and added checks for mismatches and duplicates

v1.0.1 - Happy Hollebeke

07 Oct 14:51
38817d0
Compare
Choose a tag to compare

Changes

  • Upgraded to nf-core v2.6 template

Fixes

  • Fixed the ensemblvep version (was 104.1 before and is now 105.0)
  • Updated the label of gatk4/calibratedragstrmodel to process_high to match the requirements for bigger inputs

v1.0.0 - Beautiful Bruges

03 Oct 15:08
2295f15
Compare
Choose a tag to compare

The first full release of the pipeline!