Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ligate won't treat offspring as scaffolded unless parents are still in data #108

Open
kkellysci opened this issue Aug 22, 2024 · 0 comments

Comments

@kkellysci
Copy link

kkellysci commented Aug 22, 2024

I am phasing a large (n=172k) sample of parents and offspring (some duos, some trios), but I only need the phased genotypes for the offspring.

I start by running the phasing jobs (with the chromosomes in chunks) using shapeit5 with --pedigree on a HPC cluster. The initial whole-sample output files are written to a local scratch filesystem, where they will be deleted immediately after the job finishes. I then use bcftools view -S to subset these results to just the offspring, and save that smaller results chunk on the cluster's network filesystem where files will persist past the end of the job.

After all jobs have finished running, I try to use ligate with the --pedigree flag on the offspring-only results chunks. Despite using --pedigree, it detects the offspring samples as non-scaffolded, haplotype order gets swapped, and sometimes chunks from the maternal and paternal haplotypes are incorrectly combined as if they were in phase.

Is the behavior of ligate for a file where 100% of the samples are scaffolded just the same as bcftools concat -a -d all, or would there still be a reason to prefer ligate? If there's still a reason to prefer ligate, then is there a way to get it to treat offspring as scaffolded (eg. refrain from swapping haplotypes around) even when parents are no longer in the data?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant