You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello! I'm new to SHAPEIT and am trying to use it to phase my array data.
Initially I installed SHAPEIT5 from conda on a docker image and ran phase_common on a few chromosomes already (such as chr1, chr19, chr22) using AWS instances.
Documentation says that for running on chromosome X the only difference from the other chromosomes is the presence of a text file containing the haploid samples with --haploids argument. Despite that, I haven't been able to run phasing on chromosome X. Describing my first test:
Docker image with installation from bioconda;
Instance has 32 Gb memory and 4 vcpus;
I am using 1000G reference panel, pre-processed with bcftools:
Remove CNVs/SVs;
Keep only PASS variants;
Left-normalize and decompose multiallelic;
Calculate tags AC and AN.
Male samples from the reference panel were also added to the haploids.txt file.
Input vcf is the Dragen Array vcf output files merged to get a multisample data from 24 samples and also processed for left-normalization, decompose and tags AC+AN with bcftools.
I think I may be missing some silly detail... Ideally I would get the result for the full chromosome to get through with imputation... So I ran out of ideas. Can you help me? Thank you!
The text was updated successfully, but these errors were encountered:
Regions X:1-27440787, X:24026048-50656185 and X:47743520-88764754 went straight to that same segmentation fault error.
The others, X:78359646-115258184, X:112836207-139441710 and X:137290831-1000000000 returned sites removed from the main panel because they are not on reference panel and so had no variants to be phased:
Reading genotype data:
* VCF/BCF scanning done (0.06s)
+ Variants [#sites=0 / region=X:137290831-1000000000]
- 4762 sites removed in main panel [not in reference panel]
ERROR: No variants to be **phased!**
All those chunks do have variants in the input file.
Hello! I'm new to SHAPEIT and am trying to use it to phase my array data.
Initially I installed SHAPEIT5 from conda on a docker image and ran
phase_common
on a few chromosomes already (such as chr1, chr19, chr22) using AWS instances.Documentation says that for running on chromosome X the only difference from the other chromosomes is the presence of a text file containing the haploid samples with
--haploids
argument. Despite that, I haven't been able to run phasing on chromosome X. Describing my first test:bcftools
:haploids.txt
file.bcftools
.Resulting log:
I don't believe it is a memory issue, since chromosome 1 and the others were processed on the same conditions without any problems.
I tried many solutions already... some of them:
haploids.txt
containing only haploids for the main input file.None of the above worked. I could however get a result when running on one or two positions... This worked:
I think I may be missing some silly detail... Ideally I would get the result for the full chromosome to get through with imputation... So I ran out of ideas. Can you help me? Thank you!
The text was updated successfully, but these errors were encountered: