Skip to content

ClinSig changes for some halpotypes in join_data.R - become wrong! #46

@raymond301

Description

@raymond301

This is the result of join_data.R for multi

chrom pos ref alt measureset_type measureset_id rcv allele_id symbol hgvs_c clinical_significance clinical_significance_ordered pathogenic benign conflicted
2 166277030 T G Haplotype 30359 RCV000023304 39315 SCN9A NM_002977.3:c.2794A>C Benign/Likely benign Pathogenic 0 1 0

It's wrong: NM_002977.3(SCN9A):c.[2794A>C;2971G>T] – Haplotype

Looking at each step:

clinvar_table_raw.multi.tsv - Correct
clinvar_table_normalized.multi.tsv.gz - Correct
clinvar_allele_trait_pairs.multi.tsv.gz - Correct
clinvar_alleles_grouped.multi.tsv.gz - Correct
clinvar_alleles_combined.multi.tsv.gz - WRONG

There are 20 halpotype variants with this same issue:

measureset_type measureset_id rcv allele_id
Haplotype 1631 RCV000001698 16670
Haplotype 5706 RCV000006060 20745
Haplotype 5813 RCV000006169 20852
Haplotype 7239 RCV000007661 22244
Haplotype 13065 RCV000013940 28104
Haplotype 13399 RCV000014336;RCV000014337 28436
Haplotype 16318 RCV000017711;RCV000201276 31357
Haplotype 16876;16877 RCV000018372;RCV000018373 31916
Haplotype 4297 RCV000004533;RCV000004534;RCV000004535;RCV000004536 38384
Haplotype 4297 RCV000004533;RCV000004534;RCV000004535;RCV000004536 38385
Haplotype 4816 RCV000005085 38434
Haplotype 9398 RCV000010000 38447
Haplotype 9407 RCV000010010 38448
Haplotype 16318;217371 RCV000017711;RCV000201276;RCV000201278 38476
Haplotype 30359 RCV000023304 39315
Haplotype 38571 RCV000021985 46849
Haplotype 402236 RCV000454199 98655
Haplotype 218894 RCV000203245 137950
Haplotype 225143 RCV000210779 227037
Haplotype 188053 RCV000167863 255673

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions