Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Develop an N3C mapping metadata schema to support downstream reproducibility and analyses #37

Open
DaveraGabriel opened this issue Jun 10, 2020 · 3 comments
Assignees
Labels
Harmonization & Analytics Issues which involve both Data Ingestion & Harmonization & Analytics workstreams

Comments

@DaveraGabriel
Copy link
Collaborator

per @hlehmann17 in a team meeting: Just a thought that this problem has the flavor of derived data that we’ve discussed in the OMOP mapping where I thought we agreed to bump derivations to the Palantir Phase. The “indices” that Harold mentioned as well as computable phenotypes and cohort definitions need a place between DI&H and analysis.
Could we record a mapping type version or method along with the source data – metadata about the maps. I am advocating for making all these definitions explicitly and helping the analysts organize themselves for transparency, reproducibility and reuse. The goal is not to build more data but to preserve traceability. We are doing a data harmonization step and what we have done must be transparent and reproducible. So perhaps that is the key goal: can someone else verify and reproduce whatever we do to the data to get it ready for analysis? This is a subset of the derived data problem.

@DaveraGabriel
Copy link
Collaborator Author

Melissa Haendel provided links to A Simple Standard for Sharing Ontology Mappings (SSSOM) github as starting point / reference for mapping metadata for N3C

@DaveraGabriel
Copy link
Collaborator Author

Maps created for the DI&H pipeline require versioning and change management. The version of maps used for transformation of data payloads are required metadata for OMOP data sets shared with sites after DI&H processing.

@DaveraGabriel
Copy link
Collaborator Author

see also Management and communication of CDM mapping heuristics #17 This seems to be the same issue

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Harmonization & Analytics Issues which involve both Data Ingestion & Harmonization & Analytics workstreams
Projects
None yet
Development

No branches or pull requests

6 participants