Support multiple participant ID columns in tabular data #286
Labels
flag:discuss
Flag issue that needs to be discussed before it can be implemented.
someday
Not a priority right now, but we want to keep this around to think or discuss more.
We keep encountering TSVs that have a participant ID column without the
sub-
prefix required for BIDS subdirectories.This makes sense, as most likely the phenotypic/tabular data will have preceded the BIDS conversion step for most datasets.
Right now, the only option for these kinds of tabular data to work with the Neurobagel CLI is if the data owner adds another column for the BIDS participant IDs, or modifies their existing participant ID column. This is probably not ideal, and also means that we are sometimes creating extra work for data owners only if they have imaging data in addition to phenotypic data.
We should consider having the CLI handle this work by automatically prepending
sub-
onto a given participant ID column if the prefixes don't already exist, before doing the imaging-tabular subject matching check.The text was updated successfully, but these errors were encountered: