|
| 1 | +# The validation context |
| 2 | + |
| 3 | +The core structure of the validator is the `context`, |
| 4 | +a namespace that aggregates properties of the dataset (the `dataset` variable, above) |
| 5 | +and the current file being validated. |
| 6 | + |
| 7 | +Its type can be described as follows: |
| 8 | + |
| 9 | +```typescript |
| 10 | +Context: { |
| 11 | + // Dataset properties |
| 12 | + dataset: { |
| 13 | + dataset_description: object |
| 14 | + datatypes: string[] |
| 15 | + modalities: string[] |
| 16 | + // Lists of subjects as discovered in different locations |
| 17 | + subjects: { |
| 18 | + sub_dirs: string[] |
| 19 | + participant_id: string[] |
| 20 | + phenotype: string[] |
| 21 | + } |
| 22 | + } |
| 23 | + |
| 24 | + // Properties of the current subject |
| 25 | + subject: { |
| 26 | + // Lists of sessions as discovered in different locations |
| 27 | + sessions: { |
| 28 | + ses_dirs: string[] |
| 29 | + session_id: string[] |
| 30 | + phenotype: string[] |
| 31 | + } |
| 32 | + } |
| 33 | + |
| 34 | + // Path properties |
| 35 | + path: string |
| 36 | + entities: object |
| 37 | + datatype: string |
| 38 | + suffix: string |
| 39 | + extension: string |
| 40 | + // Inferred property |
| 41 | + modality: string |
| 42 | + |
| 43 | + // Inheritance principle constructions |
| 44 | + sidecar: object |
| 45 | + associations: { |
| 46 | + // Paths and properties of files associated with the current file |
| 47 | + aslcontext: { path: string, n_rows: integer, volume_type: string[] } |
| 48 | + ... |
| 49 | + } |
| 50 | + |
| 51 | + // Content properties |
| 52 | + size: integer |
| 53 | + |
| 54 | + // File type-specific content properties |
| 55 | + columns: object |
| 56 | + gzip: object |
| 57 | + json: object |
| 58 | + nifti_header: object |
| 59 | + ome: object |
| 60 | + tiff: object |
| 61 | +} |
| 62 | +``` |
| 63 | + |
| 64 | +To take an example, in a minimal dataset containing only a single subject's T1-weighted image, |
| 65 | +the context for that image might be: |
| 66 | + |
| 67 | +```yaml |
| 68 | +dataset: |
| 69 | + dataset_description: |
| 70 | + Name: "Example dataset" |
| 71 | + BIDSVersion: "1.10.0" |
| 72 | + DatasetType: "raw" |
| 73 | + datatypes: ["anat"] |
| 74 | + modalities: ["mri"] |
| 75 | + subjects: |
| 76 | + sub_dirs: ["sub-01"] |
| 77 | + participant_id: null |
| 78 | + phenotype: null |
| 79 | + |
| 80 | +subject: |
| 81 | + sessions: { ses_dirs: null, session_id: null, phenotype: null } |
| 82 | + |
| 83 | +path: "/sub-01/anat/sub-01_T1w.nii.gz" |
| 84 | +entities: |
| 85 | + subject: "01" |
| 86 | +datatype: "anat" |
| 87 | +suffix: "T1w" |
| 88 | +extension: ".nii.gz" |
| 89 | +modality: "mri" |
| 90 | + |
| 91 | +sidecar: |
| 92 | + MagneticFieldStrength: 3 |
| 93 | + ... |
| 94 | +associations: {} |
| 95 | + |
| 96 | +size: 22017017 |
| 97 | +nifti_header: |
| 98 | + dim: 3 |
| 99 | + voxel_sizes: [1, 1, 1] |
| 100 | + ... |
| 101 | +``` |
| 102 | + |
| 103 | +Fields from this context can be queried using object dot notation. |
| 104 | +For example, `sidecar.MagneticFieldStrengh` has the integer value `3`, |
| 105 | +and `entities.subject` has the string value `"01"`. |
| 106 | +This permits the use of boolean expressions, such as |
| 107 | +`sidecar.RepetitionTime == nifti_header.pixdim[4]`. |
| 108 | + |
| 109 | +As the validator validates each file in turn, it constructs a new context. |
| 110 | +The `dataset` property remains constant, |
| 111 | +while a new `subject` property is constructed when inspecting a new subject directory, |
| 112 | +and the remaining properties are constructed for each file, individually. |
| 113 | + |
| 114 | +## Context definition |
| 115 | + |
| 116 | +The validation context is largely dictated by the [schema], |
| 117 | +and the full type generated from the schema definition can be found in |
| 118 | +[jsr:@bids/schema/context](https://jsr.io/@bids/schema/doc/context/~/Context). |
| 119 | + |
| 120 | +## Context construction |
| 121 | + |
| 122 | +The construction of a validation context is where BIDS concepts are implemented. |
| 123 | +Again, this is easiest to explain with pseudocode: |
| 124 | + |
| 125 | +```python |
| 126 | +def buildFileContext(dataset, file): |
| 127 | + context = namespace() |
| 128 | + context.dataset = dataset |
| 129 | + context.path = file.path |
| 130 | + context.size = file.size |
| 131 | + |
| 132 | + fileParts = parsePath(file.path) |
| 133 | + context.entities = fileParts.entities |
| 134 | + context.datatype = fileParts.datatype |
| 135 | + context.suffix = fileParts.suffix |
| 136 | + context.extension = fileParts.extension |
| 137 | + |
| 138 | + context.subject = buildSubjectContext(dataset, context.entities.subject) |
| 139 | + |
| 140 | + context.sidecar = loadSidecar(file) |
| 141 | + context.associations = namespace({ |
| 142 | + association: loadAssociation(file, association) |
| 143 | + for association in associationTypes(file) |
| 144 | + }) |
| 145 | + |
| 146 | + if isTSV(file): |
| 147 | + context.columns = loadColumns(file) |
| 148 | + if isNIfTI(file): |
| 149 | + context.nifti_header = loadNiftiHeader(file) |
| 150 | + ... # And so on |
| 151 | + |
| 152 | + return context |
| 153 | +``` |
| 154 | + |
| 155 | +The heavy lifting is done in `parsePath`, `loadSidecar` and `loadAssociation`. |
| 156 | +`parsePath` is relatively simple, but `loadSidecar` and `loadAssociation` |
| 157 | +implement the BIDS [Inheritance Principle]. |
| 158 | + |
| 159 | +[Inheritance Principle]: https://bids-specification.readthedocs.io/en/stable/common-principles.html#the-inheritance-principle |
0 commit comments