You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: harpy/reports/align_stats.qmd
+57-31
Original file line number
Diff line number
Diff line change
@@ -395,7 +395,6 @@ rm(hs_mult)
395
395
rm(valids)
396
396
rm(invalids)
397
397
rm(non_singletons)
398
-
#knitr::knit_exit()
399
398
```
400
399
```{r imports}
401
400
covfile <- params$coverage
@@ -484,7 +483,7 @@ list(
484
483
485
484
```{r}
486
485
#| content: valuebox
487
-
#| title: "Mol. Average Depth"
486
+
#| title: "Linked Depth"
488
487
list(
489
488
color = "#d4d4d4",
490
489
value = round(mol_global_avg,2)
@@ -493,7 +492,7 @@ list(
493
492
494
493
```{r}
495
494
#| content: valuebox
496
-
#| title: "Mol. Stdev Depth"
495
+
#| title: "Stdev Linked Depth"
497
496
list(
498
497
color = "#d4d4d4",
499
498
value = round(mol_global_sd,2)
@@ -503,7 +502,8 @@ list(
503
502
## Distdesc header
504
503
::: {.card title="Alignment Depth Distribution"}
505
504
These are the frequencies of interval coverage across all **`r windowskb` kilobase** intervals for all contigs.
506
-
For visual clarity, the distribution of alignment-depths (not molecule depths) is truncated at the 99% quantile, which is **`r q99`** for the alignment data.
505
+
For visual clarity, the distributions are truncated at the 99% quantile, which is **`r q99`** for the alignments
BX barcode validity is classified into one of three categories:
760
763
761
-
valid BX
762
-
: a complete BX barcode was present in the read (i.e. no 00 for any segments)
764
+
Valid
765
+
: A complete BX barcode was present in the read (i.e. no 00 for any segments)
766
+
767
+
Invalid
768
+
: A barcode was present in the read, but it contained 00 in at least one of the barcode segments
769
+
770
+
Missing
771
+
: There is no barcode in the read. For technical reasons this is usually equivalent to `invalid`
772
+
773
+
Linked-read data is specific for the definition of a "molecule":
774
+
775
+
Unique/Inferred Molecules
776
+
: Given linked-read barcode information, the original piece of DNA from which the sequenced fragments are considered to originate from.
777
+
778
+
Inferred Sequence
779
+
: While somewhat similar to "inferred molecule", the inferred sequence describes the original DNA fragment that was put on the sequencer. If the fragment was longer than the sequencer could fully sequence, e.g. 400bp fragment and the sequencer can only sequence 300bp, then the inferred sequence is 400bp long, even though only 300bp are represented in the sequence data. If the entire fragment was sequenced, then the inferred length and sequence lengths should be identical.
780
+
781
+
There are several kinds of "coverage" when working with linked-read data:
782
+
783
+
Aligned Depth/Coverage
784
+
: The standard interpretation of depth comparing the number of aligned base-pairs to the genome or contigs
785
+
786
+
Molecule Coverage
787
+
: The coverage breadth or depth of sequences onto _unique molecules_ (rather than the genome), as inferred from linked-read barcodes
788
+
789
+
Linked Depth/Coverage
790
+
: The coverage breadth or depth that _includes unsequenced gaps between linked sequences_ that are associated with a single unique molecule. For example, if two 300bp paired-end reads share the same barcode and map 2000bp apart, the calculation *includes* the 1400bp between the sequences as if they were present. This is similar to _inferred sequences_ described above, except spanning across linked sequences rather than within a paired-end sequence.
763
791
764
-
invalid BX
765
-
: a barcode was present in the read, but it contained 00 in at least one of the barcode segments
0 commit comments