Skip to content

Files

Latest commit

4ecf576 · Apr 24, 2022

History

History
11 lines (8 loc) · 903 Bytes

File metadata and controls

11 lines (8 loc) · 903 Bytes

Identifying Poor Quality Sequencing Cycles and Corresponding Bases from DNA Sequencing Reads

Navami Shenoy Dec 27, 2021

This Python project focuses on identifying poor quality sequencing cycles and deducing the corresponding unidentified bases (i.e. bases reported as 'N' during the sequencer reads). The raw sequence data used here has been sourced from Ajay et al (2011) and contains the first 1000 reads from the whole-genome sequence derived from the blood sample of a human male individual.

Jupyter Notebook:

http://localhost:8888/lab/tree/OneDrive/Documents/GitHub/identify-poor-quality-sequencing-cycles-and-bases/Identifying%20Poor%20Quality%20Sequencing%20Cycles%20and%20Bases.ipynb

Reference:

  1. Ajay, S. S., Parker, S. C., Abaan, H. O., Fajardo, K. V. F., & Margulies, E. H. (2011). Accurate and comprehensive sequencing of personal genomes. Genome research, 21(9), 1498-1505.