Identifying Poor Quality Sequencing Cycles and Corresponding Bases from DNA Sequencing Reads

Navami Shenoy Dec 27, 2021

This Python project focuses on identifying poor quality sequencing cycles and deducing the corresponding unidentified bases (i.e. bases reported as 'N' during the sequencer reads). The raw sequence data used here has been sourced from Ajay et al (2011) and contains the first 1000 reads from the whole-genome sequence derived from the blood sample of a human male individual.