Skip to content

Python bioinformatics project to detect poor quality sequencing cycles and deduce the identity of identified bases.

Notifications You must be signed in to change notification settings

n-shenoy/identify-poor-quality-sequencing-cycles-and-bases

Repository files navigation

Identifying Poor Quality Sequencing Cycles and Corresponding Bases from DNA Sequencing Reads

Navami Shenoy Dec 27, 2021

This Python project focuses on identifying poor quality sequencing cycles and deducing the corresponding unidentified bases (i.e. bases reported as 'N' during the sequencer reads). The raw sequence data used here has been sourced from Ajay et al (2011) and contains the first 1000 reads from the whole-genome sequence derived from the blood sample of a human male individual.

Jupyter Notebook:

http://localhost:8888/lab/tree/OneDrive/Documents/GitHub/identify-poor-quality-sequencing-cycles-and-bases/Identifying%20Poor%20Quality%20Sequencing%20Cycles%20and%20Bases.ipynb

Reference:

  1. Ajay, S. S., Parker, S. C., Abaan, H. O., Fajardo, K. V. F., & Margulies, E. H. (2011). Accurate and comprehensive sequencing of personal genomes. Genome research, 21(9), 1498-1505.

About

Python bioinformatics project to detect poor quality sequencing cycles and deduce the identity of identified bases.

Topics

Resources

Stars

Watchers

Forks