Skip to main navigation Skip to search Skip to main content

DNA base-calling using polynomial classifiers

  • Omniyah G. Mohammed
  • , Khaled T. Assaleh
  • , Ghaleb A. Husseini
  • , Amin F. Majdalawieh
  • , Scott R. Woodward
  • American University of Sharjah
  • Sorenson Molecular Genealogy Foundation

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Base-calling is one of many problems that can be solved using pattern recognition, the act of classifying raw data based on prior or statistical information extracted from the data into various classes. In this paper, we propose a new framework using polynomial classifiers to model electropherogram traces obtained from ABI sequencing machines to perform base-calling. Initially, pre-processing, which includes segmented normalization and peak sharpening, needs to be performed to reduce the imperfections caused in a trace as a result of the chemistry involved. Discriminative feature vectors are then extracted from the chromatogram traces and are expanded to a higher dimensional space by second order polynomial expansion. A linear classifier is then trained and bases are classified respectively. Chromatogram traces that were chosen for analysis belong to Homo sapiens, Saccharomyces mikatae and Drosophila melanogaster. Simulation results indicated an accuracy of up to 99.2% upon testing three different chromatogram traces consisting of about 600 to 800 bases each. The proposed model's performance was compared to the existing standards: ABI and PHRED in terms of insertion, deletion and substitution errors. Simulation evidence indicated that the designed model performs comparably or slightly better than ABI in terms of deletion and insertion errors. Moreover, polynomial classifier resulted in negligible substitution errors compared to ABI. Polynomial classifier was also observed to perform comparable to PHRED in terms of deletion error and substitution errors. The results obtained demonstrate the potential of this model to perform base-calling.

Original languageEnglish
Title of host publication2010 IEEE World Congress on Computational Intelligence, WCCI 2010 - 2010 International Joint Conference on Neural Networks, IJCNN 2010
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Print)9781424469178
DOIs
StatePublished - 2010
Externally publishedYes
Event2010 6th IEEE World Congress on Computational Intelligence, WCCI 2010 - 2010 International Joint Conference on Neural Networks, IJCNN 2010 - Barcelona, Spain
Duration: 18 Jul 201023 Jul 2010

Publication series

NameProceedings of the International Joint Conference on Neural Networks

Conference

Conference2010 6th IEEE World Congress on Computational Intelligence, WCCI 2010 - 2010 International Joint Conference on Neural Networks, IJCNN 2010
Country/TerritorySpain
CityBarcelona
Period18/07/1023/07/10

Fingerprint

Dive into the research topics of 'DNA base-calling using polynomial classifiers'. Together they form a unique fingerprint.

Cite this