Automated Classification and Cluster Visualization of Genotypes Derived from High Resolution Melt Curves.

High Resolution Melting (HRM) following PCR has been used to identify DNA genotypes. Fluorescent dyes bounded to double strand DNA lose their fluorescence with increasing temperature, yielding different signatures for different genotypes. Recent software tools have been made available to aid in the...

Full description

Bibliographic Details
Main Authors: Sami Kanderian, Lingxia Jiang, Ivor Knight
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2015-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC4659556?pdf=render
id doaj-6491620846374f0e8584055ff1c5054d
record_format Article
spelling doaj-6491620846374f0e8584055ff1c5054d2020-11-25T02:06:25ZengPublic Library of Science (PLoS)PLoS ONE1932-62032015-01-011011e014329510.1371/journal.pone.0143295Automated Classification and Cluster Visualization of Genotypes Derived from High Resolution Melt Curves.Sami KanderianLingxia JiangIvor KnightHigh Resolution Melting (HRM) following PCR has been used to identify DNA genotypes. Fluorescent dyes bounded to double strand DNA lose their fluorescence with increasing temperature, yielding different signatures for different genotypes. Recent software tools have been made available to aid in the distinction of different genotypes, but they are not fully automated, used only for research purposes, or require some level of interaction or confirmation from an analyst.We describe a fully automated machine learning software algorithm that classifies unknown genotypes. Dynamic melt curves are transformed to multidimensional clusters of points whereby a training set is used to establish the distribution of genotype clusters. Subsequently, probabilistic and statistical methods were used to classify the genotypes of unknown DNA samples on 4 different assays (40 VKORC1, CYP2C9*2, CYP2C9*3 samples in triplicate, and 49 MTHFR c.665C>T samples in triplicate) run on the Roche LC480. Melt curves of each of the triplicates were genotyped separately.Automated genotyping called 100% of VKORC1, CYP2C9*3 and MTHFR c.665C>T samples correctly. 97.5% of CYP2C9*2 melt curves were genotyped correctly with the remaining 2.5% given a no call due to the inability to decipher 3 melt curves in close proximity as either homozygous mutant or wild-type with greater than 99.5% posterior probability.We demonstrate the ability to fully automate DNA genotyping from HRM curves systematically and accurately without requiring any user interpretation or interaction with the data. Visualization of genotype clusters and quantification of the expected misclassification rate is also available to provide feedback to assay scientists and engineers as changes are made to the assay or instrument.http://europepmc.org/articles/PMC4659556?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Sami Kanderian
Lingxia Jiang
Ivor Knight
spellingShingle Sami Kanderian
Lingxia Jiang
Ivor Knight
Automated Classification and Cluster Visualization of Genotypes Derived from High Resolution Melt Curves.
PLoS ONE
author_facet Sami Kanderian
Lingxia Jiang
Ivor Knight
author_sort Sami Kanderian
title Automated Classification and Cluster Visualization of Genotypes Derived from High Resolution Melt Curves.
title_short Automated Classification and Cluster Visualization of Genotypes Derived from High Resolution Melt Curves.
title_full Automated Classification and Cluster Visualization of Genotypes Derived from High Resolution Melt Curves.
title_fullStr Automated Classification and Cluster Visualization of Genotypes Derived from High Resolution Melt Curves.
title_full_unstemmed Automated Classification and Cluster Visualization of Genotypes Derived from High Resolution Melt Curves.
title_sort automated classification and cluster visualization of genotypes derived from high resolution melt curves.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2015-01-01
description High Resolution Melting (HRM) following PCR has been used to identify DNA genotypes. Fluorescent dyes bounded to double strand DNA lose their fluorescence with increasing temperature, yielding different signatures for different genotypes. Recent software tools have been made available to aid in the distinction of different genotypes, but they are not fully automated, used only for research purposes, or require some level of interaction or confirmation from an analyst.We describe a fully automated machine learning software algorithm that classifies unknown genotypes. Dynamic melt curves are transformed to multidimensional clusters of points whereby a training set is used to establish the distribution of genotype clusters. Subsequently, probabilistic and statistical methods were used to classify the genotypes of unknown DNA samples on 4 different assays (40 VKORC1, CYP2C9*2, CYP2C9*3 samples in triplicate, and 49 MTHFR c.665C>T samples in triplicate) run on the Roche LC480. Melt curves of each of the triplicates were genotyped separately.Automated genotyping called 100% of VKORC1, CYP2C9*3 and MTHFR c.665C>T samples correctly. 97.5% of CYP2C9*2 melt curves were genotyped correctly with the remaining 2.5% given a no call due to the inability to decipher 3 melt curves in close proximity as either homozygous mutant or wild-type with greater than 99.5% posterior probability.We demonstrate the ability to fully automate DNA genotyping from HRM curves systematically and accurately without requiring any user interpretation or interaction with the data. Visualization of genotype clusters and quantification of the expected misclassification rate is also available to provide feedback to assay scientists and engineers as changes are made to the assay or instrument.
url http://europepmc.org/articles/PMC4659556?pdf=render
work_keys_str_mv AT samikanderian automatedclassificationandclustervisualizationofgenotypesderivedfromhighresolutionmeltcurves
AT lingxiajiang automatedclassificationandclustervisualizationofgenotypesderivedfromhighresolutionmeltcurves
AT ivorknight automatedclassificationandclustervisualizationofgenotypesderivedfromhighresolutionmeltcurves
_version_ 1724934090910597120