Unsupervised Cryo-EM Data Clustering through Adaptively Constrained K-Means Algorithm.

In single-particle cryo-electron microscopy (cryo-EM), K-means clustering algorithm is widely used in unsupervised 2D classification of projection images of biological macromolecules. 3D ab initio reconstruction requires accurate unsupervised classification in order to separate molecular projections...

Full description

Bibliographic Details
Main Authors: Yaofang Xu, Jiayi Wu, Chang-Cheng Yin, Youdong Mao
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2016-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC5154524?pdf=render
id doaj-d871b6309c7a4b92935d9a98b37351a9
record_format Article
spelling doaj-d871b6309c7a4b92935d9a98b37351a92020-11-25T02:47:43ZengPublic Library of Science (PLoS)PLoS ONE1932-62032016-01-011112e016776510.1371/journal.pone.0167765Unsupervised Cryo-EM Data Clustering through Adaptively Constrained K-Means Algorithm.Yaofang XuJiayi WuChang-Cheng YinYoudong MaoIn single-particle cryo-electron microscopy (cryo-EM), K-means clustering algorithm is widely used in unsupervised 2D classification of projection images of biological macromolecules. 3D ab initio reconstruction requires accurate unsupervised classification in order to separate molecular projections of distinct orientations. Due to background noise in single-particle images and uncertainty of molecular orientations, traditional K-means clustering algorithm may classify images into wrong classes and produce classes with a large variation in membership. Overcoming these limitations requires further development on clustering algorithms for cryo-EM data analysis. We propose a novel unsupervised data clustering method building upon the traditional K-means algorithm. By introducing an adaptive constraint term in the objective function, our algorithm not only avoids a large variation in class sizes but also produces more accurate data clustering. Applications of this approach to both simulated and experimental cryo-EM data demonstrate that our algorithm is a significantly improved alterative to the traditional K-means algorithm in single-particle cryo-EM analysis.http://europepmc.org/articles/PMC5154524?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Yaofang Xu
Jiayi Wu
Chang-Cheng Yin
Youdong Mao
spellingShingle Yaofang Xu
Jiayi Wu
Chang-Cheng Yin
Youdong Mao
Unsupervised Cryo-EM Data Clustering through Adaptively Constrained K-Means Algorithm.
PLoS ONE
author_facet Yaofang Xu
Jiayi Wu
Chang-Cheng Yin
Youdong Mao
author_sort Yaofang Xu
title Unsupervised Cryo-EM Data Clustering through Adaptively Constrained K-Means Algorithm.
title_short Unsupervised Cryo-EM Data Clustering through Adaptively Constrained K-Means Algorithm.
title_full Unsupervised Cryo-EM Data Clustering through Adaptively Constrained K-Means Algorithm.
title_fullStr Unsupervised Cryo-EM Data Clustering through Adaptively Constrained K-Means Algorithm.
title_full_unstemmed Unsupervised Cryo-EM Data Clustering through Adaptively Constrained K-Means Algorithm.
title_sort unsupervised cryo-em data clustering through adaptively constrained k-means algorithm.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2016-01-01
description In single-particle cryo-electron microscopy (cryo-EM), K-means clustering algorithm is widely used in unsupervised 2D classification of projection images of biological macromolecules. 3D ab initio reconstruction requires accurate unsupervised classification in order to separate molecular projections of distinct orientations. Due to background noise in single-particle images and uncertainty of molecular orientations, traditional K-means clustering algorithm may classify images into wrong classes and produce classes with a large variation in membership. Overcoming these limitations requires further development on clustering algorithms for cryo-EM data analysis. We propose a novel unsupervised data clustering method building upon the traditional K-means algorithm. By introducing an adaptive constraint term in the objective function, our algorithm not only avoids a large variation in class sizes but also produces more accurate data clustering. Applications of this approach to both simulated and experimental cryo-EM data demonstrate that our algorithm is a significantly improved alterative to the traditional K-means algorithm in single-particle cryo-EM analysis.
url http://europepmc.org/articles/PMC5154524?pdf=render
work_keys_str_mv AT yaofangxu unsupervisedcryoemdataclusteringthroughadaptivelyconstrainedkmeansalgorithm
AT jiayiwu unsupervisedcryoemdataclusteringthroughadaptivelyconstrainedkmeansalgorithm
AT changchengyin unsupervisedcryoemdataclusteringthroughadaptivelyconstrainedkmeansalgorithm
AT youdongmao unsupervisedcryoemdataclusteringthroughadaptivelyconstrainedkmeansalgorithm
_version_ 1724751896632098816