Unsupervised Cryo-EM Data Clustering through Adaptively Constrained K-Means Algorithm.
In single-particle cryo-electron microscopy (cryo-EM), K-means clustering algorithm is widely used in unsupervised 2D classification of projection images of biological macromolecules. 3D ab initio reconstruction requires accurate unsupervised classification in order to separate molecular projections...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2016-01-01
|
Series: | PLoS ONE |
Online Access: | http://europepmc.org/articles/PMC5154524?pdf=render |
id |
doaj-d871b6309c7a4b92935d9a98b37351a9 |
---|---|
record_format |
Article |
spelling |
doaj-d871b6309c7a4b92935d9a98b37351a92020-11-25T02:47:43ZengPublic Library of Science (PLoS)PLoS ONE1932-62032016-01-011112e016776510.1371/journal.pone.0167765Unsupervised Cryo-EM Data Clustering through Adaptively Constrained K-Means Algorithm.Yaofang XuJiayi WuChang-Cheng YinYoudong MaoIn single-particle cryo-electron microscopy (cryo-EM), K-means clustering algorithm is widely used in unsupervised 2D classification of projection images of biological macromolecules. 3D ab initio reconstruction requires accurate unsupervised classification in order to separate molecular projections of distinct orientations. Due to background noise in single-particle images and uncertainty of molecular orientations, traditional K-means clustering algorithm may classify images into wrong classes and produce classes with a large variation in membership. Overcoming these limitations requires further development on clustering algorithms for cryo-EM data analysis. We propose a novel unsupervised data clustering method building upon the traditional K-means algorithm. By introducing an adaptive constraint term in the objective function, our algorithm not only avoids a large variation in class sizes but also produces more accurate data clustering. Applications of this approach to both simulated and experimental cryo-EM data demonstrate that our algorithm is a significantly improved alterative to the traditional K-means algorithm in single-particle cryo-EM analysis.http://europepmc.org/articles/PMC5154524?pdf=render |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Yaofang Xu Jiayi Wu Chang-Cheng Yin Youdong Mao |
spellingShingle |
Yaofang Xu Jiayi Wu Chang-Cheng Yin Youdong Mao Unsupervised Cryo-EM Data Clustering through Adaptively Constrained K-Means Algorithm. PLoS ONE |
author_facet |
Yaofang Xu Jiayi Wu Chang-Cheng Yin Youdong Mao |
author_sort |
Yaofang Xu |
title |
Unsupervised Cryo-EM Data Clustering through Adaptively Constrained K-Means Algorithm. |
title_short |
Unsupervised Cryo-EM Data Clustering through Adaptively Constrained K-Means Algorithm. |
title_full |
Unsupervised Cryo-EM Data Clustering through Adaptively Constrained K-Means Algorithm. |
title_fullStr |
Unsupervised Cryo-EM Data Clustering through Adaptively Constrained K-Means Algorithm. |
title_full_unstemmed |
Unsupervised Cryo-EM Data Clustering through Adaptively Constrained K-Means Algorithm. |
title_sort |
unsupervised cryo-em data clustering through adaptively constrained k-means algorithm. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2016-01-01 |
description |
In single-particle cryo-electron microscopy (cryo-EM), K-means clustering algorithm is widely used in unsupervised 2D classification of projection images of biological macromolecules. 3D ab initio reconstruction requires accurate unsupervised classification in order to separate molecular projections of distinct orientations. Due to background noise in single-particle images and uncertainty of molecular orientations, traditional K-means clustering algorithm may classify images into wrong classes and produce classes with a large variation in membership. Overcoming these limitations requires further development on clustering algorithms for cryo-EM data analysis. We propose a novel unsupervised data clustering method building upon the traditional K-means algorithm. By introducing an adaptive constraint term in the objective function, our algorithm not only avoids a large variation in class sizes but also produces more accurate data clustering. Applications of this approach to both simulated and experimental cryo-EM data demonstrate that our algorithm is a significantly improved alterative to the traditional K-means algorithm in single-particle cryo-EM analysis. |
url |
http://europepmc.org/articles/PMC5154524?pdf=render |
work_keys_str_mv |
AT yaofangxu unsupervisedcryoemdataclusteringthroughadaptivelyconstrainedkmeansalgorithm AT jiayiwu unsupervisedcryoemdataclusteringthroughadaptivelyconstrainedkmeansalgorithm AT changchengyin unsupervisedcryoemdataclusteringthroughadaptivelyconstrainedkmeansalgorithm AT youdongmao unsupervisedcryoemdataclusteringthroughadaptivelyconstrainedkmeansalgorithm |
_version_ |
1724751896632098816 |