The estimation of probability distribution for factor variables with many categorical values.

With recent developments of data technology in biomedicine, factor data such as diagnosis codes and genomic features, which can have tens to hundreds of discrete and unorderable categorical values, have emerged. While considered as a fundamental problem in statistical analyses, the estimation of pro...

Full description

Bibliographic Details
Main Authors: Minhyeok Lee, Yeong Seon Kang, Junhee Seok
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2018-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC6108477?pdf=render