kamila: Clustering Mixed-Type Data in R and Hadoop

In this paper we discuss the challenge of equitably combining continuous (quantitative) and categorical (qualitative) variables for the purpose of cluster analysis. Existing techniques require strong parametric assumptions, or difficult-to-specify tuning parameters. We describe the kamila package, w...

Full description

Bibliographic Details
Main Authors: Alexander H. Foss, Marianthi Markatou
Format: Article
Language:English
Published: Foundation for Open Access Statistics 2018-02-01
Series:Journal of Statistical Software
Subjects:
R
Online Access:https://www.jstatsoft.org/index.php/jss/article/view/2812