Using the Decision Trees to Improve Variable Selection for Building Composite Indicators

The established variable selection methods for building composite indicators have strong limitations with respect to the results obtained. Some of them focus on getting an index structure with a high alpha reliability and/or a high percentage of the total data variance explained. These methods are l...

Full description

Bibliographic Details
Main Authors: Adrian Oțoiu, Emilia Țițan
Format: Article
Language:English
Published: Czech Statistical Office 2020-09-01
Series:Statistika: Statistics and Economy Journal
Subjects:
Online Access:https://www.czso.cz/documents/10180/125507865/32019720q3_otoiu_analyses.pdf/1df5f23d-9d37-4eaf-a68d-e06156d1a7a2?version=1.1
Description
Summary:The established variable selection methods for building composite indicators have strong limitations with respect to the results obtained. Some of them focus on getting an index structure with a high alpha reliability and/or a high percentage of the total data variance explained. These methods are likely to omit variables with strong explanatory power, and lead to an unsatisfactory classification of countries. Decision trees can also be used in selecting variables that are the most relevant for building composite indicators. An example of variable selection for building a composite indicator, which compares results using Cronbach's coefficient alpha, factor analysis, and decision trees, shows that the latter method yields comparable, or better results. Using cluster analysis on the selected variables, we show that the decision tree variable shortlist has better discrimination power than those obtained with the other methods, even in the presence of outliers and missing values.
ISSN:0322-788X
1804-8765