Using the Decision Trees to Improve Variable Selection for Building Composite Indicators

The established variable selection methods for building composite indicators have strong limitations with respect to the results obtained. Some of them focus on getting an index structure with a high alpha reliability and/or a high percentage of the total data variance explained. These methods are l...

Full description

Bibliographic Details
Main Authors: Adrian Oțoiu, Emilia Țițan
Format: Article
Language:English
Published: Czech Statistical Office 2020-09-01
Series:Statistika: Statistics and Economy Journal
Subjects:
Online Access:https://www.czso.cz/documents/10180/125507865/32019720q3_otoiu_analyses.pdf/1df5f23d-9d37-4eaf-a68d-e06156d1a7a2?version=1.1
id doaj-cb7a6491413f4831ad08cb0b2e07ea43
record_format Article
spelling doaj-cb7a6491413f4831ad08cb0b2e07ea432020-11-25T02:36:00ZengCzech Statistical OfficeStatistika: Statistics and Economy Journal0322-788X1804-87652020-09-011003296308Using the Decision Trees to Improve Variable Selection for Building Composite IndicatorsAdrian Oțoiu0Emilia Țițan1Bucharest University of Economic Studies, Bucharest, RomaniaBucharest University of Economic Studies, Bucharest, RomaniaThe established variable selection methods for building composite indicators have strong limitations with respect to the results obtained. Some of them focus on getting an index structure with a high alpha reliability and/or a high percentage of the total data variance explained. These methods are likely to omit variables with strong explanatory power, and lead to an unsatisfactory classification of countries. Decision trees can also be used in selecting variables that are the most relevant for building composite indicators. An example of variable selection for building a composite indicator, which compares results using Cronbach's coefficient alpha, factor analysis, and decision trees, shows that the latter method yields comparable, or better results. Using cluster analysis on the selected variables, we show that the decision tree variable shortlist has better discrimination power than those obtained with the other methods, even in the presence of outliers and missing values.https://www.czso.cz/documents/10180/125507865/32019720q3_otoiu_analyses.pdf/1df5f23d-9d37-4eaf-a68d-e06156d1a7a2?version=1.1composite indicatorsdecision treescronbach’s alphafactor analysisindicator methodologyentrepreneurship indicators
collection DOAJ
language English
format Article
sources DOAJ
author Adrian Oțoiu
Emilia Țițan
spellingShingle Adrian Oțoiu
Emilia Țițan
Using the Decision Trees to Improve Variable Selection for Building Composite Indicators
Statistika: Statistics and Economy Journal
composite indicators
decision trees
cronbach’s alpha
factor analysis
indicator methodology
entrepreneurship indicators
author_facet Adrian Oțoiu
Emilia Țițan
author_sort Adrian Oțoiu
title Using the Decision Trees to Improve Variable Selection for Building Composite Indicators
title_short Using the Decision Trees to Improve Variable Selection for Building Composite Indicators
title_full Using the Decision Trees to Improve Variable Selection for Building Composite Indicators
title_fullStr Using the Decision Trees to Improve Variable Selection for Building Composite Indicators
title_full_unstemmed Using the Decision Trees to Improve Variable Selection for Building Composite Indicators
title_sort using the decision trees to improve variable selection for building composite indicators
publisher Czech Statistical Office
series Statistika: Statistics and Economy Journal
issn 0322-788X
1804-8765
publishDate 2020-09-01
description The established variable selection methods for building composite indicators have strong limitations with respect to the results obtained. Some of them focus on getting an index structure with a high alpha reliability and/or a high percentage of the total data variance explained. These methods are likely to omit variables with strong explanatory power, and lead to an unsatisfactory classification of countries. Decision trees can also be used in selecting variables that are the most relevant for building composite indicators. An example of variable selection for building a composite indicator, which compares results using Cronbach's coefficient alpha, factor analysis, and decision trees, shows that the latter method yields comparable, or better results. Using cluster analysis on the selected variables, we show that the decision tree variable shortlist has better discrimination power than those obtained with the other methods, even in the presence of outliers and missing values.
topic composite indicators
decision trees
cronbach’s alpha
factor analysis
indicator methodology
entrepreneurship indicators
url https://www.czso.cz/documents/10180/125507865/32019720q3_otoiu_analyses.pdf/1df5f23d-9d37-4eaf-a68d-e06156d1a7a2?version=1.1
work_keys_str_mv AT adrianotoiu usingthedecisiontreestoimprovevariableselectionforbuildingcompositeindicators
AT emiliatitan usingthedecisiontreestoimprovevariableselectionforbuildingcompositeindicators
_version_ 1724802025859842048