Ensemble Performance in terms of Diversity and Performance of Individual Classifier Systems

博士 === 國立清華大學 === 資訊工程學系 === 96 === Combining multiple classifier systems (MCS’) has been shown to outperform single classifier system. It has been demonstrated that improvement for ensemble performance depends on either the diversity among or the performance of individual systems. A variety of dive...

Full description

Bibliographic Details
Main Authors: Yun-Sheng Chung, 鍾允昇
Other Authors: Chuan Yi Tang
Format: Others
Language:en_US
Published: 2007
Online Access:http://ndltd.ncl.edu.tw/handle/26130685567803696848
Description
Summary:博士 === 國立清華大學 === 資訊工程學系 === 96 === Combining multiple classifier systems (MCS’) has been shown to outperform single classifier system. It has been demonstrated that improvement for ensemble performance depends on either the diversity among or the performance of individual systems. A variety of diversity measures and ensemble methods have been proposed and studied. It remains a challenging problem to estimate the ensemble performance in terms of the performance of and the diversity among individual systems. In this paper, we establish upper and lower bounds for (a) majority voting ensemble performance with disagreement diversity measure Dis, (b) weighted majority voting performance in terms of weighted average performance and weighted disagreement diversity, and (c) plurality voting ensemble performance with entropy diversity measure ‾D . Bounds for these three cases are shown to be tight using the concept of a performance distribution pattern (PDP) for the input set. As a consequence of our previous results on diversity equivalence, (a) can be extended to several other diversity measures. Moreover, we showed in the case of (a) that when ‾ P is big enough, the ensemble performance Pm resulting from a maximum (information-theoretic) entropy PDP is an increasing function with respect to the disagreement diversity Dis. Eight experiments using data sets from various applications domains are conducted to demonstrate the complexity, richness, and diverseness of the problem in estimating the ensemble performance.