A Practically Competitive and Provably Consistent Algorithm for Uplift Modeling

Randomized experiments have been critical tools of decision making for decades. However, subjects can show significant heterogeneity in response to treatments in many important applications. Therefore it is not enough to simply know which treatment is optimal for the entire population. What we need...

Full description

Bibliographic Details
Main Authors: Zhao, Yan (Contributor), Fang, Xiao (Contributor), Simchi-Levi, David (Contributor)
Other Authors: Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science (Contributor), Massachusetts Institute of Technology. Institute for Data, Systems, and Society (Contributor)
Format: Article
Language:English
Published: Institute of Electrical and Electronics Engineers (IEEE), 2018-08-22T18:03:46Z.
Subjects:
Online Access:Get fulltext
LEADER 02026 am a22002173u 4500
001 117478
042 |a dc 
100 1 0 |a Zhao, Yan  |e author 
100 1 0 |a Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science  |e contributor 
100 1 0 |a Massachusetts Institute of Technology. Institute for Data, Systems, and Society  |e contributor 
100 1 0 |a Zhao, Yan  |e contributor 
100 1 0 |a Fang, Xiao  |e contributor 
100 1 0 |a Simchi-Levi, David  |e contributor 
700 1 0 |a Fang, Xiao  |e author 
700 1 0 |a Simchi-Levi, David  |e author 
245 0 0 |a A Practically Competitive and Provably Consistent Algorithm for Uplift Modeling 
260 |b Institute of Electrical and Electronics Engineers (IEEE),   |c 2018-08-22T18:03:46Z. 
856 |z Get fulltext  |u http://hdl.handle.net/1721.1/117478 
520 |a Randomized experiments have been critical tools of decision making for decades. However, subjects can show significant heterogeneity in response to treatments in many important applications. Therefore it is not enough to simply know which treatment is optimal for the entire population. What we need is a model that correctly customize treatment assignment base on subject characteristics. The problem of constructing such models from randomized experiments data is known as Uplift Modeling in the literature. Many algorithms have been proposed for uplift modeling and some have generated promising results on various data sets. Yet little is known about the theoretical properties of these algorithms. In this paper, we propose a new tree-based ensemble algorithm for uplift modeling. Experiments show that our algorithm can achieve competitive results on both synthetic and industry-provided data. In addition, by properly tuning the 'node size' parameter, our algorithm is proved to be consistent under mild regularity conditions. This is the first consistent algorithm for uplift modeling that we are aware of. 
655 7 |a Article 
773 |t 2017 IEEE International Conference on Data Mining (ICDM)