Incorporating machine learning and social determinants of health indicators into prospective risk adjustment for health plan payments

Abstract Background Risk adjustment models are employed to prevent adverse selection, anticipate budgetary reserve needs, and offer care management services to high-risk individuals. We aimed to address two unknowns about risk adjustment: whether machine learning (ML) and inclusion of social determi...

Full description

Bibliographic Details
Main Authors: Jeremy A. Irvin, Andrew A. Kondrich, Michael Ko, Pranav Rajpurkar, Behzad Haghgoo, Bruce E. Landon, Robert L. Phillips, Stephen Petterson, Andrew Y. Ng, Sanjay Basu
Format: Article
Language:English
Published: BMC 2020-05-01
Series:BMC Public Health
Subjects:
Online Access:http://link.springer.com/article/10.1186/s12889-020-08735-0
id doaj-60c193c62923486293fc35cfcafb53eb
record_format Article
spelling doaj-60c193c62923486293fc35cfcafb53eb2020-11-25T03:31:05ZengBMCBMC Public Health1471-24582020-05-0120111010.1186/s12889-020-08735-0Incorporating machine learning and social determinants of health indicators into prospective risk adjustment for health plan paymentsJeremy A. Irvin0Andrew A. Kondrich1Michael Ko2Pranav Rajpurkar3Behzad Haghgoo4Bruce E. Landon5Robert L. Phillips6Stephen Petterson7Andrew Y. Ng8Sanjay Basu9Department of Computer Science, Stanford UniversityDepartment of Computer Science, Stanford UniversityDepartment of Statistics, Stanford UniversityDepartment of Computer Science, Stanford UniversityDepartment of Computer Science, Stanford UniversityDepartment of Healthcare Policy, Harvard Medical SchoolCenter for Professionalism & Value in Health Care, American Board of Family Medicine FoundationRobert Graham Center, American Academy of Family PhysiciansDepartment of Computer Science, Stanford UniversityCenter for Primary Care, Harvard Medical SchoolAbstract Background Risk adjustment models are employed to prevent adverse selection, anticipate budgetary reserve needs, and offer care management services to high-risk individuals. We aimed to address two unknowns about risk adjustment: whether machine learning (ML) and inclusion of social determinants of health (SDH) indicators improve prospective risk adjustment for health plan payments. Methods We employed a 2-by-2 factorial design comparing: (i) linear regression versus ML (gradient boosting) and (ii) demographics and diagnostic codes alone, versus additional ZIP code-level SDH indicators. Healthcare claims from privately-insured US adults (2016–2017), and Census data were used for analysis. Data from 1.02 million adults were used for derivation, and data from 0.26 million to assess performance. Model performance was measured using coefficient of determination (R2), discrimination (C-statistic), and mean absolute error (MAE) for the overall population, and predictive ratio and net compensation for vulnerable subgroups. We provide 95% confidence intervals (CI) around each performance measure. Results Linear regression without SDH indicators achieved moderate determination (R2 0.327, 95% CI: 0.300, 0.353), error ($6992; 95% CI: $6889, $7094), and discrimination (C-statistic 0.703; 95% CI: 0.701, 0.705). ML without SDH indicators improved all metrics (R2 0.388; 95% CI: 0.357, 0.420; error $6637; 95% CI: $6539, $6735; C-statistic 0.717; 95% CI: 0.715, 0.718), reducing misestimation of cost by $3.5 M per 10,000 members. Among people living in areas with high poverty, high wealth inequality, or high prevalence of uninsured, SDH indicators reduced underestimation of cost, improving the predictive ratio by 3% (~$200/person/year). Conclusions ML improved risk adjustment models and the incorporation of SDH indicators reduced underpayment in several vulnerable populations.http://link.springer.com/article/10.1186/s12889-020-08735-0Risk estimationMachine learningSocial determinants of health
collection DOAJ
language English
format Article
sources DOAJ
author Jeremy A. Irvin
Andrew A. Kondrich
Michael Ko
Pranav Rajpurkar
Behzad Haghgoo
Bruce E. Landon
Robert L. Phillips
Stephen Petterson
Andrew Y. Ng
Sanjay Basu
spellingShingle Jeremy A. Irvin
Andrew A. Kondrich
Michael Ko
Pranav Rajpurkar
Behzad Haghgoo
Bruce E. Landon
Robert L. Phillips
Stephen Petterson
Andrew Y. Ng
Sanjay Basu
Incorporating machine learning and social determinants of health indicators into prospective risk adjustment for health plan payments
BMC Public Health
Risk estimation
Machine learning
Social determinants of health
author_facet Jeremy A. Irvin
Andrew A. Kondrich
Michael Ko
Pranav Rajpurkar
Behzad Haghgoo
Bruce E. Landon
Robert L. Phillips
Stephen Petterson
Andrew Y. Ng
Sanjay Basu
author_sort Jeremy A. Irvin
title Incorporating machine learning and social determinants of health indicators into prospective risk adjustment for health plan payments
title_short Incorporating machine learning and social determinants of health indicators into prospective risk adjustment for health plan payments
title_full Incorporating machine learning and social determinants of health indicators into prospective risk adjustment for health plan payments
title_fullStr Incorporating machine learning and social determinants of health indicators into prospective risk adjustment for health plan payments
title_full_unstemmed Incorporating machine learning and social determinants of health indicators into prospective risk adjustment for health plan payments
title_sort incorporating machine learning and social determinants of health indicators into prospective risk adjustment for health plan payments
publisher BMC
series BMC Public Health
issn 1471-2458
publishDate 2020-05-01
description Abstract Background Risk adjustment models are employed to prevent adverse selection, anticipate budgetary reserve needs, and offer care management services to high-risk individuals. We aimed to address two unknowns about risk adjustment: whether machine learning (ML) and inclusion of social determinants of health (SDH) indicators improve prospective risk adjustment for health plan payments. Methods We employed a 2-by-2 factorial design comparing: (i) linear regression versus ML (gradient boosting) and (ii) demographics and diagnostic codes alone, versus additional ZIP code-level SDH indicators. Healthcare claims from privately-insured US adults (2016–2017), and Census data were used for analysis. Data from 1.02 million adults were used for derivation, and data from 0.26 million to assess performance. Model performance was measured using coefficient of determination (R2), discrimination (C-statistic), and mean absolute error (MAE) for the overall population, and predictive ratio and net compensation for vulnerable subgroups. We provide 95% confidence intervals (CI) around each performance measure. Results Linear regression without SDH indicators achieved moderate determination (R2 0.327, 95% CI: 0.300, 0.353), error ($6992; 95% CI: $6889, $7094), and discrimination (C-statistic 0.703; 95% CI: 0.701, 0.705). ML without SDH indicators improved all metrics (R2 0.388; 95% CI: 0.357, 0.420; error $6637; 95% CI: $6539, $6735; C-statistic 0.717; 95% CI: 0.715, 0.718), reducing misestimation of cost by $3.5 M per 10,000 members. Among people living in areas with high poverty, high wealth inequality, or high prevalence of uninsured, SDH indicators reduced underestimation of cost, improving the predictive ratio by 3% (~$200/person/year). Conclusions ML improved risk adjustment models and the incorporation of SDH indicators reduced underpayment in several vulnerable populations.
topic Risk estimation
Machine learning
Social determinants of health
url http://link.springer.com/article/10.1186/s12889-020-08735-0
work_keys_str_mv AT jeremyairvin incorporatingmachinelearningandsocialdeterminantsofhealthindicatorsintoprospectiveriskadjustmentforhealthplanpayments
AT andrewakondrich incorporatingmachinelearningandsocialdeterminantsofhealthindicatorsintoprospectiveriskadjustmentforhealthplanpayments
AT michaelko incorporatingmachinelearningandsocialdeterminantsofhealthindicatorsintoprospectiveriskadjustmentforhealthplanpayments
AT pranavrajpurkar incorporatingmachinelearningandsocialdeterminantsofhealthindicatorsintoprospectiveriskadjustmentforhealthplanpayments
AT behzadhaghgoo incorporatingmachinelearningandsocialdeterminantsofhealthindicatorsintoprospectiveriskadjustmentforhealthplanpayments
AT bruceelandon incorporatingmachinelearningandsocialdeterminantsofhealthindicatorsintoprospectiveriskadjustmentforhealthplanpayments
AT robertlphillips incorporatingmachinelearningandsocialdeterminantsofhealthindicatorsintoprospectiveriskadjustmentforhealthplanpayments
AT stephenpetterson incorporatingmachinelearningandsocialdeterminantsofhealthindicatorsintoprospectiveriskadjustmentforhealthplanpayments
AT andrewyng incorporatingmachinelearningandsocialdeterminantsofhealthindicatorsintoprospectiveriskadjustmentforhealthplanpayments
AT sanjaybasu incorporatingmachinelearningandsocialdeterminantsofhealthindicatorsintoprospectiveriskadjustmentforhealthplanpayments
_version_ 1724573758014881792