Accurate and Interpretable Machine Learning for Transparent Pricing of Health Insurance Plans

Kshirsagar, Rohun; Hsu, Li-Yen; Chaturvedi, Vatshank; Greenberg, Charles H.; McClelland, Matthew; Mohan, Anushadevi; Shende, Wideet; Tilmans, Nicolas P.; Frigato, Renzo; Guo, Min; Chheda, Ankit; Trotter, Meredith; Ray, Shonket; Lee, Arnold; Alvarado, Miguel

Computer Science > Computers and Society

arXiv:2009.10990 (cs)

[Submitted on 23 Sep 2020 (v1), last revised 27 Feb 2021 (this version, v2)]

Title:Accurate and Interpretable Machine Learning for Transparent Pricing of Health Insurance Plans

Authors:Rohun Kshirsagar, Li-Yen Hsu, Vatshank Chaturvedi, Charles H. Greenberg, Matthew McClelland, Anushadevi Mohan, Wideet Shende, Nicolas P. Tilmans, Renzo Frigato, Min Guo, Ankit Chheda, Meredith Trotter, Shonket Ray, Arnold Lee, Miguel Alvarado

View PDF

Abstract:Health insurance companies cover half of the United States population through commercial employer-sponsored health plans and pay 1.2 trillion US dollars every year to cover medical expenses for their members. The actuary and underwriter roles at a health insurance company serve to assess which risks to take on and how to price those risks to ensure profitability of the organization. While Bayesian hierarchical models are the current standard in the industry to estimate risk, interest in machine learning as a way to improve upon these existing methods is increasing. Lumiata, a healthcare analytics company, ran a study with a large health insurance company in the United States. We evaluated the ability of machine learning models to predict the per member per month cost of employer groups in their next renewal period, especially those groups who will cost less than 95\% of what an actuarial model predicts (groups with "concession opportunities"). We developed a sequence of two models, an individual patient-level and an employer-group-level model, to predict the annual per member per month allowed amount for employer groups, based on a population of 14 million patients. Our models performed 20\% better than the insurance carrier's existing pricing model, and identified 84\% of the concession opportunities. This study demonstrates the application of a machine learning system to compute an accurate and fair price for health insurance products and analyzes how explainable machine learning models can exceed actuarial models' predictive accuracy while maintaining interpretability.

Comments:	Accepted for publication in The Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI-21), in the Innovative Applications of Artificial Intelligence track. This is the extended version with some stylistic fixes from the first posting and complete author list
Subjects:	Computers and Society (cs.CY); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2009.10990 [cs.CY]
	(or arXiv:2009.10990v2 [cs.CY] for this version)
	https://doi.org/10.48550/arXiv.2009.10990

Submission history

From: Rohun Kshirsagar [view email]
[v1] Wed, 23 Sep 2020 08:07:33 UTC (514 KB)
[v2] Sat, 27 Feb 2021 22:47:22 UTC (357 KB)

Computer Science > Computers and Society

Title:Accurate and Interpretable Machine Learning for Transparent Pricing of Health Insurance Plans

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computers and Society

Title:Accurate and Interpretable Machine Learning for Transparent Pricing of Health Insurance Plans

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators