Research on the Application of Machine Learning Hierarchical Analysis in Identifying Medical Insurance Fund Irregularity Risks

doi:10.19546/j.issn.1674-3830.2026.1.004

China Health Insurance ›› 2026, Vol. 0 ›› Issue (1) : 30-37. DOI: 10.19546/j.issn.1674-3830.2026.1.004

Author information +

History +

Abstract

Objective: This study explores how to utilize machine learning technology to enhance the identification accuracy and regulatory efficiency of medical insurance violations, given the new challenges faced by medical insurance fund supervision, in order to promote the stability and sustainable development of the fund. Methods: To effectively identify complex and concealed patterns of medical insurance violations, this study adopts a hierarchical analysis method based on machine learning, processes and integrates structured and unstructured medical insurance data from patients with three high-volume disease entities (breast malignancy, coronary heart disease, and acute pancreatitis) in a tertiary hospital in City C, Sichuan Province from 2023 to 2024, and analyzes potential risks of medical insurance violations. Results: Risk outliers are highly concentrated in cross-groupings of specific ages and lengths of hospital stays, accurately identifying high-risk populations that require key monitoring. The abnormal cost structure exhibits two differentiated patterns: high proportion (indicating overtreatment) and low proportion (indicating insufficient service), revealing the concealment and dynamic adaptability of medical insurance violations. This method provides an effective technical path for constructing an intelligent monitoring system, implementing precise risk warnings, and promoting the transformation of medical insurance governance models toward prior warning and in-process intervention. Conclusion: This study verifies the effectiveness and application value of the hierarchical analysis method combined with machine learning technology in accurately identifying potential risks of medical insurance violations. In the future, efforts should be made to promote cross-departmental data governance and deep integration, continuously optimize the interpretability and adaptability of algorithm models, strengthen the construction of composite talent teams, and closely integrate with payment method reforms such as DRG/DIP, iteratively build a more intelligent, efficient, and robust comprehensive governance system for medical insurance funds. This will systematically strengthen the safety defense line of the fund and promote the high-quality and sustainable development of the medical security cause.

Key words

medical insurance fund / violation / risk control / machine learning

Cite this article

EndNote

Ris (Procite)

Bibtex

Download Citations

Research on the Application of Machine Learning Hierarchical Analysis in Identifying Medical Insurance Fund Irregularity Risks[J]. China Health Insurance. 2026, 0(1): 30-37 https://doi.org/10.19546/j.issn.1674-3830.2026.1.004

References

[1] 国家医疗保障局. 2024年全国医疗保障事业发展统计公报[EB/OL]. (2025-07-14)[2025-12-26]. https://www.nhsa.gov.cn/art/2025/7/14/art_7_17248.html.
[2] 丁杨军,钱钢.基于大数据的医保审计优化路径研究[J]. 卫生经济研究,2023,40(05):47-50.
[3] 沈建美,杨浩宸,冯其祥,等.基于大数据的基本医疗保险欺诈预警模型构建与治理策略研究[C]//清华大学经济管理学院中国保险与风险管理研究中心,武汉大学经济与管理学院,宁波大学商学院.2024中国保险与风险管理国际年会论文集.湖南大学金融与统计学院,2024:346-368.
[4] 刘莹,锁凌燕.基于机器学习方法的商业医疗险赔付预测研究——引入健康行为偏好的新视角[J]. 华中师范大学学报(人文社会科学版),2023,62(04):81-93.
[5] LIOU F, TANG Y, CHEN J.Detecting hospital fraud and claim abuse through diabetic outpatient services[J]. Health care management science, 2008,11(4):353-358.
[6] 林源. 基于BP神经网络的新农合欺诈识别实证研究——以定点医疗机构欺诈滥用为中心[J]. 云南师范大学学报(哲学社会科学版),2015,47(03):117-128.
[7] HUBICK K.Artificial neural networks in Australia[M]//Department of Industry, Technology and Commerce. Canberra: CPN Publications, 1992.
[8] ORTEGA PA, FIGUEROA CJ, RUZ GA.A medical claim fraud/abuse detection system based on data mining: a case study in Chile[C]//In proceedings of International Conference on Data mining. Las Vegas, 2006.
[9] 闫春,李亚琪,孙海棠.基于蚁群算法优化随机森林模型的汽车保险欺诈识别研究[J]. 保险研究,2017(06):114-127.
[10] LI Y, YAN C, LIU W, et al.A principle component analysis-based random forest with the potential nearest neighbor method for automobile insurance fraud identification[J]. Applied soft computing, 2018,70:1000-1009.
[11] 李秀芳,黄志国,陈孝伟.Bagging集成方法在保险欺诈识别中的应用研究[J]. 保险研究,2019(04):66-84.
[12] 林源,李连友.基于PSD-LDA模型的新农合欺诈风险测度实证研究[J]. 财经理论与实践,2014,35(05):18-23.
[13] 刘鹏. 基于Spark机器学习实现医疗保险关联频繁模式的欺诈行为挖掘技术探讨[J]. 中国数字医学,2019,14(05):15-18.
[14] 王坤,付钰,段雪源,等.基于深度学习的SDN异常流量分布式检测方法[J]. 通信学报,2024, 45(11):114-130.
[15] 马振涛,文晓初.健康中国战略下我国商业健康保险定位价值、发展趋势及问题建议[J]. 中国医疗保险,2025(03):92-106.
[16] 李金灿,徐珂琳,於州,等.大数据技术在医保反欺诈中的应用[J]. 中国医疗保险,2021(01):48-52.
[17] 张子健. 异地就医医保基金协同监管机制研究[J]. 中国医疗保险,2025(04):75-81.
[18] 卢婧然. 基本医疗保险基金支出监管的问题与对策研究[D]. 咸阳:西北农林科技大学,2024.