机器学习分层分析在识别医保基金违规行为风险中的应用研究

李佳瑾; 杨芳; 崔欢欢; 王晓昕; 马良; 杨宇航; 滕世伟; 冯海欢

doi:10.19546/j.issn.1674-3830.2026.1.004

中国医疗保险 ›› 2026, Vol. 0 ›› Issue (1) : 30-37. DOI: 10.19546/j.issn.1674-3830.2026.1.004

李佳瑾, 杨芳, 崔欢欢, 王晓昕, 马良, 杨宇航, 滕世伟, 冯海欢

作者信息 +

Research on the Application of Machine Learning Hierarchical Analysis in Identifying Medical Insurance Fund Irregularity Risks

Author information +

文章历史 +

摘要

目的: 本研究在医保基金监管面临新挑战的形势下,探讨如何利用机器学习技术提升对医保违规行为的识别精度与监管效率,以促进基金的稳定与可持续发展。方法: 本研究为有效识别复杂隐蔽的医保违规行为模式,采用机器学习分层分析方法,对四川省C市某三甲医院2023—2024年三个高就诊量病种（乳腺恶性肿瘤、冠心病、急性胰腺炎）患者的结构化与非结构化医保数据进行层级化处理与特征整合,分析潜在的医保违规风险情况。结果: 风险异常值高度集中于特定年龄与住院时长的交叉分组中,精准定位了需重点监控的高风险人群。异常费用结构呈现高占比（提示过度医疗）与低占比（提示服务不足）两种差异化模式,揭示了医保违规行为的隐蔽性与动态适应性。该方法为构建智能监测系统、实施精准风险预警及推动医保治理模式向事前预警、事中干预转型提供了有效的技术路径。结论: 本研究验证了分层分析方法结合机器学习技术在精准识别潜在医保违规风险中的有效性与应用价值。未来,应着力推进跨部门数据治理与深度整合,持续优化算法模型的解释性与适应性,加强复合型人才队伍建设,并紧密结合DRG/DIP等支付方式改革,迭代构建更加智能、高效、稳健的医保基金综合治理体系,从而系统性筑牢基金安全防线,推动医疗保障事业高质量可持续发展。

Abstract

Objective: This study explores how to utilize machine learning technology to enhance the identification accuracy and regulatory efficiency of medical insurance violations, given the new challenges faced by medical insurance fund supervision, in order to promote the stability and sustainable development of the fund. Methods: To effectively identify complex and concealed patterns of medical insurance violations, this study adopts a hierarchical analysis method based on machine learning, processes and integrates structured and unstructured medical insurance data from patients with three high-volume disease entities (breast malignancy, coronary heart disease, and acute pancreatitis) in a tertiary hospital in City C, Sichuan Province from 2023 to 2024, and analyzes potential risks of medical insurance violations. Results: Risk outliers are highly concentrated in cross-groupings of specific ages and lengths of hospital stays, accurately identifying high-risk populations that require key monitoring. The abnormal cost structure exhibits two differentiated patterns: high proportion (indicating overtreatment) and low proportion (indicating insufficient service), revealing the concealment and dynamic adaptability of medical insurance violations. This method provides an effective technical path for constructing an intelligent monitoring system, implementing precise risk warnings, and promoting the transformation of medical insurance governance models toward prior warning and in-process intervention. Conclusion: This study verifies the effectiveness and application value of the hierarchical analysis method combined with machine learning technology in accurately identifying potential risks of medical insurance violations. In the future, efforts should be made to promote cross-departmental data governance and deep integration, continuously optimize the interpretability and adaptability of algorithm models, strengthen the construction of composite talent teams, and closely integrate with payment method reforms such as DRG/DIP, iteratively build a more intelligent, efficient, and robust comprehensive governance system for medical insurance funds. This will systematically strengthen the safety defense line of the fund and promote the high-quality and sustainable development of the medical security cause.

导出引用

李佳瑾, 杨芳, 崔欢欢, 王晓昕, 马良, 杨宇航, 滕世伟, 冯海欢. 机器学习分层分析在识别医保基金违规行为风险中的应用研究[J]. 中国医疗保险. 2026, 0(1): 30-37 https://doi.org/10.19546/j.issn.1674-3830.2026.1.004

Research on the Application of Machine Learning Hierarchical Analysis in Identifying Medical Insurance Fund Irregularity Risks[J]. China Health Insurance. 2026, 0(1): 30-37 https://doi.org/10.19546/j.issn.1674-3830.2026.1.004

中图分类号： F840.684C913.7

参考文献

[1] 国家医疗保障局. 2024年全国医疗保障事业发展统计公报[EB/OL]. (2025-07-14)[2025-12-26]. https://www.nhsa.gov.cn/art/2025/7/14/art_7_17248.html.
[2] 丁杨军,钱钢.基于大数据的医保审计优化路径研究[J]. 卫生经济研究,2023,40(05):47-50.
[3] 沈建美,杨浩宸,冯其祥,等.基于大数据的基本医疗保险欺诈预警模型构建与治理策略研究[C]//清华大学经济管理学院中国保险与风险管理研究中心,武汉大学经济与管理学院,宁波大学商学院.2024中国保险与风险管理国际年会论文集.湖南大学金融与统计学院,2024:346-368.
[4] 刘莹,锁凌燕.基于机器学习方法的商业医疗险赔付预测研究——引入健康行为偏好的新视角[J]. 华中师范大学学报(人文社会科学版),2023,62(04):81-93.
[5] LIOU F, TANG Y, CHEN J.Detecting hospital fraud and claim abuse through diabetic outpatient services[J]. Health care management science, 2008,11(4):353-358.
[6] 林源. 基于BP神经网络的新农合欺诈识别实证研究——以定点医疗机构欺诈滥用为中心[J]. 云南师范大学学报(哲学社会科学版),2015,47(03):117-128.
[7] HUBICK K.Artificial neural networks in Australia[M]//Department of Industry, Technology and Commerce. Canberra: CPN Publications, 1992.
[8] ORTEGA PA, FIGUEROA CJ, RUZ GA.A medical claim fraud/abuse detection system based on data mining: a case study in Chile[C]//In proceedings of International Conference on Data mining. Las Vegas, 2006.
[9] 闫春,李亚琪,孙海棠.基于蚁群算法优化随机森林模型的汽车保险欺诈识别研究[J]. 保险研究,2017(06):114-127.
[10] LI Y, YAN C, LIU W, et al.A principle component analysis-based random forest with the potential nearest neighbor method for automobile insurance fraud identification[J]. Applied soft computing, 2018,70:1000-1009.
[11] 李秀芳,黄志国,陈孝伟.Bagging集成方法在保险欺诈识别中的应用研究[J]. 保险研究,2019(04):66-84.
[12] 林源,李连友.基于PSD-LDA模型的新农合欺诈风险测度实证研究[J]. 财经理论与实践,2014,35(05):18-23.
[13] 刘鹏. 基于Spark机器学习实现医疗保险关联频繁模式的欺诈行为挖掘技术探讨[J]. 中国数字医学,2019,14(05):15-18.
[14] 王坤,付钰,段雪源,等.基于深度学习的SDN异常流量分布式检测方法[J]. 通信学报,2024, 45(11):114-130.
[15] 马振涛,文晓初.健康中国战略下我国商业健康保险定位价值、发展趋势及问题建议[J]. 中国医疗保险,2025(03):92-106.
[16] 李金灿,徐珂琳,於州,等.大数据技术在医保反欺诈中的应用[J]. 中国医疗保险,2021(01):48-52.
[17] 张子健. 异地就医医保基金协同监管机制研究[J]. 中国医疗保险,2025(04):75-81.
[18] 卢婧然. 基本医疗保险基金支出监管的问题与对策研究[D]. 咸阳:西北农林科技大学,2024.