Objective: The paper analyzed, summarized the characteristics of medical insurance data quality by taking the statistical process of “drug costs included in the payment scope” of negotiated drugs as an example, providing references for medical insurance data cleaning, application. Methods: Through the medical insurance settlement system of a city, this study collected medical insurance settlement records of national negotiated products within the agreement periodfrom January2018to September2024as the target dataset, supplemented with several auxiliary datasets by Internet search. A data quality assessment framework covering completeness, standardization, consistency was constructed. Results: In the completeness dimension, the main data problems were missing disease diagnosis codes, names, involving29.6%, 29.7% of record items. In the standardization dimension, 66.2% of record items had non-standard one-to-many correspondence between drug codes, names. There were problems of abnormal medical institution coding content, irregularities in the correspondence between medical institution codes, names.60.3% of record items had disease diagnosis codes that did not conform to standards. There were48non-basic medical insurance participants, involving405record items. In the consistency dimension, 4.9% of record items had payment prices that exceeded the payment standard by ±10%;1.4% of pharmacy purchase data items corresponded to retail pharmacies outside the “dual-channel” recognition scope;78record items did not qualify for medical insurance coverage of patient age or prescription hospital conditions. Conclusion: There are quality issues in medical insurance data, such as missing data, irregular application of medical security information coding system, confusion of patient identity, logic error due to suspected improper payment of medical insurance. It is suggested to accelerate the exploration of appropriate uniform, operational statistical, cleaning rules which are suitable for medical insurance, promote the efficacy of medical insurance data for decision-making
Key words
medical insurance data /
data quality /
medical insurance negotiation /
medical insurance payment scope /
real world data
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}
References
[1] RILEY G F.Administrative and claims records as sources of health care cost data[J].Medical care, 2009, 47(7): S51-S55.
[2] TRICCO A C, PHAM B, RAWSON N S B.Manitoba and Saskatchewan administrative health care utilization databases are used differently to answer epidemiologic research questions[J].Journal of clinical epidemiology, 2008, 61(2): 192-197.
[3] STROM B L.Overview of automated databases in pharmacoepidemiology[M].Oxford: Wiley-Blackwell, 2012: 158-162.
[4] HOFFMANN F.Review on use of German health insurance medication claims data for epidemiological research[J].Pharmacoepidemiology and drug safety, 2009, 18(05): 349-356.
[5] KREIS K, NEUBAUER S, KLORA M, et al.Status and perspectives of claims data analyses in Germany—a systematic review[J].Health policy, 2016, 120(2): 213-226.
[6] KIM J A, YOON S, KIM L Y, et al.Towards actualizing the value potential of Korea health insurance review and assessment (HIRA) data as a resource for health research: strengths, limitations, applications, and strategies for optimal use of HIRA data[J].Journal of Korean medical science, 2017, 32(5): 718-728.
[7] HSING A W, IOANNIDIS J P A.Nationwide population science: Lessons from the Taiwan national health insurance research database[J].JAMA internal medicine, 2015, 175(9): 1527-1529.
[8] TYREE P T, LIND B K, LAFFERTY W E.Challenges of using medical insurance claims data for utilization analysis[J].American journal of medical quality: the official journal of the American college of medical quality, 2006, 21(4): 269-275.
[9] KORAM N, DELGADO M, STARK J H, et al.Validation studies of claims data in the Asia-Pacific region: a comprehensive review[J].Pharmacoepidemiology and drug safety, 2019, 28(2): 156-170.
[10] BEHRENDT C A, HEIDEMANN F, RIESS H C, et al.Registry and health insurance claims data in vascular research and quality improvement[J].Zeitschrift fur Gefasskrankheiten, 2017, 46(1): 11-15.
[11] 中华人民共和国中央人民政府.全国统一医保信息平台建成[EB/OL].(2022-05-12)[2024-12-30].https://www.gov.cn/xinwen/2022-05/12/content_5689783.htm.
[12] 国家医疗保障局.关于印发医疗保障定点医疗机构等信息业务编码规则和方法的通知[EB/OL].(2019-10-08)[2024-12-30].https://www.nhsa.gov.cn/art/2019/10/8/art_104_6461.html.
[13] 任燕,梁远波,刘密密,等.构建博鳌乐城创新药械真实世界数据研究体系基于我国首个使用境内真实世界数据获批上市的医疗器械研究的探讨[J].中国循证医学杂志,2020,20(10):1117-1123.
[14] 李宛亭,乔佳慧,孟令全,等.真实世界数据存在的问题与质量提升对策研究[J].中国新药杂志, 2021, 30(13):1160-1163.
[15] 汤真清,汪铭涵,金春林,等.医保大数据应用研究进展[J].中国医疗保险,2023(8):36-39.
[16] 冷家骅,陈治水.医院医保大数据建设的问题与展望——以某三甲医院为例[J].中国医疗保险,2018(6):59-62.
[17] 文云霞.关于医保数据赋能医保数据分析业务应用的思考[J].山西财税,2024(7):46-47.
[18] 董昊.医保大数据赋能服务提质增效的思考[J].中国医疗保险,2024(10):10-14.
[19] 中华人民共和国中央人民政府.2024年版国家医保药品目录出炉,新增91种药品——让参保人用上更多新药好药[EB/OL].(2024-11-29)[2024-12-26].https://www.gov.cn/zhengce/202411/content_6989980.htm.
[20] 国家医疗保障局.关于公布《2022年国家基本医疗保险、工伤保险和生育保险药品目录调整工作方案》及相关文件的公告[EB/OL].(2022-06-29)[2024-12-24].https://www.nhsa.gov.cn/art/2022/6/29/art_109_8342.html.
[21] 国家医疗保障局.关于公布《谈判药品续约规则》及《非独家药品竞价规则》的公告[EB/OL].(2023-07-21)[2024-12-24].https://www.gov.cn/zhengce/zhengceku/202307/content_6893726.htm.
[22] 国家医疗保障局.关于公布《2024年国家基本医疗保险、工伤保险和生育保险药品目录调整工作方案》及申报指南等文件的公告[EB/OL].(2024-06-28)[2024-12-24].https://www.nhsa.gov.cn/art/2024/6/28/art_109_13084.html.
[23] 杨莹,毛宗福.医疗机构大数据与药物政策评价——以药品集中带量采购为例[M]//中国互联网医疗发展报告(2022—2023)——医疗健康大数据.北京:社会科学文献出版社,2023:67-85.
[25] 苑顺,张乐,杨莹,等.“双通道”高值药品院内外使用特征及影响因素分析[J].中国医疗保险,2024(05):16-25.
[26] 耿鑫,杨莹,文小桐,等.贝达喹啉治疗耐多药结核的临床综合评价[J].中华结核和呼吸杂志,2023,46(06):572-579.
[27] 申远,杨莹,毛李宁,等.我国公立医疗机构国家谈判药品采购使用现状基于13个地区的实证数据分析[J].中国卫生资源,2022,25(03):283-290.
[28] 国家医疗保障局.基本医疗保险用药管理暂行办法[EB/OL].(2020-07-31)[2024-12-24].https://www.nhsa.gov.cn/art/2020/7/31/art_53_3388.html.
[29] 国家市场监督管理总局,国家标准化管理委员会.信息技术数据质量评价指标(GB/T 36344—2018)[S/OL].北京:中国标准出版社,2018.http://c.gb688.cn/bzgk/gb/showGb?type=online&hcno=D12140EDFD3967960F51BD1A05645FE7.
[30] 国家药品监督管理局药品审评中心.《用于产生真实世界证据的真实世界数据指导原则(试行)》的通告[EB/OL].(2021-04)[2024-11-13].https://www.cde.org.cn/main/news/viewInfoCommon/2a1c437ed54e7b838a7e86f4ac21c539.
[31] 关心淼,朱燕姿,刘浩,等.我国医疗标准及规范中真实世界数据治理要求及技术要点[J].协和医学杂志,2024:1-11.
[32] 张泽丹,王斌,张雯,等.中医药真实世界数据质量评估方法进展[J].科技导报,2024,42(21): 13-19.
[33] BIAN J, LYU T, LOIACONO A, et al.Assessing the practice of data quality evaluation in a national clinical data research network through a systematic scoping review in the era of real-world data[J].Journal of the American Medical Informatics Association, 2020, 27(12): 1999-2010.
[34] SYED R, EDEN R, MAKASI T, et al.Digital health data quality issues: systematic review[J].Journal of medical internet research, 2023, 25: e42615.
[35] MASHOUFI M, AYATOLLAHI H, KHORASANI-ZAVAREH D, et al.Data quality in health care: main concepts and assessment methodologies[J].Methods of information in medicine, 2023, 62(1-2): 5-18.
[36] 许涤龙,叶少波.统计数据质量评估方法研究述评[J].统计与信息论坛,2011,26(07):3-14.
[37] 卢二坡.统计数据质量评估方法述评[J].统计与决策,2006(23):70-71.
[38] 姬卫东,李琳,张振,等.互联互通背景下医疗数据治理面临的问题与对策[J].中国数字医学,2021,16(11):6-11.
[39] 阮彤,邱加辉,张知行,等.医疗数据治理——构建高质量医疗大数据智能分析数据基础[J].大数据,2019,5(1):12-24.
[40] 冯世民.定点零售药店医保合规法律风险及建议[J].中国医疗保险,2024(7):76-81.
Funding
国家自然科学基金“医保战略性购买视角下慢性病多层次门诊用药保障的经济效应与健康效应研究”(72404098); 中国博士后科学基金项目“价值导向下慢性病门诊用药保障与老年人健康产出:作用机制与政策优化”(2024M761028); 湖北省博士后创新人才培养项目(2024HBBHCXB019); 国家资助博士后研究人员计划(GZC20240534)