基于机器学习下构建新生儿高胆红素血症风险预测模型的应用研究

doi:10.20274/j.cnki.issn.1674-3865.2026.01.003

中国中西医结合儿科学 ›› 2026, Vol. 18 ›› Issue (1): 12-17.doi: 10.20274/j.cnki.issn.1674-3865.2026.01.003

基于机器学习下构建新生儿高胆红素血症风险预测模型的应用研究

丁雪, 崔明明(), 周梦婕, 娄溪萌

250061 济南，山东中医药大学中医儿科学专业研究生（丁雪，周梦婕，娄溪萌）
250061 济南，山东中医药大学儿科教研室（崔明明）

收稿日期:2025-08-19 修回日期:2025-10-17 出版日期:2026-02-25 上线日期:2026-02-25
通讯作者: 崔明明 E-mail:cmmc321321@163.com
作者简介:丁雪（1999-），女，山东中医药大学2023级硕士研究生在读。研究方向：儿童肺系疾病及新生儿疾病研究
基金资助:
2022年山东省医药卫生科技发展计划项目(鲁卫函〔2022〕467号);山东省“齐鲁扁仓中医药人才”培育项目(鲁卫中医药科教字〔2025〕2号);山东中医药大学附属医院“青年名中医”培养项目(省中人字〔2024〕60号)

An applied study on constructing a neonatal hyperbilirubinemia risk prediction model based on machine learning

Xue DING, Mingming CUI(), Mengjie ZHOU, Ximeng LOU

Shandong University of Traditional Chinese Medicine，Jinan 250061, China

Received:2025-08-19 Revised:2025-10-17 Published:2026-02-25 Online:2026-02-25
Contact: Mingming CUI E-mail:cmmc321321@163.com
Supported by:
Shandong Provincial Medical and Health Science and Technology Development Program Project

摘要/Abstract

摘要：

目的通过调查新生儿高胆红素血症发病高危因素，利用多种机器学习方法建立和评估新生儿高胆红素血症风险预测模型，为防治新生儿高胆红素血症研究提供依据。方法收集山东中医药大学附属医院儿科门诊及病房的新生儿，采用问卷调查方式收集新生儿高胆红素血症的数据,采用Python3中的Scikitlearn机器学习软件进行统计分析，运用Logistic回归算法进行筛选高胆红素血症相关指标。基于纵向体检队列资料抽样产生不同样本量的模拟数据，对数据运用6种机器学习算法建立新生儿高胆红素血症风险预测模型，采用受试者工作特征曲线下面积验证模型鉴别新生儿高胆红素血症的能力和准确性。结果在6种机器算法中，随机森林模型综合效果最佳，其风险预测模型重要性特征排名较高为孕期疾病、新生儿是否患有新生儿溶血病、出生后异常症状（感染）、是否为早产儿。结论基于机器学习下构建的新生儿高胆红素血症风险预测模型对于新生儿高胆红素血症的防治具有一定的临床诊断价值；根据其结果生成的新生儿黄疸管理系统可对高风险患儿加强管理及监测，减少或预防并发症的发生。

关键词: 新生儿高胆红素血症, 预测模型, 机器学习, 风险评估

Abstract:

Objective To investigate high-risk factors for neonatal hyperbilirubinemia(NHB) and establish and evaluate a NHB risk prediction model using multiple machine learning methods, providing evidence for NHB prevention and treatment research. Methods The neonates from the pediatric outpatient and inpatient departments of Shandong University of Traditional Chinese Medicine Affiliated Hospital were enrolled. Data on neonatal hyperbilirubinemia were collected via questionnaire surveys. Statistical analysis was performed using the Scikitlearn machine learning software in Python3, with the Logistic regression algorithm employed to screen for hyperbilirubinemia-related indicators. Simulated data with varying sample sizes were generated from longitudinal physical examination cohort samples. Six machine learning algorithms were applied to establish neonatal hyperbilirubinemia risk prediction models. The ability and accuracy of these models to distinguish neonatal hyperbilirubinemia were validated using the area under the receiver operating characteristic curve(ROC). Results Among the six machine learning algorithms, the Random Forest model demonstrated the best overall performance. Key features ranking highly in this risk prediction model included pregnancy diseases, neonatal hemolytic disease, abnormal postnatal symptoms(infection), and preterm birth status. Conclusion The machine learning-based risk prediction model for neonatal hyperbilirubinemia holds clinical diagnostic value for its prevention and management. The resulting neonatal jaundice management system enables enhanced monitoring and management of high-risk infants, thereby reducing or preventing complications.

Key words: Neonatal hyperbilirubinemia, Prediction model, Machine learning, Risk assessment

中图分类号:

R722.19

丁雪, 崔明明, 周梦婕, 娄溪萌. 基于机器学习下构建新生儿高胆红素血症风险预测模型的应用研究[J]. 中国中西医结合儿科学, 2026, 18(1): 12-17.

Xue DING, Mingming CUI, Mengjie ZHOU, Ximeng LOU. An applied study on constructing a neonatal hyperbilirubinemia risk prediction model based on machine learning[J]. Chinese Pediatrics of Integrated Traditional and Western Medicine, 2026, 18(1): 12-17.

图/表 5

表1

表2

表3

表4

图2

参考文献 32

[1]	王继,刘合作.新生儿高胆红素血症围生期影响因素分析[J].现代实用医学, 2023, 35(7):925-927.
[2]	涂阳阳, 原新慧, 李宇宁. 新生儿胆红素脑病的发病机制及诊治进展[J]. 医学综述, 2021，27(16): 3160-3166.
[3]	中华医学会儿科学分会新生儿学组,《中华儿科杂志》编辑委员会. 新生儿高胆红素血症诊断和治疗专家共识[J]. 中华儿科杂志,2014,52(10):745-748.
[4]	American Academy of Pediatrics Subcommittee on Hyperbilirubinemia. Management of hyperbilirubinemia in the newborn infant 35 or more weeks of gestation[J]. Pediatrics， 2004，114(1):297-316.
[5]	熊丽辉,高婷婷,王承湘,等.基于机器算法的脑梗死多模态“脉-病”预测模型研究[J/OL].世界中医药,1-11[2025-12-31].
[6]	程丽,刘琼娜,蒋晶晶.妊娠期糖尿病血糖控制水平与新生儿高胆红素血症发病的相关性[J].中国妇幼健康研究,2024,35(2):64-73.
[7]	Dey SK, Islam S, Jahan I, et al. Association of hyperbilirubinemia requiring phototherapy or exchange transfusion with hearing impairment among admitted term and late preterm newborn in a NICU[J]. Mymensingh Med J， 2020，29(2):405-413.
[8]	Mutlu M, Aslan Y, Kader Ş, et al. Preventive effects of probiotic supplementation on neonatal hyperbilirubinemia caused by isoimmunization[J]. Am J Perinatol， 2020，37(11):1173-1176.
[9]	恽雯昕,范建霞. 孕前糖尿病对妊娠的影响[J]. 中国实用妇科与产科杂志,2025,41(4):400-404.
[10]	Yang ST, Liu FC, Chen HL. Comparison of transcutaneous and serum bilirubin before, under, and after phototherapy in term and late-preterm infants[J]. Kaohsiung J Med Sci, 2019, 35(11): 715-724.
[11]	Soliman A, Salama H, Al Rifai H, et al. The effect of different forms of dysglycemia during pregnancy on maternal and fetal outcomes in treated women and comparison with large cohort studies[J]. Acta Biomed, 2018, 89(S5): 11-21.
[12]	郑芳慧,张阳,邹丽.高龄肥胖女性再次妊娠代谢性疾病的预防和管理[J].中国实用妇科与产科杂志,2023,39(6):593-597.
[13]	田青,李晓东,查文清,等.新生儿高胆红素血症危险因素的Logistic分析[J].实用医学杂志,2011,27(14):2560-2562.
[14]	李翠莹,李小薇.胎儿新生儿溶血病实验室检测专家共识[J].临床输血与检验, 2021, 23(1):20-23.
[15]	程晨,张怡,陈怡静,等.吸收IgG抗-AB抗体后IgG抗-A/抗-B抗体效价预测ABO-胎儿新生儿溶血病的价值[J].中国实验血液学杂志,2024,32(6):1903-1908.
[16]	Li P, Pang LH, Liang HF, et al. Maternal IgG anti-A and anti-B titer levels screening in predicting ABO hemolytic disease of the newborn: a meta-analysis[J]. Fetal Pediatr Pathol, 2015, 34(6): 341-350.
[17]	贾丹,蒋莎,李娜,等.妊娠期ABO抗体阳性的研究进展[J].中国免疫学杂志, 2019, 35(18):2302-2306.
[18]	周晋宇,沈茹,吴翰欣,等.云南地区283例ABO系统新生儿溶血病检测结果分析[J].中国实验血液学杂志,2025,33(3):881-885.
[19]	刘凡,严争,危夷,等. 新生儿高胆红素血症的危险因素分析(附365例报告)[J]. 福建医药杂志,2016,38(6):16-17.
[20]	王秀兰.新生儿高胆红素血症的病因分析及临床处理方法探讨[J].临床医药文献电子杂志,2016,3(31):6158，6160.
[21]	陈秀, 李业瑜, 许立伦. 新生儿高胆红素血症影响因素分析[J]. 中国儿童保健杂志, 2014，22(1): 77-79.
[22]	舒春兰,邓爱果,晏菲琴,等.足月新生儿高胆红素血症影响因素的流行病学调查[J].江西医药,2022,57(10):1681-1683.
[23]	刘岩,杨春侠,张莹.新生儿黄疸128例病因探讨及中西医结合疗效分析[J].中国现代医生,2021,59(35):70-73.
[24]	左爽, 李景, 华子瑜. 1990—2019年全球新生儿黄疸疾病负担分析[J]. 中国当代儿科杂志, 2023，25(10): 1008-1015.
[25]	Lin Q, Zhu D, Chen C, et al. Risk factors for neonatal hyperbilirubinemia: a systematic review and meta-analysis[J]. Transl Pediatr， 2022，11(6):1001-1009.
[26]	吴菲,冯向春,付蓉,等.新生儿高胆红素血症相关影响因素Logistic回归分析[J].河北医科大学学报,2018,39(3):351-354.
[27]	Cai Y, Li X, Wang P, et al. Predictive factors for readmission due to neonatal hyperbilirubinemia: A retrospective case-control study[J]. PLoS One， 2025，20(4):e0320767.
[28]	Vidavalur R, Devapatla S. Trends in hospitalizations of newborns with hyperbilirubinemia and kernicterus in United States: an epidemiological study[J]. J Matern Fetal Neonatal Med, 2022, 35(25): 7701-7706.
[29]	黄家虎, 孙建华. 新生儿高胆红素血症病因的研究进展[J]. 医学综述, 2021，27(4): 680-684.
[30]	Zhang K, Fan S, Lv A, et al. Integrated analysis of microbiota with bile acids for the phototherapy treatment of neonatal jaundice[J]. Arch Med Sci， 2021，19(2):401-410.
[31]	Lin C, Lin Y, Xiao R, et al. Bifidobacterium species associated with breastfeeding alleviate neonatal hyperbilirubinaemia via the gut microbiota-α-linolenic and linoleic acid metabolism-enterohepatic circulation axis[J]. Microbiome， 2025，13(1):187.
[32]	Li Z, Zhang Y, Luo X, et al. Dynamic relationships between bilirubin concentrations and the gut microbiota in the neonatal period: A pilot prospective cohort study[J]. Pediatr Investig， 2025，9(4):347-360.

变量	非高胆红素血症（n=103）	高胆红素血症（n=118）	χ²值	P值
孕期疾病数（个）			68.712	0.000
0	42(40.0)	31(26.3)
1	36(34.3)	0(0.0)
2	27(25.7)	87(73.7)
流产或早产史			2.936	0.087
否	61(58.1)	55(46.6)
是	44(41.9)	63(53.4)
妊娠高血压或妊娠糖尿病的病史			4.911	0.027
否	73(69.5)	65(55.1)
是	32(30.5)	53(44.9)

变量	非高胆红素血症（n=103）	高胆红素血症（n=118）	χ²值	P值
分娩方式			12.376	0.000
剖宫产	13(12.4)	38(32.2)
顺产	92(87.6)	80(67.8)
新生儿性别			0.005	0.942
女	60(57.1)	68(57.6)
男	45(42.9)	50(42.4)
出生体质量			0.656	0.720
<2 500 g	36(34.3)	41(34.7)
2 500～3 500 g	45(42.9)	55(46.6)
>3 500 g	24(22.8)	22(18.7)

变量	非高胆红素血症（n=103）	高胆红素血症（n=118）	χ²值	P值
是否为早产儿			0.176	0.675
否	54(51.4)	64(54.2)
是	51(48.6)	54(45.8)
出生后异常症状（贫血、感染）			4.930	0.026
无	73(69.5)	97(82.2)
感染	3(2.9)	17(14.4)
贫血	27(25.7)	4(3.4)
贫血合并感染	2(1.9)	0(0.0)
新生儿溶血病			28.442	0.000
否	80(76.2)	117(99.2)
是	25(23.8)	1(0.8)
肌酸激酶			0.176	0.675
正常	54(51.4)	64(54.2)
偏高	51(48.6)	54(45.8)
乳酸脱氢酶			1.919	0.383
正常	31(29.5)	34(28.8)
偏低	22(20.9)	17(14.4)
偏高	52(49.6)	67(56.8)

项目	真阳性	假阴性	假阳性	真阴性	召回率	精确率	准确率	F1值	特异度	阳性预测值	阴性预测值	ROC曲线下面积	灵敏度	ROC曲线下面积的95%CI
Logistic回归	22	11	7	27	0.667	0.759	0.731	0.710	0.794	0.759	0.711	0.806	0.667	0.693～0.919
决策树	30	3	7	27	0.909	0.811	0.851	0.857	0.794	0.811	0.900	0.852	0.909	0.750～0.954
SVM	30	3	3	31	0.909	0.909	0.910	0.909	0.912	0.909	0.912	0.960	0.909	0.905～1.000
随机森林	30	3	3	31	0.909	0.909	0.910	0.909	0.912	0.909	0.912	0.981	0.909	0.948～1.000
XGBOOST	28	5	6	28	0.848	0.824	0.836	0.836	0.824	0.824	0.848	0.948	0.848	0.905～0.991
GBDT	30	3	5	29	0.909	0.857	0.881	0.882	0.853	0.857	0.906	0.949	0.909	0.906～0.992

基于机器学习下构建新生儿高胆红素血症风险预测模型的应用研究

An applied study on constructing a neonatal hyperbilirubinemia risk prediction model based on machine learning

RichHTML

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

图/表 5

参考文献 32

相关文章 9

编辑推荐

Metrics

本文评价

[1]	李叶, 宛玉欢, 李卓. 颅内占位性病变患儿术后营养不良风险预测模型的构建及验证研究[J]. 中国中西医结合儿科学, 2025, 17(4): 330-335.
[2]	周赛君, 宋青青, 杨美玉. 先天性心脏病患儿术后发生肺部感染的危险因素及预测模型构建[J]. 中国中西医结合儿科学, 2025, 17(2): 178-183.
[3]	阳波, 滕思思. 儿童体外循环手术压力性损伤发生风险因素及预测模型构建[J]. 中国中西医结合儿科学, 2024, 16(6): 521-525.
[4]	全卉, 宋青青, 易青梅. 川崎病患儿并发冠脉损伤的预测模型构建[J]. 中国中西医结合儿科学, 2024, 16(5): 436-439.
[5]	吴绍霞, 沈广礼, 吴海霞, 刘继鹏, 王洪洲, 朱美云. 基于临床资料和一氧化氮相融合的毛细支气管炎发生反复喘息的预测模型构建及验证[J]. 中国中西医结合儿科学, 2024, 16(3): 215-221.
[6]	赵健翔, 吴振起, 王雪峰, 王子, 褚亚奇, 游毅. 基于机器学习的注意力缺陷多动障碍风险预测研究[J]. 中国中西医结合儿科学, 2024, 16(2): 130-136.
[7]	吕玮. 经皮黄疸仪监测新生儿胆红素210例分析[J]. 中国中西医结合儿科学, 2012, 4(5): 445-446.
[8]	陶静. 瑞氏综合征患儿应用护理风险评估1例体会[J]. 中国中西医结合儿科学, 2012, 4(4): 383-384.
[9]	应雅丽. 大黄茵陈甘草汤联合妈咪爱治疗新生儿高胆红素血症42例疗效观察[J]. 中国中西医结合儿科学, 2012, 4(3): 224-225.