
浏览全部资源
扫码关注微信
上海中医药大学中药学院(上海 201203)
张能仪,女,本科,主要从事中药学研究工作
王天明,高级实验师;E-mail:wtmtcm@126.com
收稿:2025-09-19,
纸质出版:2026-02-10
移动端阅览
张能仪,李园园,王天明.机器学习算法在中药活性成分筛选中的应用[J].上海中医药杂志,2026,60(2):1-10.
ZHANG Nengyi,LI Yuanyuan,WANG Tianming.Application of machine learning algorithms in screening of active components in Chinese materia medica[J].Shanghai Journal of Traditional Chinese Medicine,2026,60(2):1-10.
张能仪,李园园,王天明.机器学习算法在中药活性成分筛选中的应用[J].上海中医药杂志,2026,60(2):1-10. DOI: 10.16305/j.1007-1334.2026.z20250919002.
ZHANG Nengyi,LI Yuanyuan,WANG Tianming.Application of machine learning algorithms in screening of active components in Chinese materia medica[J].Shanghai Journal of Traditional Chinese Medicine,2026,60(2):1-10. DOI: 10.16305/j.1007-1334.2026.z20250919002.
中药作为复杂的化学体系,其活性成分筛选长期面临成分多样性高、作用机制不明等挑战。近年来,机器学习算法技术的介入为中药现代化研究提供了新范式。系统梳理机器学习算法的分类并探讨其在中药活性成分筛选中的应用场景、技术挑战与优化策略。
Chinese materia medica (CMM), as a complex chemical system, has long faced challenges in active components screening, such as the high compositional diversity and unclear mechanisms of action. In recent years, the integration of machine learning algorithms has provided a new paradigm for the modernization of CMM research. This paper systematically reviews the classification of machine learning algorithms and explores their application scenarios, technical challenges, and optimization strategies in the screening of active components from CMM.
巩亦萍 , 顾志荣 , 郭引兄 , 等 . 中药药效物质基础研究方法与思路的进展与展望 [J]. 中成药 , 2025 , 47 ( 4 ): 1211 - 1216 .
许晴 , 李智 , 万梅绪 , 等 . 中药活性成分筛选新技术研究进展 [J]. 药物评价研究 , 2021 , 44 ( 7 ): 1541 - 1547 .
刘晓光 , 李梅 . 基于深度学习的药物-靶标相互作用预测研究综述 [J]. 智能系统学报 , 2024 , 19 ( 3 ): 494 - 524 .
丁梦 , 谭旺晓 , 张笑 , 等 . 基于靶点的中药活性成分发现技术研究进展 [J]. 中国中药杂志 , 2025 , 50 ( 13 ): 3645 - 3656 .
SUN J , DONG Q X , WANG S W , et al . Artificial intelligence in psychiatry research, diagnosis, and therapy [J]. Asian J Psychiatr , 2023 , 87 : 103705 .
张君宇 , 方格 , 陈家旭 , 等 . 基于机器学习方法构建气滞证及亚型证辨证决策网络模型 [J]. 辽宁中医药大学学报 , 2025 , 27 ( 7 ): 175 - 180 .
MENG X , LANG Y , LI X , et al . Progress in the application of AI in the standardization of traditional Chinese medicine: A review based on machine learning and deep learning [J]. Pharmac Res Modern Chin Med , 2025 , 16 : 100639 .
TANG Y P , XU D Q , YUE S J , et al . Modern research thoughts and methods on bio-active components of TCM formulae [J]. Chin J Nat Med , 2022 , 20 ( 7 ): 481 - 493 .
何丽君 , 顾承真 , 马黄璜 , 等 . 基于色谱技术对淫羊藿成分及指纹图谱研究 [J]. 海峡药学 , 2023 , 35 ( 2 ): 61 - 65 .
文星星 , 沈萍 , 张涛 , 等 . 生物色谱法在中药研究中的应用进展 [J]. 工业微生物 , 2023 , 53 ( 6 ): 81 - 83 .
HE Y , ZHAO X , YU M , et al . Affinity ultrafiltration mass spectrometry for screening active ingredients in traditional Chinese medicine: A review of the past decade (2014-2024) [J]. Molecules , 2025 , 30 ( 3 ): 608 .
黄晨存 , 严晟 , 赵元 , 等 . 赤芍抗凝血活性成分的筛选 [J]. 中国药物化学杂志 , 2023 , 33 ( 3 ): 181 - 190 .
赵东方 , 何荣景 , 侯旭东 , 等 . 源于天然的酶抑制剂高效发现及评价新技术:进展与展望 [J]. 上海中医药大学学报 , 2021 , 35 ( 1 ): 1 - 11, 19 .
赵珊 , 张文青 , 张欣悦 , 等 . 基于药物靶标识别中药活性成分的研究方法及应用 [J]. 药物评价研究 , 2023 , 46 ( 7 ): 1578 - 1586 .
PATEL L , SHUKLA T , HUANG X , et al . Machine learning methods in drug discovery [J]. Molecules , 2020 , 25 ( 22 ): 5277 .
刘利梅 , 陈晓晋 , 孙世伟 , 等 . 深度学习在药物活性预测研究中的应用 [J]. 生物化学与生物物理进展 , 2022 , 49 ( 8 ): 1498 - 1519 .
DAINA A , ZOETE V . Testing the predictive power of reverse screening to infer drug targets, with the help of machine learning [J]. Commun Chem , 2024 , 7 ( 1 ): 105 .
OBAIDO G , MIENYE I D , EGBELOWO O F , et al . Supervised machine learning in drug discovery and development: Algorithms, applications, challenges, and prospects [J]. Machine Learn Applic , 2024 , 17 : 100576 .
JIN K , RONG C , CHANG J . Application of machine learning algorithms in drug screening [J]. J Complex Health Sci , 2023 , 6 ( 2 ): 35 - 38 .
CERVANTES J , GARCIA-LAMONT F , RODRIGUEZ-MAZAHUA L , et al . A comprehensive survey on support vector machine classification: Applications, challenges and trends [J]. Neurocom , 2020 , 408 : 189 - 215 .
ROGRIGUEZ-PEREZ R , BAJORATH J . Evolution of support vector machine and regression modeling in chemoinformatics and drug discovery [J]. J Comput Aided Mol Des , 2022 , 36 ( 5 ): 355 - 362 .
LU Y , YE T , ZHENG J . Decision tree algorithm in machine learning [C]// IEEE. Proceedings of 2022 IEEE International Conference on Advances in Electrical Engineering and Computer Applications (AEECA) . Dalian : IEEE , 2022 : 1014 - 1017 .
SALMAN H A , KALAKECH A , STEITI A . Random forest algorithm overview [J]. Babylon J Machine Learn , 2024 , 2024 : 69 - 79 .
LIND A P , ANDERSON P C . Predicting drug activity against cancer cells by random forest models based on minimal genomic information and chemical properties [J]. PLoS One , 2019 , 14 ( 7 ): e0219774 .
BENTEJAC C , CSORGO A , MARTINEZ-MUNOZ G . A comparative analysis of gradient boosting algorithms [J]. Art Intel Rev , 2021 , 54 ( 3 ): 1937 - 1967 .
STAUFFER R , ZEILEIS A , MAYR G J . Long‑term foehn reconstruction combining unsupervised and supervised learning [J]. Int J Clim , 2024 , 44 ( 16 ): 5890 - 5901 .
YANG Y , ZHOU D X . Optimal rates of approximation by shallow reLUk neural networks and applications to nonparametric regression [J]. Construc Approx , 2024 , 62 : 329 - 360 .
HADIBY S , ALI Y M B . FNN based-virtual screening using 2D pharmacophore fingerprint for activity prediction in drug discovery [J]. Int J Comput Intel App , 2022 , 21 ( 3 ): 2250019 .
LIANG L , LIU Y , KANG B , et al . Large-scale comparison of machine learning algorithms for target prediction of natural products [J]. Brief Bioinform , 2022 , 23 ( 5 ): bbac359 .
AHMED H , SHEDEED H A , HAMAD S , et al . Feedforward neural network in cancer treatment response prediction [C] //ICAICV. Proceedings of The International Conference on Artificial Intelligence and Computer Vision . Cham : Springer Nature Switzerland , 2023 : 119 - 128 .
GLIELMO A , HUSIC B E , RODRIGUEZ A , et al . Unsupervised learning methods for molecular simulation data [J]. Chem Rev , 2021 , 121 ( 16 ): 9722 - 9758 .
NAEEM S , ALI A , ANAM S , et al . An unsupervised machine learning algorithms: Comprehensive review [J]. Int J Comput Dig Sys , 2023 , 13 ( 1 ): 911 - 921 .
ECKHARDT C M , MADJAROVA S J , WILLIAMS R J , et al . Unsupervised machine learning methods and emerging applications in healthcare [J]. Knee Surg Sports Traumatol Arthrosc , 2023 , 31 ( 2 ): 376 - 381 .
江启煜 , 曾慧妍 . 基于人工智能和组学数据驱动的中药潜在机制新型分析预测方法 [J]. 中国组织工程研究 , 2025 , 29 ( 35 ): 7552 - 7561 .
LEE K , JANG J , SEO S , et al . Drug-likeness scoring based on unsupervised learning [J]. Chem Sci , 2022 , 13 ( 2 ): 554 - 565 .
秦智慧 , 李宁 , 刘晓彤 , 等 . 无模型强化学习研究综述 [J]. 计算机科学 , 2021 , 48 ( 3 ): 180 - 187 .
SHEIKHOLESLAMI M , MAZROUEI N , GHEISARI Y , et al . DrugGen enhances drug discovery with large language models and reinforcement learning [J]. Sci Rep , 2025 , 15 ( 1 ): 13445 .
ALZUBAIDI L , ZHANG J , HUMAIDI A J , et al . Review of deep learning: concepts, CNN architectures, challenges, applications, future directions [J]. J Big Data , 2021 , 8 ( 1 ): 53 .
AHMED S F , ALAM M S B , HASSAN M , et al . Deep learning modelling techniques: current progress, applications, advantages, and challenges [J]. Art Intel Rev , 2023 , 56 ( 11 ): 13521 - 13617 .
KIMBER T B , CHEN Y , VOLKAMER A . Deep learning in virtual screening: Recent applications and developments [J]. Int J Mol Sci , 2021 , 22 ( 9 ): 4435 .
ZHANG P , ZHANG D , ZHOU W , et al . Network pharmacology: towards the artificial intelligence-based precision traditional Chinese medicine [J]. Brief Bioinform , 2024 , 25 ( 1 ): bbad518 .
TILBORG D V , GRISONI F . Traversing chemical space with active deep learning for low-data drug discovery [J]. Nat Comput Sci , 2024 , 4 ( 10 ): 786 - 796 .
NGUYEN-VO T H , DO T T T , NGUYEN B P . Multitask learning on graph convolutional residual neural networks for screening of multitarget anticancer compounds [J]. J Chem Inform Model , 2024 , 64 ( 18 ): 6957 - 6968 .
DAS P , MAZUMDER D H . MLCNNF: a multi-label convolutional neural network framework for predicting adverse COVID drug reactions from the chemical structure [J]. IEEE Trans Comput Biol Bioinform , 2025 , 22 ( 1 ): 285 - 297 .
SUN M , ZHAO S , GILVARY C , et al . Graph convolutional networks for computational drug development and discovery [J]. Brief Bioinform , 2020 , 21 ( 3 ): 919 - 935 .
XIA W , XIAO J , BIAN H , et al . Deep learning-based construction of a drug-like compound database and its application in virtual screening of HsDHODH inhibitors [J]. Methods , 2024 , 225 : 44 - 51 .
ZOU J , ZHAO L , SHI S . Generation of focused drug molecule library using recurrent neural network [J]. J Mol Model , 2023 , 29 ( 12 ): 361 .
LIU X , YE K , VAN VLIJMEN H W T , et al . An exploration strategy improves the diversity of de novo ligands using deep reinforcement learning: a case for the adenosine A 2A receptor [J]. J Cheminform , 2019 , 11 ( 1 ): 35 .
CARPENTER K , PILOZZI A , HUANG X . A pilot study of multi-input recurrent neural networks for drug-kinase binding prediction [J]. Molecules , 2020 , 25 ( 15 ): 3372 .
ZHANG H , LIU X , CHENG W , et al . Prediction of drug-target binding affinity based on deep learning models [J]. Comput Biol Med , 2024 , 174 : 108435 .
WANG X , LIU J , ZHANG C , et al . SSGraphCPI: A novel model for predicting compound-protein interactions based on deep learning [J]. Int J Mol Sci , 2022 , 23 ( 7 ): 3780 .
NARGIS T , SHAHA S A N , HAZAREENA A . An approach in prediction of medicine side effects using RNN networks [C]//IEEE. Proceedings of 2024 IEEE International Conference on Distributed Computing, VLSI, Electrical Circuits and Robotics (DISCOVER) . Wuhan : IEEE , 2024 : 416 - 421 .
CAI H , ZHANG H , ZHAO D , et al . FP-GNN: a versatile deep learning architecture for enhanced molecular property prediction [J]. Brief Bioinform , 2022 , 23 ( 6 ): 1 - 12 .
ZHANG S , JIN Y , LIU T , et al . SS-GNN: A simple-structured graph neural network for affinity prediction [J]. ACS Omega , 2023 , 8 ( 25 ): 22496 - 22507 .
XIONG J , XIONG Z , CHEN K , et al . Graph neural networks for automated de novo drug design [J]. Drug Discov Today , 2021 , 26 ( 6 ): 1382 - 1393 .
WANG Y , SU Y , ZHAO K , et al . A deep learning drug screening framework for integrating local-global characteristics: A novel attempt for limited data [J]. Heliyon , 2024 , 10 ( 14 ): e34244 .
BOONPALIT K , CHUNTAKARUK H , KINCHAGAWAT J , et al . Pre-training strategy for antiviral drug screening with low-data graph neural network: A case study in HIV-1 K103N reverse transcriptase [J]. J Comput Chem , 2025 , 46 ( 1 ): e27514 .
NAVEED H , KHAN A U , QIU S , et al . A comprehensive overview of large language models [J]. ACM Trans Int Sys Tech , 2025 , 16 ( 5 ): 1 - 72 .
ZHENG Y , KOH H Y , JU J , et al . Large language models for drug discovery and development [J]. Patterns , 2025 , 6 ( 10 ): 101346 .
CHOI Y S , HAN S K , KIM J , et al . ConPlex: a server for the evolutionary conservation analysis of protein complex structures [J]. Nucleic Acids Res , 2010 , 38 ( Web Server issue ): W450 - W456 .
马璐 , 田国祥 , 耿辉 , 等 . 中药系统药理数据库TCMSP及其分析应用简介 [J]. 中国循证心血管医学杂志 , 2020 , 12 ( 12 ): 1413 - 1416 .
XUE T , GONG S , XIE T H , et al . Development status and trend of traditional Chinese medicine database [J]. TMR Mod Herb Med , 2022 , 5 : 19 .
GAO K , LIU L , LEI S , et al . HERB 2.0: an updated database integrating clinical and experimental evidence for traditional Chinese medicine [J]. Nucleic Acids Res , 2025 , 53 ( D1 ): D1404 - D1414 .
RYZHKOV F V , RYZHKOVA Y E , ELINSON M N . Python tools for structural tasks in chemistry [J]. Mol Divers , 2025 , 29 ( 4 ): 3733 - 3752 .
KORSHUNOVA M , GINSBURG B , TROPSHA A , et al . OpenChem: a deep learning toolkit for computational chemistry and drug design [J]. J Chem Inform Model , 2021 , 61 ( 1 ): 7 - 13 .
CARRACEDO-REBOREDO P , LINARES-BLANCO J , RODRIGUEZ-FERNANDEZ N , et al . A review on machine learning approaches and trends in drug discovery [J]. Comput Struct Biotechnol J , 2021 , 19 : 4538 - 4558 .
王曦廷 , 李彧 , 张澜 , 等 . 基于机器学习的抗纤维化中药化合物筛选研究 [J]. 北京中医药大学学报 , 2019 , 42 ( 1 ): 30 - 36 .
YANG S , SHEN Y , LU W , et al . Evaluation and identification of the neuroprotective compounds of Xiaoxuming Decoction by machine learning: a novel mode to explore the combination rules in traditional Chinese medicine prescription [J]. Biomed Res Int , 2019 , 2019 ( 1 ): 6847685 .
XIONG W P , ZHU Y M , ZENG Q X , et al . Dose-effect relationship analysis of TCM based on deep Boltzmann machine and partial least squares [J]. Math Biosci Eng , 2023 , 20 ( 8 ): 14395 - 14413 .
HUANG Z Y , WANG L J , WANG J J , et al . Hispaglabridin B,a constituent of liquorice identified by a bioinformatics and machine learning approach,relieves protein-energy wasting by inhibiting forkhead box O1 [J]. Br J Pharmacol , 2019 , 176 ( 2 ): 267 - 281 .
GUO F , TANG X , ZHANG W , et al . Exploration of the mechanism of traditional Chinese medicine by AI approach using unsupervised machine learning for cellular functional similarity of compounds in heterogeneous networks, XiaoErFuPi granules as an example [J]. Pharmacol Res , 2020 , 160 : 105077 .
ZHAO Y , MAO S , LI Y , et al . Rapid screening of active ingredients in traditional Chinese medicine based on bar-form-diagram fingerprint combined with spectrum-effect relationship [J]. J Ethnopharmacol , 2025 , 351 : 120137 .
TIAN M , MA X , NIE L , et al . Screening of bioactive compounds and deep learning-driven quality control of Angong Niuhuang Pills [J]. J Ethnopharmacol , 2025 , 351 : 120095 .
PAN Y , CAI H , YE F , et al . ElixirSeeker: A machine learning framework utilizing fusion molecular fingerprints for the discovery of lifespan-extending compounds [J]. Aging Cell , 2025 , 24 ( 8 ): e70116 .
WANG H , LIU G , XU M . A novel method for predicting the interaction between Chinese herbal compounds and targets [C]//SPIE. Proceedings of Third International Conference on Biomedical and Intelligent Systems (IC-BIS 2024) . Nanchang : SPIE , 2024 : 677 - 683 .
JOSHI C P , BALDI A , KUMAR N , et al . Harnessing network pharmacology in drug discovery: an integrated approach [J]. Naunyn Schmiedebergs Arch Pharmacol , 2025 , 398 ( 5 ): 4689 - 4703 .
NOOR F , ASIF M , ASHFAQ U A , et al . Machine learning for synergistic network pharmacology: a comprehensive overview [J]. Brief Bioinform , 2023 , 24 ( 3 ): bbad120 .
MA J , WEI P , XU X , et al . Machine learning-assisted analysis of serum metabolomics and network pharmacology reveals the effective compound from herbal formula against alcoholic liver injury [J]. Chin Med , 2025 , 20 ( 1 ): 48 .
GONG Z , CHE Q , HU M , et al . Integrating multi-dimensional data to reveal the mechanisms and molecular targets of baikening granules for treatment of pediatric influenza [J]. Front Mol Biosci , 2025 , 12 : 1637980 .
ZHENG X , SHI C , XIE Y , et al . Bioactive components of Jiedu Sangen decoction against colorectal cancer: A novel and comprehensive research strategy for natural drug development [J]. Phytomedicine , 2025 , 142 : 156795 .
MOHAMMED S , BUDACH L , FEUERPFEIL M , et al . The effects of data quality on machine learning performance on tabular data [J]. Inform Sys , 2025 , 132 : 102549 .
张泽丹 , 王斌 , 张雯 , 等 . 中医药真实世界数据质量评估方法进展 [J]. 科技导报 , 2024 , 42 ( 21 ): 13 - 19 .
陈健 , 陈启龙 . 网络药理学在中医药研究中的现状及思考 [J]. 上海中医药大学学报 , 2021 , 35 ( 5 ): 1 - 6 .
ZENG J , JIA X . Systems theory-driven framework for AI integration into the holistic material basis research of traditional Chinese medicine [J]. Engineering , 2024 , 40 : 28 - 50 .
VO A H , VAN VLEET T R , GUPTA R R , et al . An overview of machine learning and big data for drug toxicity evaluation [J]. Chem Res Toxicol , 2019 , 33 ( 1 ): 20 - 37 .
WINTER P M , EDER S , WEISSENBOCK J , et al . Trusted artificial intelligence: Towards certification of machine learning applications [J]. arXiv , 2021 , 2103 : 16910 .
MURDOCH W J , SINGH C , KUMBIER K , et al . Definitions, methods, and applications in interpretable machine learning [J]. Proc Natl Acad Sci U S A , 2019 , 116 ( 44 ): 22071 - 22080 .
RUDIN C , CHEN C , CHEN Z , et al . Interpretable machine learning: Fundamental principles and 10 grand challenges [J]. Stat Surv , 2022 , 16 : 1 - 85 .
STIGLIC G , KOCBBEK P , FIJACKO N , et al . Interpretability of machine learning-based prediction models in healthcare [J]. WIREs Data Mining Knowl Discov , 2020 , 10 ( 5 ): e1379 .
SONG Z , CHEN G , CHEN C Y C . AI empowering traditional Chinese medicine? [J]. Chem Sci , 2024 , 15 ( 41 ): 16844 - 16886 .
苗明三 , 谢逸轩 . 论精准医学视角下的“精准中药” [J]. 上海中医药杂志 , 2023 , 57 ( 5 ): 31 - 36 .
王志强 , 薛启隆 , 刘新龙 , 等 . 基于数值模拟的中药制药减碳绿色优化策略 [J]. 上海中医药杂志 , 2025 , 59 ( 9 ): 1 - 8 .
0
浏览量
0
下载量
0
CSCD
0
CNKI被引量
关联资源
相关文章
相关作者
相关机构
沪公网安备 31011502015228号