Released: 2026-04-01
This is the first public release of TCMDATA, a comprehensive R toolkit for Traditional Chinese Medicine (TCM) network pharmacology research.
search_herb(): Query herb–compound–target associations by herb name (Chinese, Pinyin, or Latin)search_target(): Reverse lookup—find herbs targeting specific genessearch_disease(): Query DisGeNET (via DOSE) for disease-associated genes; supports UMLS CUI, exact name, and fuzzy matchingsearch_gene_disease(): Reverse lookup—find all diseases associated with given gene symbols or Entrez IDssearch_geo_datasets(): Query NCBI GEO for expression datasets (GDS/GSE) by disease keyword; filters by organism and dataset type
resolve_cid(): Resolve compound names to PubChem CIDsgetprops(): Retrieve compound properties (MW, LogP, TPSA, etc.)compound_similarity(): Tanimoto similarity search against PubChemdownload_ligand_structure(): Download molecular structures (SDF/MOL2/PDB)convert_structure(): Convert between molecular file formats
get_ppi(): Retrieve STRING PPI networks (wrapper aroundclusterProfiler::getPPI())prepare_herb_graph(): Build herb–compound–target networksppi_subset(): Filter PPI networks by confidence scorecompute_nodeinfo(): Calculate 17+ centrality metrics (degree, betweenness, closeness, MCC, DMNC, BN, EPC, radiality, Stress, etc.)rank_ppi_nodes(): Rank nodes by multiple topological featuresppi_knock(): Evaluate network robustness via drug-attack simulation
run_louvain(): Louvain community detectionrun_MCL(): Markov Clustering (MCL)run_mcode(): MCODE algorithm for dense subgraph detection
herb_enricher(): Herb-based over-representation analysis (ORA)- Compatible with GO/KEGG via clusterProfiler
run_ml_screening(): Run 6 ML algorithms with 3 validation modes-
LASSO, Elastic Net, Ridge
-
Random Forest + Boruta
-
SVM-RFE
-
XGBoost
-
get_ml_consensus(): Consensus feature selection across methodsplot_ml_roc(),plot_ml_venn(): Visualize ML results
tcm_setup(): Configure LLM provider (OpenAI, Anthropic, Gemini, Deepseek, aihubmix, and 10+ others)tcm_interpret(): Interpret enrichment results, PPI data, or free textdraft_result_paragraph(): Generate manuscript-ready result paragraphstcm_interpret_schema(): Structured output with custom schemascreate_tcm_agent(),make_tcm_function(): Build reusable AI functions
tcm_agent(): One-shot natural language entry point with automatic task routing to appropriate toolstcm_chat(): Interactive multi-turn chat session with built-in commands (/help,/artifacts,/history,/model,/stats,/clear,/quit)route_tcm_task(): Rule-based task router supporting 11 task types (herb_lookup, disease_lookup, enrichment, ppi_analysis, ml_screening, pubmed, geo_search, visualization, interpretation, etc.) with Chinese and English keyword patternscreate_tcm_task_agent(): Create a configured aisdk agent with tools and skills for custom workflowscreate_tcm_tools(): Generate 36 aisdk Tool objects wrapping TCMDATA functions; supports filtering by task_type or tool_namestcm_init_skills(): Copy bundled skills to local directory for customizationtcm_use_skills()/tcm_reset_skills()/tcm_skill_dir(): Skill directory management
get_pubmed_data(): Search PubMed for TCM–disease literatureget_pubmed_table(): Export results as publication tables
tcm_sankey(): Herb–compound–target Sankey diagramsggdot_sankey(): Combined dot-Sankey plots for enrichmentggdock(): Molecular docking affinity heatmapsgglollipop(): Lollipop plots for enrichment resultsplot_node_heatmap(): PPI centrality heatmapsradar_plot(): Radar charts for multi-dimensional comparisonsgo_barplot(),gocircle_plot(): GO enrichment visualizationsggvenn_plot(): Venn diagramsupsetplot(): UpSet plot for gene set intersection visualization (wrapper aroundaplotExtra::upset_plot())
gutMGene: Gut microbiota–metabolite associationstf_targets: Transcription factor–target regulation pairsdn_gcds,dn_otp: Diabetic nephropathy reference datasets (GeneCards & Open Targets)deg_earlydn: DESeq2 results for early diabetic nephropathy (GSE142025)demo_ppi: Example PPI network for tutorialscovid19: COVID-19 case study dataset