HyMM: Hybrid method for disease-gene prediction by integrating multiscale module structures

Motivation: Identifying disease-related genes is an important issue in computational biology. Module structure widely exists in biomolecule networks, and complex diseases are usually thought to be caused by perturbations of local neighborhoods in the networks, which can provide useful insights for the study of disease-related genes. However, the mining and effective utilization of the module structure are still challenging in such issues as disease-gene prediction.

Results: We propose a hybrid disease-gene-prediction method integrating multiscale module structure (HyMM), which can utilize multiscale information from local to global structure to more effectively predict disease-related genes. HyMM ex-tracts module partitions from local to global scales by multiscale modularity optimization with exponential sampling, and estimates the disease relatedness of genes in partitions by the abundance of disease-related genes within modules. Then, a probabilistic model for integration of gene rankings is designed in order to integrate multiple predictions derived from mul-tiscale module partitions and network propagation, and a parameter estimation strategy based on functional information is proposed to further enhance HyMM’s predictive power. By a series of experiments, we reveal the importance of module partitions at different scales, and verify the stable and good performance of HyMM compared with eight other state-of-the-arts and its further performance improvement derived from the parameter estimation.

Conclusions: The results confirm that HyMM is an effective framework for integrating multiscale module structure to en-hance the ability to predict disease-related genes, which may provide useful insights for the study of multiscale module structure and its application in such issues as disease-gene prediction.

Requirements

Matlab 2016 or above

Codes

#main_HyMM.m: cross-validation code.
This code allows parallel execution. You can change "parfor" to "for" to cancel parallel execution

#A_HyMM.m: the recommended HyMM algorithm in the study.
A_HyMM(COM_Dataset, AdjGfG,AdjGfD,AdjDfD, DisIDset, plus_method_set, RankMergeMethod ) % Input:
% COM_Dataset is a table recording the partition matrices;
% AdjGfG: associatins between Genes (G) and Genes (G)
% AdjGfD: associatins between Genes (G) and Diseases (D)
% AdjDfD: associatins between Diseases (D) and Diseases (D)
% DisIDset: disease index
% plus_method_set: baseline algorithms. If plus_method_set is given, the results of baseline algorithms will be output.
% e.g. plus_method_set = {'RWRH'};
% RankMergeMethod: aggregation method
% Ouput:
% TableScores: a table whos variable record the scores of genes.
% COM_Dataset: multiscale partitions that are preprocessed, facilating the usage of partition information in the latter, e.g. for cross-validation.

Dataset

A dataset is located in the directory: data/demoDataSet&PPICOM_ModCM_delta=0.2.mat
This dataset includes:
(1) disease-gene associations, disease-disease associations and gene-gene associations;
(2) multiscale module partition matrices.

Results

The results will be automatically saved into the directory: results.

Cite

If you use HyMM in your research, please cite:
Xiang, et al., HyMM: Hybrid method for disease-gene prediction by integrating multiscale module structure, bioRxiv (2021), doi: 10.1101/2021.04.30.442111.

Contact

Email: [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
data		data
A_HyMM.m		A_HyMM.m
README.md		README.md
Supplementary_File.zip		Supplementary_File.zip
getPerf.m		getPerf.m
main_HyMM.m		main_HyMM.m

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HyMM: Hybrid method for disease-gene prediction by integrating multiscale module structures

Requirements

Codes

Dataset

Results

Cite

Contact

About

Releases

Packages

Languages

xiangju0208/DGP_HyMM

Folders and files

Latest commit

History

Repository files navigation

HyMM: Hybrid method for disease-gene prediction by integrating multiscale module structures

Requirements

Codes

Dataset

Results

Cite

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages