PB-ML: Poisson-Boltzmann Machine Learning Model

Details can be found in the paper: "Poisson-Boltzmann based machine learning (PBML) model for electrostatic analysis", https://arxiv.org/abs/2312.11482.
We first trained different DNN architectures based on 367 features of 4294 PDBBind protein data, using the following 448 combinations of hyperparameters:

epochs = [50, 100, 150, 200, 250, 300, 400, 500, 600, 700, 800, 900, 1000, 1500],
layers = ['2048,2048,512,512', '8000,8000,8000,8000', '500,500,500', '500,250,250', '12000,12000,12000', '2048,5000,8000,2048','12000,8000,8000,10000,5000','8192,8192,2048,2048,4096,4096'],
batch_sizes = [50, 100, 200, 400],
and No. 364 model performed the best with a minimum test MAPE (mean absolute percentage error) at 0.004023. This model is trained from Keras with batch_size = 400, epochs = 900, layers = [367,500,500,500,1], and is applied on the 195 test proteins and compared with MIBPB solver at a mesh size = 0.5, as illustrated in Figure 4 in the paper.

In this repo, the python script "feature.py" generates the graph features, VDW and Coulomb force features, in a total number of 75.
The python script "generate_feature.py" gathers all 367 features including the above, other protein features and GB features, as described in the paper.
The python script "run364.py" was written for running 195 test proteins in a parallel manner. As our train and test datasets are still private, some functions and comment lines which are serving the test dataset can be neglected. The original dataset might be shared upon request. This script now has the following functions:

it calls the softwares, i.e., the GB solver: bornRadius, the ESES solver: MS_Intersection;
it calls the above two sciptes to prepare the features for each protein;
it uses the trained No.364 model to predict the solvation energy difference between MIBPB at mesh size 0.2 and GB solvation energy;
it has now been update to predict Barnase and Barstar binding simulation, the corresponding datasets are in prep_bind/data-set2;
it uses the X_train file containing the 367 features of 4294 proteins for data normalization.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
__pycache__		__pycache__
prep_bind		prep_bind
saved_model/364		saved_model/364
README.md		README.md
X_train.txt		X_train.txt
feature.py		feature.py
generate_feature.py		generate_feature.py
run364.py		run364.py
training.pyc		training.pyc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PB-ML: Poisson-Boltzmann Machine Learning Model

About

Releases

Packages

Languages

yangxinsharon/PB-ML

Folders and files

Latest commit

History

Repository files navigation

PB-ML: Poisson-Boltzmann Machine Learning Model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages