Skip to content

Commit c2bbe26

Browse files
committed
Setup information added to docs
1 parent 5e377e5 commit c2bbe26

File tree

1 file changed

+55
-1
lines changed

1 file changed

+55
-1
lines changed

docs/README.md

+55-1
Original file line numberDiff line numberDiff line change
@@ -52,8 +52,62 @@ More in detail, each glowworm is represented as a 3D-axis object in its center o
5252

5353
## 2. Setup a simulation
5454

55-
TBC
55+
In LightDock versions prior to 0.5.0, this step was optional. From version 0.5.0, a setup step is mandatory for any simulation. This new Lightdock setup reduces complexity of the main program and it will allow the user to apply distance restraints.
56+
57+
If you execute `lightdock_setup` several options will appear:
58+
59+
```bash
60+
usage: lightdock_setup [-h] [--seed_points STARTING_POINTS_SEED]
61+
[-ft ftdock_file] [--noxt] [-anm] [--seed_anm ANM_SEED]
62+
[-anm_rec ANM_REC] [-anm_lig ANM_LIG]
63+
receptor_pdb_file ligand_pdb_file swarms glowworms
64+
lightdock_setup: error: too few arguments
65+
66+
```
67+
68+
The mandatory arguments are the PDB file containing the receptor structure, the PDB file containing the ligand structure, the number of swarms of the simulations and the number or glowworms of each swarm in this order.
69+
70+
In the original publication, for each complex of the [Protein-Protein Benchmark v5](https://zlab.umassmed.edu/benchmark/) analyzed, the number of swarms and glowworms were:
71+
72+
```
73+
Number of swarms: 400
74+
Number of glowworms per swarm: 300
75+
```
76+
77+
We found reasonable those values for number of swarms and glowworms per swarm in order to ensure a certain density and exhaustiveness of sampling, despite for smaller molecules, a smaller number of swarms could be completely valid.
78+
79+
Below there is a description of the rest of accepted paramenters of `lightdock_setup`:
80+
81+
- **seed_points** *STARTING_POINTS_SEED*: An integer can be specified as the seed used in the random number generator of the initial random poses of the ligand.
82+
- **ft** *ftdock_file*: LightDock can read the output of the venerable [FTDock](http://www.sbg.bio.ic.ac.uk/docking/ftdock.html) software in order to use the FTDock rigid-body predictions as the starting poses of the LightDock simulation. In order to do so, `lightdock_setup` classifies the different FTDock predictions according to its translation into the corresponding swarm over the surface of the receptor.
83+
- **noxt**: If this option is enabled, LightDock ignores OXT atoms. This is useful for several scoring functions which don't understand this special type of atom.
84+
- **anm**: If this option is enabled, the ANM mode is activated and backbone flexibility is modeled using ANM (via ProDy).
85+
- **seed_anm** *ANM_SEED*: An integer can be specified as the seed used in the random number generator of ANM normal modes extents.
86+
- **anm_rec** *ANM_REC*: The number of non-trivial normal modes calculated for the recepetor in the ANM mode.
87+
- **anm_lig** *ANM_LIG*: The number of non-trivial normal modes calculated for the ligand in the ANM mode.
88+
89+
### 2.1. Results of the setup
90+
91+
After the execution of `lightdock_setup` script, several files and directories will be generated in the root of your docking project:
92+
93+
- `init`: A directory containing initialization data, see below:
94+
- `cluster_centers.pdb`: A file in PDB format containing dummy atoms which coordinates correspond to a swarm center.
95+
- `initial_positions_X.dat`: One file like this for each swarm (where X indicates the id of the swarm), containing the translation, quaternion and normal modes extents (if ANM is activated) for each of the glowworms in this swarm.
96+
- `starting_positions_X.pdb`: Same as before, but a PDB file with the glowworm translation vector expressed as a dummy atom.
97+
- `starting_poses_X.bild`: Same as before, but this time every glowworm pose in rotational space is expressed as a BILD 3D object which can be readed by UCSF Chimera. More information about the format can be found [here](https://www.cgl.ucsf.edu/chimera/docs/UsersGuide/bild.html).
98+
- `swarm_0, ..., swarm_(N-1)`: For each of the N swarms specified by the `swarms` argument, a directory is created. Note that the first directory is called `swarm_0` and not `swarm_1`.
99+
- `setup.json`: A file with a summary of the setup step in JSON format. This file will be necessary for running the simulation in the next step.
100+
- `lightdock_*.pdb`: LightDock parses the input PDB structures and creates two new PDB files, one for the receptor and one for the ligand.
101+
- `*.xyz.npy`: Two files with this extension, one for the receptor and one for the ligand, which contains information about the minimum ellipsoid containing each of the structures in NumPy format.
102+
- `lightdock_rec.nm.npy`and `lightdock_lig.nm.npy`: If ANM is enabled, two files are created containing the normal modes information calculated by the ProDy library.
103+
104+
### 2.2. Tips and tricks
105+
106+
- As a general rule of thumb, the receptor structure is the bigger molecule and the ligand the smaller. With bigger and smaller, the metric used is the longest diameter of the minimum ellipsoid containing the molecule. A script called `lgd_calculate_diameter.py` can be found in `$LIGHTDOCK_HOME/bin/support` path in order to calculate an approximation of that metric.
107+
108+
- If the `init` directory exists, `lightdock_setup` makes use of the information contained in that directory.
56109

110+
- The file `setup.json` is intended for reproducibility of the results, but not to be modified by the user.
57111

58112
## 3. Run a simulation
59113

0 commit comments

Comments
 (0)