Using Autodock 4 and Vina With Autodocktools: A Tutorial: Written by Ruth Huey, Garrett M. Morris and Stefano Forli
Using Autodock 4 and Vina With Autodocktools: A Tutorial: Written by Ruth Huey, Garrett M. Morris and Stefano Forli
with AutoDockTools:
A Tutorial
8 December 2011
1
Contents
USINGAUTODOCK4 ANDVINAWITHAUTODOCKTOOLS:
A TUTORIAL................................................................................... 1
Written by Ruth Huey, Garrett M. Morris and Stefano Forli
.............................................................................................................................1
The Scripps Research Institute
Molecular Graphics Laboratory
10550 N. Torrey Pines Rd.
La Jolla, California 92037-1000
USA ....................................................................................................................1
8 December 2011
.............................................................................................................................1
Contents..........................................................................2
Introduction.....................................................................4
Exercise One: PDB Files Are Not Perfect: Editing a PDB File
......................................................................................13
Procedure:..................................................................................14
2
Files for Exercises:..........................................................30
Input Files:.................................................................................30
Results Files...............................................................................30
Ligand.............................................................................................................. 30
Macromolecule.................................................................................................30
AutoGrid.......................................................................................................... 30
AutoDock......................................................................................................... 30
VINA................................................................................................................30
Modifier........................................................................................................31
Pick.......................................................................................................... 31
Rotate.......................................................................................................31
Shift.............................................................................................................. 31
Key............................................................................................................... 31
R..........................................................................................................31
3
Introduction
This tutorial will introduce you to docking using the AutoDock suite
of programs. We will use a Graphical User Interface called
AutoDockTools, or ADT, that helps a user easily set up the two
molecules for docking, launches the external number crunching jobs in
AutoDock, and when the dockings are completed also lets the user
interactively visualize the docking results in 3D.
FAQ – Frequently Asked Questions
Yes, for both the macromolecule and the ligand, you should
always add hydrogens, compute Gasteiger charges and then
you must merge the non-polar hydrogens. Polar hydrogens are
hydrogens that are bonded to electronegative atoms like
oxygen and nitrogen. Non-polar hydrogens are hydrogens
bonded to carbon atoms.
You need one AutoGrid map for every atom type in the ligand
plus an electrostatics map and a desolvation map. E.g.: for
ethanol, C2H5OH, you would need C, OA and HD maps plus an
electrostatics ‘e’ map plus a desolvation ‘d’ map.
Yes, if you do not know where the ligand binds, you can build
a grid volume that is big enough to cover the entire surface of
the protein, using a larger grid spacing than the default value of
0.375Å, and more grid points in each dimension. Then you can
perform preliminary docking experiments with AutoDock to
see if there are particular regions of the protein that are
preferred by the ligand. This is sometimes referred to as
“blind docking”.
An AutoDock search for the best ways to fit two molecules together
constitutes a docking. The docking consists of a series of independent
searches or runs which results in a final ‘pose’ or conformation of the
ligand. Each conformation is the lowest energy found during a search.
An AutoDock experiment generally includes clustering of
conformations based on their similarity. The two molecules to fit
together and which search method to use are specified in the docking
parameter file (DPF). The results are written to a docking log file
(DLG). Reading a docking log or a set of docking logs into ADT is
Note: the dlg file contains many details that
the first step in analyzing the results of docking experiments.
are output as AutoDock parses the input files
and reports what it finds. For example, when In this exercise we will use the file imatinib_1iep_receptor.dlg from
AutoDock opens each AutoGrid map, it
reports opening the map file and how many a previous AutoDock calculation. In it, the imatinib (Gleevec ©
data points it read in. When it parses the input Novartis), a drug mainly used for the treatment of chronic
ligand file, it reports building various internal
data structures. After the input phase, myelogenous leukemia (CML), is docked to the c-Abl kinase domain.
AutoDock begins the specified number of The key results in a docking log are the docked conformations found at
runs. It reports which run number it is
starting; it may report specifics about each the end of each run, the energies of these docked structures and their
generation or simulated annealing cycle. similarities to each other. The similarity of docked structures is
After completing the runs, AutoDock begins
an analysis phase of the conformational measured by computing the root-mean-square-deviation, rmsd,
similarity of the dockings. At the very end, it between the coordinates of the atoms and creating a clustering of the
reports a summary of the time taken and
outputs the words ‘Successful Completion’. conformations based on these rmsd values.
The level of output detail is controlled by the
parameter “outlev” in the docking parameter
file. For dockings using the LGA algorithm, A visual examination of the interactions between the docked ligand
minimal output (‘outlev 0’) is recommended. and the macromolecule is used to evaluate the chemical
reasonableness of the docking.
AUTODOCKProcedure:
First, you need to choose the AutoDock log file you would like to
Note: If there were a previous Docking in the
viewer, you would be asked whether you want Analyze. This command opens a file browser that lets you choose a
to add this DLG to the previous Docking file with the extension .dlg Navigate into the Results directory and
instance. This can be done when the same
AutoGrid map files, ligand, and DPF files choose imatinib_1iep_receptor.dlg. Improve the display of the
were used for both docking experiments. ligand by clicking on the circle under S&B and the diamond under
Also, WARNING messages from the docking
log can be viewed in the python shell. To do Atom in the Dashboard Widget (Appendix 2).
so, open the python shell and type
mv.docked.warnings
Reading a docking log creates a Docking in the viewer and displays
the ligand in the lowest-energy conformation. Each conformation has a
docked energy value, a binding energy value, and per atom
electrostatic and vdw energy values. During the docking calculation,
AutoDock computes the free energy of binding and reports a detailed
energy breakdown.
ADT reports how many docked conformations were read in from the
DLG and tells you to how to visualize the docked conformations or
‘states’.
For example, the lowest energy conformation in the first bar is 1_1
while the lowest in second bar is 2_1. The height of the bar represents
how many conformations are in that cluster. Clicking on a bar makes
that cluster the current sequence for the ligand’s conformation player
CP, and its color changes to red. [See Appendix 3 for more details on
the CP.]
First, display the ligand in its lowest energy conformation using the
Conformation Player.
Examine the interactions between the ligand and nearby atoms in the
receptor:
Now that we have shown you the results of docking, we’re going to go
back to the very start to show you how to set up molecules for docking
and how to prepare the input files for AutoGrid and AutoDock.
VINAProcedure:
Protein Data Bank (PDB) files may have a variety of problems that
need to be corrected before they can be used in AutoDock. These
potential problems include missing atoms, added waters, more than
one molecule, chain breaks, alternate locations etc.
Note: Commands available via the Position the cursor over PMV Molecules in the Dashboard
Dashboard can also be invoked using the
menus at the top of the PMV GUI. Here (Appendix 2) and click with the right mouse button. This will
you could use this menu sequence to open open a file browser, showing all the files in the current
the file browser:
File->Read Molecule directory. If you are still in the Results directory, navigate up
See Appendix 2 for information about the one level by clicking on the button on the top-right of the Read
Dashboard. Molecule widget. Select 1iep_receptor.pdb and click on
Open.
In the Dashboard click on the square under the “S”, and note
all the subsets that are available.
5. Adding hydrogens
Procedure:
Note: We already added hydrogens to In the “Ligand File for AutoDock 4:” widget, click on the small
imatinib.mol2. You must always add
hydrogens to your ligand before you select it
bar at the right of PDBQT files: (*.pdbqt) to display a list of
to be the ligand. file type choices. Click on MOL2 files: to show all the files in
this directory and choose imatinib.mol2. Click on Open.
After the ligand is loaded in the viewer, ADT initializes it. This
process involves a number of steps:
• ADT detects whether the ligand already has charges or not.
If not, ADT computes Gasteiger charges; remember that
for the Gasteiger calculation to work correctly, the ligand
must have all hydrogen atoms added, including both polar
and non-polar ones. If the charges are all zero, ADT will
try to add charges. It checks whether the total charge per
residue is an integer.
ADT determines which atom fits its idea of the best root and
marks it with a green sphere.
This best root is the atom in the ligand with the smallest largest
sub-tree. In the case of a tie, if either atom is in a cycle, it is
picked to be root. If neither atom is in a cycle, the first found
is picked. (If both are in a cycle, the first found is picked). As
you might imagine, this can be a slow process for large ligands.
The rigid portion of the molecule includes this root atom and
all atoms connected to it by non-rotatable bonds (which we will
examine in the next section.) You can visualize the current
root portion with Ligand Torsion Tree
Show Root Expansion (and hide this with Ligand
Torsion Tree Show/Hide Root Marker).
5. Hide the root marker and the ligand before going on:
Procedure:
AUTODOCKProcedure:
Note: Alternatively, if you plan to use the Choosing the ligand opens the AutoGpf Ligand widget that
same macromolecule with a variety of
different ligands, you might choose to allows you to modify the types of maps to be calculated, and to
calculate all the maps you would eventually choose whether to model possible hydrogen bonding.
need via
Set Map Types-> Directly.
Close this widget with the Accept button.
• This has menu buttons at the top: File, Center, View and
Help.
File
This menu lets you close the Grid Options Widget,
which also causes the grid box to disappear. You can
Close saving current values to keep your changes
or Close w/out saving to forget your changes.
Center
This menu lets you set the center of the grid box in four
ways: Pick an atom, Center on ligand,
Center on macromolecule or On a named
atom.
View
• You will also notice it has a thumbwheel that lets you adjust
the spacing between the grid points.
• There are also entries and thumbwheels that let you change the
location of the center of the grid.
Vina grid definition requires the center coordinates x,y,z and the sizes
in Angstrom. Those values can be obtained from the Grid widget used
for the AutoGrid setup (see step 2) by changing the grid resolution to
1.00 Angstroms. Try to define a grid box the most similar to the one
used by AutoDock. For this purpose, use the same center as in the step
2, and visually inspect that the number of points 17, 24, 21 are
satisfactory.
Exercise Five: Starting AutoGrid 4
Procedure:
2. Launch
Please note that you can easily start a job from the command line by cd
into the directory containing the input files and typing:
The docking parameter file tells AutoDock which map files to use, the
ligand molecule to move, what its center and number of torsions are,
where to start the ligand, the flexible residues to move if sidechain
motion in the receptor is to be modeled, which docking algorithm to
use and how many runs to do. It usually has the file extension, “.dpf”.
Four different docking algorithms are currently available in AutoDock:
SA, the original Monte Carlo simulated annealing; GA, a traditional
Darwinian genetic algorithm; LS, local search; and GALS, which is a
hybrid genetic algorithm with local search. The GALS is also known
as a Larmarckian genetic algorithm, or LGA, because children are
allowed to inherit the local search adaptations of their parents.
Each search method has its own set of parameters, and these must be
set before running the docking experiment itself. These parameters
include what kind of random number generator to use, step sizes, etc.
The most important parameters affect how long each docking will run.
In simulated annealing, the number of temperature cycles, the number
of accepted moves and the number of rejected moves determine how
long a docking will take. In the GA and GALS, the number of energy
evaluations and the number of generations affect how long a docking
will run. ADT lets you change all of these parameters, and others not
mentioned here. See Appendix 4 Docking Parameters for more
information about individual keywords.
Procedure:
Note: If you are setting up a docking with
flexible residues , here you must select the
name of the file that contains the rigid
1. Docking Macromolecule Set Rigid Filename…
receptor, e.g. 1iep_receptor_rigid.pdbqt
Select 1iep_receptor.pdbqt in the file browser that opens.
Note: ADT allows you to change the 5. Docking Output Lamarckian GA…
parameters for any of the four possible
docking algorithms at any time. You commit
to a specific algorithm only when you write We specify the name of the DPF we are about to write out here.
the docking parameter file. This file will contain docking parameters and instructions for
a), Lamarckian Genetic Algorithm (LGA) docking also known
as a Genetic Algorithm-Local Search (GA-LS).
Type in imatinib_1iep_receptor.dpf and click on Save.
In general you should know that AutoDock and Vina must be run in
the directory where the macromolecule, ligand, GPF and DPF files or
config files are to be found.
AUTODOCK4Procedure:
2. Launch
Please note that you can easily start a job from the command line by cd
into the directory containing the input files and typing:
% autodock4 –p imatinib_1iep_receptor.dpf –l
imatinib_1ieb_receptor.dlg
VINAProcedure:
cd C:\User\%USERNAME%\Desktop\MGL\day00\vina\input
InputFiles:
1iep_receptor.pdb
imatinib.pdb
ResultsFiles
Ligand
imatinib.pdbqt (7 torsions)
Macromolecule
1iep_receptor.pdbqt
AutoGrid
1iep_receptor.gpf
1iep_receptor.glg
1iep_receptor.*.map
1iep_receptor.maps.fld,1iep_receptor.maps.xyz
AutoDock
imatinib_1iep_receptor.dpf
imatinib_1iep_receptor.dlg
VINA
1iep_receptor.pdbqt
imatinib.pdbqt
imatinib_1iep_receptor.conf
imatinib_out*.pdbqt
Appendix 1: PMV Basics
You might have a three-button mouse. If so, the mouse buttons can be
used alone or with a modifier key to perform different operations. To
zoom the molecule (make the molecule look bigger or smaller) in the
viewer window, press and hold down the <Shift> key and then click and
drag with the middle mouse button. To summarize what the mouse buttons
do:
Note: as you translate a molecule out in the Z You can also press the following keys in the viewer window to
dimension, it will disappear into fog which is used
for depth-cueing.
change the view of the molecule:
Key Action
R Reset view
Note: By default, the Viewer’s current object is D Toggle on/off Depth-cueing (blends molecule into
root so you will not see any changes here if you background farther away)
toggle between transform root and transform
current object. The DejaVu GUI lets you change
the current object.
T Toggle between transform root (i.e. scene) and transform
the Viewer’s current object.
Appendix 2: Dashboard Widget
Tree Widget
Note: Clicking on a shape - rectangle, circle, The Tree Widget on the left lists all molecules currently loaded in
square or diamond - under a command causes
the command linked to the shape to be applied to PMV. Click on the arrows to navigate between molecules ,
each node in the corresponding row. If the chains , residues and atoms . Clicking on a shape in
shape is off (colored white), the command will
be applied to nodes and the shape will be colored one of the columns in the right section executes the PMV
red. If the shape is on (colored red), clicking on command corresponding to the label at the top of the column on
the command button will undo the command and
the shape will be colored white. Circles are used the group of nodes corresponding to the row. There 16 different
for display commands, squares for label commands that can be executed this way - gray rectangle
commands and diamonds for color commands.
Coloring can be replaced by a different coloring (Show/Hide), select/unselect (Sel.), display lines (Lines), display
scheme but cannot be undone. The gray CPK (CPK), display sticks and balls (S&B), display secondary
rectangle is used for show/hide and the white
rectangle for select. structure (Rib.), display molecular surface (MS), display labels
(Lab.), color by atom type (Atom), color by molecule (Mol), color
by chain (Chain), color by residue according Rasmol (RAS), color
by residue according Shapely (SHA), color according to David
Goodsell colors (DG), color by secondary structure element type
(Sec.Str.) and color by instance (Inst).
Note: A selection in the Tree is used to build a The Sel: entry in the top left corner of the Dashboard can be used
group of nodes to be the target for commands
linked to shapes. It is not the same as the current to select entries in the Tree using a Pmv compound selector. Nodes
selection in the Viewer. It can be selected using matching the specified string will be selected. Selected nodes are
the appropriate rectangles….
outlined with a yellow selection box. When a shape is clicked for
a selected node, the corresponding command is applied to all
currently selected nodes. Hovering over this entry shows samples
of the required syntax.
The option menu on the top allows the user to specify whether
commands should be applied to the backbone atoms only (BB), the
side chain atoms only (SC), the sidechain atoms and CA atoms
(SC+CA) or the full molecular fragment (ALL). This setting can
be overridden for each column (CMD).
Next, a tour of the Set Play Options widget and its buttons:
• Show Info opens and closes a separate panel Conformation # Info
Note: building hydrogen bonds
requires that the receptor molecule be which displays additional information about current conformation
present in the viewer and that you (see following).
have either chosen it using:
Analyze Macromolecule
• Build H-bonds turns on and off building and displaying hydrogen
Choose or read it in specifically bonds between the macromolecule and the ligand in its current
using conformation.
Analyze Macromolecule
Open…
• Color by allows you to choose how to color the ligand from a list
of available coloring schemes.
• Make clust RMS ref sets the reference coordinates for RMS to
those of the current conformation. [This RMS value is shown in Info
panel as clRMS]
• Choose mol for RMS ref lets you select a different molecule
from list of those in Viewer to use as reference for a new RMS
computation.
• Write All writes a formatted file for each set of coordinates in the
current sequence. This uses default filenames based on the id of
each Conformation.
These 4 radiobuttons are used to set the current play mode. [Note that at any
time, the current endFrame and the current startFrame depend on the
direction of play.]
• once and stop plays from the current conformation in the current
direction up to and including the endFrame.
Choose Conformation widget has list of ids for each conformation in the
current sequence list. Double clicking on an entry in this list updates the
ligand to the corresponding conformation. This widget is closed by clicking
on the checkbutton Show Conf List in Set Play Options widget.
Ki=exp((deltaG*1000.)/(Rcal*TK)
where deltaG is docking energy, Rcal is 1.98719 and TK is 298.15
• rseed1 and rseed2 are the specific random number seeds used for
current conformation’s docking run.
Appendix 4: Docking Parameters
'fld': grid data field file created by AutoGrid and readable by AVS.
'map': filename for the first AutoGrid affinity grid map of the 1st
atom type. Repeated for all atom types specified in ‘ligand_types’.
'axisangle0: initial axis and angle for rotation Qx, Qy, Qz, Qang or
random.
'rmstol': the rms deviation tolerance for cluster analysis, carried out
after multiple docking runs. If two conformations have an rms less
than this tolerance, they will be placed in the same cluster. The
structures are ranked by energy, as are the clusters.
SimulatedAnnealingSpecificParameters:
'select': State selection flag. ‘m’ minimum state is selected or ‘l’ last
state. Default is m.
GeneticAlgorithmSpecificParameters:
LocalSearchSpecificParameters:
ClusteringKeywords: