Autodock Protocol For Protein - Ligand Docking
Autodock Protocol For Protein - Ligand Docking
Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, California, USA. Correspondence should be addressed
to A.J.O. ([email protected]).
Computational docking can be used to predict bound conformations and free energies of binding for small-molecule ligands
to macromolecular targets. Docking is widely used for the study of biomolecular interactions and mechanisms, and it is applied
to structure-based drug design. The methods are fast enough to allow virtual screening of ligand libraries containing tens of
thousands of compounds. This protocol covers the docking and virtual screening methods provided by the AutoDock suite of
programs, including a basic docking of a drug molecule with an anticancer target, a virtual screen of this target with a small
ligand library, docking with selective receptor flexibility, active site prediction and docking with explicit hydration. The entire
2016 Macmillan Publihers Limited. All rights reserved.
INTRODUCTION
Computational docking is widely used for the study of protein based on empirical free energies of binding, is used to score poses
ligand interactions and for drug discovery and development. quickly at each step of the conformation search.
Typically, the process starts with a target of known structure, Both of these are serious limitations, and users must use
such as a crystallographic structure of an enzyme of medicinal tools such as molecular dynamics or free-energy perturbation
interest. Docking is then used to predict the bound conformation if a more realistic conformational search or energy prediction
and binding free energy of small molecules to the target. Single is necessary. These tools are complementary to computational
docking experiments are useful for exploring the function of docking methods, as docking methods generally search a larger
the target, and virtual screening, in which a large library of conformational space, but more advanced methods can predict
compounds are docked and ranked, may be used to identify conformation and energy more accurately within a local area of
new inhibitors for drug development. the conformational landscape.
AutoDock is a suite of free open-source software for the Advanced docking methods may be used to improve results
computational docking and virtual screening of small molecules in cases in which the limitations of requiring a rapid method
to macromolecular receptors. The suite currently includes several for energy evaluation are too restrictive. For instance, many
complementary tools: docking methods use a rigid model for the receptor, which
often leads to improper results for proteins with an appreci-
AutoDock Vina: a turnkey computational docking program able induced fit upon binding. AutoDock includes a method
that is based on a simple scoring function and rapid gradient- for treating a selection of receptor side chains explicitly, to
optimization conformational search1. account for limited conformational changes in the receptor.
AutoDock: a computational docking program based on an In addition, ordered water molecules often mediate interac-
empirical free-energy force field and rapid Lamarckian genetic tions between ligands and receptors, and advanced methods for
algorithm search method2,3. treating selected waters explicitly have been implemented in
Raccoon2: an interactive graphical tool for virtual screening AutoDock. Both of these advanced methods are demonstrated
and analysis4. in this protocol.
AutoDockTools (ADT): an interactive graphical user interface Many reports have compared the performance of popular
(GUI) for coordinate preparation, docking and analysis5. docking methods such as AutoDock (recently reviewed by Sousa
AutoLigand: a program for predicting optimal sites of ligand et al.7). Different methods can achieve different success rates
binding on receptors6. depending on specific targets, but in general they all perform sim-
ilarly when tested on a series of diverse proteinligand complexes:
The AutoDock suite, including source code, is freely available, and they all perform well for the prediction of bound complexes for
it has been widely used in research and drug discovery. drug-sized molecules, with estimates of free energies of binding
with errors of roughly 23 kcal mol1, provided that there is no
Comparisons with other methods significant motion required in the receptor. Better results may be
A variety of academic and commercial methods for computational obtained by tuning the docking method for a particular system
ligand docking are currently available (see Sousa et al.7 for an or moving to more sophisticated and computationally intensive
extensive review of current methods). Most of these methods sim- parameterizations of the system.
plify the problem in two ways to make the computation tractable.
First, the conformational space is reduced by limitations imposed Applications of the protocol
on the system, such as a rigid receptor and fixed bond angles and The AutoDock suite is used in numerous laboratories: a recent
lengths in the ligand. Second, a simplified scoring function, often search on PubMed yielded over 1,000 citations in the past year.
have been developed in parallel, to respond to two different parameterization of the AutoDock scoring function is available
needs. Development began with AutoDock 2,3,5,21,22, and it to the user, to allow tuning for particular systems if desired. For
continues to be the platform for experimentation in docking instance, methods for incorporating explicit solvents and for
methods. AutoDock Vina was developed more recently to fulfill predicting conformations of covalent complexes were developed
the need for a turnkey docking method that does not require by modifying the AutoDock potentials23,24.
extensive expert knowledge from users1. It is highly optimized
to perform docking experiments using well-tested default Raccoon and virtual screening. Virtual screening is rapidly
methods. Both methods are currently freely available. AutoDock becoming the primary application of computational dock-
Vina is fast and effective for most systems, whereas AutoDock ing methods, with many successes in the discovery of new lead
is available for systems that require additional methodological compounds for pharmaceutical development25. The idea is
enhancements. to screen a large library of available ligands to identify a small
Both methods are designed to be generic computational docking subset for purchase and experimental testing.
tools, by accepting coordinate files for receptor and ligand and pre- Raccoon is a graphical user interface that is designed to
dicting optimal docked conformations. Typically, users start with streamline the steps of a virtual screening and analysis of the
receptor coordinates from crystallography or NMR spectroscopy results. These include the following:
and with ligand coordinates generated from SMILES (Simplified
Molecular-Input Line-Entry System) strings or other methods. Automated server connection manager and installation of
Because the search methods are stochastic, a set of optimal docking services (such as AutoDock Vina).
docked conformations is predicted, and then they are typically Ligand library for upload and management of large ligand
clustered spatially to analyze the consistency of the results. collections.
Highly clustered results are an indication that the conforma- Receptor management from multiple targets and flexible
tional search procedure is exhaustive enough to ensure coverage residues.
of the accessible conformational space. Because of the stochastic Graphical interface for docking parameter setup.
nature of the search, the method cannot ensure that a global Graphical management of jobs on computational resources.
minimum has been found. For this reason, it is important to Automated retrieval and preprocessing of results to extract
use re-docking experiments with known complexes of similar features of interest.
conformational complexity to evaluate the docking protocol User-friendly filtering of virtual screening results based on
being used. properties and interactions.
AutoDock and AutoDock Vina currently use several simplifica- Export of filtered results.
tions that affect the results that are obtained. The most significant
simplification is the use of a rigid receptor. This approximation Coordinate preparation with ADT. Successful docking and
reduces the size of the conformational space, which allows it to virtual screening require careful attention to the quality of the
be searched reliably, and it reduces the computational effort of coordinates used for receptors and ligands. Both AutoDock and
scoring each trial conformation. When applying these docking AutoDock Vina use a simplified representation of the molecules,
methods to a given receptor, it is important to consider the possi- which is included in a modified Protein Data Bank (PDB) file
ble effects of this limitation, and if the system includes significant format, termed PDBQT:
receptor motion a number of methods may be used, including
the following: United atom representation: both methods require coordinate
sets that include the polar hydrogen atoms. AutoDock uses polar
Using receptor structures taken from receptorligand complexes, hydrogen coordinates during docking; AutoDock Vina uses them
in which there is some expectation that the receptor is in the to assign the hydrogen bonding state of the heteroatoms, but it
relevant conformation. does not use explicit hydrogens during the docking.
of libraries of compounds. logical and medicinal interest, and it is expected to perform con-
Preparation of coordinate files (Steps 14) is arguably the most sistently with this type of system, as described in more detail in
important aspect of the process, as the quality of coordinates will the ANTICIPATED RESULTS below.
affect all results from docking. After preparation, take a crucial Systems that deviate from these design parameters will give
look at the coordinate files and examine the protonation state variable results, and they should be approached with caution.
and charges (particularly metals, if present) to see whether they Users most often encounter two significant limitations. First,
are consistent with knowledge of the system. For instance, ADT users often want to dock very large ligands, such as decapep-
does not provide any charges for metal ions; therefore, charges tides. These ligands present too many degrees of freedom, and
need to be manually added in the PDBQT file using a text editor. docking methods are not able to search the accessible confor-
Receptor coordinate files deposited in the PDB often have many mational space. Most often, the best way to solve this problem
challenges that need to be addressed. Double-check to make sure is to break it down into smaller pieces. Second, the protein targets
that you have the proper biological unit, that essential residues often show significant conformational flexibility, which is
or loops are not missing from the coordinate set, that you have not modeled in the AutoDock suite, apart from the selective
chosen only one set of coordinates in which there are alternate side-chain motion, including in this protocol. This problem is
conformations in the deposition and that you have included only typically approached through the generation of conformations
essential cofactors and structural waters. of the protein by other methods (such as molecular dynamics)
before docking.
Advanced methods. The default methods used in AutoDock
and AutoDock Vina are highly effective for typical drug-like About the protocol
ligands, and they have been widely used for applications such This protocol describes many of the steps involved in a typical
as virtual screening25. However, several refinements have been drug discovery project. The target for the protocol is the kinase
developed to approach problems that present challenges with domain of the proto-oncogene tyrosine protein kinase c-Abl.
the default methods. The protein is an important target for cancer chemotherapyin
particular, the treatment of chronic myelogenous leukemia. This
Receptor flexibility: arguably the greatest limitation in these types example was chosen because it demonstrates the use of the proto-
of docking methods is the rigid model of the receptor. AutoDock col on an important cancer target with an approved drug, Gleevec
and AutoDock Vina both allow limited flexibility of selected (imatinib), and it can show the impact of the drug-resistant muta-
receptor side chains, as described in this protocol. For systems tions of the target. The protocol covers the following:
with larger motions of loops or domains, the relaxed complex
method26 has shown success by sampling a variety of receptor Re-docking experiments using the structure of c-Abl with
conformations using molecular dynamics and then performing the anticancer drug imatinib (PDB entry 1iep (ref. 27)), using
docking simulations on these snapshots. AutoDock Vina and AutoDock.
Explicit hydration: interactions between biological molecules are Virtual screening with Raccoon2 of a library of compounds
often mediated by ordered water molecules. We have developed against c-Abl, using protein coordinates from PDB entry 1iep.
a method that uses the existing version of AutoDock but modi- Cross-docking of imatinib with c-Abl coordinates from PDB
fies the force field to model explicit water molecules. The ligand entry 1fpu (ref. 28), modeling flexibility in a threonine that
is decorated with an ensemble of water molecules, which may interacts with the drug.
or may not then contribute to the interaction based on a modi- Prediction of optimal ligands for c-Abl using AutoLigand.
fied energy evaluation grid. In tests, this hydration has shown An example of docking with explicit water molecules, which can
improvement in the prediction of bound conformations of improve results for fragment-based drug design.
MATERIALS
EQUIPMENT Hardware and software
Starting data Computer: Linux, Macintosh or Windows PC; Internet access
Coordinate file for receptor (in a variety of formats, including pdb, For virtual screening with Raccoon, a Linux cluster/HPC with either a
mol2, cif and sdf) PBS or SGE scheduler
Coordinate file for ligand (in a variety of formats, including pdb, mol2, Text editor
cif and sdf) AutoDock: http://autodock.scripps.edu (information on installation is
Several files are available in Supplementary Data for use as a tutorial available at http://autodock.scripps.edu/downloads/autodock-registration/
for each of the protocols: 1iep_receptorH.pdb (coordinates of c-Abl autodock-4-2-download-page/)
kinase domain from PDB entry 1iep, with hydrogen atoms added in AutoDock Vina: http://vina.scripps.edu (information on installation is
AutoDockTools), 1iep_ligandH.pdb (coordinates of imatinib from available at http://vina.scripps.edu/manual.html#faq)
PDB entry 1iep, with hydrogen atoms added in AutoDockTools), AutoDockTools (part of MGLTools): http://mgltools.scripps.edu
1fpu_receptorH.pdbqt (coordinates of c-Abl kinase domain from (information on installation is available at http://mgltools.scripps.edu/
PDB entry 1fpu, in PDBQT format), imatinib.pdbqt (coordinates of downloads)
imatinib in PDBQT format) and NCIdivII_subset (a folder that includes AutoLigand is part of AutoDockTools
499 compounds from the ZINC library, formatted as PDBQT files) Raccoon is available at http://autodock.scripps.edu/resources/raccoon
2016 Macmillan Publihers Limited. All rights reserved.
PROCEDURE
Coordinate preparation with ADT TIMING 10 min
1| Generate the ligand coordinate file. A coordinate set that includes hydrogen atoms is required. This may be obtained in
a variety of ways, including with experimental coordinates from the PDB (http://www.rcsb.org/) or Cambridge Crystallographic
Database (http://www.ccdc.cam.ac.uk/), or via structure generation methods such as the CACTUS server (http://cactus.nci.
nih.gov/translate/). The file 1iep_ligandH.pdb is provided for use as a tutorial for this protocol (all example files are
supplied in Supplementary Data). This file includes ligand coordinates taken from PDB entry 1iep, to which all hydrogen
atoms have been added and manually adjusted to the known protonation state. Start ADT (Fig. 1) and set the working
directory by clicking File Preferences Set. Type your working directory path name into the Startup Directory box
and click Set. Click Dismiss at the bottom of the window.
2| Read the atomic coordinates. To do this, first select Ligand Input Open and use the Files of type menu to
choose PDB files. Click on your coordinate file (in this case 1iep_ligandH.pdb) and click Open. ADT will read the coordi-
nates, add charges if necessary, merge nonpolar hydrogens and assign appropriate atom types. At this point, the ligand will
be displayed in the viewer window, with aromatic carbons in green. Click OK on the popup to continue.
CRITICAL STEP If your coordinate set does not include hydrogen positions, click File Read Molecule and choose your
coordinate file. Next, click Edit Hydrogens Add to add all hydrogens by default. Select Ligand Input Choose to
choose the ligand molecule.
3| Prepare a PDBQT file by selecting Ligand TorsionTree DetectRoot; this will define the center of the torsion tree.
Selecting Ligand TorsionTree ChooseTorsions will launch a window that allows choice of torsional degrees of freedom.
Rotatable bonds are in green, rigid portions are in red and potentially rotatable bonds that are currently set as not rotatable
(such as the peptide bond at the center of imatinib) are
in magenta. Clicking on bonds will switch the rotation
flexibility on and off. When this step has been completed, PMV menus
click Done. Click Ligand Output SaveAsPDBQT, and ADT menus
then select Save to write the file 1iep_ligandH.pdbqt.
? TROUBLESHOOTING
Interactive Viewer
Figure 1 | AutoDockTools (ADT). ADT is built within the Python
Molecule Viewer (PMV). ADT commands for coordinate preparation,
docking and analysis are available through menus on the lower toolbar.
PMV commands for higher-level visualization are available on the upper
tool bar. The Dashboard controls representation, colors and labeling of
molecular objects that are displayed in the Interactive Viewer panel.
For more information on the capabilities of PMV and ADT, see http://
mgltools.scripps.edu/documentation.
use a text editor to remove water, ligands, cofactors, ions and so on that should not be included in the receptor.
The file 1iep_receptorH.pdb is provided for use as a tutorial for this protocol, and it includes receptor coordinates that are
taken from PDB entry 1iep. Open the file by selecting Grid Macromolecule Open, and use the Files of type menu to
choose all files. Click on your coordinate filein this case 1iep_receptorH.pdband click Open. ADT will read coordinates,
add charges, merge nonpolar hydrogens and assign appropriate atom types. Click OK to accept the changes. A window will
pop up to write the PDBQT file. Click Save to write the file 1iep_receptorH.pdbqt.
CRITICAL STEP If your coordinate set does not include hydrogen positions, Click File Read Molecule and choose your
coordinate file. Next, select Edit Hydrogens Add to add all hydrogens by default. Click Grid Macromolecule
Choose to choose the receptor molecule.
? TROUBLESHOOTING
Option
(Step 5) Method Description
A Single-docking experiment with AutoDock Vina Basic docking method for study of a single ligand with a single receptor
B Single-docking experiment with AutoDock Basic docking method for study of a single ligand with a single
receptor, with explicit calculation of affinity maps
C Virtual screening with Raccoon2 and AutoDock Vina Virtual screen of a library of ligands with a single receptor, often
used for drug discovery
D AutoDock Vina with flexible side chains Docking method for a single ligand with a single receptor, incorporat-
ing limited receptor flexibility
E Active site prediction with AutoLigand Method for analysis of receptor binding sites, for prediction of
druggable sites
F Docking with explicit waters Advanced docking method for a single ligand with a single receptor
incorporating explicit bridging water molecules
basis of the ligand. Other options are available, or the values may be changed manually with the thumbwheels. When
this step has been completed, choose File CloseSavingCurrent in the window Docking Output VinaConfig and
write the configuration file (default name: config.txt) by clicking Save.
(iii) Run AutoDock Vina. The imatinib ligand used in this protocol is challenging, and Vina will occasionally not find the
correct pose with the default parameters. Vina provides a parameter called Exhaustiveness to change the amount
of computational effort used during a docking experiment. The default exhaustiveness value is 8; increasing this to
about 24 will give a more consistent docking result. There are two ways to run AutoDock Vina: from ADT or from the
command line. From ADT, select Run RunAutoDockVina, and in the popup window use the Browse buttons to
specify the path to the AutoDock Vina executable file vina and the path to the configuration file. Edit the Cmd line
to change the exhaustiveness, such as
./vina --config config.txt --exhaustiveness=24
Click the Launch button to run the program.
Programs in the AutoDock suite may also be run at the command line. To do this, instead open a terminal window
and change to the directory that contains the coordinate files and configuration file. Next, issue the command listed
above. This command assumes that the AutoDock Vina executable vina is also located in the same directory.
2016 Macmillan Publihers Limited. All rights reserved.
Running AutoDock Vina will write a docked coordinate file 1iep_ligandH_out.pdbqt and also present docking
information to the terminal window. The predicted free energy of binding should be about 13 kcal mol1 for poses
that are similar to the crystallographic pose. With exhaustiveness set to 24, Vina will most often give a single docked
pose with this energy. With the lower default exhaustiveness, several poses flipped end to end, with less favorable
energy, may be reported.
? TROUBLESHOOTING
(iv) To visualize the results from AutoDock Vina (Fig. 2), launch ADT and set the working directory (see Step 1). Select
Analyze Dockings OpenAutoDockVinaResult and choose the output file from Step 5A(iii). Choose the default
of Single molecule with multiple conformations and click OK. This will show coordinates for each docked result.
Use the arrow keys on the keyboard to scroll through the poses. Click Analyze Macromolecule Open to choose
the receptor coordinate file to visualize the receptor 1iep_receptorH.pdbqt. Select File ReadMolecule to choose the
ligand coordinate file 1iep_ligandH.pdb, visualize the crystallographic location of the ligand and compare it with
the docked conformation.
Analyze Dockings ShowInteractions will launch a visualization that highlights interactions between the
ligand and the receptor.
(B) Single docking experiment with AutoDock TIMING 60 min
(i) Generate a grid parameter file for AutoDock that specifies the PDBQT files for the receptor and parameters for
generating the atomic affinity maps. Start ADT and set the default working directory (see Step 1). Select Ligand
Input Open and choose the ligand PDBQT file (1iep_ligandH.pdbqt). Select Grid Macromolecule Open and
choose the receptor PDBQT file (1iep_receptorH.pdbqt). Set the map types by selecting Grid SetMapTypes
ChooseLigand and choosing the ligand PDBQT file
(1iep_ligandH.pdbqt). Define the search space by
a
selecting Grid GridBox, which will open a window
for specifying the center and size of the search space.
Use Center CenteronLigand to define a minimal
box. When finished, choose File CloseSavingCurrent. AutoDock
Finally, select Output SaveGPF to save the AutoDock Vina
path to the AutoGrid executable file and the path to the grid parameter file. After you specify the grid parameter file,
ADT will suggest a name for the log file. Click the Launch button to run AutoGrid.
Alternatively, at the command line, open a terminal window and change to the directory that contains the coordi-
nate files and grid parameter file. Then issue the following command (this command assumes that the AutoGrid execut-
able autogrid4 is also located in the same directory):
./autogrid4 -p 1iep.gpf -l 1iep.glg
? TROUBLESHOOTING
(iii) (Optional) Visualize AutoGrid maps in ADT (see Step 5E). Start ADT and set the default working directory (see Step 1).
In the top toolbar, select Grid3D Read and choose a map file (such as 1iep_receptorH.C.map for the carbon affinity
map). In the popup control panel, click on the file in the Grid Name panel and then click on the Isocontour button;
ADT will display a histogram of map values. Change the max value to 0.0 and press return (or enter). Shift-click in
the histogram to calculate an isocontour; you can then drag the bar in the histogram right and left to change the
contour level. Values in the range of 0.5 to 0.1 are effective. The menu option File ReadMolecule can be used
to display the receptor.
(iv) Generate the docking parameter file that specifies the PDBQT file for the ligand and parameters for the
2016 Macmillan Publihers Limited. All rights reserved.
docking simulation. Select Docking Macromolecule SetRigidFilename and choose the receptor PDBQT file
(1iep_receptorH.pdbqt). Select Docking Ligand Choose, choose the ligand PDBQT file (1iep_ligandH.pdbqt)
and accept the default ligand docking parameters, which will randomize the pose of the ligand before docking.
Select Docking SearchParameters GeneticAlgorithmParameters and use the default parameters for most
drug-sized ligands, except set Number of GA Runs to 50. Additional docking parameters are specified with Docking
DockingParameters; use default parameters for most drug-sized ligands. Finally, write the docking parameter file
by selecting Docking Output LamarckianGA, which is typically given the extension .dpf (use 1iep.dpf).
(v) Run AutoDock, either from ADT or at the command line. To run from ADT, select Run RunAutoDock, and in the
popup window use the Browse buttons to specify the path to the AutoDock4 executable file and the path to
the docking parameter file. After you specify the docking parameter file, ADT will suggest a name for the log file.
Click the Launch button to run the program.
Alternatively, at the command line, open a terminal window and change to the directory that contains the
coordinate files and docking parameter file. Then issue the following command (this command assumes that the
AutoGrid executable autodock4 is also located in the same directory):
./autodock4 -p 1iep.dpf -l 1iep.dlg
? TROUBLESHOOTING
(vi) Visualize AutoDock results (Fig. 2a). Start ADT and set the default working directory (see Step 1), and to access the
results select Analyze Dockings Open and choose the 1iep.dlg file. There are several options in ADT for visual-
izing results: select Analyze Conformations
PlayRankedByEnergy and click on the arrow buttons
to scroll through conformations, or select Analyze
Docking Load to open a clickable window with
ranked conformations and energies.
? TROUBLESHOOTING
(C) Virtual screening with Raccoon2 and AutoDock
Vina TIMING 60 min
(i) Start Raccoon2 and configure the server. Raccoon
is designed to run virtual screening on a large
computational resource, such as a Linux cluster.
When you add a new server, you must configure
the connection and install one or more docking
services. Launch Raccoon2 (Fig. 3) and create a
new server connection by clicking the three-gear
icon to open the Connection Manager. Several
options need to be set: server name (a name to
identify the resource); address (the host name
or IP address of the server); and a username
and password. Click Save and close the Figure 3 | Raccoon2. Tabs at the top allow the user to choose each of the
Connection Manager. steps for setting up, running and analyzing a virtual screen.
(iv) Configure AutoDock Vina docking parameters. In the CONFIG tab, load a receptor structure in the 3D viewer by
selecting files in the Receptor list. Use Center and Size thumbwheels to define the search space, or place the mouse
cursor over boxes and type numeric values. Modify docking parameters if desired, or load the config.txt file created
in Step 5A(i).
(v) Perform the virtual screening calculation. In the JOB
MANAGER tab, ensure that all Cluster submission
requirements are green, and then click Submit.
Several options will need to be specified: set
Project to new and type a descriptive name of
your choice; set Experiment to new and type a
descriptive name of your choice; and click OK to
start the calculation.
To follow the status of the docking, right-click
on the experiment name and select update status.
When one or more jobs in the experiment have
been completed, download them individually by
right-clicking on the job name, or download them
together by right-clicking on the experiment, and
select Download results.
(vi) Filter and analyze the results. To assist with filtering
and selection, Raccoon calculates docking pose
properties such as interaction, score and ligand
efficiency. In the ANALYSIS DATA SOURCE tab,
click on Add results Select downloaded results,
and browse to the results directory. Choose the
receptor file to be used for processing results, and
type a descriptive log file name or select the config.
txt file generated at Step 5A(i).
The filter panel allows a variety of filters to be
applied to the results (Fig. 4):
Energy will show poses within an energy range;
Ligand Efficiency will select poses within a range
of ligand efficiency; and with Target Interactions,
use + to filter by selected interactions with
receptor atoms.
Figure 4 | Raccoon result filtering. The Data Source window (top) has
several options for filtering the results of a virtual screen. Two interaction
filters are applied here. The Visualization window (bottom) allows
visualization of the filtered results, and checkboxes can be used to select
the desired compounds for export.
For the c-Abl virtual screen described in this protocol, click on the + button and add an HB acceptor filter
with MET318, and then press return (note: the interface currently requires that the residue name be in all capital
letters). Click on the + button and add an HB any filter with THR315, and then press enter. Switch the menu from
match any to match all and update the filters. This will select the small number of molecules that show both of
these interactions. The THR319 position of c-Abl is known to mutate to obtain resistance. To find molecules that
do not interact with this position, click on the filter button for THR315 added above, and switch the filter from
wanted (green) to not wanted (red).
? TROUBLESHOOTING
(vii) Export results. Once a set of ligands is filtered, select interesting ones by clicking on the button in the Results panel.
Use the EXPORT tab to export the selected candidates.
(D) AutoDock Vina with flexible side chains TIMING 60 min
(i) Generate receptor coordinate files. The receptor coordinates are split into two PDBQT files: one for the rigid portion
and one for the flexible side chains. As with the rigid docking protocols, the method requires a receptor coordinate
file that includes all hydrogen atoms. This protocol describes the cross-docking of imatinib to c-Abl in PDB entry 1fpu,
treating Thr315 as flexible.
Start ADT and set the default working directory (see Step 1), select Grid Macromolecule Open and
2016 Macmillan Publihers Limited. All rights reserved.
The first step is to calculate the energy maps. Start ADT and set the default working directory (see Step 1). In Grid
Macromolecule Open, select a coordinate file for receptor (1iep_receptorH.pdbqt). By default, ADT will name the map
files on the basis of this file name. In Grid SetMapTypes Directly, edit the atom list to be C HD OA.
In Grid GridBox, adjust the size to cover the entire protein, and set the spacing to 1 . For 1iep, use a size of
60, 60, 70 (put the mouse cursor over the dial, type in values, and press return (or enter)), and save the parameters
using File CloseSavingCurrent. Write the grid parameter file with Grid Output Save GPF with the filename
1iep.gpf. Run AutoGrid at the command line (see Step 5B(ii)):
./autogrid4 -p 1iep.gpf -l 1iep.glg &
(ii) AutoLigand is not installed by default, and it must be loaded into ADT: select File BrowseCommands. Next, in
Select a Package, select AutoDockTools; in Select a Module, select AutoLigand Command; click Load Select Module
and then click OK. This will add an AutoLigand entry to the ADT Compute menu.
(iii) AutoLigand may be run in two modes. This protocol will generate ligand envelopes individually, by manually picking
a seed point and a volume in ADT. This mode is appropriate if you know the location of the active site of your target.
AutoLigand may also be run at the command line to scan the entire surface of the protein, predicting the location of
optimal binding sites. This mode is described in more detail on the AutoDock website.
2016 Macmillan Publihers Limited. All rights reserved.
Select File ReadMolecule and read 1iep_receptorH.pdbqt (from Step 4), and then select File ReadMolecule
and read 1iep_ligandH.pdbqt from Step 3 (the ligand coordinates are not used by AutoLigand, but they will be used
to locate the active site in this protocol). Start the AutoLigand computation with Compute AutoLigand
RunAutoLigand. This opens a window that allows you to choose the seed point and size of the envelope. Use the
sliders to place the yellow seed point marker near the ligand/active site, or shift-click on an atom in the ligand to
place the marker there. Use the default of 100 for the volume of the envelope. Start the AutoLigand calculation by
clicking OK. When AutoLigand finishes, a pop-up window will display the results. The envelope will be displayed in
the viewer (Fig. 5), and a PDB file will be written with coordinates of the envelope.
(F) Docking with explicit waters TIMING 60 min
(i) This method adds dummy atoms to the ligand
that correspond to all possible sites of hydration. a b
A modified AutoGrid map is then used during docking,
giving a favorable score when the water is well placed
and omitting the water if it overlaps with the receptor.
A final script analyzes the docked results, retaining
only those waters in appropriate positions.
This protocol assumes that the ligand and receptor
have been prepared for a standard AutoDock docking.
Two coordinate files, ligand.pdbqt and protein.pdbqt,
created using the same protocol as that described in
Steps 14 in the above example, have been provided
for this protocol, along with a modified parameter file
for the force field, a modified docking parameter file
and several Python scripts for performing each step. Carbon affinity map
To add water positions to the ligand, type at the
command line: c d
$ python wet.py -i ligand.pdbqt
This script adds the dummy W atoms to the ligand
PDBQT file, saving it to the file ligand_HYDRO.pdbqt.
(ii) Calculate the default atomic grid maps. Start ADT
and set the default working directory (see Step 1).
Select Ligand Input Open and choose the
ligand PDBQT file (ligand.pdbqt), and select Grid Macromolecule Open and choose the receptor PDBQT file
(receptor.pdbqt). Set the map types in Grid SetMapTypes Directly and ensure that the HD and OA types are in
the list. Use Grid GridBox to specify the center and size of the search space. In the popup window, use Center
CenteronLigand to define a minimal box. When finished, save the parameters using File CloseSavingCurrent. Write
the grid parameter file using Output SaveGPF, with the name protein.gpf. Run AutoGrid (see Step 5B(ii) for more
information) with Run RunAutoGrid or at the command line with
./autogrid4 -p protein.gpf -l protein.glg.
(iii) Generate the W map. If standard filenames are used for the maps, only the receptor name must be specified for the
script that generates the map for the water energy evaluation:
$ python mapwater.py -r protein.pdbqt -s protein.W.map
Box 3 | Modified docking parameter file for docking with explicit waters
autodock_parameter_version 4.2 #version control
2016 Macmillan Publihers Limited. All rights reserved.
(iv) Create a modified docking parameter file. Select Docking Macromolecule SetRigidFilename and choose the
receptor PDBQT file (protein.pdbqt). Select Docking Ligand Choose and choose the ligand PDBQT file
(ligand_HYDRO.pdbqt), and accept the default ligand docking parameters, which will randomize the pose of the
ligand before docking. Set the search parameters in Docking SearchParameters GeneticAlgorithmParameters,
changing Number of GA Runs to 50, and use defaults on the remaining parameters. In Docking DockingParameters,
use default parameters for most drug-sized ligands.
Write the docking parameter file using Docking Output LamarckianGA, with the name ligand_HYDRO_protein.dpf.
The standard docking parameter file must be modified manually in two ways: parameter_file AD4_water_forcefield.
dat must be added near the top and protein.W.map must be added to the list of maps. Both of these changes are
highlighted in Box 3.
? TROUBLESHOOTING
(v) Run AutoDock (see Step 5B(v) for more information) in ADT with Run RunAutoDock or at the command line with
./autodock4 -p ligand_HYDRO_protein.dpf -l ligand_HYDRO_protein.dlg.
(vi) Extract and score the results at the command line with
2016 Macmillan Publihers Limited. All rights reserved.
? TROUBLESHOOTING
Troubleshooting advice can be found in Table 1.
3 Incorrect protonation ADT currently uses Babel to add hydrogen Double-check the PDBQT file and manually curate
of the ligand positions, which makes occasional errors errors in protonation
4 Missing coordinates Experimental coordinate sets are heterogeneous. Double-check the PDBQT file and manually
in the receptor Common problems include missing side chains curate to include the proper biological unit.
and loops, asymmetric units that include In general, use the biological unit and omit
only a portion of the biological unit, alternate ligands and waters except for strongly bound
conformations, essential cofactors and structural prosthetic groups
waters
5A(iii) Vina does not run ADT Start Vina GUI or command line is not Locate the Vina executable on your computer,
using the proper path to the Vina executable, and use the appropriate path name. Refer
or file is incorrect type for the CPU or operating to online installation notes to locate the
system executable
No low-energy Grid box does not enclose the active site Choose an appropriate location for the grid
conformations are found box in ADT
5B(ii) AutoGrid does not run ADT Run AutoGrid GUI or command line is not Locate the AutoGrid executable on your
using the proper path name to the AutoGrid4 computer, and use the appropriate path name.
executable, or file is incorrect type for the CPU Refer to online installation notes to locate the
or operating system executable
5B(v) AutoDock does not run ADT Run AutoDock GUI or command line is not Locate the AutoDock executable on your
using the proper path name to the AutoDock4 computer, and use the appropriate path name.
executable, or file is incorrect type for the CPU Refer to online installation notes to locate the
or operating system executable
5B(vi) Re-docking experiments System has too many degrees of freedom for Use longer search protocols, or simplify the
do not reproduce the AutoDock conformational search system by reducing degrees of freedom (docking
known pose in parts or rigidifying part of the ligand)
(continued)
5C(i) Problems with selection Right-click is used in several places in Use System Preferences on Macintosh
Raccoon for selection (control-click on computers to enable right-clicking for the
Macintosh computers) mouse or trackpad
5C(vi) No results selected Residue names are case sensitive Use all capitals, such as THR315
during filtering by
interaction
5D(iv) No low-energy Grid box does not enclose all of the side chains Choose a larger grid box that encloses the
conformations are found binding side and all of the flexible side chains
5F(iv) Error message about Water dummy atom parameters not defined Ensure that the modified files are used
missing W parameters in force field parameter file and/or docking
parameter files
2016 Macmillan Publihers Limited. All rights reserved.
TIMING
The indicated timing of each step is a rough estimatethe actual times will depend on the complexity of the system being
docked and the equipment being used for the computation.
Steps 14, coordinate preparation with ADT: 10 min
Step 5A, docking with AutoDock Vina: 30 min
Step 5B, docking with AutoDock: 60 min
Step 5C, virtual screening with Raccoon2 and AutoDock Vina: 60 min
Step 5D, flexible side-chain docking with AutoDock Vina: 60 min
Step 5E, active site prediction with AutoLigand: 30 min
Step 5F, docking with explicit hydration: 60 min
ANTICIPATED RESULTS
AutoDock Vina (Step 5A) will provide coordinates for one or more optimized poses for the ligand (Fig. 2a). In our tests
of the docking of imatinib with c-Abl, we found that the default docking parameters are sufficient to give a consistent
solution in most cases. The conformational flexibility of this system is at the limit of the default docking protocol,
which may be indicated by a docking result with multiple less favorable poses. For challenging systems with high degrees
of conformational flexibility, the exhaustiveness parameter can be used to perform additional docking simulations, often
giving more consistent results. This is described in Step 5A(iii). When analyzing results from AutoDock Vina, note that
it uses the input hydrogen positions to assign hydrogen-bonding types to heteroatoms, but it does not optimize them
during docking simulation, so the hydrogen positions in the output pose are in random conformations.
The number of evaluations used in AutoDock will determine the exhaustiveness of the conformational search
(Step 5B). For ligands with 14 torsional degrees of freedom, short (250,000) or medium (2,500,000) lengths should
provide adequate searching, but for larger ligands, such as imatinib, used in this protocol, a longer search is needed,
with 10,000,00025,000,000 evaluations. The clustering analysis is the best way to determine whether the simulation
has adequately searched the available conformation space: perform multiple docking simulations and confirm that the
best conformation is found multiple times.
Virtual screening with Raccoon2 allows the docking and ranking of tens of thousands of compounds to a macromolecular
target (Step 5C). Analysis and filtering is one of the most challenging aspects of virtual screening, as the goal is to identify
a few promising leads from a large body of docked results. We have obtained effective results by filtering on the basis of
predicted docking energy combined with the presence of key interactions in the system. The addition of this type of expert
knowledge greatly improves the success rate when compounds are tested.
In our hands, roughly half of the systems we have used for docking simulations will give useful results using the default
rigid model for the receptor. In other cases, protein motion will cause problems for the prediction of reasonable poses.
AutoDock and AutoDock Vina may be configured to dock ligands with selected receptor residues treated explicitly as flexible
(Step 5D). Docking with flexible side chains is difficult and often requires additional computational effort. The test system
included with this protocol is an example. PDB entry 1fpu includes a structure of Abl with an inhibitor similar to imatinib.
Threonine 315 in the structure is flipped relative to that in 1iep, and it is unable to form a hydrogen bond with the drug.
The protocol performs a cross-docking experiment of imatinib docking to Abl from 1fpu, treating Thr315 as flexible. For
best results, AutoDock Vina needs to be run with a more exhaustive search to find the proper pose. For systems with larger
motion of loops or domains, separate docking simulations may be run for different conformations of the protein, obtained
from multiple experimental structures or dynamics simulations of individual structures.
We have found explicit hydration (Step 5F) to be particularly useful for the docking of small ligands and fragment
molecules, as water-mediated interactions may form a larger portion of their interactions than for larger ligands.
In the example included here (Fig. 6), the pyridine ring on nicotine forms a water-mediated interaction with the
acetylcholine-binding protein. When the default methods in AutoDock are used, the ligand is placed in the proper position,
but two conformations are found with very similar predicted association energies, with the pyridine in the observed
position in one and rotated by 180 in the other. In the hydrated docking experiment, two water dummy atoms are placed
2016 Macmillan Publihers Limited. All rights reserved.
on the ligand: one interacting with the pyridine and the other interacting with a tertiary amino group. The best pose
places the pyridine and associated water in the experimentally observed position and deletes the other water molecule,
as it clashes with the protein.
AutoLigand analyzes the atomic affinity maps to predict the optimal locations for substrate binding (Step 5E).
The computation time is dependent on the size of the envelope, and typical drug-sized molecules fall in the range
of 50500 3 grid points. The method combines information from three mapscarbon, oxygen and hydrogento predict
the optimal atom type at each point. The oxygen map also represents likely locations for nitrogen atoms that accept
hydrogen bonds, as the force field parameters for nitrogen atom types are similar to those for oxygen. We have used
AutoLigand as a tool to identify the regions of a given ligand that are providing the most affinity, and to identify locations
on lead molecules that would provide the most additional affinity during functionalization.
Note: Any Supplementary Information and Source Data files are available in the 8. Goodsell, D.S., Lauble, H., Stout, C.D. & Olson, A.J. Automated docking
online version of the paper. in crystallography: analysis of the substrates of aconitase. Proteins 17,
110 (1993).
Acknowledgments This work was supported by the US National Institutes 9. Goodsell, D.S., Morris, G.M. & Olson, A.J. Automated docking
of Health (grant R01 GM069832 to A.J.O.). This is manuscript number 29118 of flexible ligands: applications of AutoDock. J. Mol. Recognit. 9,
from the Scripps Research Institute. 15 (1996).
10. Soares, T.A., Goodsell, D.S., Briggs, J.M., Ferreira, R. & Olson, A.J.
AUTHOR CONTRIBUTIONS All authors contributed equally to this work. Docking of 4-oxalocrotonate tautomerase substrates: implications for the
D.S.G. and S.F. authored the protocol manuscript with extensive input from the catalytic mechanism. Biolpolymers 50, 319328 (1999).
other authors, based on tutorials developed by all authors. All authors have been 11. Rosenfeld, R.J. et al. Automated docking of ligands to an artificial
instrumental in development of the AutoDock suite and training of users. active site: augmenting crystallographic analysis with computer modeling.
J. Comput. Aided Mol. Des. 17, 525536 (2003).
COMPETING FINANCIAL INTERESTS The authors declare no competing financial 12. Brik, A. et al. Rapid diversity-oriented synthesis in microtiter plates
interests. for in situ screening of HIV protease inhibitors. ChemBioChem 4,
12461248 (2003).
Reprints and permissions information is available online at http://www.nature. 13. Brik, A. et al. 1,2,3-Triazole as a peptide surrogate in the rapid synthesis
com/reprints/index.html. of HIV-1 protease inhibitors. ChemBioChem 6, 11671169 (2005).
14. Perryman, A.L. et al. Fragment-based screen against HIV protease.
1. Trott, O. & Olson, A.J. AutoDock Vina: improving the speed and accuracy Chem. Biol. Drug Des. 75, 257268 (2010).
of docking with a new scoring function, efficient optimization, and 15. Cosconati, S. et al. Structure-based virtual screening and biological
multithreading. J. Comput. Chem. 31, 455461 (2010). evaluation of Mycobacterium tuberculosis adenosine 5-phosphosulfate
2. Goodsell, D.S. & Olson, A.J. Automated docking of substrates to proteins reductase inhibitors. J. Med. Chem. 51, 66276630 (2008).
by simulated annealing. Proteins 8, 195202 (1990). 16. Perryman, A.L. et al. A virtual screen discovers novel, fragment-sized
3. Morris, G.M. et al. Automated docking using a Lamarckian genetic inhibitors of Mycobacterium tuberculosis InhA. J. Chem. Inf. Model. 55,
algorithm and an empirical binding free energy function. J. Comput. 645659 (2015).
Chem. 19, 16391662 (1998). 17. Cosconati, S. et al. Identification of novel beta-secretase inhibitors
4. Forli, S. Raccoon for Processing Virtual Screening <http://autodock.scripps. through inclusion of protein flexibility in virtual screening calculations.
edu/resources/raccoon> (2013). FASEB J. 22, 791798 (2008).
5. Morris, G.M. et al. AutoDock4 and AutoDockTools4: automated docking 18. Cosconati, S. et al. Protein flexibility in virtual screening: the BACE-1
with selective receptor flexibility. J. Comput. Chem. 30, 27852791 case study. J. Chem. Inf. Model. 52, 26972704 (2012).
(2009). 19. Jiang, H. et al. Stabilizers of the Max homodimer identified in virtual
6. Harris, R., Olson, A.J. & Goodsell, D.S. Automated prediction of ligand screening inhibit Myc function. Mol. Pharmacol. 76, 491502
ligand-binding sites in proteins. Proteins 70, 15061517 (2008). (2009).
7. Sousa, S.F. et al. Protein-ligand docking in the new millenniuma 20. Leonard, S.E., Garcia, F.J., Goodsell, D.S. & Carroll, K.S. Redox-based
retrospective of 10 years in the field. Curr. Med. Chem. 20, 22962314 probes for protein tyrosine phosphatases. Angew. Chem. Int. Ed. Engl. 50,
(2013). 44234427 (2011).