Bioluminate User Manual
Bioluminate User Manual
BioLuminate 1.0
User Manual
Schrödinger Press
BioLuminate User Manual Copyright © 2012 Schrödinger, LLC. All rights reserved.
While care has been taken in the preparation of this publication, Schrödinger
assumes no responsibility for errors or omissions, or for damages resulting from
the use of the information contained herein.
Schrödinger software includes software and libraries provided by third parties. For
details of the copyrights, and terms and conditions associated with such included
third party software, see the Legal Notices, or use your browser to open
$SCHRODINGER/docs/html/third_party_legal.html (Linux OS) or
%SCHRODINGER%\docs\html\third_party_legal.html (Windows OS).
This publication may refer to other third party software not included in or with
Schrödinger software ("such other third party software"), and provide links to third
party Web sites ("linked sites"). References to such other third party software or
linked sites do not constitute an endorsement by Schrödinger, LLC or its affiliates.
Use of such other third party software and linked sites may be subject to third
party license agreements and fees. Schrödinger, LLC and its affiliates have no
responsibility or liability, directly or indirectly, for such other third party software
and linked sites, or for damage resulting from the use thereof. Any warranties that
we make regarding Schrödinger products and services do not apply to such other
third party software or linked sites, or to the interaction between, or
interoperability of, Schrödinger products and services and such other third party
software.
In addition to the use of italics for names of documents, the font conventions that are used in
this document are summarized in the table below.
Sans serif Project Table Names of GUI features, such as panels, menus,
menu items, buttons, and labels
Monospace $SCHRODINGER/maestro File names, directory names, commands, envi-
ronment variables, command input and output
Italic filename Text that the user must replace with a value
Sans serif CTRL+H Keyboard keys
uppercase
Links to other locations in the current document or to other PDF documents are colored like
this: Document Conventions.
In descriptions of command syntax, the following UNIX conventions are used: braces { }
enclose a choice of required items, square brackets [ ] enclose optional items, and the bar
symbol | separates items in a list from which one item must be chosen. Lines of command
syntax that wrap should be interpreted as a single command.
File name, path, and environment variable syntax is generally given with the UNIX conven-
tions. To obtain the Windows conventions, replace the forward slash / with the backslash \ in
path or directory names, and replace the $ at the beginning of an environment variable with a %
at each end. For example, $SCHRODINGER/maestro becomes %SCHRODINGER%\maestro.
Keyboard references are given in the Windows convention by default, with Mac equivalents in
parentheses, for example CTRL+H (H). Where Mac equivalents are not given, COMMAND
should be read in place of CTRL. The convention CTRL-H is not used.
In this document, to type text means to type the required text in the specified location, and to
enter text means to type the required text, then press the ENTER key.
References to literature sources are given in square brackets, like this: [10].
Chapter 1
Chapter 1: Introduction
This manual documents the unique tools and capabilities of BioLuminate, and provides refer-
ences to other documents for the related tools. A brief description of the tool set is given below,
with links to the relevant parts of this manual or of other manuals. The descriptions are classi-
fied by function. These tools are divided between the Tools menu, where the action does not
take much time and may be interactive, and the Tasks menu, where a job may need to be run
that takes a larger amount of time.
• Protein Structure Quality Viewer (Tools → Protein Structure Quality): Show reports on
deviations of protein parameters from standard values, in graphical and tabular form. See
Chapter 3.
• Residue Analysis (Tasks → Residue Analysis): Calculate energetic and other properties
of residues. See Chapter 4.
• Consensus Visualization (Tools → Protein Consensus Viewer): Locate consensus waters,
counter ions and ligands in a set of homologs to a reference protein. See Chapter 5.
• Reactive Protein Residues (Tools → Reactive Residue Identification): Identify residues
that are prone to specified reactions, by matching sequence patterns and some structural
information. See Chapter 6.
• Aggregation Surface (Tasks → Aggregation Surface): Predict regions on a protein surface
that have a propensity for aggregation. See Chapter 7.
• Low Mode Vibrational Sampling (Tasks → Low Normal Mode Analysis): Locate and visu-
alize large-scale vibrational motions in a protein. See Chapter 8.
• SiteMap (Tools → Binding Site Identification): Locate druggable sites on a protein. See the
SiteMap User Manual.
• Protein Preparation Wizard (Tools → Protein Preparation): Prepare proteins for modeling
by assigning bonds, fixing structural defects, removing unwanted parts, assigning proton-
ation and tautomeric states, and refining the structure. See the Protein Preparation Guide.
• Simple Homology Modeling (Tasks → Simple Homology Modeling): Predict the structure
of proteins using homology modeling, where the homology is high and the alignment of
the query and the template is straightforward. See Chapter 11.
• Structure Prediction (Tasks → Advanced Homology Modeling): Predict the structure of
single-chain or multi-chain proteins, including multimers, by homology modeling. See
Chapter 3 through Chapter 5 of the Prime User Manual.
• Refinement (Tasks → Loop + Sidechain Prediction): Refine protein structures by per-
forming predictions of selected side chains or loops. Also performs minimizations. See
Chapter 6 of the Prime User Manual.
• Peptide Helicity (Tasks → Peptide Alpha Helicity): Predict the stability of alpha helices for
small peptide sequences, using molecular dynamics. See Chapter 9.
• Align Binding Sites (Tools → Binding Site Alignment): Align the sites on a set of proteins
at which drug-like molecules can bind. See Chapter 7 of the Prime User Manual.
• Protein Structure Alignment (Tools → Protein Structure Alignment): Structurally align two
or more proteins, using secondary structure information as well as coordinates. See
Chapter 7 of the Prime User Manual.
• Superposition (Tools → Superposition): Align two or more structures by minimizing the
RMSD of a selected set of atoms. See Section 10.3 of the Maestro User Manual.
Linux:
To run any Schrödinger program on a Linux platform, or start a Schrödinger job on a remote
host from a Linux platform, you must first set the SCHRODINGER environment variable to the
installation directory for your Schrödinger software. To set this variable, enter the following
command at a shell prompt:
Once you have set the SCHRODINGER environment variable, you can run programs and utilities
with the following commands:
$SCHRODINGER/program &
$SCHRODINGER/utilities/utility &
You can start the BioLuminate interface with the following command:
Windows:
The primary way of running Schrödinger applications on a Windows platform is from a graph-
ical interface. To start the BioLuminate interface, double-click on the BioLuminate icon, on a
BioLuminate project, or on a structure file; or choose Start → All Programs → Schrodinger-
2012 > BioLuminate. You do not need to make any settings before starting BioLuminate or
running programs. The default working directory is the Schrodinger folder in your documents
folder (Documents on Windows 7/Vista, My Documents on XP).
If you want to run applications from the command line, you can do so in one of the shells that
are provided with the installation and that have the Schrödinger environment set up:
Mac:
The primary way of running Schrödinger software on a Mac is from a graphical interface. To
start the BioLuminate interface, click its icon on the dock. If there is no BioLuminate icon on
the dock, you can put one there by dragging it from the SchrodingerSuite2012 folder in your
Applications folder. This folder contains icons for all the available interfaces. The default
working directory is the Schrodinger folder in your Documents folder ($HOME/Documents/
Schrodinger).
Running software from the command line is similar to Linux—open a terminal window and
run the program. You can also start BioLuminate from the command line in the same way as on
Linux. The default working directory is then the directory from which you start BioLuminate.
You do not need to set the SCHRODINGER environment variable, as this is set in your default
environment on installation. If you need to set any other variables, use the command
Chapter 2
The BioLuminate interface is a customized form of the Maestro interface that is specially
designed for biologics use. It inherits most of the capabilities of the Maestro interface (though
organized differently), and it has features of its own.
This chapter focuses on the features that are unique to BioLuminate. Summaries of the main
Maestro features are given, with references to the Maestro User Manual for details. If you have
never used Maestro, you should be able to gain a basic understanding of its operation from this
chapter.
If you prefer to use the standard Maestro interface, you can do so. Most of the capabilities of
BioLuminate are available from the BioLuminate submenu of the Applications menu, and some
of them are on the Tools menu.
• Menu bar. This is at the top of the window on Linux and Windows, and is the menu bar
on the Mac.
• Manager toolbar. This toolbar is just below the menu bar on Linux and Windows, and at
the top of the window on the Mac. Each label on this toolbar displays or hides another
toolbar. By default they are all hidden, as much of their function is available in the Toggle
Table. See Section 2.4 of the Maestro User Manual for details of the toolbars.
• Toggle Table. This dockable panel is displayed on the right side of the main window. You
can undock it from the main window and redock it with the docking button.
Many other panels are also dockable. You can change the docking behavior in the Prefer-
ences panel (Edit → Settings → Preferences, or CTRL+,)
• Workspace. This is the large black area that occupies the main part of the main window.
It is where structures are displayed, along with any associated objects such as surfaces
and text labels.
• Status bar. This bar is below the Workspace. At the left is a button that displays informa-
tion on what jobs are running, which you can click to open the Monitor panel for detailed
information on your jobs. When the pointer is not over an atom in the Workspace, the sta-
tus bar gives information on the contents of the Workspace. When the pointer is over an
atom, the status bar gives information on the identity of the atom. For more information,
see Section 2.5 of the Maestro User Manual.
• Auto-help. This orange-yellow bar at the bottom of the main window gives tips on the
current action that can be performed in the Workspace.
There are several other components of the main window that can be displayed when needed, by
choosing Edit → Settings and then choosing the component. These components include the
Sequence Viewer, which displays the sequences of proteins that are in the Workspace; the Find
toolbar (also opened with CTRL+F, F), which you can use to find structural components in
the Workspace, like chains or residues; and the Clipping Planes window, which shows a top
view of the Workspace and the planes where the structures in the Workspace are clipped for
display.
To add structures to the project, you can import them from an external source, such as a file or
the Protein Data Bank (PDB). To import structures from a file, choose File → Import Struc-
tures. To get structures directly from the PDB (either from a local copy or from the web),
choose File → Get PDB. Details of both of these methods for importing can be found in
Section 3.1 of the Maestro User Manual.
To see a list of all the entries in the project, you can open the Project Table panel with CTRL+T
(T) or Window → Show → Project Table. This panel lists the entries in the project with their
properties, and provides ways of doing actions on the entries and their properties, such as
management tasks, sorting, grouping, plotting, import and export of entries and properties. Full
details of the operation of the Project Table can be found in Chapter 9 of the Maestro User
Manual. The menu organization in the Project Table panel in the BioLuminate interface is a
little different from that in the standard Maestro interface: the Table menu in the standard inter-
face is split between the Table menu and the Tools menu in the BioLuminate interface.
When the toggle table is displayed, a set of shortcut (or context) menus is also available in the
Workspace, which you open by right-clicking. These menus offer the same functions as the
toggles.
The interaction with the Workspace provided by the Toggle Table panel is very similar to the
operation of PyMOL. If you are familiar with PyMOL, this interface should be easy to learn. If
you are familiar with Maestro but not with PyMOL, you can close the Toggle Table panel and
use the standard Maestro interaction with the Workspace.
The features of the Toggle Table panel are described in detail in the following subsections.
Note: The terminology used in the Toggle Table panel is the PyMOL terminology, which is
somewhat different from that used in standard Maestro.
• Reset—Reset the view to the default view, in which the view axes are aligned with the
coordinate axes of the structure.
• Zoom—Change the view of the Workspace so that all atoms fit inside the Workspace
area.
• Orient—Orient the Workspace structures by translating and rotating them so that the cen-
ter of mass is at the origin, the largest principal axis lies on the x axis, and the second-
largest principal axis lies on the y axis.
Note: This operation changes the coordinates of the structures, not just the coordinates
of the view.
• Draw—Save an image of the Workspace to a file in TIFF, JPEG, or PNG format. (Same as
File → Save Image.)
• The All row: Actions taken in this row apply to all entries in the Toggle Table.
• Entry rows: These rows apply to a single project entry that is currently in the Workspace.
The name of the row is the Title property of the entry. Entry rows are deleted when the
entry is excluded from the Workspace, and a new row is added when an entry is added to
the Workspace.
• Selection rows: When a group of atoms is selected, a selection row is added to the table.
Actions in this row apply to the group of atoms that was selected to create this row and
even apply after those atoms are no longer selected. The selection row named (Selection)
always refers to the most recent group of selected atoms. If a new selection is made while
a selection row is active, that row now refers to the new set of selected atoms. Only one
selection can be active at a time.
Selection rows can be renamed: renamed selection rows always refer to the same set of
atoms regardless of subsequent Workspace selections. To rename a selection row, choose
A → Rename Selection. Selection row names are always enclosed in parentheses.
Selection rows are deleted when any atoms they refer to are removed from the Work-
space. To delete a selection row, choose A → Delete Selection.
Using the A button submenus, selection rows can also be duplicated, copied, and
extracted to define a new entity that is independent of the objects from which the selec-
tion was originally derived.
Some operations or menu items change based on whether they are being applied to the All row,
an entry row, or a selection row. The descriptions below primarily describe the behavior for
entry rows. When a behavior changes for the All row or a selection row, the change is noted
after the description.
Clicking the name of the All row or an entry row changes the visibility of that object in the
Workspace. When the object's visibility is off, the name is dimmed. This is a quick way of
showing or hiding the atoms in entries. Hiding atoms does not remove them from the Work-
space, so any action taken on the entry or the entire Workspace applies to the hidden atoms as
well as the visible atoms.
For instance, if the Workspace contains ten entries and only one entry is visible, choosing
Clean from the A menu in the All row operates on all ten structures. This may take a consider-
able amount of time to complete and lead to unexpected results. Similarly, if a panel imports
structures from the Workspace, it imports all ten structures rather than just the visible structure.
Clicking the name of the current selection row deselects the selected atoms, but the selection
remains defined. Clicking the name of any other selection row selects the atom group that the
row refers to, and any currently selected atoms that are not part of this atom group are dese-
lected.
• Zoom—Change the view of the Workspace so that all the atoms in the structure fit inside
and fill the Workspace area. In the All row, this action is equivalent to clicking the Zoom
button at the top of the panel.
• Orient—Orient the structure by translating and rotating it so that the center of mass is at
the origin, the largest principal axis lies on the x axis, and the second-largest principal
axis lies on the y axis. When you apply this operation to the selection, it is the center of
mass and principal axes of the selection that are used, but the entire structure is reori-
ented.
Note: This operation changes the coordinates of the structures, not just the coordinates
of the view (the camera angle).
• Center—Center the structure in the Workspace, by translating the structure so that its
centroid is at the center of the Workspace.
• Origin—Set the center around which rotation is performed to the centroid of the structure.
The centroid need not be at the Workspace origin.
• Simple—Show proteins as ribbons (C alpha trace) colored by chain, ligands and bound
receptor as sticks, and solvent, disulfides and ions as lines. Atom colors are not changed.
• Simple (no solvent)—Same as Simple, but no waters are shown.
• Ball and Stick—Show atoms and bonds in ball and stick, with no protein ribbons.
• B-Factor—Show the protein as tubes with residues colored by the B-factors of the resi-
dues, in a relative scheme that ranges from shades of blue, through green and yellow to
red.
• Technical—Show atoms as sticks, colored with rainbow colors by residue position, and
show polar contacts (hydrogen bonds) as yellow dotted lines.
• Ligand—Show proteins as ribbons colored with rainbow colors by residue position, and
ligands as lines. All protein atoms within 5 Å of the ligand are shown as lines, with car-
bons colored with rainbow colors by residue position. Waters are shown as sticks, polar
contacts are shown as yellow dotted lines, and the view is zoomed in to the atoms shown.
• Ligand Sites—This submenu shows variations on the Ligand preset that alter the way the
protein or region around the ligand is shown:
• Cartoon—Show proteins as cartoons rather than ribbons.
• Solid surface—Show the molecular surface of the protein around the ligand as an
opaque surface, colored by the nearest non-hydrogen atom.
• Transparent surface—Show the molecular surface of the protein around the ligand
as a semi-transparent surface, colored by the nearest non-hydrogen atom; show
atoms and bonds as sticks.
• Dot surface—Show the molecular surface of the protein around the ligand as a dot
surface, colored by the nearest non-hydrogen atom; show atoms and bonds as sticks.
• Mesh surface—Show the molecular surface of the protein around the ligand as a
mesh surface, colored by the nearest non-hydrogen atom; show atoms and bonds as
sticks.
• Pretty—Show proteins as cartoons colored with rainbow colors by residue position and
ligands as sticks.
• Pretty (with solvent)—Same as Pretty but waters are shown as ball and stick.
• Publication—Same as Pretty, but protein helices are two-sided.
• Publication (with solvent)—Same as Pretty (with solvent), but protein helices are two-
sided.
• Protein Interface—Color ribbons and carbons by chain, show anything not at a protein
interface as cartoon ribbons, and interface residues as ball and stick. Non-carbon atoms
retain their previous coloring. Interface residues are residues in a chain with more than
300 atoms that are within 4.5 Å of another chain with more than 300 atoms.
• Antibody—Show everything as cartoon ribbons colored by antibody structure. The light
chain is colored in red hues, the heavy chain is colored in blue hues, and everything else
is colored green. Constant regions are dark hues and the CDR regions are bright hues.
The light chain L1-L3 loops are shaded orange to brown, while the heavy chain H1-H3
are shaded grey-blue to cyan.
• Default—Show everything as lines with default colors (colored by element with green
carbons).
• Within Object
• Involving Side Chains
• Involving Solvent
• Excluding Solvent
• Excluding Main Chain
• Excluding Intra-Main Chain
• Just Intra-Side Chain
• Just Intra-Main Chain
• To Other Atoms In Entry
• To Other Atoms In Entry Excluding Solvent
• To Any Atoms
• To Any Atoms Excluding Solvent
Each menu choice clears any previous choice before applying the new choice.
If you want more flexibility in choosing the atom groups between which polar contacts are
shown, you can use the Measurements panel. Choose Style → Measurements → H-Bonds
from the menu bar at the top of the Workspace to open this panel.
• All—All atoms.
• Polymer—Backbone and side-chain atoms.
• Organic—Ligand atoms.
• Solvent—Water atoms.
• Surface Residues—All residues with solvent-exposed surface area greater than 10 Å2.
• Protein Interface—Residues in a chain of more than 300 atoms that are within 4.5 Å of
another chain of more than 300 atoms.
Atom selections can also be generated by picking atom groups in the Workspace. To set the
kind of atom group you want to pick, choose Edit → Pick Mode then the atom group name
(Atoms, Residues, Chains, Molecules, or Entries), using the menu bar at the top of the Work-
space. You can also set the mode by typing the first letter of the name when the pointer is in the
Workspace. To pick an atom group, click on an atom in the Workspace that belongs to the
group. You can see information about the atom in the Status bar when you pause the pointer
over the atom.
The Symmetry Mates submenu items control the cutoff distance from the original structure for
which symmetry mate atoms should be displayed. Note that no matter how far the cutoff is
placed from the original structure, only the nearest-neighbor mates are created and shown. The
symmetry mates are created as separate, temporary entries (“scratch entries”) and are shown in
the toggle table. You can remove the symmetry mates by choosing Generate → Symmetry
Mates → Show None.
The Modify actions all have a choice of atom groups to which they apply.
• Around—select all atoms or residues within a given distance from the current set of
atoms, and deselect the current set of atoms. The distance is chosen from the submenu,
and can encompass atoms only or be filled to entire residues that have any atoms within
the chosen distance.
• Expand—Expand the current selection to include all atoms or residues within a given dis-
tance from the current set of atoms. The distance is chosen from the submenu, and can
encompass atoms only or be filled to entire residues that have any atoms within the cho-
sen distance.
• Extend—Expand the current selection to include all atoms or residues within a given
number of bonds from the current set of atoms.
• Invert—Deselect the current set of atoms and select all other atoms within a given atom
group. The atom groups can be chosen from the submenu:
• Within Objects—All atoms in the entry that are not part of the selection are selected.
• Within Chains—In each chain that contains selected atoms, the unselected atoms are
selected, and the selected atoms are deselected.
• Within Residues—In each residue that contains selected atoms, the unselected
atoms are selected, and the selected atoms are deselected.
• Within Molecules—In each molecule that contains selected atoms, the unselected
atoms are selected, and the selected atoms are deselected.
• Within Any—All atoms in the entire Workspace that are not part of the selection are
selected.
• Complete—Add all other atoms within a given atom group to the selection. The atom
groups can be chosen from the submenu.
• Residues—In each residue that contains selected atoms, all atoms are selected.
• Chains—In each chain that contains selected atoms, all atoms are selected.
• Objects—In each entry that contains selected atoms, all atoms are selected.
• Molecules—In each molecule that contains selected atoms, all atoms are selected.
• C-alphas—All alpha carbons for residues within the selection are selected. All other
atoms are deselected.
• Restrict to—Reduce the selection to only those atoms within a specific group that are cur-
rently selected. The available atom groups are:
• Object—Restrict the selection to atoms in a specific entry, chosen from the sub-
menu.
• Selection—Restrict the selection to atoms in a specific selection row, chosen from
the submenu.
• Visible—Restrict the selection to atoms that are visible.
• Polymer—Restrict the selection to backbone and side-chain atoms.
• Organic—Restrict the selection to ligand atoms.
• Solvent—Restrict the selection to water atoms.
• Inorganic—Restrict the selection to atoms other than H, C, N, O, F, P, S, Cl, Br, or I.
• Include—Include additional atom groups in the current selection. The available atom
groups are:
• Object—Include atoms in a specific entry, chosen from the submenu.
• Selection—Include atoms in a specific selection row, chosen from the submenu.
• Visible—Include all visible Workspace atoms.
• Exclude—Exclude specific atoms from the selection. The available atom groups are:
• Object—Exclude atoms in a specific entry, chosen from the submenu.
• Selection—Exclude atoms in a specific selection row, chosen from the submenu.
• Visible—Exclude atoms that are visible.
• Polymer—Exclude backbone and side-chain atoms.
• Organic—Exclude ligand atoms.
• Solvent—Exclude water atoms.
• Inorganic—Exclude atoms with element other than H, C, N, O, F, P, S, Cl, Br, or I.
For selection rows Delete Selection just removes the row from the Toggle Table. The atoms in
this selection group are deselected but otherwise remain unaltered.
You can change the name of a selection row with the Rename Selection action. If you do this
for the default selection row, the selection is preserved for future use as a named selection. You
can also rename named selection.
For entry rows, the action creates a new project entry below the entry that is the duplicate of the
entry. Both structures remain in the Workspace and are listed in the Toggle Table.
For selection rows, this action creates a duplicate selection row in the Toggle Table. No project
entry is created. This can be useful if you want to use a selection as the basis for another selec-
tion.
To clear the Workspace entirely, choose Remove Everything from Workspace in the All row.
To create a project entry by removing atoms from entries and placing them into a new entry,
you can choose Extract to New Project Entry in a selection row. The new project entry contains
the atoms in the selection, and the atoms are deleted from their current structure.
You can also add hydrogens from the main menu bar with Edit → Add Hydrogens or from the
Edit toolbar.
The computed properties are displayed in a window that opens, and you can copy and paste the
text. They are not stored in the Project Table.
These items appear on both the Show menu itself and on the As submenu. When you choose
from the Show menu, the representation is added to the display. When you choose from the As
submenu, the previous representation is replaced by the new choice.
For instance, a residue shown with lines and cartoon ribbons is shown as only ball and stick if
S → As → Ball and Stick is chosen, but is shown as cartoon and ball and stick if S → Ball and
Stick is chosen.
Two commands on the menu for entry rows and the All row create and display molecular
surfaces. The surface is created if it does not exist, otherwise the color and representation of
the surface is changed.
• Everything—Hide all features: atomic, ribbon, and surface representations and labels.
• Atoms—Hide atoms and bonds.
• Ribbon—Hide all ribbon representations.
• Cartoon—Hide all ribbon representations.
• Label—Hide labels. The labels remain defined and can be redisplayed.
• Nonbonded Atoms—Hide atoms with no attachments, such as Cl– ions.
• Mesh—Hide surfaces.
• Surface—Hide surfaces.
• Main Chain—Hide backbone atoms.
• Side Chain—Hide side-chain atoms.
• Waters—Hide water atoms.
• Hydrogens—Hide nonpolar or all hydrogens, as chosen from the submenu.
• Symmetry Mates—Removes all crystal symmetry mates from the Workspace. This is a
Workspace setting, so affects all symmetry mates in the Workspace.
• Polar Contacts—Remove polar contact (hydrogen bond) markers.
• All Others—Hide everything for all atoms in the Workspace other than the atoms defined
in the row.
Labels can be cleared from the Workspace with L → Clear. They can be hidden with H →
Label. If they are hidden, they can be redisplayed with S → Label, while if they are cleared,
they need to be created (usually by other Label menu commands) before they can be shown
again.
Residues—Label the first carbon atom in each residue with the three-letter PDB code and
residue number.
Chains—Label the first and last residue in each chain with the chain name.
The next set of commands offers a choice of the label content, including identifiers and
numeric properties.
• Other Properties—Submenu with other properties that can be used for labels:
• Formal Charge—Label each atom with its formal charge.
• Partial Charge (0.00)—Label each atom with its partial charge to two decimal
places.
• Partial Charge (0.0000)—Label each atom with its partial charge to four decimal
places.
• MacroModel Text Type—Label each atom with its MacroModel atom type.
• MacroModel Numeric Type—Label each atom with the numerical index for the Mac-
roModel atom type.
• Stereochemistry—Label each atom with E,Z and R,S stereochemistry.
• Atom Identifiers—Submenu of atom identifiers that can be used for labels.
• Color by Element—Color H, C, N, O and S atoms. Default colors are white for H, green
for C, blue for N, red for O and yellow for S. There are several choices for modifying the
color scheme on this submenu.
• Reset HNOS—Set H, N, O and S atoms to their default color. Carbon atoms remain
their current color.
• Custom Color {C}HNOS—Pick the color for carbon atoms and set H, N, O, and S
atoms to their default color. Clicking on the menu item opens a palette of colors to
choose from for carbon atoms, while selecting Recent Color Choices lists the most
recent colors chosen by this command.
• Custom Color {H}CNOS—Pick the color for hydrogen atoms and set C, N, O, and S
atoms to their default color. Clicking on the menu item opens a palette of colors to
choose from for hydrogen atoms, while selecting Recent Color Choices lists the
most recent colors chosen by this command.
• Color by Chain—Color atoms by chain:
• by Chain (Carbons)—Change the color of carbon atoms only.
• by Chain (Calpha)—Change the color of alpha carbon atoms only.
• by Chain—Change the color of all atoms.
• Chainbows—Each chain is colored with rainbow colors.
• Color by Substructure—Helices, sheets and loops are colored by the chosen color
scheme. All other atoms retain their current color.
• Color by Spectrum—Color all residues by a spectrum of colors.
• Rainbow (Carbons)—Color carbon atoms only with rainbow colors by residue posi-
tion. The chain is divided into segments, each of which has residues of the same
color.
• Rainbow (Calpha)—Color alpha carbon atoms only with rainbow colors by residue
position.
Most of the menu items are the same as those on the Toggle Table buttons or button menus. The
Selection shortcut menu has a Disable item, which turns off the selection. The Workspace
shortcut menu has Enable and Disable actions, which you can use to display or undisplay any
of the Workspace rows. These actions are also available from some of the submenus. This
menu also has a Select action, which you can use to create a selection from the visible
(displayed) atoms. It also allows you to operate on the visible atoms only or on all atoms in the
Workspace.
Chapter 3
The quality of a protein structure is often measured by deviations from values reported in the
PDB. You can analyze a protein and display tabular and graphical reports on its quality in the
Protein Structure Quality Viewer panel, which you open by choosing Tools → Protein Structure
Quality.
If there is a protein in the Workspace, it is analyzed when you open the panel. Otherwise, you
can display a protein in the Workspace and click Analyze Workspace to perform the analysis.
At the top of the panel, the protein table lists the chains in the protein that is analyzed along
with various measures of the overall structure quality. You can analyze multiple proteins and
they are all listed in the table, and you can select multiple chains in a single protein for
reporting, but you cannot select multiple proteins.
The remainder of the panel consists of two tabs that show different data: the Ramachandran
Plot tab, and the Protein Report tab.
Pausing the cursor over a point displays information for that residue at the top of the panel, and
highlights the residue in the Workspace. Clicking on a point selects the point and zooms the
Workspace image in to that residue, and highlights it with pale yellow markers. The point is
displayed as an outline instead of solid black. The residue information is displayed at the top of
the panel. Click again on the point to deselect it.
Below the plot, you can select options to change the appearance of the residues in the Work-
space structure for each of the three regions. The appearance is changed by coloring the resi-
dues and modifying the molecular representation. The coloring is applied when you analyze
the Workspace or change the color scheme, so you should set these options first, and then click
Analyze Workspace or change the color scheme. Deselecting any of these options does not
Figure 3.1. The Protein Structure Quality Viewer panel, Ramachandran Plot tab.
revert the color scheme to the original scheme, so you must change the scheme manually to
revert it.
Figure 3.2. The Protein Structure Quality Viewer panel, Protein Report tab.
The property is plotted as a function of the row number in the table in the area below the table.
The plot area has a toolbar, which is described in Section 3.3. The red dashed horizontal lines
and blue dashed vertical lines can be dragged to highlight a portion of the plot of interest,
which is given a white background, and the rest is gray. The atoms associated with the high-
lighted portion of the plot are interactively selected in the Workspace, and the rows for the
points in the highlighted region are selected in the table.
If you want to export the values in the table, click Export or Export All. Export exports the
current property table as a text file. Export All exports all property tables as a text file. Both
buttons open a file selector in which you can navigate to the location and name the file.
Reset
Reset the plot to the original pan and zoom settings.
Back
Display the previous view of the plot in the view history
Next
Display the next view of the plot in the view history
Pan/zoom
Pan the plot with the left mouse button, zoom with the right mouse button.
Zoom to rectangle
Drag out a rectangle on the plot to zoom in to that rectangle.
Configure subplots
Configure the margins and spacing of each plot in the panel.
Chapter 4
To open the Residue Analysis panel choose Tasks → Residue Analysis in the main window.
Before analyzing a protein, you should prepare it with the Protein Preparation Wizard. Calcu-
lating the energetic properties requires an all-atom structure with bond assignments. To
analyze the properties of a protein, first display it in the Workspace, and then click Analyze
Workspace. A job is run to calculate energetic properties. This job can take several minutes,
and progress is reported in a bar at the bottom of the panel.
When the job finishes, the table is filled in and the first property is plotted below. You can sort
the table by clicking on the heading of the column you want to sort by. A second click changes
the sort direction. The table columns are described in Table 4.1.
Column Description
Residue Residue identity: chain, residue number and insertion code, 3-letter name.
Hydropathy Hydropathy calculated using the Kyte-Doolittle scale [6], normalized by the
solvent accessible surface area.
Potential Energy Sum of the residue-based internal energy and the non-bonded interaction
energy (vdW, electrostatic) between the residue and the remainder of the sys-
tem.
Internal Energy Sum of energies arising from intra-residue bonded interactions (bonds,
angles, torsions) and intra-residue non-bonded interactions (vdW, electro-
static).
Interaction Energy Energy of interaction between this residue and all other atoms.
SASA (Non-polar) Solvent-accessible surface area of nonpolar atoms of this residue
SASA (Polar) Solvent-accessible surface area of polar atoms of this residue
SASA Total solvent-accessible surface area of this residue
Rotatable bonds Number of rotatable bonds in this residue
You can export the table data to a CSV file by clicking Export and then providing the file name
in the file selector that opens.
To highlight one or more residues in the Workspace, select the table rows. If you have Fit on
selection selected, the view zooms in (or out) so that these residues occupy most of the Work-
space.
To examine a particular property for all residues, you can make a plot of the property as a func-
tion of residue position. Choose the property from the Graph property option menu to display
the plot. You can use the plot to select residues in the table and highlight them in the Work-
space, by moving the dotted lines to enclose the residues you are interested in. For example,
you might want to select residues that have significantly larger or significantly smaller values
of the properties than the average, by moving the upper or the lower red dotted line to enclose
just those data points.
The toolbar provides tools for manipulation of the plot and for saving an image of the plot.
This is a generic toolbar, and some of the actions may not be useful in the current context. The
toolbar buttons are described below.
Reset
Reset the plot to the original pan and zoom settings.
Back
Display the previous view of the plot in the view history
Next
Display the next view of the plot in the view history
Pan/zoom
Pan the plot with the left mouse button, zoom with the right mouse button.
Zoom to rectangle
Drag out a rectangle on the plot to zoom in to that rectangle.
Configure subplots
Configure the margins and spacing of each plot in the panel.
If you want to analyze another protein, click Reset to clear all the panel data.
Chapter 5
The Consensus Visualization panel helps you to identify conserved waters, counter ions, and
ligands for a protein. To open this panel, choose Tools → Protein Consensus Viewer.
Homologs of the target protein are identified by a BLAST search. You can select a subset of
these homologs to determine the consensus between them for the locations of waters, counter
ions, and ligands. These homologs are aligned, both by sequence and by structure, to the target
protein. The consensus between the positions of the waters, counter ions and ligands is then
determined. Consensus analysis can help you to quickly identify moieties that are repeated
among multiple structures, such as structurally important waters that should be included in
modeling studies.
For a tutorial introduction to consensus visualization, see Chapter 5 of the BioLuminate Quick
Start Guide.
• Browse for File—Open a file browser in which you can navigate to the desired location
and select the file that contains the structure. The allowed file types are Maestro and
PDB.
• From PDB ID—Import the structure from the specified PDB ID. Opens the Enter PDB ID
dialog box, in which you can enter the PDB ID of the structure. The structure is retrieved
from a local copy of the PDB if it is available, or from the RCSB web site, depending on
the preference set for PDB retrieval.
• From Workspace—Import the structure that is displayed in the Workspace.
If you choose to prepare the protein beforehand in the Protein Preparation Wizard panel, you
should ensure that you do not delete any of the molecules for which you are seeking a
consensus. In particular, you might want to deselect Delete waters beyond N Å from het groups,
or make the distance large enough to ensure that you have the relevant waters.
If you already have a set of homologs, you can simply import them with the Import button.
Once they are imported you can align them structurally by clicking Align.
To run a BLAST search for homologs, click Find and Align Homologs. First, the Blast Search
Settings panel opens. You do not usually need to change the BLAST search settings. When you
are satisfied with the settings, click Start Job. A Job Progress dialog box replaces the Blast
Search Settings dialog box, and displays the log file from the BLAST search.
After a few minutes, the job finishes and the BLAST Search Results dialog box opens, with the
results of the search. The top ten results are selected in the table by default. You can change the
number of top rows to select by entering the number of rows in the text box below the table,
and clicking Select Top. You can also manually select rows in the table.
When you have finished selecting rows, click Incorporate Selected Rows. If you do not have a
local installation of the BLAST or PDB databases, the search is done on the web, and a
warning is displayed: “Multiple Sequence Viewer is attempting to access a remote server.
Would you like to continue?” You can select Do not ask this question again, to prevent it from
opening each time a structure is downloaded, then click OK.
If an information box opens stating that problems were found when importing a structure, you
can select Do not show this dialog again to prevent it from opening for each structure that has
problems, and click OK. The structures are imported without any preprocessing, so they might
have structural defects. For the purposes of this panel, it is generally acceptable to use struc-
tures from the PDB that have structural issues.
The homologs you selected are aligned, added to the sequence viewer, and displayed in the
Workspace. All atoms are marked in all of the homologs.
For each of the three types of molecules, you can perform the following actions:
• Choose whether to display all, only the consensus, or none of the molecule of the given
type, from the Display option menu. For example, viewing all the molecules gives an indi-
cation of whether a consensus exists, whereas viewing the consensus shows whether there
is a strong enough consensus to consider the molecule as conserved.
• Select and display residues near the molecules of the given type. You can choose to dis-
play residues near any of the molecules or only the consensus molecules. The action is
not performed until you click Select.
• Change the color of the residues that are near the molecules of the given type, by clicking
on the color button, and choosing a color in the color selector that opens.
When displaying the molecules, the identity of any consensus molecule can be ascertained by
moving the cursor over the structure in the Workspace and viewing the text in the status bar,
below the Workspace.
Chapter 6
You may need to identify reactive residues in a protein, so that you can mutate them to improve
the protein properties. This can be done in the Reactive Protein Residues panel, which you
open by choosing Tools → Reactive Residue Identification.
Reactive residues are identified by matching residue patterns in the sequence. Four patterns are
provided by default, for the common reactions: deamidation, oxidation, glycosylation, and
proteolysis. You can use these patterns or you can set up and use your own patterns.
To identify reactive residues, include the protein you want to analyze in the Workspace, and
click Analyze Workspace. The structure in the Workspace is analyzed to identify residues that
match the patterns.
The results are listed in the table, which shows the reaction type, the reactive residues identi-
fied, the solvent-accessible surface area of the reactive residues, their percentage exposure to
solvent, and their B-factors (if available). The B-factor shown is the average over all atoms in
the residue.
You can use the Show option menu to show only the results for a particular reaction type. You
can also apply filters on the percentage solvent exposure and the B-factor.
To sort the table by the values in a column, click on the column heading. Click again to change
the direction of the sort. The column by which the table is sorted is indicated by an arrow on
the right side of the heading, which also indicates the sort direction.
The reactive sites are marked with spheres in the Workspace. The spheres are colored
according to the reaction type, and the color legend for the spheres is given below the table.
When you select a table row, the residue is highlighted in the Workspace. If you have Fit on
selection selected, the view zooms in to that residue
If you want to define your own reactive groups, click Edit, to open the Edit Patterns dialog box.
This dialog box lists the patterns in a table, giving the pattern name, the definition, and a
hotspot index. To edit an existing pattern, double-click the table cell that you want to edit, and
enter the changes. The lower part of the panel explains the syntax for the patterns in the Defini-
tion column. The syntax is an extended PROSITE syntax, which allows you to specify
secondary structure and some properties:
• Standard IUPAC one-letter (upper case) codes are used for all amino acids.
• Lower case x is used for any amino acid.
• Each element of a pattern is separated with a - symbol.
• Residues that are permitted at a given position are listed between square brackets, e.g.
[ACT] means one of Ala, Cys, or Thr, or in other words, only Ala, Cys, or Thr can appear
at this position.
• Residues that are not permitted at a given position are listed between curly brackets, e.g.
{GP} means not Gly and not Pro, or in other words, any residue but Gly or Pro can
appear at this position.
• Repetition is indicated using parentheses, e.g. A(3) means Ala-Ala-Ala, G(2,4) means
between 2 to 4 Gly residues.
• The following lower case characters can be used for residue types:
• a—acidic residue: [DE]
• b—basic residue: [KR]
• o—hydrophobic residue: [ACFILPWVY]
• p—aromatic residue: [WYF]
• The following lower case characters can be used to restrict residue types by property:
• s—solvent-exposed residue
• h—residue in helical region
• e—residue in extended (beta strand) region
• f—flexible residue, defined as having a B-factor above the average over all residues
These four characters can be appended to a residue type to restrict the type of residue, e.g.
Ah means Ala in a helical region.
Some examples of valid and invalid patterns are given below, with comments.
The hotspot index is the index of the reactive residue in the pattern.
Chapter 7
Protein aggregation often occurs via hydrophobic regions. You can locate these regions using
the Aggregation Surface panel, which you open by choosing Tasks → Aggregation Surface.
Aggregation regions are defined by identifying clusters of exposed hydrophobic residues, and
creating a molecular surface that is colored red in the regions near these residues.
Once you have located potential aggregation regions on a protein, you might want to mutate
residues in these regions to reduce the tendency to aggregate.
• Entire entry containing selected atoms—Compute the surface for the entry that con-
tains the selected atoms. This is useful if you want to view how the aggregation sur-
face on one protein matches residues in another protein.
• Entire chain containing selected atoms—Compute the surface for the entire chains
that contain the selected atoms. This is useful for computing a surface for one chain
of a multi-chain protein, for example.
• Selected atoms and attached hydrogen atoms only—Compute the surface for the
selected atoms with their attached hydrogens only. This option allows you to calcu-
late the surface for only part of a protein chain, for example.
3. Enter a name in the Surface name text box, if you want to change the default name.
This name is used in the Manage Surfaces panel (Style → Create and Manage Surfaces
→ Surface Manager) to identify the surface.
4. Click Create Surface.
A progress bar is displayed above the buttons while the surface is being created. Surface
creation should take less than a minute.
When the surface is created, it is displayed in the Workspace and other surfaces are hidden. To
display more than one surface, use the Manage Surfaces panel (Window → Show → Surface
Manager).
The aggregation surface is a molecular surface that is created for a large probe molecule, repre-
senting part of a protein. It is colored red in the regions near the hydrophobic residue clusters.
The side chains of these residues and the alpha carbons are also colored red, and the remaining
backbone atoms are colored gray. The side chains are displayed in ball-and-stick representa-
tion, and the backbone is displayed as lines. Residues that are in contact with these residues are
displayed as lines in gray. All other residues are hidden, so you see only the residues that
contribute to the aggregation regions and very near neighbors. The surface is semi-transparent,
so you can see the atoms inside the surface. The residues involved in the clusters are colored
red in the Workspace sequence viewer.
Because the surface color scheme depends on the atom coloring, the surface is removed if you
change the atom color scheme. For a similar reason, the surfaces are removed when you close
the panel. However, if you don't make any changes, the atom visibility and the color scheme
for the residues are still in effect when you close the panel even though the surface is removed.
If you want to remove the surfaces and redisplay all atoms, click Delete Surfaces.
• Surface grid spacing—Set the grid spacing in angstroms for generation of the surface. A
smaller number results in a smoother surface, but takes longer to generate the surface.
• Probe radius—Set the radius of the probe for defining the surface. This is the radius of
the sphere that is rolled over the van der Waals surface to create a Connolly surface. The
large default radius is intended to model a protein probe. Hydrogens are included when
creating the surface. See Section 12.1.2 of the Maestro User Manual for details on the
construction of the surface.
• Transparency—Set the default transparency of the surface. The transparency can be
changed in the Surface Display Options dialog box—see Section 12.4.2 of the Maestro
User Manual.
• Radius to find neighbors—Specify the distance cutoff for finding hydrophobic neighbors
to identify aggregation regions. The distance between any side-chain heavy atom in one
hydrophobic residue and any side-chain heavy atom in another hydrophobic residue must
be less than this cutoff for the residues to be counted as neighbors.
• Hydrophobic neighbors required for site—Specify the minimum number of hydrophobic
neighbors required to include a residue (site) in an aggregation region.
• Buried residue SASA—Specify the maximum solvent-accessible surface area (SASA) for
a residue to be regarded as buried. Buried residues are not included in aggregation
regions. The SASA is measured for the heavy atoms only, and does not include hydro-
gens.
One way of doing this is to perform a set of mutations to reduce the size of the aggregation
regions. You can create a set of structures that have single mutations at selected sites, which
you choose from the residues in the aggregation regions, as follows:
The residues that were found to contribute to the aggregation region are listed in the table in the
Residues tab. You can then select any of them for mutation, define the mutations, and run the
job. See Chapter 13 for details of setting up a residue scanning job.
Another option is to mutate a single residue or a loop. Mutating a loop (“loop swap”) is useful
if you want to mutate more than one residue and the residues are adjacent. For this purpose,
you can use the Residue and Loop Mutation panel. The loop to change is defined by selecting
the residues in the Workspace. You can take advantage of the fact that you only have the aggre-
gation residues and their neighbors visible in the Workspace to choose the residues for the loop
to swap. See Chapter 12 for details of using this panel.
Chapter 8
Finding large-scale motions in proteins can provide information on the flexibility or rigidity of
parts of the protein, on domain movements, or give some clues to biochemical processes.
Trajectories from molecular dynamics simulations can show the large-scale motions of
proteins, but these simulations are not resolved into individual modes, and they require a large
amount of computational resources. The lowest vibrational modes of a protein can be deter-
mined and visualized using the Low Mode Vibrational Sampling panel. You can open this panel
by choosing Tasks → Low Normal Mode Analysis → Calculate or Visualize.
Before running the calculation, you should ensure that the protein is properly prepared, using
the Protein Preparation Wizard (Tools → Protein Preparation). You should remove waters and
solvent molecules so that you are analyzing just the protein (and its ligands, if any).
First, the input structure is minimized (using the PRCG method for a maximum of 10000 itera-
tions to a gradient convergence threshold of 0.05 kJ mol–1Å–1). This ensures that the structure
is at its minimum, which is important for generating the vibrational modes. The vibrational
modes are generated as a set of structures sampled at regular intervals along a full cycle of the
vibrational mode. The rotational and translational modes (the trivial modes) are discarded. The
vibrational modes are then visualized by displaying the structures in sequence as a “movie” in
the Workspace.
2. Set the number of frames to generate per vibrational cycle in the Number of frames per
mode text box.
Each frame is a snapshot of the structure at a particular point in the vibration between the
classical limits. The full cycle of a vibration is divided evenly to define the coordinates
used for the snapshots, so this number should be divisible by 4.
3. Set the maximum amplitude of vibration in the Vibrational amplitude text box.
This is the maximum displacement of the fastest-moving atom. For proteins, the fastest
moving atom could potentially move a large distance if the motion involves a long loop,
for example. This choice is somewhat arbitrary: you might for example want to exagger-
ate the motions to make them easier to see.
4. Click Run.
A job is started that generates the series of structures for each vibration. The limit on the
number of atoms that can be processed depends on the memory available on the machine, but
with 4GB of memory, it is about 5000. For example, 1ETT, with about 4800 atoms, runs
successfully. The job can take several hours to run, depending on the size of the protein.
When you have finished viewing the modes, you should clean up the structures in the project,
by clicking Remove Structures from Project.
Chapter 9
Determining the stability of alpha-helical peptides is important for their use as therapeutic
agents. You can obtain an estimate of the stability by using the Peptide Helicity panel, which
you can open by choosing Tasks → Peptide Alpha Helicity. The primary purpose of this panel is
to provide information on the relative stabilities of a series of small peptides, although you can
also obtain information on a single peptide.
The alpha helical tendency of one or more peptides is determined from a molecular dynamics
simulation by tracking i to i+4 hydrogen bond formation, and other indications of helical struc-
ture. The prediction is made entirely from the sequences, by building them as idealized alpha-
helices, and then performing a molecular dynamics simulation in water using simulated
annealing, to simulate experimental melting experiments. At the end of the simulation, aver-
ages are taken to determine the values of properties that can be used to determine the helicity
of the sequences.
The simulations are set up in the Start Simulations tab, and the results are presented in the
Results tab.
To run a simulation:
1. Select an option for the sequences to simulate under Use sequences from.
The options are Workspace (the included entries), Project Table (the selected entries),
File, or Sequences. If you choose File, enter the file name in the text box or click Browse
to navigate to and select the file. If you choose Sequences, type or paste the sequences
into the text box, separated by commas. Each sequence that you provide is simulated sep-
arately in a single job.
2. Click Start Simulations.
A Start dialog box opens, in which you can set the job name, select a host, and specify the
number of processors. Click Start in this dialog box to submit the job to a host for
execution.
Since the simulations are likely to take many hours, it is a good idea to run the job on a
multiprocessor host. The simulation uses a 3D domain decomposition of the simulation
box, so you can specify the number of processors for each dimension in the decomposi-
tion (labeled x, y, and z). See Section 3.10 of the Desmond User Manual for more
information. A good choice is 2 processors for each, on an 8-core CPU.
When the simulation finishes, you can review the results of the simulations in the Results tab.
Click Load Output File read the results, which are in a CSV file. A file selector opens, in which
you can navigate to and select the output CSV file. The results are presented in a table, whose
columns are described in Table 9.1. The averages are taken over the length of the simulation.
If you want to run another calculation, click Reset to clear all panel data.
Table 9.1. Columns of the Results table in the Peptide Helicity panel.
Column Description
Chapter 10
The question of whether one protein binds to another, and where, can be addressed by protein-
protein docking. Protein-protein docking in BioLuminate is performed using the Piper
program [5], under license from Boston University. The job can be set up in the Protein-Protein
Docking panel. In this panel you can set up jobs to dock two arbitrary proteins, dock an antigen
to an antibody, or dock one protein to itself to form a dimer or a trimer.
One protein is treated as the “receptor” and the other as the “ligand”. In the general case, it
does not matter which protein is treated as the receptor and which protein is treated as the
ligand. For antibody-antigen docking, the receptor is the antibody and the ligand is the antigen.
The algorithm samples all possible orientations of the two proteins, subject to whatever
constraints are applied. It uses a grid to locate the best poses of the two proteins, with a
maximum resolution in the poses of about 5°. The docking is performed as a rigid-body opti-
mization: there is no subsequent minimization of the interfacial region.
To prepare your protein for docking, you can use the Protein Preparation Wizard (see the
Protein Preparation Guide). When you prepare your structure, you should select Fill in missing
side chains using Prime to predict the side chains, and Fill in missing loops using Prime to
predict the loops. The loop prediction used in the protein preparation is the faster look-up
method, rather than the more extensive ab initio loop building. Both of these predictions can
take several minutes.
If you are concerned about the accuracy of the surface side chains or loops, you can do a more
extensive prediction in the Refinement panel (Tasks → Loop and Sidechain Prediction). If you
have access to the X-ray data, you could consider performing some refinement of the structure
with PrimeX, the X-ray refinement program. Choose Tasks → Advanced Tasks → Protein X-
Ray Refinement → Display Toolbar, and use the toolbar to perform the refinement tasks. See
the PrimeX User Manual for more information.
In the docking experiment, however, the accuracy of the methods may not be sufficient to
distinguish between conformations, so an extensive prediction of the surface side chains or
loops is probably not necessary. The presence of the side chain or loop in the right region is
likely to be more important than its exact conformation. If you want to test the effects of loop
or side chain conformations, you can generate multiple conformations and dock each of them.
Next, choose the receptor protein and the ligand protein. It does not matter which of your two
proteins you choose as the receptor and which you choose as the ligand. To select the struc-
tures, click Receptor or Ligand in the Protein Structures section, and choose a source from the
menu that is displayed. The menu items are the same for each menu:
• Browse for File—Opens a file selector so that you can browse to the location of the file
and select it. The structure is added to the project and to the Workspace.
• From the Workspace—Use the structure that is in the Workspace. You should ensure that
only the desired structure is included in the Workspace. If you have prepared the protein
with the Protein Preparation Wizard, the structure should already be in the Workspace.
• From PDB ID—Opens a dialog box in which you can specify a PDB ID. The protein you
specify is imported from the PDB, either from a local copy or from the web site. It is
added to the project and to the Workspace. Proteins from the PDB are likely to have
atoms missing.
When you import the structures, if hydrogens are missing, you are prompted to add them. If the
protein has multiple chains, you are prompted to choose the chains. You can choose more than
one chain for the docking.
When the protein is imported, the structure is added to the project and included in the Work-
space. If the structure is removed from the Workspace you can add it with the Include and
zoom button (eye icon).
To view the sequences of the proteins you imported in a sequence viewer, click View
Sequences.
When you have selected the proteins to dock, you can set two parameters to control the
docking. The first is the number of ligand orientations to the receptor that are sampled. You can
set the value in the Number of ligand rotations to probe box The default of 70,000 corresponds
approximately to sampling every 5° in the space of Euler angles, and is the maximum value
allowed. Decreasing the number of rotations generally degrades the results, but decreases the
run time.
The second parameter is the number of poses to return, which you can set in the Maximum
poses to return box. Each pose is the center of a cluster that results from clustering the top
1000 results of rigid docking of the ligand. If more clusters are found than the number of poses
to return, the clusters are ranked by size and poses are chosen from the largest clusters. If fewer
clusters are found than the maximum number of poses to return, one pose per cluster is
returned.
After setting the parameters, click Generate Models to run the docking job. A Start dialog box
opens, in which you can choose whether to append the results to the project (Append new
entries) or leave them on disk (Do not incorporate), choose a host to run the job on, name the
job, and start it. Docking jobs are run on a single processor. A typical docking job with the
default parameters takes several hours.
Constraints can be added in the Constraint section of the panel. Each constraint is represented
by a row in the table in this section. To add a new constraint row to the table, click Add
Constraint. All of the settings needed to define the constraint can be made in the cells of the
table.
Column Description
To define the constraint, you must choose the constraint type in the Type column, choose the
protein to apply it to in the Protein column, set a value in the Bonus column if it is an attractive
constraint, and then select the residues to apply the constraint to by using the Residues
column. You can select the residues in the Workspace, then choose From Workspace selection,
or you can choose Choose Workspace atoms, then use the Atom Selection dialog box to select
the residues for the constraint. See Section 6.5 of the Maestro User Manual for information on
using the Atom Selection dialog box.
To delete a constraint, click the red minus icon in the Actions column.
The dimer and the trimer are subject to symmetry constraints: the dimer must have a twofold
axis of rotation (C2 axis), and the trimer must have a threefold axis of rotation (C3 axis).
As for the other types of docking, you can choose the antibody and antigen in the Protein struc-
tures section. After prompting you to add missing hydrogens, the antibody is analyzed to
locate the CDR regions and determine which are the light and heavy chains. A progress bar is
displayed while the analysis is done. When the analysis finishes, another dialog box opens,
prompting you to select the chains to use for the antibody (receptor). You must select two
chains: a light chain and a heavy chain. When selecting the antigen, an alert box opens,
warning you about the chains. You can dismiss this box.
Apart from these changes, docking an antigen to an antibody is set up and run in the same way
as a general protein-protein docking job.
Chapter 11
If you have a protein sequence and want to build a homology model of the protein, there are
three ways that you can proceed in the BioLuminate interface.
Proteins for which you expect a high homology with the template and that require only a
straightforward alignment to the template can be modeled in the Homology Model panel. In the
default mode, any missing loops are predicted using a curated database of known loops in the
PDB. This approach is very fast and a full homology model can typically be generated using
this panel in 2-5 minutes. To open this panel, choose Tasks → Homology Modeling → Simple
Homology Modeling. The use of this panel is described below.
Proteins where the homology is not as high or where alignment of the template and the query is
required can be modeled either in the Multiple Sequence Viewer panel (opened with Tools →
Multiple Sequence Viewer), or with the Structure Prediction panel (opened with Tasks →
Homology Modeling → Advanced Homology Modeling). For information on using these panels,
see the Multiple Sequence Viewer document for the Multiple Sequence Viewer and the Prime
User Manual for the Structure Prediction panel.
For a tutorial introduction to homology modeling using the Homology Modeling panel, see
Chapter 6 of the BioLuminate Quick Start Guide. For a tutorial introduction to advanced
homology modeling, see the Prime Quick Start Guide.
2. Choose a source for the template structure on which to build the model, using the Tem-
plate button.
You can either read in a template structure from a Maestro file or a PDB file (Browse for
File), or you can run a BLAST search, as follows:
When the job finishes, the model is added to the Workspace, in cartoon representation. The
cartoon is colored by how the template was used: dark blue for residues for which all coordi-
nates were taken from the template, cyan for residues for which the backbone was taken from
the template, and red for residues that were entirely modeled, not using the template. The title
given to the model includes information on the query and the template.
If the job fails, it is likely that there is insufficient homology or poor alignment between the
reference and the template to build a model. In this case you should use the Advanced
Homology Modeling panel (Tasks → Homology Modeling → Advanced Homology Modeling) or
the Multiple Sequence Viewer (Tools → Multiple Sequence Viewer).
If you want to examine the quality of the model structure, click Examine Model Quality, to open
the Protein Structure Quality Viewer panel, where you can view reports on the protein structure,
a Ramachandran plot, and plots of protein properties
If you want to refine loops in the model, click Refine Loops. The Refinement panel opens with
the Refine loops task selected. You should only need to refine the loops if they were not
predicted from the template. Click Non-Template in the Refinement panel to list only the loops
that did not come from the template. See Chapter 6 of the Prime User Manual for information
on this panel.
Chapter 12
At some point in a workflow, you might want to mutate a single residue, or replace a single
loop with another loop. You can do this in the Residue and Loop Mutation panel, which you
open from the Tasks menu.
BioLuminate provides other tools for mutations of more than one residue. The Residue Scan-
ning panel allows you to mutate a protein at multiple sites to generate a set of proteins, each
with a single mutation—see Chapter 13 for information. If you want to mutate residues to or
from cysteine and break or form disulfide bonds, you should use the Cysteine Mutation
panel—see Chapter 14 for information.
The workflows for each of these mutations is summarized here and detailed in the following
sections.
To find a particular residue in the Workspace, you can use the Find tool. Type CTRL+F (F) in
the Workspace to display the Find toolbar below the Workspace. You can then choose Residue
number or Residue type from the Find menu to choose residues by number or type, then enter
the number in the text box or choose the type from the menu, and click the N or P button.
To find a residue in the sequence viewer in the Residue and Loop Mutation panel, enter the
residue letter in the Find Pattern text box, and click the arrow keys to step through the occur-
rences. You can also enter multiple residues to find a pattern, e.g. D-A-P. The pattern uses
PROSITE syntax, which is explained in the tool tip. When you have found the residue you
want, clear the Find Pattern text box, then click on the residue to select it, or simply select it in
the Workspace. The residues that are found are selected in the Workspace, so clicking on one
of them changes the selection to the desired residue.
If the sequence viewer doesn’t have a sequence in it, you can click Import and choose From
Workspace to load the sequence of the Workspace structure.
To begin editing a standard amino acid in the Workspace to produce a custom amino acid,
check Edit. The amino acid replaces the protein in the Workspace, with the backbone repre-
sented as ball and stick and the side chain as lines.
To add groups to the structure or to change elements, you can use the Build and the Fragments
toolbars. Click Build or Fragments on the Manager toolbar at the top of the main window to
display these toolbars. With the Fragments toolbar, you can select a fragment, then click on an
atom to replace that atom with the fragment. With the Build toolbar, you can sketch a structure
with the Draw tool, change the element with the Set Element tool. You might also want to add
hydrogens after sketching a structure, which you can do from the Edit toolbar with the Add H
tool. You should also use the Clean Up tool after sketching the structure and adding hydrogens,
to ensure that the structure is not distorted.
For more information on building structures, see Chapter 5 of the Maestro User Manual.
When you have finished editing, clear the Edit check box. A dialog box prompts you to provide
a 3-letter PDB name for the custom amino acid. The default is USR. You can edit the name in
the Three-letter PDB Name text box after creating a custom amino acid.
The text on the amino acid option menu is set to Custom when you finish editing.
After the mutation is defined, the mutation is reported in the panel, and a title for the mutated
structure is entered in the Mutated structure title text box. You can edit this title if you wish.
There are two types of refinement available: minimization and molecular dynamics simulation.
You can choose one or the other or both. The minimization is run first.
Minimization is the fastest option, and is a good choice if the mutated residue is in approxi-
mately the right conformation. If you choose to do a minimization, you can run it in the gas
phase or in implicit solvent. The residues minimized can be limited to just the altered residues,
all residues within a given distance of the altered residues, or the entire structure.
Molecular dynamics is much more time-consuming, but it can sample other parts of conforma-
tional space, particularly if you run simulated annealing. Once the structure is mutated and any
minimization is done, the protein is prepared for the simulation by adding explicit water mole-
cules. The Molecular Dynamics panel or the Simulated Annealing panel opens, and you can
make settings and run the simulation, which uses the Desmond molecular dynamics program.
For more information on these panels, see Chapter 3 of the Desmond User Manual.
If you decide you do not want to run the simulation, which can take many hours, you should
delete the entry group in the Project Table that was created for the simulation, as follows:
1. Select the entry group (click the row with the number in square brackets).
2. Choose Entry → Unlock.
3. Choose Entry → Delete.
For information on using the Project Table, see Chapter 9 of the Maestro User Manual.
The basic procedure is summarized below, and details are given in the following sections.
5. If desired, load a new structure and select residues to specify a starting set of residues for
the replacement loop.
By default, the replacement loop starts out identical to the original loop.
6. If desired, edit the replacement loop by clicking on residues in the table and specifying an
insertion, deletion or mutation at that point.
7. Choose the number of models to generate and the prediction method.
8. Click Accept.
The Insertions, Deletions, and Loop Swaps panel closes.
9. Click Mutate.
No refinement of the loop modification is offered, so you must perform any refinement inde-
pendently.
• Browse for File—Open a file browser in which you can navigate to the desired location
and select the file that contains the structure. The allowed file types are Maestro and
PDB.
• From PDB ID—Import the structure from the specified PDB ID. Opens the Enter PDB ID
dialog box, in which you can enter the PDB ID of the sequence. The sequence and struc-
tures are retrieved from a local copy of the PDB if it is available, or from the RCSB web
site, depending on the preference set for PDB retrieval.
• From Workspace—Import the structure that is displayed in the Workspace.
The sequence for the imported structure is added to the sequence viewer. This allows you to
import multiple structures, and choose which structure to take a loop from.
When you have the structure that you want to mutate, you can select the loop to mutate either
in the Workspace or in the sequence viewer in the panel.
Selecting the loop in the Workspace allows you to visually identify the loop. To ensure that you
are picking residues in the Workspace to define the loop, choose Edit → Pick Mode → Resi-
dues, or type the letter R with the pointer in the Workspace. You can then pick residues that
belong to the loop you are interested in. The residues that you pick are highlighted in the
Workspace (yellow dots) and in the sequence viewer (white letter on blue background).
Selecting the loop in the sequence viewer allows you to easily search for patterns in the
sequence, or to use the secondary structure assignment (SSA) to identify loops. The SSA has
no annotation where there are loops (the absence of any other secondary structure). To search
for patterns in the sequence, enter the pattern in the Find Pattern text box, with a dash sepa-
rating each residue from the next, e.g. D-A-P. The syntax is summarized in the tool tip for the
text box, and is given in more detail on page 39. You can use the arrow keys to step through the
patterns. All of the matches are selected in the Workspace, so you might have to select the loop
that you want in the Workspace, or by clearing the Find Pattern text box, then selecting the
residues to use.
When you select the residues for the loop to mutate, you should select at least two residues on
either side of the residues that you plan to modify. These extra residues are “stem” residues,
which are used by the Prime software when rebuilding the loop, to properly fit the new loop
onto the structure. For example, if a single residue is being deleted, the original loop should
consist of five residues - the residue to delete and the two residues on either side of it.
After you have selected the residues, click Workspace Selection in the Original loop section, to
register the selection and copy the structure for the loop mutation job. The original loop is used
by default for the replacement loop, which you can modify as described in the next section.
If you want to replace the loop with a loop from another structure, you can place another struc-
ture in the Workspace, and select a loop from this structure. If the structure is not already in the
sequence viewer, you can import it as described above. You can also select the loop in the same
way as for the original loop.
You might want to align the sequence for the replacement to the sequence for the original
structure, so that you can select the loop by its alignment. To do the alignment, make sure that
the original sequence is at the top of the sequence viewer (use the shortcut menu for the
sequence name to move it if necessary), and add the replacement sequence to the selected
sequences. You can then use either the pairwise alignment tool or the multiple alignment tool
to align the sequences (using ClustalW).
You can also do manual alignment, with locking and unlocking of gaps. See the Multiple
Sequence Viewer document for more information on doing manual alignment.
When you have selected the residues in the desired structure for the replacement loop, click
Workspace Selection in the Replacement loop selection. If you decide not to use this loop, you
can click Set to Original Loop to change the replacement loop back to the original loop struc-
ture.
When you choose to insert a residue, a new column is added to the table before or after the
current position. The cell in the Original row has three dashes, to indicate that there is no
residue in the sequence at this position. The cell in the Replacement row has the residue name
between two colons, to indicate that the chain and residue number are not yet assigned. Inser-
tion codes are added for the inserted residues when the job is run.
If you delete a residue, the cell in the Replacement row is set to three dashes, to indicate the
deletion.
Likewise, if you mutate a residue, the cell in the Replacement row has the new residue name
between two colons.
You can also choose whether to create the loops via a fast table-lookup method or via a full
Prime loop prediction, which builds loops by sampling multiple conformations and scoring
them, to produce the best loop structures. The fast method takes only a minute or so, where as
the thorough loop sampling can take hours. To make the choice, select Fast or Thorough under
Loop prediction. No additional refinement beyond these methods is done for new loop predic-
tions.
When you have finished selecting options, click the Accept button to return to the main
Residue and Loop Mutation panel. Click the Mutate button on that panel to begin the loop swap
job.
Chapter 13
This chapter describes how to scan a protein for potential residue mutations, generate mutated
structures, and compare the properties of the mutated structures. The mutation sites and the
mutations can be selected manually or selected automatically based on homology modeling or
3D structural criteria.
If you have not already opened the Residue Scanning panel, open it by choosing Tasks →
Residue Scanning → Perform Calculations in the main window. You can display the protein
before or after opening the panel.
The first part of the procedure is to analyze the protein, which you do by clicking Analyze
Workspace. In this process, you may be asked some questions:
• If your structure has not been prepared for modeling, you are asked if you want to use the
Protein Preparation Wizard to prepare the structure. You cannot proceed if you do not
have a protein that is an all-atom, 3D structure with bonding information. After preparing
the protein, display it in the Workspace again and click Analyze Workspace.
• If your structure has multiple chains, a dialog box opens, prompting you to choose a
chain. Mutations can only be performed on a single chain in the residue scanning run. If
you want to mutate multiple chains, you must mutate them in separate runs.
When the structure has been analyzed, the residues table is filled in with all the residues in the
chosen chain.
Instead of analyzing an entire protein or an entire chain, you can select residues and analyze
only the selected residues. To do this, select the residues in the Workspace structure, select
Analyze only selected Workspace residues in the Residue Scanning panel, then click Analyze
Workspace.
There are two main ways of selecting residues to mutate: manually, or based on analysis of
homologs to the structure of interest. Manual selection is described in this section, and using
homology is described in Section 13.3 on page 74.
• To select a residue for mutation, check the box in the Mutate column for that residue.
• You can also select residues for mutation and select the mutations at the same time—see
Section 13.2.2 for details.
When you select rows in the table, the Workspace view zooms in to the residues you have
selected. The selected residues are highlighted with green carbons, and the remaining residues
are dimmed. (You can adjust how far the view zooms in by making a setting in the Preferences
panel. Choose Edit → Settings → Preferences, select Fitting under Workspace in the tree on
the left, then enter a value in the Fit margin text box.)
The residue types include groups such as Neutral, Polar, Hydrophobic, Positive, and Nega-
tive. You can choose more than one residue or group from the list, and the new residues
are added to the list. Each residue type that you select is checked on the list.
3. Click Apply.
The Mutate check box is checked for any of the selected rows that are not already marked for
mutation, and the mutation is set to the residues that you chose from the option menu. The
mutations are listed in the Mutations column of the table.
A text message is displayed at the bottom of the tab that reads Will mutate M of N residues;
Number of mutations: K. The number of mutations is the total number of structures generated.
Only one site is mutated at a time, so the number of structures is linear in both the number of
sites mutated and the number of mutations at each site.
Figure 13.2. Option menu for defining mutations for a single residue.
The region around the mutation site that is relaxed during the calculations is defined by a cutoff
distance. Any residue that has an atom within the specified distance of a hypothetical Arg
residue at the mutation site is included in the refinement. A hypothetical Arg residue is used to
ensure that the set of residues refined is identical, regardless of the initial or mutated residue
identities. This ensures that comparisons of the properties of the mutated structures are not
affected by the choice of residues that are relaxed.
The method used for the refinement of the selected residues can be chosen from the Refine-
ment option menu. The choices are:
• Gas-phase minimization—Minimize the energy of the residues using the internal force-
field minimizer. This is the fastest method, but does not include solvation effects.
• Implicit solvent minimization—Minimize the energy of the residues with an implicit (con-
tinuum) solvation model, using Prime.
• Side-chain prediction—Prior to minimization, a thorough exploration of possible side
chain conformations is performed.
• Side-chain prediction (cbeta)—Prior to minimization, a thorough exploration of possible
side chain conformations, including the CA-CB torsion, is performed.
• Side-chain prediction (bb)—Prior to minimization, a thorough exploration of both poten-
tial side chain and potential backbone conformations is performed.
For systems that consist of more than one entity (chain), the binding affinity changes with
residue mutation can be calculated. The affinity changes are calculated between the chains that
constitute a “ligand” group and the remainder of the system. The ligand group is defined as the
chain for which mutations are being calculated (the chain you selected) plus any additional
chains you wish to add to the ligand group. The binding affinity is calculated with the Prime
MM-GBSA technology.
If you run the job on a multiprocessor host, you can divide the job into subjobs and distribute
them over multiple processors. The minimum work a subjob can do is to mutate one residue to
another residue, so you should not specify more subjobs than there are output structures. For
optimal load balancing, the number of subjobs should be a few times the number of processors.
A status bar showing the progress of the job is displayed above the Start button. The job takes
several minutes per residue mutation to run, depending on the refinement options.
At the end of the job, the BLAST Search Results dialog box opens, so that you can choose how
many of the homologs you want to use. You can do one of the following to select homologs:
When you have selected the homologs, click Incorporate Selected Rows to add the structures
to the sequence viewer.
• If you have a set of homologs in a file, click the Import button and choose Browse for file.
A file selector opens so you can locate and import the file.
• If you want to manually import structures from the PDB, click Import and choose From
PDB ID, then specify the PDB ID in the dialog box that opens.
• If the structures of the homologs are in the Workspace, click Import and choose From
Workspace.
To run the alignment job, click Align Homologs, or click the Multiple Alignment toolbar
button.
The alignment, which uses ClustalW, is usually very quick. A dialog box may be displayed
briefly before the results are incorporated. The alignment adds gaps in the sequence viewer as
necessary.
To align sequences manually, you can drag residues to the right or the left, to fill or create gaps.
If you have already created gaps that you want to preserve during another alignment, you can
click the Lock Gaps toolbar button. The gap symbol changes to - to indicate that the gap is
locked. To unlock the gaps again, click the Unlock Gaps toolbar button.
• Variability at position >/< N %—Select residues based on the percentage variability at the
residue position. You can apply a minimum or a maximum variability by choosing from
the option menu, and specify the percentage threshold in the text box.
• Variability at position >/< N residue types—Select residues based on the residue type vari-
ability at the residue position. Choose whether to apply a minimum or a maximum vari-
ability from the option menu, and specify the threshold for the number of residue types
that can vary in the text box.
• Ignore positions with group conservations—Ignore (do not select) residues that are
strongly conserved or that are either strongly or weakly conserved. Select the appropriate
option for strong conservation or both strong and weak conservation.
Strong conservation means that all residues at a particular position are in one of the fol-
lowing groups: STA, NEQK, NHQK, NDEQ, QHRK, MILV, MILF, HY, FYW. Weak
conservation means that all residues at a particular position are in one of the following
groups: CSA, ATV, SAG, STNK, STPA, SGND, SNDEQK, NDEQHK, NEQHRK,
FVLIM, HFY. These definitions are those used by ClustalW.
• Ignore parent sequence in applying above criteria—Ignore the parent (query) sequence
when applying the variability or conservation criteria. The selection is then based on the
variability in the homologs only.
• Parent residue different than N % homologs at position—Select residues for which the
parent residue is different from more than the specified percentage of homologs at the
residue position.
When residues are selected on the basis of homology, a default set of mutations is also defined.
The mutations for a given residue include all the variants found at that residue position.
• Solvent accessible surface area—Select this option to select residues by their solvent-
accessible surface area (SASA) relative to an isolated residue of the same type, and set a
threshold for the maximum or minimum allowed relative SASA. This option is useful for
locating surface or buried residues.
• Residue side chain makes no more than N interactions with protein—Select this option to
filter out residues whose side chains have multiple interactions with other protein resi-
dues, and set the maximum number of residues with which the side chain has interac-
tions.
• Residue side chain interacts/does not interact with molecule N—Select this option to
select residues by their interaction with a particular molecule. Choose whether to allow or
disallow the interaction from the option menu, and specify the molecule in the text box,
or select Pick and pick the molecule in the Workspace. This is useful for mutating resi-
dues near a ligand, or for mutating residues at protein-protein interfaces, for example.
Interactions are determined by a distance cutoff: any residue that has atoms within 4 Å of the
side chain is considered to interact.
If you want to examine results from a job that was completed earlier, you can open this panel
by choosing Tasks → Residue Scanning → View Results. You can then click Import to locate
the Maestro file (.maegz) that contains your results and import it into the current project.
The results are listed in the Mutations table. Each mutant is identified by the residue position
and the original and mutated residue names. The properties that were calculated for each
mutant are listed in the table. These properties are described briefly in Table 13.1. Some of
these properties are described in more detail in the sections below.
All properties are calculated after the refinement is performed, and so include relaxation of the
protein after mutation. The energy and stability properties are calculated for the ligand group
that you specified for the binding affinity calculations: the remainder of the protein is ignored.
You can sort the table columns by clicking on the column headings. You can plot any of these
properties against the mutation (table row) by choosing the property from the Graph property
option menu. If you want to export the table data as a CSV file, click Export, and navigate to a
location and name the file.
Property Description
You can select a region in the graph using the horizontal and vertical dashed lines, which can
be dragged to create the selection. The rows corresponding to the selected region of the graph
are highlighted in the table above, and the residues are highlighted in the Workspace.
If you select a table row, the view zooms in to the mutated residue. To display the original
structure, select Display original structure in grey. The parent protein is displayed and colored
grey. You can then see how the mutation is positioned in relation to the original residue.
ΔG1
R+L R·L
ΔG3 ΔG4
ΔG2
R+L' R·L'
where R is the receptor, L is the ligand in the parent, and L' is the mutated ligand. R+L and
R+L' represent the separated receptor and ligand. R·L and R·L' represent the receptor bound to
the ligand. The change in binding affinity is
Experiment measures ΔG1 and ΔG2, but it is ΔG3 and ΔG4 that are calculated, to optimize the
cancellation of error in the computational models. The calculations are done with Prime MM-
GBSA, which uses an implicit (continuum) solvation model.
ΔG1
L(u) L(f)
ΔG3 ΔG4
ΔG2
L'(u) L'(f)
where L(u) is the unfolded parent ligand, L(f) is the folded parent ligand, L'(u) is the unfolded
mutated ligand, and L'(f) is the folded mutated ligand. The change in stability is
Experiment measures ΔG1 and ΔG2, but it is ΔG3 and ΔG4 that are calculated, to optimize the
cancellation of error in the computational models. For the purpose of the model, the unfolded
ligand is represented as a tripeptide, Gly-X-Gly, where X is the residue that is mutated. The
assumption is that the remaining interactions in the unfolded state are negligible. The calcula-
tions are done with Prime MM-GBSA, which uses an implicit (continuum) solvation model.
Chapter 14
Disulfide bridges between cysteine residues add to the stability of a protein structure. Mutating
residues to form or break disulfide bridges offers a way of controlling the stability of a protein.
This chapter describes how to run a cysteine mutation calculation to locate and rank possible
disulfide bridges. The calculation is set up and run in the Cysteine Mutation panel, in the Run
tab, and the results are presented in the Results tab.
The protein must be analyzed to locate possible residue pairs that could be mutated to cyste-
ines, or to locate disulfide bridges that could be broken by mutation to other residues, which
you do by clicking Analyze Workspace.
Instead of analyzing an entire protein, you can analyze the Workspace selection. To do this,
select the desired residues in the Workspace structure, select Analyze only selected Workspace
residues in the Cysteine Mutation panel, then click Analyze Workspace.
If your structure has not been prepared for modeling, you are asked if you want to use the
Protein Preparation Wizard to prepare the structure. You cannot proceed if you do not have a
protein that is an all-atom, 3D structure with bonding information. After preparing the protein,
display it in the Workspace again and click Analyze Workspace.
When the structure has been analyzed, the Residue pairs for mutation table is filled in with all
the residue pairs that meet the criteria for forming or breaking a disulfide bridge. The criterion
for identifying potential cysteine pairs is a Cβ–Cβ distance between the residues that is less
than the distance specified in the panel. For Gly, the distance is taken from the alpha hydrogen.
Column Description
(Index) The first column contains the index of the residue pair. When the table is filtered
to show only certain residue pairs, this index remains the same (i.e. it is not a
table row number).
Type Mutation type, which can be one of the following:
X-X -> S-S: Mutation of two residues to Cys with formation of a disulfide bond.
S-X -> S-S: Mutation of one residue to Cys with formation of a disulfide bond to
a nearby cysteine
S-S -> S-X: Mutation of one Cys residue of a bonded pair to break a disulfide
bond.
Residue 1 Identity of the first residue in the pair, given by the chain letter, the residue num-
ber and insertion code, and the 3-letter residue name. For Cys-Cys pairs, this res-
idue is the residue that is mutated, so the pair is listed twice, in opposite order, to
allow selection of only one of the pair to mutate.
Residue 2 Identity of the second residue in the pair, given by the chain letter, the residue
number and insertion code, and the 3-letter residue name.
β-carbon Distance Distance in angstroms between the beta carbons (CB) of the two residues. In the
case of Gly, the distance is taken from the alpha hydrogen (HA) that would be
replaced by the beta carbon of the Cys.
The option menu contains individual residues, which you can select independently, an All
option to select all residues (except Pro), a None option to clear the list, and a Conserva-
tive option to select conserved residues (GLY, ILE, LEU, VAL, ALA). The residues that
you choose are displayed in the main part of the option menu; the complete list is shown
in a tooltip if it is too long. The table is updated for each selection that you make.
c. Use the Beta carbon distance cutoff text box
You can specify the maximum allowed distance between the beta carbons of the residues
in a pair. This distance is used to filter the table to show only residues with a smaller dis-
tance. For Gly, the distance is taken from the alpha hydrogen.
These display options allow you to restrict the list of residue pairs to those that are of interest,
so that it is easier to select the pairs in the table that you want to mutate. The job mutates only
the selected pairs in the table, so you must make a selection before you run the job.
If you have Cys-Cys pairs whose bond you want to break by mutating one of the cysteines to
another residue, you must also select the mutations, from the Cys -> X replacement residues
option menu. The option menu works in the same way as the X -> Cys replacement residues
option menu, described above.
• Residues within N Å—Optimize all residues that have atoms within the specified distance
of the mutated residue pair.
• Adjacent residues in sequence—Optimize the N residues next in the sequence on either
side of the residues in the pair, where N is selected from the option menu.
• None—Do not optimize any residues but the two residues in the pair.
When you have chosen the output option and named the job, click Start. The job is run locally.
The progress of the job is displayed in a status bar at the bottom of the panel.
The results of the mutation job are displayed in the Mutations table. The table columns are
described in Table 14.2.
Selecting a row in the table zooms in on the mutated pair in the Workspace. The mutated resi-
dues are displayed in ball-and-stick representation, and the rest of the structure uses a darker
color scheme. If you want to see the original residues as well, select Display original structure in
gray.
Column Description
Residues Chain name, residue number and insertion code of both residues in the mutated
pair.
Original 3-letter names of original residues in the pair.
Mutated 3-letter names of mutated residues.
Δ Ei Change in interaction energy between the residue pair and the rest of the protein
on mutation.
Δ Strain E Change in strain energy on mutation. The strain energy is the difference in internal
energy between the state of the residue pair in the protein and the relaxed state of
each residue or of the disulfide in the gas phase.
Δ Ei+Strain E Change in the sum of interaction energy and strain energy on mutation, equal to
Δ Ei + Δ Strain E.
Quality Classification of the quality of a predicted disulfide bond into Good, Medium,
Bad, based on the changes in interaction energy and strain energy on mutation.
Good Δ Ei < 0 and Δ Strain E < 20.
Bad Δ Ei > 40 or Δ Strain E > 40 or Δ Ei > 20 and Δ Strain E > 20
Medium Any other values
8. Select the pairs that you want to mutate in the table. Each pair is mutated independently
to produce a structure and an energy evaluation.
9. Decide whether you want to optimize residues near to the mutated residues by selecting
from the Gas phase optimization shell options. The mutated residues are always opti-
mized.
10. Click Start to run the mutation job. You can set the job name and decide where to send the
output in the dialog box that opens.
11. When the job finishes, go to the Results tab. If the results are not listed in the Mutations
table, click Load Results from Previous Run, navigate to and open the output Maestro file
from the job.
Chapter 15
The main task of antibody modeling in BioLuminate is to construct a homology model based
on a database of antibody templates. Once the model is constructed, it can be humanized if
required. A database of antibody templates is provided with BioLuminate, based on a new
analysis of antibodies in the PDB from 2010 [2], which you can modify or add to, or create
your own database. Antibody-antigen docking is covered in Section 10.5 on page 53.
The modeling is performed using the Antibody Prediction panel, which you open by choosing
Tasks → Antibody Modeling → Prediction in the main window.
The left part of the panel displays a diagram of the Fv region of an antibody. Clicking on the
light variable or heavy variable region in the diagram displays a menu, from which you can
choose the source of the sequence for this region. The choices are:
• Browse for File—Open a file browser in which you can navigate to the desired location
and select the file that contains the sequence.
• From Workspace—Use the sequence for the structure that is displayed in the Workspace.
Only one structure must be displayed.
• From Selected Entries in the Project Table—Use the sequence from the entry that is
selected in the Project Table. Only one entry must be selected.
• From PDB ID—Use the sequence from the specified PDB ID. Opens the Enter PDB ID
dialog box, in which you can enter the PDB ID of the sequence. The sequence is retrieved
from a local copy of the PDB if it is available, or from the RCSB web site, depending on
the preference set for PDB retrieval.
• Enter/Paste New Sequence—Type or paste in the sequence. Opens the Sequence Editor
dialog box, in which you can name the sequence and type it or paste it in, as a string of
single-letter codes.
Figure 15.2. The Antibody Prediction panel showing the import menu.
If you intend to model only part of an antibody and use the input structure for the rest, you
should ensure that you import a sequence that has an associated structure. This means that you
can choose any of the menu items except the last.
When the protein is read in, it is analyzed to find the chains and the loops. If there is more than
one possible chain that could be used (for example, in a dimer), a dialog box opens, in which
you can choose one of the chains. When the analysis finishes and you have chosen a chain if
requested, the region in the diagram is colored to indicate that the chain is assigned. When both
chains have been chosen, the text prompting you to import the two chains is no longer
displayed.
You can view the sequences in a sequence viewer at any time by clicking View Sequences.
You can choose one or more databases to use when modeling an antibody. The default is the
database in the installation. If you want to select the databases, click Configure Active Data-
bases, to open the Database Configuration dialog box. This dialog box has a table of databases
that you can choose from. You can do the following:
• Add a database to the list, by clicking Add Database and navigating to the database in the
file selector that opens.
• Select a database to use, by checking the check box in the Active column.
• Remove a database from the list, by clicking the button in the Actions column.
When you have finished modifying the list and choosing the active databases, click OK.
If you want to use input coordinates, select Input coordinates. The sequences that you imported
for the light and heavy regions must of course be associated with a structure, and an error
message is posted if there is no structure available.
If you want to use a homology model, select Homology. To run the search, click Search. When
the search finishes, the table in the lower part of the Framework Search tab is filled in with the
results, in order of their score. The table columns are described in Table 15.1. You can select a
single result in the table to use as the template. The table columns are described below.
Column Description
When you have selected a template or chosen to use input coordinates, click Accept to accept
the choice and move on to the next stage. The Basic Loop Model tab is displayed automatically,
after a short pause.
Figure 15.4. The Antibody Prediction panel after searching for templates.
First, choose a property from the Available properties list at the top of the panel. The list shows
the property name, the family (category) it belongs to, and the range of values, for numeric
properties. You can limit the list to a particular property family by choosing from the Show
family option menu. If you type in the Property text box, a completion list is displayed below it,
from which you can choose a property.
Once you have chosen a property, it is displayed in the Property text box. You can then use the
text boxes and menus to the right to define the restrictions on the values of this property. Click
Add to add the filter to the Filtering definitions and criteria list. You can add multiple criteria or
definitions, and each of them is applied to the databases. The number of structures in the data-
base that match the filters is reported at the end of the list. Property values are case-sensitive,
so for example 1FSK does not match 1fsk.
If you want to filter on multiple properties, you can choose another property, set up the restric-
tions on its value, and click Add to add the filter to the list. The filters are cumulative (implicit
AND), so the resulting structures are those that match both filters.
As an example, to include structures with properties in a specified range (all values from a
minimum to a maximum inclusive):
As another example, to filter our structures for which a property has values in a given range:
The six CDR loops are listed in a table that provides check boxes for selecting the source of
input coordinates. To choose the source, check the appropriate box. By default all loops are
predicted from the database. Accepting the default loop assignments is usually the best option.
If you want to choose individual clusters for one or more loops, click View Clusters. This opens
the Loop Clusters dialog box, which shows you the clusters for each loop, and allows you to
select a cluster to use in the model. It also allows you to set a similarity cutoff for the loop to be
used in the model. The procedure is described in more detail in Section 15.1.6 on page 98.
Figure 15.6. The Antibody Prediction panel, Basic Loop Model tab.
Otherwise, the selection of the loop is done by sorting the clusters by the size of the cluster,
then locating the largest cluster in the list that has a member whose loop similarity to that of the
query is greater than the loop similarity cutoff. The cluster member with the greatest similarity
to the query is the one that is used to build the loop.
You can build more than one model for the structure, by setting the desired number in the
Number of models text box. If more than one model is requested, a series of diverse models is
returned. The first model returned is usually the most likely to be correct.
To generate the loop model or models, click Generate Initial Models. These are called “initial
models” because you might want to refine the models, particularly the H3 loop. To assess the
quality of a generated model, you can examine it in the Workspace (click View in Workspace),
or analyze it in the Structure Quality Viewer (click View in Structure Quality Viewer). If you
have multiple models, you can choose them from the Model to view option menu.
When you view a model in the Workspace, it is colored by residue with the following color
scheme:
• Blue—residues for which the full residue conformation was copied from the template.
• Cyan—residues for which the residue backbone conformation was copied from the tem-
plate, and the side chain was modeled (because there was a residue mutation in the tem-
plate relative to the query).
• Red—residues for which both the backbone and the side chain were modeled.
• Maroon—residues in the CDR loops.
When you view the models in the Protein Structure Quality Viewer, all models are listed in the
table at the top of the viewer, and the structures are colored in the Workspace according to the
Ramachandran plot regions.
When this dialog box is first opened, the antibody databases are searched for loops of the same
length as those in the query sequence for each of the six loops, and the loops are clustered
structurally. The dialog box may therefore take a short while to open. Subsequently these loops
and clusters are reused, so opening is much faster.
The clusters for the loop chosen from the Loop option menu are listed in the table, in order of
decreasing similarity to the query sequence. The list is restricted to the clusters that have a loop
whose similarity to the query loop is greater than the cutoff specified in the Minimum similarity
box. The table columns are described in Table 15.2.
You can set the minimum acceptable similarity between a loop from the database and the query
loop in the Minimum similarity box. Only the clusters that contain at least one loop whose simi-
larity to the query is greater than this threshold are used in building the model, whether in the
default automatic procedure or by manual selection of a cluster.
If the minimum cluster similarity is changed, this change applies to all loops. If no cluster is
found with a loop that has the minimum similarity, then the program automatically uses the
template loop in the database that has the greatest similarity to the query, no matter which
cluster it belongs to.
Column Description
Source PDB ID of the member of the given cluster that has the highest similarity to the
query. The template used for the framework region is also included in the list.
Max Similarity Similarity of the member of the cluster that has the highest similarity to the query.
Sequence Sequence of the query for this loop (top) and of the most similar cluster member
(bottom). Residues that differ are marked in red.
Members Number of members of the cluster.
The H3 loop, however, is often difficult to predict, and may need more advanced refinement.
You can carry out advanced loop refinement using the Advanced Loop Model tab. From this
Figure 15.8. The Antibody Prediction panel, Advanced Loop Model tab.
tab, you can use Prime protein refinement to re-predict selected loops. If you generated more
than one model, you should select the model you want to use from the Model to view option
menu before leaving the Basic Loop Model tab.
Some general guidelines for when to refine a loop with Prime in the Advanced Loop Model tab
are given below:
• If the loop is long (9 residues or more) and the homology to the template is good (similar-
ity above about 80%) then refinement with Prime is not usually necessary.
• If the loop sequence similarity is less than 40%, the basic model quality usually is very
poor and a Prime refinement is recommended.
• The quality of the Prime refinement is greater for shorter loops.
• When building a new H3 loop with new sequences on the native (crystal) structure of an
antibody, Prime refinement is recommended.
Accurate and detailed loop predictions using Prime can take hours for each loop. It is not
usually necessary to run an advanced loop prediction for any of the loops except the H3 loop.
The H3 loop is selected by default for an advanced prediction, as it is the hardest loop to
predict. The results are returned to the Project Table as new structures.
The input structure for the loop prediction is always the structure that came from the Basic
Loop Model tab. This means that you cannot do sequential loop predictions in this panel, but
you can use the Refinement panel to predict more than one loop (Tasks → Loop + Sidechain
Prediction). See Chapter 6 of the Prime User Manual for more information on refinement
tasks.
To choose the loop for prediction, select it in the table. Click Run Prime to run a Prime loop
prediction job for the selected loop. A dialog box opens, in which you can name the job, select
a host, and specify the maximum number of processors to use. The single, best loop prediction
is returned. The Prime loop prediction algorithm is automatically selected based on the length
of the loop.
The same controls as in the Basic Loop Model tab are present for viewing the structure.
15.1.8 Summary
The basic procedure for running a prediction using a homology model is as follows:
1. Import the sequences for the light and heavy chains: Click the part of the diagram for the
region you want to import, and choose a source from the list that is displayed.
2. Set up the antibody databases that you want to use to search for homologs for the frame-
work region and the loops:
3. (Optional) Click Configure Active Databases in the Framework Search tab to add data-
bases and select them in the table. The default database is the one from the installation.
4. (Optional) Click Define Search Limits to filter the databases by properties of the struc-
tures in the databases.
5. Select Homology and run the homology search by clicking Search.
6. Select the homolog in the results table that you want to use for the framework region, and
click Accept.
7. (Optional) In the Basic Loop Model tab, select the loop cluster you want to use for each
loop in the model by clicking View Clusters, and choosing a cluster in the Loop Clusters
panel. By default a cluster is chosen automatically, based on cluster size and a minimum
similarity criterion.
8. (Optional) Set the number of models of the antibody that you want to generate.
9. Click Generate Initial Models to generate the models.
10. (Optional) If you think that the H3 loop needs further refinement, select it in the
Advanced Loop Model tab, and click Run Prime.
You can also use coordinates from existing structures instead of homology models, for
example if you want to vary just one of the loops. The procedure is similar to that given above.
1. Import the sequences for the light and heavy chains: Click the part of the diagram for the
region you want to import, and choose a source from the list that is displayed.
2. Set up the antibody databases that you want to use to search for homologs for the frame-
work region and the loops:
3. (Optional) Click Configure Active Databases in the Framework Search tab to add data-
bases and select them in the table. The default database is the one from the installation.
4. Select Input coordinates and click Accept.
5. (Optional) In the Basic Loop Model tab, select the loop cluster you want to use for each
loop in the model by clicking View Clusters, and choosing a cluster in the Loop Clusters
panel. By default a cluster is chosen automatically, based on similarity, then cluster size.
6. (Optional) Check the boxes for the loops for which you want to use input coordinates.
7. (Optional) Set the number of models of the antibody that you want to generate.
8. Click Generate Initial Models to generate the models.
9. (Optional) If you think that the H3 loop needs further refinement, select it in the
Advanced Loop Model tab, and click Run Prime.
The mutations can be set up and run in the Humanize Antibodies panel, which you open by
choosing Tasks → Antibody Modeling → Humanize in the main window. The panel has two
tabs, one for setting up the criteria for choosing residues to mutate, and one for selecting the
residues and defining the mutants. When both these tasks are done, you can start the job to
mutate the residues.
Once the analysis is complete, the sequence is shown in the panel’s sequence viewer, colored
by residue type, with its secondary structure assignment and disulfide bond annotation. You
can choose which chain to display by using the Show in viewer options. This sequence is called
the “parent” sequence.
The path to the database that will be used for the search is displayed in the (noneditable) Anti-
body database text box. If you want to change the database, click Browse and navigate to the
database. The default database is a database of human antibody data.
To find the homologs, click Search Antibody Database for Homologs. The progress of the
search is shown in the status area at the bottom of the panel. When the search is done, the
homologs must be aligned to the parent sequence, so that selection of residues for mutation can
be done on the basis of matching residue positions. Click Align Homologs to perform the
multiple sequence alignment of the homologs to the parent.
Criteria that are based on homology use the variations in the residues at each residue position
among the homologs, or between the homologs and the parent. The variations found are used
as a basis for choosing default mutations. The criteria that you can set are:
• Variability at position >/< N %—Filter residues based on the percentage variability at the
residue position. Choose whether to apply a minimum or a maximum variability from the
option menu, and specify the percentage threshold in the text box.
• Variability at position >/< N residue types—Filter residues based on the residue type vari-
ability at the residue position. Choose whether to apply a minimum or a maximum vari-
ability from the option menu, and specify the threshold for the number of residue types
The criteria based on the 3D structure of the parent antibody include solvent-accessible surface
area (SASA) and interactions between residues. Interactions are determined by a distance
cutoff: any residue that has atoms within 4 Å of a given side chain is considered to interact
with it.
• Solvent accessible surface area—Select this option to filter residues by their solvent-
accessible surface area (SASA) relative to an isolated residue of the same type, and set a
threshold for the maximum or minimum allowed relative SASA. This option is useful for
locating surface (or buried) residues.
• Residue side chain makes no more than N interactions with protein—Select this option to
filter out residues whose side chains make multiple interactions with the protein, and set
the maximum number of such interactions.
• Residue side chain interacts/does not interact with molecule N—Select this option to filter
residues by their interaction with a selected molecule. Choose whether to allow or disal-
low the interaction from the option menu, and specify the molecule in the text box, or
select Pick and pick the molecule in the Workspace.
At the top of the Residues tab is a table that lists all the residues in the Workspace. The first
table column contains check boxes that you can use to select residues for mutation. The resi-
dues that were selected in the Humanization Criteria tab are already selected for mutation. The
second column identifies the residues. The third column specifies the mutations of the residues
that are selected for mutation.
To select other residues for mutation, you can select the check box in the Mutate column, or
you can select table rows and click Check Mutation Box for Selected Residues.
To define the mutations for a given residue, click in the Mutations column. A menu is displayed
in the column, from which you can select one or more residues to mutate to, or select groups of
residue types. When you make a selection from the list, the residue is checked in the list and is
added to the text box at the top of the menu. To close the list, click the arrow button at the right
of the text box, or click in some other location. To hide the menu, click somewhere else in the
table (for example, in the Residue column). The residue is then selected for mutation if it is not
already selected.
To define a common set of mutations for multiple residues, first select the residues in the table.
Then choose the residues from the For all selected rows, set mutations to menu, and click
Apply. This menu works in the same way as the menu in the Mutations column of the table.
By default, a set of mutations based on the variations seen in the homologs is chosen for the
residues you selected in the Humanization Criteria tab.
The number of mutations is reported below the Check Mutation Box for Selected Residues
button.
There are several methods available for refinement of the mutated (and other) residues, on the
Refinement menu:
• Gas-phase minimization—Minimize the energy of the residues using the internal force-
field minimizer. This is the fastest method, but does not include solvation effects.
• Implicit solvent minimization—Minimize the energy of the residues with an implicit (con-
tinuum) solvation model, using Prime.
• Side-chain prediction—Prior to minimization, a thorough exploration of possible side
chain conformations is performed.
• Side-chain prediction (cbeta)—Prior to minimization, a thorough exploration of possible
side chain conformations, including the CA-CB torsion, is performed.
• Side-chain prediction (bb)—Prior to minimization, a thorough exploration of both poten-
tial side chain and potential backbone conformations is performed.
With the exception of the first, the refinements are run with the Prime protein modeling
program.
As part of the refinement, you can calculate the binding affinity of selected chains to the rest of
the Workspace structure, by selecting Calculate binding affinity. The calculation is performed
with the MM-GBSA method in the Prime program. This option is not available with gas phase
minimization.
The chains are treated as the ligand (which in this situation can be a structure of any size, such
as an antibody). By default, the heavy and light chains of the antibody are selected, so the
binding affinity of the antibody to the rest of the Workspace is calculated, such as an antigen. A
choice of chains to treat as the “ligand”, derived from the analysis of the Workspace structure,
is offered in the option menu.
The Mutations table lists all the mutations that were generated, along with the changes in a
range of properties as a result of the mutation. The properties include SASA (total, polar, and
nonpolar), pKa, hydropathy, number of rotatable bonds, energy, potential energy, and stability.
These quantities are defined in Table 13.1 on page 79. You can sort the table columns by
clicking on the column heading. You can plot any of these properties against the mutation
(table row) by choosing the property from the Graph property option menu. If you want to
export the table data as a CSV file, click Export, and navigate to a location and name the file.
You can select a region in the graph using the horizontal and vertical dashed lines, which can
be dragged to create the selection. The rows corresponding to the selected region of the graph
are highlighted in the table above, and the residues are highlighted in the Workspace.
If you select a table row, the view zooms in to the mutated residue. To display the original
structure, select Display original structure in grey. The parent antibody is displayed and colored
grey. You can then see how the mutation is positioned in relation to the original residue.
15.2.9 Summary
1. Display the structure in the Workspace and analyze it, to identify residues and antibody
features (Analyze Workspace).
2. Load an antibody database that is used to search for homologs.
3. Run the search (Search Antibody Database for Homologs) and align the homologs (Align
Homologs).
4. (Optional) Select an option for the chain to show, and examine the alignment for that
chain.
5. Choose the regions that you want to substitute residues in.
6. Specify the criteria for automatic selection of residues, in the Selection by homology crite-
ria and Parent structure 3D criteria sections.
7. Click Select in the Residues Tab to select the residues that meet the criteria.
8. In the Residues tab, make any changes to the residues marked for mutation.
9. Choose the mutations for these residues by clicking in the Mutations column of the table
and selecting from the option menu that is displayed.
A default selection is included for the automatically selected residues that is based on the
variations seen in the homologs.
10. Click Start, make job settings, and start the job.
The mutated structures are incorporated into the project as new entries, and the Humanize
Antibody Viewer opens.
Structures that are added to a database are automatically characterized and curated. Antibody
structures and the light and heavy chains are identified using a similarity search against known
antibodies, and structures that do not meet the similarity criteria are rejected. Then the constit-
uent regions of the antibody chains, including the framework region, as well as the six hyper-
variable loops, are identified and annotated for use in subsequent predictions.
When you first open the panel, the default database is loaded. Normally this database is
installed by an administrator and you do not have write privileges, so it is opened read-only.
You can import this database into your own database if you want, as described below. It is only
necessary to do this if you want to modify the data in some way, because you can specify
multiple databases for modeling.
To open a database, or to create a new database, click Open/Create. A file selector opens, and
you can navigate to the desired location. If you want to open a database, select the database
from the list of files. It should have the extension .db. If you want to create a new database,
enter the name in the File name text box.
The structures in the database are shown in the antibody table. You can restrict the structures
that are listed in the table by searching for strings in the table and only showing the rows that
contain the string. By default, all visible text is searched, but you can change it by clicking
Select and choosing the columns you want to search on. To return to sorting all visible
columns, click Reset. The text box has a tool tip that explains the syntax for the search string,
which can include relational operators, wild cards, regular expressions, and some special
terms.
The table shows only a few columns by default. If you want to see all the columns, by Show
columns, select All. The full set of columns includes the identity (residue range) and length of
each loop, and a range of information from the originating PDB structure. If you want to export
the information in the columns to a CSV file, click Export Table, and navigate to a location and
name the file in the file selector that opens.
To delete structures from the database, select them in the table and click Delete Selected Rows.
If you want to clear the entire database, click Delete All Data. You should exercise care when
deleting rows or all data, as these functions are not reversible.
Structures can be added to the database from several sources, represented by buttons in the Add
structures to database section. In each case, the imported data is automatically processed for
you: the antibody chains are identified, the constituent regions of the chains are determined,
and all the pertinent information required for subsequent modeling is saved in a rapidly
accessed format. You need only supply the antibody structures in PDB (or Maestro) format.
The structures are filtered to ensure that only those structures that have characteristics of anti-
bodies are included in the database. You can alter the criteria for filtering structures in the Anti-
body Database - Advanced Options dialog box, which you open by clicking Advanced Options.
You can also select the file types that are presented when importing structures from files.
In the Minimum sequence identity section, you can specify cutoffs for determining whether a
structure should be included in the database, based on percentage sequence identity. If no chain
in the structure passes the tests, it is not included in the database. A chain must meet the FR
threshold and either the VL or VH threshold to be included in the database. If a chain meets
only one of the VL or VH thresholds, it is only included as a light or heavy variable chain.
For the framework regions, you can specify which of the four framework regions to use in the
similarity analysis, by selecting the options for FR1, FR2, FR3, or FR4 in the Framework
regions used in similarity analysis section.
References
Getting Help
• The docs folder (directory) of your software installation, which contains HTML and
PDF documentation. Index pages are available in this folder.
• The Schrödinger web site, http://www.schrodinger.com/, particularly the Support Center,
http://www.schrodinger.com/supportcenter, and the Knowledge Base, http://www.schro-
dinger.com/kb.
To get information:
• Pause the pointer over a GUI feature (button, menu item, menu, ...). In the main window,
information is displayed in the Auto-Help text box, which is located at the foot of the
main window, or in a tooltip. In other panels, information is displayed in a tooltip.
If the tooltip does not appear within a second, check that Show tooltips is selected under
General → Appearance in the Preferences panel, which you can open with CTRL+, (,).
Not all features have tooltips.
• Click the Help button in a panel or press F1 for information about a panel or the tab that is
displayed in a panel. The help topic is displayed in your browser.
• Choose Help → Online Help or press CTRL+H (H) to open the default help topic in your
browser.
• When help is displayed in your browser, use the navigation links or search the help in the
side bar.
• Choose Help → Manuals Index, to open a PDF file that has links to all the PDF docu-
ments. Click a link to open the document.
• Choose Help → Search Manuals to search the manuals. The search tab in Adobe Reader
opens, and you can search across all the PDF documents. You must have Adobe Reader
installed to use this feature.
E-mail: [email protected]
USPS: Schrödinger, 101 SW Main Street, Suite 1300, Portland, OR 97204
Phone: (503) 299-1150
Fax: (503) 299-4532
WWW: http://www.schrodinger.com
FTP: ftp://ftp.schrodinger.com
Generally, e-mail correspondence is best because you can send machine output, if necessary.
When sending e-mail messages, please include the following information:
4. Click Create.
An archive file is created in your working directory, and an information dialog box with
the name of the file opens. You can highlight and copy the name of the file.
5. Attach the file specified in the dialog box to your e-mail message.
6. Copy and paste any log messages from the window used to start Maestro (or the job) into
the email message,or attach them as a file.
• Windows: Right-click in the window and choose Select All, then press ENTER to
copy the text.
• Mac: Start the Console application (Applications → Utilities), filter on the applica-
tion that you used to start the job (Maestro, BioLuminate, Elements), copy the text.
If Maestro failed:
1. Open the Diagnostics panel.
• Windows: Start → All Programs → Schrodinger-2012 → Diagnostics
• Mac: Applications → Schrodinger2012 → Diagnostics
• Linux/command line: $SCHRODINGER/diagnostics
2. When the diagnostics have run, click Technical Support.
A dialog box opens, with instructions. You can highlight and copy the name of the file.
3. Attach the file specified in the dialog box to your e-mail message.
4. Attach the file maestro_error.txt to your e-mail message.
This file should be in the following location:
• Windows: %LOCALAPPDATA%\Schrodinger\appcrash
(Choose Start → Run and paste this location into the Open text box.)
• Mac: Documents/Schrodinger
• Linux: Maestro’s working directory specified in the dialog box (the location is
given in the terminal window).
5. On Windows, also attach the file maestro.EXE.dmp, which is in the same location as
maestro_error.txt.
SCHRÖDINGER ®