Q-SiteFinder     Ligand Binding Site Prediction

Help file

Contents
Description of Q-SiteFinder
Description of Pocket-Finder
Identification of Ligands
CHIME Interface
Java-Mage Interface
Prediction Accuracy
Manipulation of Output Using Rasmol
Citing Q-SiteFinder

Description of Q-SiteFinder
Q-SiteFinder is a new method of ligand binding site prediction. It works by binding hydrophobic (CH3) probes to the protein, and finding clusters of probes with the most favourable binding energy. These clusters are placed in rank order of the likelihood of being a binding site according to the sum total binding energies for each cluster.

Description of Pocket-Finder
Pocket-Finder is a pocket detection algorithm based on Ligsite written by Hendlich et al (1997). Pocket-Finder works by scanning a probe radius 1.6 angstoms along all gridlines of a grid resolution 0.9 angstroms surrounding the protein. The probe also scans cubic diagonals. Grid points are defined to be part of a site when the probe is within range of protein atoms followed by free space followed by protein atoms. Since the protein is scanned in seven directions, each grid point can be defined to be part of a site up to seven times. Grid points are only retained if they are defined to be part of a site at least five times. Pocket-Finder uses the same interface as Q-SiteFinder.

Identification of Potential Ligands
Both Q-SiteFinder and Pocket-Finder allow you to upload a PDB file or select one from the Protein Database. The proteins are initially scanned for ligands. If ligands are allowed to remain on the protein, they may interfere with binding site prediction. Potential ligands are identified. The option is available to retain atoms as protein (this would be applicable to covalently bound residues such as peripheral carbohydrates), treat the atoms as ligands or discard them from analysis completely. If atom groups are selected as ligands, the program analyses the precision of binding site prediction for these ligands.

CHIME Interface
The CHIME-based user interface.  Key to image: A (right): CHIME showing molecule. B (top middle): Selection list for the different predicted sites.  C (top left):  Option menu.  D (upper-middle left):  Coordinates of binding box around selected sites.  E (lower-middle left):  Site information.  F (bottom left):  Atoms of the predicted site

CHIME is a molecular graphics plugin that allows an Internet Browser to display PDB files. CHIME is freely available for download. Some computers (such as those running UNIX) are not currently able to use CHIME. Q-SiteFinder results can be downloaded and viewed in Rasmol. Instructions are given below.

The following information refers to the figure above.

  1. The protein is displayed using the CHIME plug-in. Initially, the first predicted site (turquoise) and any ligands (yellow) are displayed. To rotate the molecule, click and drag with the left mouse button. To zoom in or out, hold down the shift key and click and drag with the left mouse button. To rotate the molecule in a plane, hold down the shift key and click and drag with the right mouse button. To translate the molecule, hold down the control (ctrl) key and click and drag with the right mouse button.

  2. Predicted binding site selection is colour-coded according to the likelihood of being an actual binding site. Green is the most likely, followed by blue, purple and orange/brown.

  3. This is the main option window. All of the predicted sites can be selected or deselected. CPK colours can be toggled on and off. Ligands can be hidden. The sites can be shown in the probe-cluster form, or all the residues close to the probe sites can be displayed. The PDB file can be downloaded (instructions for manipulation using Rasmol are given below).

  4. This box displays the coordinates that define the binding-box around selected sites. This can be used as input for a docking program to ensure that ligands are only docked onto into the binding site. In the example shown, the binding site consists of predicted sites 1, 2 and 8.

  5. This box gives information about the selected predicted binding site. It gives an estimate of cavity volume, protein volume and also shows the binding precision of known ligands.

  6. All of the protein atoms close to a probe-cluster defining a site are shown here. The atom number, atom type, residue name, chain identifier and residue number are given. Each entry is clickable, and causes the selected atom to be displayed in CPK colours. They are invisible if the predicted sites are being displayed in CPK colours. These selected atoms are persistent, and are not affected by selection or deselection of sites. They are only deselected by clicking on 'hide atoms'.

Java-Mage Interface
The Java-based user interface.

Mage is a Java-based molecule viewer developed by the Richardson lab. To use Mage, you will need the Java 2 runtime environment. This is freely available from http://java.sun.com. The following information refers to the figure above.

  1. The protein is displayed using Mage. Initially, the first predicted site (turquoise) and any ligands (yellow) are displayed. To rotate the molecule, click and drag with the left mouse button. Predicted binding site selection is colour-coded according to the likelihood of being an actual binding site. Green is the most likely, followed by blue, purple and orange/brown.

  2. To zoom in or out, drag the button left or right. To display predicted sites as either favourably binding methyl probes, or residues close to the probes, click on the appropriate buttons.

  3. This is the information selection window. Note that unlike the CHIME interface, predicted site information must be selected separately from site selection in Mage. The PDB file can be downloaded (instructions for manipulation using Rasmol are given below).

  4. This box displays site information, including precision, estimated site volume, estimated protein volume and the coordinates that define the binding-box around selected sites. Binding box coordinates can be used as input for a docking program to ensure that ligands are only docked onto into the binding site.

  5. All of the protein atoms close to a probe-cluster defining a site are shown here. The atom number, atom type, residue name, chain identifier and residue number are given. These entries are not clickable, unlike for the CHIME interface.

Prediction Accuracy
Success rates

This data was obtained using 134 PDB entries that correspond to the GOLD data set. We defined success as having a precision of at least 25%. Precision is defined as the percentage of probe sites defined by a single cluster that are within 1.6 angstoms of an atom of a particular ligand. It is a measurement of how well the predicted site maps onto the ligand coordinates. When a single ligand gives a success in two separate sites, only the higher ranking site has been counted, since they are part of the same binding site. If two ligands are found in the same predicted binding site, only one success is counted. Separate ligands in separate sites are counted as successes separately.

Manipulation of Output Using Rasmol
The option of downloading the output for display in Rasmol has been made available. To select specific groups of atoms type "select XXX", where "XXX" is the group identifier. To visualise these atoms, type (or click on) "spacefill". To hide a selected group, type "select XXX" followed by "wireframe". The codes for the sites are as follows:

Ligand atoms

LIG

Sites defined by protein atoms

ZAA - Predicted site 1
ZAB - Predicted site 2
ZAC - Predicted site 3
ZAD - Predicted site 4
ZAE - Predicted site 5
ZAF - Predicted site 6
ZAG - Predicted site 7
ZAH - Predicted site 8
ZAI - Predicted site 9
ZAJ - Predicted site 10

Sites defined by probe clusters

YAA - Predicted site 1
YAB - Predicted site 2
YAC - Predicted site 3
YAD - Predicted site 4
YAE - Predicted site 5
YAF - Predicted site 6
YAG - Predicted site 7
YAH - Predicted site 8
YAI - Predicted site 9
YAJ - Predicted site 10

Citations

Laurie AT, Jackson RM (2005). Q-SiteFinder: an energy-based method for the prediction of protein-ligand binding sites. Bioinformatics, 21: 1908-1916 Pubmed

In this publication, Q-SiteFinder is compared with Pocket-Finder. Q-SiteFinder has also been used in the following study of protein-protein interactions.

Burgoyne NJ, Jackson RM (2006). Predicting protein interaction sites: binding hot-spots in protein-protein and protein-ligand interfaces. Bioinformatics, 22: 1335-1342 Pubmed

Contacts

If you wish to know more about Q-SiteFinder please contact us.

BBSRC Logo
University of Leeds Logo

 

Content on this page is maintained by Jon Fuller