![]() |
Protein fold estimation from NMR distance restraints |
PRIDE-NMR Server descriptionThe PRIDE-NMR Server compares NOE data sets to H-H distributions back-calculated from protein 3D coordinates. The basic idea is similar to the PRIDE method, originally described in: Carugo O and Pongor S, Protein fold similarity estimated
by a probabilistic approach based on Cα-Cα distance comparison,
J. Mol. Biol (2002) 315:887-898
The PRIDE-NMR approach is described in: Angyan A, Perczel A, Pongor S and Gaspari Z, Fast protein fold estimation from NMR-derived distance restraints, Bioinformatics (2008) 24:272-275. Outline
The PRIDE-NMR method compares a single input histogram to those stored in a database. The input histogram is generated from the NOE dataset, where the bins representing different sequential distances contain the number of intrabackbone NOE restraints. The backbone is defined here as containing the amide H, Hα and Hβ atoms. This is necessary to render the method largely sequence-independent. The histograms in the database are similar, back-calculated H-H distributions using cutoff distances of 5, 6 and 7 Ångstroms. The comparison of the input and database histograms is done with contingency analysis analogously as described in Carugo and Pongor, 2002. The resulting probability is the PRIDE-NMR Score. It is possible to use multiple cutoff distances for the database search, in this case the scores obtained with different cutoffs will be averaged.
The server expects a NOE restraint list in X-PLOR / CNS format. (There are scripts to convert other formats into this, e.g. from xeasy .upl format you may use the upl2tbl.awk script by Eiso). Best recognized are files that contain one-line assign records, although every effort was made to read in X-PLOR / CNS datasets we found in databases. However, it might be that your file is not acceptable by the server - if you feel that it should be, please send an e-mail to Zoltán Gáspári. By definition, a high PRIDE-NMR score means high similarity between the NOE pattern and the back-calculated H-H distance distribution. However, even true positive hits may get a relatively low score (below 0.5). Nevertheless, in an ideal case (when the number of restraints per residue is sufficiently high and the pattern is informative), the first 10 hits should contain more than one homolog of the query protein (if any in the database). If you know to what SCOP family/superfamily your protein belongs and no positive hits are obtained, it may be worth to try several non-default options, i.e. turn on length filtering or choose a minimum sequential distance larger than 3. If you still do not get positive hits, than it might well be that the information content of the NOE restraints is not sufficient to represent the structure in question. Evaluation of the PRIDE-NMR methodDetailed evaluation data are available in xls format: pridenmreval.xls. These data correspond to the results described in the PRIDE-NMR paper. |
| Queries related to the PRIDE-NMR method should go to Zoltán Gáspári (szpari at chem.elte.hu) |