Evaluating template-based and template-free protein–protein complex structure prediction

Vreven, Thom; Hwang, Howook; Pierce, Brian G.; Weng, Zhiping

doi:10.1093/bib/bbt047

Abstract

We compared the performance of template-free (docking) and template-based methods for the prediction of protein–protein complex structures. We found similar performance for a template-based method based on threading (COTH) and another template-based method based on structural alignment (PRISM). The template-based methods showed similar performance to a docking method (ZDOCK) when the latter was allowed one prediction for each complex, but when the same number of predictions was allowed for each method, the docking approach outperformed template-based approaches. We identified strengths and weaknesses in each method. Template-based approaches were better able to handle complexes that involved conformational changes upon binding. Furthermore, the threading-based and docking methods were better than the structural-alignment-based method for enzyme–inhibitor complex prediction. Finally, we show that the near-native (correct) predictions were generally not shared by the various approaches, suggesting that integrating their results could be the superior strategy.

protein–protein structure, template-based prediction, protein–protein docking, ZDOCK, PRISM, COTH

INTRODUCTION

The interaction between pairs of proteins is critical in many biological processes, including enzyme inhibition, signaling pathways and the immune response. Although experiments have shown that most proteins interact with at least one other protein, the determination of atomic resolution structures of protein–protein complexes is laborious and not always successful. As an alternative to experimental approaches, computational algorithms have been developed to predict the bound structures of protein–protein complexes. These computational approaches can be divided into two main classes of algorithms—template-free or docking [1–12] and template-based [13–28]. The various approaches for predicting protein–protein complex structures were recently reviewed by Tuncbag et al. [29]. The docking approaches start with the unbound structures of the component proteins, which are typically obtained using x-ray crystallography or nuclear magnetic resonance (NMR), but can also be built using homology modeling. The translational and rotational space is then searched for favorable binding orientations. Searching the 6-dimensional space is computationally expensive, and often carried out with rapidly computable scoring functions and efficient grid-based search algorithms such as fast Fourier transform (FFT) [1, 4–6] or geometric hashing [3]. In contrast with docking approaches, which are based on the physical properties of the component proteins for the prediction, template-based algorithms use similarities with known complex structures for generating the prediction. A wide spectrum of template-based methods has been introduced in recent years, often with components adopted from monomer protein structure prediction approaches. The methods differ in the way similarity is defined, which can be based predominantly on sequence identity [19, 20], sequence-structure ‘threading’ [13, 26] or structural alignments, the last often for the interfacial regions only [22, 25, 27, 28].

Each approach has its strengths and weaknesses, with template-based approaches critically depending on template availability, and docking approaches being sensitive to conformational changes upon binding. In this work, we examined the strengths and weaknesses of the respective approaches, and identify aspects that affect the likelihood of success. Specifically, we compared the ZDOCK [7, 9, 30] algorithm for protein–protein docking that was developed in our lab with two template-based algorithms: COTH by Mukherjee and Zhang [26] and PRISM by Gursoy, Keskin and co-workers [25, 27]. COTH and PRISM represent two main approaches in template-based complex structure prediction, with COTH requiring sequences only and using threading to build the predictions, and PRISM relying on structural alignments of the interface regions to select the templates.

Finally, we assess the availability of templates in the Protein Data Bank (PDB) [31] for protein–protein complexes. The relationship between sequence identity and binding modes was addressed earlier by Aloy et al. [32]. Similar work on template availability, but with a focus on the structural space of the PDB, has recently been published by others [33, 34]. We developed a protocol (ZTEM) that uses sequence alignment and structural alignment to determine templates that match native complexes. ZTEM and COTH have in common that the binding partners are aligned (or threaded) globally, whereas PRISM performed local alignments.

METHODS

Dataset

For testing the algorithms, we used the complexes from a protein–protein docking benchmark developed earlier by our lab [35]. The latest version of the benchmark contains the bound and unbound structures of 176 protein–protein complexes, and is nonredundant at the SCOP family level [36]. We classify the complexes based on biochemical function and docking difficulty. According to biochemical function, we have 52 enzyme–inhibitor complexes, 25 antibody–antigen complexes and 99 other complexes (referred to as the ‘others’ category). Judged by docking difficulty, 121 are rigid-body, 30 are of medium difficulty and 25 are difficult.

Template-based methods are not suitable for antibody–antigen complexes. Multiple antibodies, which differ only in their complementarity-determining loops, can recognize a variety of epitopes on an antigen; thus, template-based approaches would result in false positives. Therefore, we excluded the antibody–antigen complexes from the analyses in this work.

COTH

COTH takes sequences as input and generates predictions for the complex structures based on threading a sequence onto a template structure [26]. COTH follows a two-stage procedure. In the first step, the sequences of both component proteins are threaded using a library of nonredundant complex templates. This yields a selection, typically 10, of templates that describe potential binding modes. In the second step, the sequences of the monomers are threaded separately, using a library of monomer templates. This yields a prediction for each monomer, which is then superposed onto the complex templates. To generate the COTH predictions, we used the Web server described in [26].

PRISM

PRISM takes structures of the unbound component proteins as input, and performs structural alignments of the surfaces of the monomers with a library of binding-interface templates. The library is constructed from the PDB complex structures and nonredundant. To improve predictive accuracy, the alignment results are subjected to various filters, such as a threshold for the alignment root mean square deviation (RMSD), a minimum number of matching residues and residue pairs between the template and the predicted structures, clashing thresholds and a matching template hotspot residue in the prediction. After the alignment and filtering, the predictions are refined and scored using FiberDock [37]. To generate the PRISM results, we used the collection of scripts provided by the authors [25], in combination with the required external programs [15, 37–39]. We used the larger of the two available template libraries, which contains 7922 interfaces.

ZDOCK

For the docking algorithm, we used ZDOCK3.0 [30], which was developed in our lab and includes the IFACE statistical pair potential [30]. ZDOCK is a grid-based rigid-body approach that uses FFT, and samples the three Euler angles with 6° or 15° spacing and the three translational degrees of freedom with an 1.2 Å spacing. For each set of rotational angles, only the best scoring translation is retained, which results in 3600 or 54 000 predictions for 15° or 6° rotational sampling, respectively. The predictions are subsequently ranked according to the ZDOCK scoring function. In the current work, we used 6° sampling, and we varied the total number of predictions considered per test case to match those of template-based methods, as described in the text below.

ZTEM

We developed the ZTEM (Zlab TEMplates) protocol to investigate the availability of templates in the PDB for protein–protein complexes. We applied ZTEM to the 151 enzyme–inhibitor and ‘other’ cases from the Benchmark. A BLAST [40] sequence alignment was used to search for matches to the sequences of protein structures in the PDB (downloaded Oct 4, 2012). Complex templates from the PDB were retained if they showed a sequence alignment within the BLAST significance threshold (E-value ≤ 10.0) for each chain of the query. Finally, the FAST structural alignment program [41] was used for superposing the query proteins onto the complex templates. Note that ZTEM only performs sequence alignments to find candidate templates. Recently, it was shown that complexes can possess structural similarity without sequence similarity [32, 33], and a structural alignment approach would result in more candidate templates. However, because the purpose of ZTEM is to provide a baseline using the simplest way of finding templates, we only considered sequence alignments in this work.

Scoring

We note that each of the methods that were compared uses its own scoring function, and the quality of the function can affect the performance of the method. Unfortunately, it is not straightforward to remove the scoring component of the method, or to use a single scoring function for all approaches. However, because the scoring functions presumably optimize the performance of the respective methods, we have assumed that the scoring function is an integral part of each method and that we could compare the overall performance of the methods.

RESULTS AND DISCUSSION

COTH

We could not test COTH using all the cases from our Benchmark for two reasons. First, COTH can only predict complexes formed by single-chain monomers. Second, we wanted to exclude predictions with both monomers having sequence identities >95% with the complex template (a template was allowed only if at least one of the monomers has <95% sequence identity to the target). The latter poses a problem, as the COTH Web server makes a fixed set of 10 predictions for each case, and does not allow sequence identity cutoffs between the input monomers and the templates to be specified. As a result, most cases yielded <10 COTH predictions that were considered valid (<95% sequence identity for at least one monomer) in our analysis. To avoid bias against cases that have few valid predictions, we only retained the Benchmark cases that had eight or more valid predictions, and for each of these cases included only the top eight valid predictions, ensuring that all retained Benchmark cases had the same number of predictions. Applying these filters, we retained 111 test cases (Table 1), of which 42 and 69 were of the enzyme–inhibitor and ‘other’ complex types, respectively. Seventy cases were rigid-body, and 23 and 18 cases were of the medium and difficult categories, respectively. Although this filter excluded a relative large number of rigid cases and complexes of the ‘other’ type, the remaining numbers are large enough to consider the test set well balanced.

Table 1:

Open in new tab

Summary of the number of hits found for COTH and ZDOCK (using 5 Å IRMSD cutoff for hits)

Complex type	All cases	COTH cases with hits	ZDOCK(1)^a cases with hits	ZDOCK(8)^a cases with hits
Enzyme–inhibitor	42	13	13	18
‘Other’	69	6	5	14
Rigid-body	70	14	15	25
Medium difficulty	23	3	3	6
Difficult	18	2	0	1
Total cases	111	19	18	32
Cases with hits shared with COTH hits			4	7

Complex type	All cases	COTH cases with hits	ZDOCK(1)^a cases with hits	ZDOCK(8)^a cases with hits
Enzyme–inhibitor	42	13	13	18
‘Other’	69	6	5	14
Rigid-body	70	14	15	25
Medium difficulty	23	3	3	6
Difficult	18	2	0	1
Total cases	111	19	18	32
Cases with hits shared with COTH hits			4	7

^aNumber of ZDOCK predictions considered for each case in parentheses.

Table 1:

Open in new tab

Summary of the number of hits found for COTH and ZDOCK (using 5 Å IRMSD cutoff for hits)

Complex type	All cases	COTH cases with hits	ZDOCK(1)^a cases with hits	ZDOCK(8)^a cases with hits
Enzyme–inhibitor	42	13	13	18
‘Other’	69	6	5	14
Rigid-body	70	14	15	25
Medium difficulty	23	3	3	6
Difficult	18	2	0	1
Total cases	111	19	18	32
Cases with hits shared with COTH hits			4	7

Complex type	All cases	COTH cases with hits	ZDOCK(1)^a cases with hits	ZDOCK(8)^a cases with hits
Enzyme–inhibitor	42	13	13	18
‘Other’	69	6	5	14
Rigid-body	70	14	15	25
Medium difficulty	23	3	3	6
Difficult	18	2	0	1
Total cases	111	19	18	32
Cases with hits shared with COTH hits			4	7

^aNumber of ZDOCK predictions considered for each case in parentheses.

We define a hit as a prediction with interface root mean square deviation (IRMSD) of ≤5 Å. When assessing docking approaches, we typically use a 2.5 Å cutoff, but template-based prediction has no or limited sampling in the conformational space and a looser cutoff is appropriate (for a discussion of complex prediction metrics see [42]). Furthermore, COTH predictions may not contain all the residues specified in the input, and therefore we require a hit to have at least 50% of the native interface residues present in the structure of each binding partner. Of the 111 test cases, 19 cases had at least one hit. Thirteen were enzyme–inhibitor, and six of the ‘other’ type. Thus, COTH has a much higher success rate (percentage of cases with hits; 31%) for enzyme–inhibitor complexes than for the ‘other’ category (9%), suggesting this a particular strength of COTH.

When we consider the highest ranked hit for each case, we see for 13 cases a hit was within the top 3 (out of 8) ranked predictions, and that for nearly half (9 out of 19) of the test cases one of the monomers had a >95% sequence identity with the complex template that was used for the prediction. These results show that, despite the COTH approach being based on threading, sequence identity is an important factor in the prediction of near-native complex structure, and that COTH’s ranking is able to distinguish true positives from false positives. Based on these findings, it seems that considering more predictions per test case would only moderately improve COTH’s performance.

Comparison of COTH with ZDOCK

For comparing COTH with ZDOCK, we used the 111 Benchmark cases retained in the COTH analysis. The top ranked ZDOCK prediction was a hit for 18 cases (again using 5 Å IRMSD cutoff for hit definition); thus, the overall performance of COTH with eight predictions per test case (19 cases with hits) is similar to ZDOCK performance with one prediction per case. Of the Benchmark cases for which ZDOCK found hits, 15 are of the rigid category, and 3 of the medium difficulty category (for COTH, we found 14 rigid, 3 medium and 2 difficult). Furthermore, 13 of the ZDOCK hits are enzyme–inhibitor cases, and 5 are of the ‘other’ type (for COTH, we found hits for 13 enzyme–inhibitor cases and 6 of the ‘other’ type). Thus, COTH with eight predictions per test case has similar performance to ZDOCK with the top ranked prediction considered; moreover, the two methods show similar patterns regarding the complex type and expected docking difficulty. The most notable difference is that ZDOCK produced no hits for cases of the difficult category while COTH predicted hits for two difficult cases. This agrees with the observation that rigid-body docking algorithms generally do not perform well when there are large conformational changes on forming the complex, whereas conformational changes should have a smaller impact on template-based approaches. Of the 19 COTH and 18 ZDOCK cases with hits, only 4 are shared. This suggests that ‘pooling’ the predictions of COTH and ZDOCK could yield a higher hit-to-prediction ratio than either of the approaches alone. When we allowed ZDOCK to make the same number of predictions as COTH for each test case, we obtained 32 cases with hits, representing a success rate >50% higher than that of COTH. The number of cases with hits shared between COTH and ZDOCK is still small compared with the total number of cases where COTH has hits (increases from 4 to 7), indicating that pooling approaches can still be beneficial when larger numbers of ZDOCK predictions are considered.

PRISM

In contrast with COTH, PRISM is based on structural alignment and is not limited to single-chain monomers. Therefore, we used all enzyme–inhibitor and ‘other’ test cases that contain single and multi-chain component proteins for the analysis. As suggested in [27], we relaxed some of the filters used in PRISM to increase the number of hits. First, we lifted the requirement that at least one predicted hotspot from the template has an equivalent residue in the prediction. Second, we reduced the number of residues that are required to match between the template and prediction to 12 (default is 15). Note that these changes are similar but not identical to those suggested in [27] (despite discussion with the authors of PRISM, we were not able to determine the relaxed filter settings to achieve the reported performance).

After removing the predictions for which both monomers have a sequence identity >95% with the complex template, we retained an average of 33 predictions for each case. There is, however, a large variation; e.g. PRISM produced 600 predictions for 1N2C. Using the 5 Å IRMSD cutoff, PRISM yielded hits for 26 cases, with 11 enzyme–inhibitor, and 15 of the ‘other’ type (Table 2). Separated by expected docking difficulty there were 21 rigid-body cases and 5 with medium or high difficulty. Of the Benchmark cases with hits, nearly half (11 out of 26) only had hits with a sequence identity >95% for one of the monomers. This shows that, despite PRISM being based on structural alignment, the sequence identity often is a determining factor for the identification of near-native complex structures.

Table 2:

Open in new tab

Summary of the number of hits found for PRISM and ZDOCK (using 5 Å IRMSD cutoff for hits)

Complex type	All cases	PRISM cases with hits	ZDOCK(1)^a cases with hits	ZDOCK(33)^a cases with hits
Enzyme–inhibitor	52	11	14	25
‘Other’	99	15	11	32
Rigid-body	99	21	22	44
Medium difficulty	29	3	3	9
Difficult	23	2	0	4
Total	151	26	25	57
Cases with hits shared with PRISM hits			8	15

Complex type	All cases	PRISM cases with hits	ZDOCK(1)^a cases with hits	ZDOCK(33)^a cases with hits
Enzyme–inhibitor	52	11	14	25
‘Other’	99	15	11	32
Rigid-body	99	21	22	44
Medium difficulty	29	3	3	9
Difficult	23	2	0	4
Total	151	26	25	57
Cases with hits shared with PRISM hits			8	15

^aNumber of ZDOCK predictions considered for each case in parentheses.

Table 2:

Open in new tab

Summary of the number of hits found for PRISM and ZDOCK (using 5 Å IRMSD cutoff for hits)

Complex type	All cases	PRISM cases with hits	ZDOCK(1)^a cases with hits	ZDOCK(33)^a cases with hits
Enzyme–inhibitor	52	11	14	25
‘Other’	99	15	11	32
Rigid-body	99	21	22	44
Medium difficulty	29	3	3	9
Difficult	23	2	0	4
Total	151	26	25	57
Cases with hits shared with PRISM hits			8	15

Complex type	All cases	PRISM cases with hits	ZDOCK(1)^a cases with hits	ZDOCK(33)^a cases with hits
Enzyme–inhibitor	52	11	14	25
‘Other’	99	15	11	32
Rigid-body	99	21	22	44
Medium difficulty	29	3	3	9
Difficult	23	2	0	4
Total	151	26	25	57
Cases with hits shared with PRISM hits			8	15

^aNumber of ZDOCK predictions considered for each case in parentheses.

We note that the template library used in PRISM is designed based on the structure space of binding interfaces. When we excluded predictions based on sequence identity (both monomers having a sequence identity >95% with the complex template), we may effectively reduce the structure space of the template library that is relevant for the solution. This is illustrated by the following findings. Based on the 95% sequence identity cutoff, we exclude at least one prediction for 63 Benchmark cases. PRISM identified hits for 10% of these 63 cases. For the remaining 88 cases for which we did not need to exclude any predictions, PRISM reported hits for 23% of the cases. This is the result of the structure space of the template library constructed to be nonredundant.

Comparison of PRISM with ZDOCK and COTH

Among the set of Benchmark cases used for PRISM, the top ranked ZDOCK prediction was a hit for 25 cases, compared with 26 cases with hits obtained by PRISM. The difference in success rates for the enzyme–inhibitor and ‘other’ complex types is smaller for PRISM (21%−15% = 6%) than for ZDOCK (27%−11% = 16%). Thus, the performance of PRISM depends less on the type of complex than ZDOCK. Because ZDOCK and COTH showed similar success rates for the different complex types, PRISM’s performance also depends less on the complex type than COTH. Of the PRISM and ZDOCK cases with hits, only eight are shared, which again suggests that pooling the predictions from the two methods may be a successful way to obtain the optimum hit-to-predictions ratio. When we allow ZDOCK to make the same number of 33 predictions to make for each case as PRISM does on average, the number of cases with hits more than doubles. However, the PRISM program returns many predictions that are identical, and 33 predictions is an upper limit we used for this comparison. The number of hits that are shared by PRISM and ZDOCK with 33 predictions is still moderate, and a pooling approach still promising.

PRISM and COTH showed similar overall performance, and both found hits for cases in the ‘difficult’ docking category, whereas ZDOCK does not. This is as expected, as template-based methods are less sensitive to conformational changes than rigid-body docking approaches.

ZTEM for determining template availability

To provide a baseline of template-based prediction methods using sequence alignment, we investigated the availability of templates in the PDB. As with the other template-based methods, we excluded templates that had >95% sequence identity with both monomers from the Benchmark case (here we calculated the identity for the chains separately, and used the largest value when a monomer had multiple chains).

The results obtained with ZTEM are summarized in Table 3. For 53 cases, we found at least one template onto which the monomers could be superimposed to produce a near-native complex structure. This represents 35% of the Benchmark cases considered, which is considerably higher than the roughly 20% found with PRISM and COTH. This suggests that template-based methods may be improved further. Because the template search is entirely based on sequence identity, it again shows that sequence identity is a determining factor for the chance of a template resulting in a hit. Such information can be used to develop a confidence measure for template-based docking.

Table 3:

Open in new tab

Template availability using ZTEM (using 5 Å IRMSD cutoff for defining near-native structures)

Complex type	All cases	Number of cases with near-native templates
Enzyme–inhibitor	52	24
‘Other’	99	29
Rigid-body	99	39
Medium difficulty	29	10
Difficult	23	4
Total	151	53

Complex type	All cases	Number of cases with near-native templates
Enzyme–inhibitor	52	24
‘Other’	99	29
Rigid-body	99	39
Medium difficulty	29	10
Difficult	23	4
Total	151	53

Table 3:

Open in new tab

Template availability using ZTEM (using 5 Å IRMSD cutoff for defining near-native structures)

Complex type	All cases	Number of cases with near-native templates
Enzyme–inhibitor	52	24
‘Other’	99	29
Rigid-body	99	39
Medium difficulty	29	10
Difficult	23	4
Total	151	53

Complex type	All cases	Number of cases with near-native templates
Enzyme–inhibitor	52	24
‘Other’	99	29
Rigid-body	99	39
Medium difficulty	29	10
Difficult	23	4
Total	151	53

When we considered the complex types, we observed a large difference in potential to predict enzyme–inhibitor complexes and complexes of the ‘other’ type (46%−29% = 17%). This is comparable with COTH and ZDOCK. Thus, only PRISM’s performance depends less on the complex type, possibly because of the approaches we considered, that is the one that relies the most on structural alignment. As with COTH and PRISM, the ZTEM results suggest that medium and difficult Benchmark cases may be better suited for template-based approaches than rigid-body docking. Eventually protein flexibility analysis may be incorporated to assess the confidence level of the various approaches [43].

CONCLUSIONS

The performance we obtained for template-based methods based on threading and based on structural alignment are comparable. In addition, the template-based methods showed a similar performance as the docking method that was allowed a single prediction for each complex, but when the same number of predictions was allowed for each method, the docking approach outperformed template-based approaches. With the test cases separated by expected docking difficulty, template-based approaches were better able to handle complexes that involved conformational changes upon binding. When we separated the test cases by complex type, we observed that threading and docking approaches were somewhat better for enzyme–inhibitor structure prediction than structural alignment template-based prediction and the reverse for other test cases.

Most importantly, the set of correct predictions from one method only moderately overlapped with the set of correct predictions of another method. This suggests that integrating their results could be the superior strategy for obtaining useful predictions in practical situations. For such an approach to be successful, it would be essential to develop scoring or other confidence metrics that can compare complex structure predictions from different sources.

Finally, we want to stress that in the current work, we used the methods in their standard form, even though performance improvements could possibly be gained by additional computation. For example, rigid-body docking results are often re-ranked or refined using more accurate but slower to compute algorithms. In previous work, we developed the ZRANK and IRAD functions for re-ranking initial-stage docking predictions, and increased the chance to find a near-native structure [44, 45]. Structural refinement generally leads to more accurate predictions [37, 46], and improving the performance of docking approaches is a continuing effort in our lab [43, 47]. For template based methods, improvements can be achieved via algorithmic development or by extending the library of templates. For example, the PRISM interface dataset was constructed in 2006, and since then the number of entries in the PDB has more than doubled. Although the increase of the template dataset is likely <2-fold owing to structural redundancy, the addition of any templates should improve the performance.

Key points

Success rates of template-based and template-free methods for protein–protein complex structure prediction are similar.
Correct predictions are often not shared between the two types of approaches; thus, their results are complementary.
Each method has its strengths and weaknesses.

ACKNOWLEDGEMENTS

We thank Srayanta Mukherjee (University of Kansas Medical Center), Nurcan Tuncbag (M.I.T.) and Guray Kuzu (Koc University, Istanbul) for helpful discussions.

FUNDING

National Institutes of Health [R01 GM084884] (to Z.W.).

References

1

Katchalski-Katzir

E

Shariv

I

Eisenstein

M

et al.

Molecular-surface recognition - determination of geometric fit between proteins and their ligands by correlation techniques

,

Proc Natl Acad Sci USA

,

1992

, vol.

89

(pg.

2195

-

9

)

2

Vakser

IA

Protein docking for low-resolution structures

,

Protein Eng

,

1995

, vol.

8

(pg.

371

-

7

)

3

Norel

R

Lin

SL

Wolfson

HJ

et al.

Molecular-surface complementarity at protein-protein interfaces–the critical role played by surface normals at well places, sparse, points in docking

,

J Mol Biol

,

1995

, vol.

252

(pg.

263

-

73

)

4

Gabb

HA

Jackson

RM

Sternberg

MJE

Modelling protein docking using shape complementarity, electrostatics and biochemical information

,

J Mol Biol

,

1997

, vol.

272

(pg.

106

-

20

)

5

Vakser

IA

Matar

OG

Lam

CF

A systematic study of low-resolution recognition in protein-protein complexes

,

Proc Natl Acad Sci USA

,

1999

, vol.

96

(pg.

8477

-

82

)

6

Mandell

JG

Roberts

VA

Pique

ME

et al.

Protein docking using continuum electrostatics and geometric fit

,

Protein Eng

,

2001

, vol.

14

(pg.

105

-

13

)

7

Chen

R

Weng

Z

Docking unbound proteins using shape complementarity, desolvation, and electrostatics

,

Proteins

,

2002

, vol.

47

(pg.

281

-

94

)

8

Dominguez

C

Boelens

R

Bonvin

A

HADDOCK: a protein-protein docking approach based on biochemical or biophysical information

,

J Am Chem Soc

,

2003

, vol.

125

(pg.

1731

-

7

)

9

Chen

R

Li

L

Weng

Z

ZDOCK: an initial-stage protein-docking algorithm

,

Proteins

,

2003

, vol.

52

(pg.

80

-

7

)

10

Andrusier

N

Mashiach

E

Nussinov

R

et al.

Principles of flexible protein-protein docking

,

Proteins

,

2008

, vol.

73

(pg.

271

-

89

)

11

Lyskov

S

Gray

JJ

The RosettaDock server for local proteinprotein docking

,

Nucleic Acids Res

,

2008

, vol.

36

(pg.

W233

-

8

)

12

Ritchie

DW

Kozakov

D

Vajda

S

Accelerating and focusing protein-protein docking correlations using multi-dimensional rotational FFT generating functions

,

Bioinformatics

,

2008

, vol.

24

(pg.

1865

-

73

)

13

Lu

L

Lu

H

Skolnick

J

MULTIPROSPECTOR: an algorithm for the prediction of protein-protein interactions by multimeric threading

,

Proteins

,

2002

, vol.

49

(pg.

350

-

64

)

14

Aloy

P

Russell

RB

Interrogating protein interaction networks through structural biology

,

Proc Natl Acad Sci USA

,

2002

, vol.

99

(pg.

5896

-

901

)

15

Shatsky

M

Nussinov

R

Wolfson

HJ

A method for simultaneous alignment of multiple protein structures

,

Proteins

,

2004

, vol.

56

(pg.

143

-

56

)

16

Aloy

P

Bottcher

B

Ceulemans

H

et al.

Structure-based assembly of protein complexes in yeast

,

Science

,

2004

, vol.

303

(pg.

2026

-

9

)

17

Aytuna

AS

Gursoy

A

Keskin

O

Prediction of protein-protein interactions by combining structure and sequence conservation in protein interfaces

,

Bioinformatics

,

2005

, vol.

21

(pg.

2850

-

5

)

18

Gunther

S

May

P

Hoppe

A

et al.

Docking without docking: ISEARCH-Prediction of interactions using known interfaces

,

Proteins

,

2007

, vol.

69

(pg.

839

-

44

)

19

Launay

G

Simonson

T

Homology modelling of protein-protein complexes: a simple method and its possibilities and limitations

,

BMC Bioinformatics

,

2008

, vol.

9

pg.

427

20

Kundrotas

PJ

Lensink

MF

Alexov

E

Homology-based modeling of 3D structures of protein-protein complexes using alignments of modified sequence profiles

,

Int J Biol Macromol

,

2008

, vol.

43

(pg.

198

-

208

)

21

Chen

HL

Skolnick

J

M-TASSER: an algorithm for protein quaternary structure prediction

,

Biophys J

,

2008

, vol.

94

(pg.

918

-

28

)

22

Sinha

R

Kundrotas

PJ

Vakser

IA

Docking by structural similarity at protein-protein interfaces

,

Proteins

,

2010

, vol.

78

(pg.

3235

-

41

)

23

Kundrotas

PJ

Vakser

IA

Accuracy of protein-protein binding sites in high-throughput template-based modeling

,

Plos Comput Biol

,

2010

, vol.

6

pg.

e1000727

24

Ghoorah

AW

Devignes

MD

Smail-Tabbone

M

et al.

Spatial clustering of protein binding sites for template based protein docking

,

Bioinformatics

,

2011

, vol.

27

(pg.

2820

-

7

)

25

Tuncbag

N

Gursoy

A

Nussinov

R

et al.

Predicting protein-protein interactions on a proteome scale by matching evolutionary and structural similarities at interfaces using PRISM

,

Nat Protoc

,

2011

, vol.

6

(pg.

1341

-

54

)

26

Mukherjee

S

Zhang

Y

Protein-protein complex structure predictions by multimeric threading and template recombination

,

Structure

,

2011

, vol.

19

(pg.

955

-

66

)

27

Tuncbag

N

Keskin

O

Nussinov

R

et al.

Fast and accurate modeling of protein-protein interactions by combining template-interface-based docking with flexible refinement

,

Proteins

,

2012

, vol.

80

(pg.

1239

-

49

)

28

Sinha

R

Kundrotas

PJ

Vakser

IA

Protein docking by the interface structure similarity: how much structure is needed?

,

Plos One

,

2012

, vol.

7

pg.

e31349

29

Tuncbag

N

Gursoy

A

Keskin

O

Prediction of protein-protein interactions: unifying evolution and structure at protein interfaces

,

Phys Biol

,

2011

, vol.

8

pg.

035006

30

Mintseris

J

Pierce

B

Wiehe

K

et al.

Integrating statistical pair potentials into protein complex prediction

,

Proteins

,

2007

, vol.

69

(pg.

511

-

20

)

31

Berman

HM

Westbrook

J

Feng

Z

et al.

The protein data bank

,

Nucleic Acids Res

,

2000

, vol.

28

(pg.

235

-

42

)

32

Aloy

P

Ceulemans

H

Stark

A

et al.

The relationship between sequence and interaction divergence in proteins

,

J Mol Biol

,

2003

, vol.

332

(pg.

989

-

98

)

33

Garma

L

Mukherjee

S

Mitra

P

et al.

How many protein-protein interactions types exist in nature?

,

Plos One

,

2012

, vol.

7

pg.

e38913

34

Kundrotas

PJ

Zhu

ZW

Janin

J

et al.

Templates are available to model nearly all complexes of structurally characterized proteins

,

Proc Natl Acad Sci USA

,

2012

, vol.

109

(pg.

9438

-

41

)

35

Hwang

H

Vreven

T

Janin

J

et al.

Protein-protein docking benchmark version 4.0

,

Proteins

,

2010

, vol.

78

(pg.

3111

-

14

)

36

Murzin

AG

Brenner

SE

Hubbard

T

et al.

SCOP–a structural classification of proteins database for the investigation of sequences and structures

,

J Mol Biol

,

1995

, vol.

247

(pg.

536

-

40

)

Google Scholar

PubMed

OpenURL Placeholder Text

WorldCat

37

Mashiach

E

Nussinov

R

Wolfson

HJ

FiberDock: a web server for flexible induced-fit backbone refinement in molecular docking

,

Nucleic Acids Res

,

2010

, vol.

38

(pg.

W457

-

61

)

38

Pearson

WR

Lipman

DJ

Improved tools for biological sequence comparison

,

Proc Natl Acad Sci USA

,

1988

, vol.

85

(pg.

2444

-

8

)

39

Hubbard

SJ

Thornton

JM

‘NACCESS computer program’, 1993. Department of Biochemistry and Molecular Biology, University College of London, UK. Program is available at: http://www.bioinf.manchester.ac.uk/naccess/ (11 May 2004, date last accessed)

40

Altschul

SF

Gish

W

Miller

W

et al.

Basic local alignment search tool

,

J Mol Biol

,

1990

, vol.

215

(pg.

403

-

10

)

41

Zhu

JH

Weng

Z

FAST: a novel protein structure alignment algorithm

,

Proteins

,

2005

, vol.

58

(pg.

618

-

27

)

42

Gao

M

Skolnick

J

New benchmark metrics for protein-protein docking methods

,

Proteins

,

2011

, vol.

79

(pg.

1623

-

34

)

43

Hwang

H

Vreven

T

Whitfield

TW

et al.

A machine learning approach for the prediction of protein surface loop flexibility

,

Proteins

,

2011

, vol.

79

(pg.

2467

-

74

)

44

Pierce

B

Weng

Z

ZRANK: reranking protein docking predictions with an optimized energy function

,

Proteins

,

2007

, vol.

67

(pg.

1078

-

86

)

45

Vreven

T

Hwang

H

Weng

Z

Integrating atom-based and residue-based scoring functions for protein-protein docking

,

Protein Sci

,

2011

, vol.

20

(pg.

1576

-

86

)

46

Pierce

B

Weng

Z

A combination of rescoring and refinement significantly improves protein docking performance

,

Proteins

,

2008

, vol.

72

(pg.

270

-

9

)

47

Vreven

T

Hwang

H

Weng

Z

Exploring angular distance in protein-protein docking algorithms

,

PLoS ONE

,

2013

, vol.

8

pg.

e56645

Author notes

*These authors contributed equally to this work.

Download all slides

Month:	Total Views:
December 2016	2
January 2017	2
February 2017	8
March 2017	17
April 2017	6
May 2017	14
June 2017	1
July 2017	4
August 2017	5
September 2017	23
October 2017	3
November 2017	3
December 2017	18
January 2018	10
February 2018	8
March 2018	25
April 2018	22
May 2018	16
June 2018	17
July 2018	15
August 2018	21
September 2018	19
October 2018	26
November 2018	16
December 2018	26
January 2019	10
February 2019	20
March 2019	21
April 2019	21
May 2019	15
June 2019	21
July 2019	17
August 2019	10
September 2019	17
October 2019	21
November 2019	11
December 2019	12
January 2020	13
February 2020	15
March 2020	10
April 2020	17
May 2020	12
June 2020	18
July 2020	19
August 2020	18
September 2020	14
October 2020	10
November 2020	15
December 2020	9
January 2021	11
February 2021	12
March 2021	29
April 2021	21
May 2021	15
June 2021	9
July 2021	11
August 2021	7
September 2021	15
October 2021	18
November 2021	26
December 2021	11
January 2022	23
February 2022	24
March 2022	18
April 2022	15
May 2022	24
June 2022	8
July 2022	15
August 2022	20
September 2022	20
October 2022	25
November 2022	24
December 2022	12
January 2023	22
February 2023	12
March 2023	26
April 2023	32
May 2023	8
June 2023	4
July 2023	8
August 2023	13
September 2023	11
October 2023	19
November 2023	7
December 2023	20
January 2024	25
February 2024	27
March 2024	10
April 2024	23
May 2024	14
June 2024	25
July 2024	21
August 2024	19
September 2024	20
October 2024	9
November 2024	45

Article Contents

Evaluating template-based and template-free protein–protein complex structure prediction

Abstract

INTRODUCTION

METHODS

Dataset

COTH

PRISM

ZDOCK

ZTEM

Scoring

RESULTS AND DISCUSSION

COTH

Comparison of COTH with ZDOCK

PRISM

Comparison of PRISM with ZDOCK and COTH

ZTEM for determining template availability

CONCLUSIONS

ACKNOWLEDGEMENTS

FUNDING

References

Author notes

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

Article Contents

Evaluating template-based and template-free protein–protein complex structure prediction

Abstract

INTRODUCTION

METHODS

Dataset

COTH

PRISM

ZDOCK

ZTEM

Scoring

RESULTS AND DISCUSSION

COTH

Comparison of COTH with ZDOCK

PRISM

Comparison of PRISM with ZDOCK and COTH

ZTEM for determining template availability

CONCLUSIONS

ACKNOWLEDGEMENTS

FUNDING

References

Author notes

Citations

Views

Altmetric

Email alerts

Citing articles via

Latest

Most Read

Most Cited

This Feature Is Available To Subscribers Only