Abstract
The identification of interface residues involved in protein-protein interactions(PPIs) has broad application in rational drug design and metabolic etc. Here a Naïve Bayes classifier for PPIs prediction with features including protein sequence profile and residue accessible surface area was proposed. This method adequately used the character of Naïve Bayes classifier which assumed independence of the attributes given the class. Our test results on a diversity dataset made up of only hetero-complex proteins achieved 68.1% overall accuracy with a correlation coefficient of 0.201, 40.2% specificity and 49.9% sensitivity in identify interface residues as estimated by leave-one-out cross-validation. This result indicated that the method performed substantially better than chance (zero correlation). Examination of the predictions in the context of 3-dimensional structures of proteins demonstrated the effectiveness of this method in identifying protein-protein sites.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Zhou, H.-X., Qin, S.: Interaction-site prediction for protein complexes: a critical assessment. Bioinformatics 23(17), 2203–2209 (2007)
Ben-Shem, A., Frolow, F., Nelson, N.: Crystal structure of plant photosystem I. Nature 426, 630–635 (2003)
Lanman, J., Lam, T.T., Bames, S., Sakalian, M., Emmett, M.R., Marshall, A.G., Prevelige, J., Peter, E.: Identification of novel interactions in HIV-1 capsid protein assembly by high-resolution mass Spectrometry. J. Mol. Biol. 325, 759–772 (2003)
Trester-Zedlitz, M., Kamada, K., Burley, S.K., Fenyo, D., Chait, B.T., Muir, T.W.: A modular cross-linking approach for exploring protein interactions. J. Am. Chem. Soc. 125, 2416–2425 (2003)
Koike, A., Takagi, T.: Prediction of protein-protein interaction sites using support vector machines. Protein Engineering, Design & Selection 17(2), 165–173 (2004)
Chung, J.-L., Wang, W., Bourne, P.E.: Exploiting sequence and structure homologs to identify protein-protein binding sites. PROTEINS:Structure, Function, and Bioinformatics 62, 630–640 (2006)
Yan, C., Dobbs, D., Honavar, V.: A two-stage classifier for identification of protein-protein interface residues. Bioinformatics 20 (suppl. 1), i371-i378 (2004)
Ofran, Y., Rost, B.: Predicted protein-protein interaction sites from local sequence information. FEBS Letters 544, 236–239 (2003)
Chen, H., Zhou, H.-X.: Prediction of Interface residues in protein-protein complexes by a consensus neural network method: test against NMR data. PROTEINS: Structure, Function, and Bioinformatics 61, 21–35 (2005)
LI, M.-H., Lin, L., Wang, X.-L., Liu, T.: Protein-protein interaction site prediction based on conditional random fields. Bioinformatics 23(5), 597–604 (2007)
James, R., Bradford, C.J., Needham, A.J., Bulpitt, D.R.: Westhead: Insights into protein-protein interfaces using a Bayesian network prediction method. J. Mol. Biol. 362, 365–386 (2006)
Bradford, J.R., Westhead, D.R.: Improved prediction of protein-protein binding sites using a support vector machines approach. Bioinformatics 21(8), 1487–1494 (2005)
Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., Boume, P.E.: The Protein Data Bank. Nucleic Acids Res. 28, 235–242 (2000)
Xennarios, I., Salwinski, L., Duan, X.J., Higney, P., Kim, S., Eisenberg, D.: DIP: The Database of Interacting Proteins. A research tool for studying cellular networks of protein interactions. Nucleic Acids Res. 30, 303–305 (2002)
Chakrabarti, P., Janin, J.: Dissecting protein-protein recognition sites. PROTEINS: Structure, Function, and Genetics 47, 334–343 (2002)
Res, I., Mihalek, I., Lichtarge, O.: An evolution based classifier for prediction of protein interfaces without using protein structures. Bioinformatics 21(10), 2496–2501 (2005)
Dong, Q., Wang, X., Lin, L., Guan, Y.: Exploiting residue-level and profile-level interface propensities for usage in binding sites prediction of proteins. BMC Bioinformatics 8(147), 1–13 (2007)
Kabsch, W., Sander, C.: Dictionary of protein secondary structure: pattern of hydrogen-bonded and geometrical features. Biopolymers 22, 2577–2637 (1983)
Rost, B., Sander, C.: Conservation and prediction of solvent accessibility in protein families. PROTEINS: Structure, Function, and Genetics 20, 216–226 (1994)
Jones, S., Thomton, J.M.: Principles of protein-protein interactions. Proc. Natl. Acad. Sci. USA 93, 13–20 (1996)
Yan, C., Honavar, V., Dobbs, D.: Identification of interface residues in protease-inhibitor and antigen-antibody complexes: a support vector machine approach. Neural Comput. & Applic. 13, 123–129 (2004)
Dodge, C., Schneider, R., Sander, C.: The HSSP database of protein structure-sequence alignments and family profiles. Nucleic Acids Res. 26, 313–315 (1998)
Christian Borgelt’s Webpages, http://www.borgelt.net//bayes.html
Honavar, V., Yan, C., Dobbs, D.: Predicting protein-protein interaction sites from amino acid sequence. Technical report ISU-CS-TR 02-11, Department of Computer Science, Iowa State University, pp. 2–11 (2002), http://archives.cs.iastate.edu/documents/disk0/00/00/02/88/index.html
Stubbs, M.T., Laber, B., Bode, W., Huber, R., Jerala, R., Lenarcic, B., Turk, V.: The refined 2.4 A X-ray crystal structure of recombinant human stefin B in complex with the cysteine proteinase papain: a novel type of proteinase inhibitor interaction. EMBO J. 9, 1939–1947 (1990)
Rees, D.C., Lipscomb, W.N.: Refined crystal structure of the potato inhibitor complex of carboxypeptidase A at 2.5 A resolution. J. Mol. Biol. 160, 475–498 (1982)
Deisenhofer, J.: Crystallographic refinement and atomic models of a human Fc fragment and its complex with fragment B of protein A from Staphylococcus aureus at 2.9- and 2.8-A resolution. Biochemistry 20, 2361–2370 (1981)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, C., Cheng, J., Su, S., Xu, D. (2008). Identification of Interface Residues Involved in Protein-Protein Interactions Using Naïve Bayes Classifier. In: Tang, C., Ling, C.X., Zhou, X., Cercone, N.J., Li, X. (eds) Advanced Data Mining and Applications. ADMA 2008. Lecture Notes in Computer Science(), vol 5139. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88192-6_20
Download citation
DOI: https://doi.org/10.1007/978-3-540-88192-6_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88191-9
Online ISBN: 978-3-540-88192-6
eBook Packages: Computer ScienceComputer Science (R0)