nips nips2004 nips2004-95 nips2004-95-reference knowledge-graph by maker-knowledge-mining

95 nips-2004-Large-Scale Prediction of Disulphide Bond Connectivity


Source: pdf

Author: Jianlin Cheng, Alessandro Vullo, Pierre F. Baldi

Abstract: The formation of disulphide bridges among cysteines is an important feature of protein structures. Here we develop new methods for the prediction of disulphide bond connectivity. We first build a large curated data set of proteins containing disulphide bridges and then use 2-Dimensional Recursive Neural Networks to predict bonding probabilities between cysteine pairs. These probabilities in turn lead to a weighted graph matching problem that can be addressed efficiently. We show how the method consistently achieves better results than previous approaches on the same validation data. In addition, the method can easily cope with chains with arbitrary numbers of bonded cysteines. Therefore, it overcomes one of the major limitations of previous approaches restricting predictions to chains containing no more than 10 oxidized cysteines. The method can be applied both to situations where the bonded state of each cysteine is known or unknown, in which case bonded state can be predicted with 85% precision and 90% recall. The method also yields an estimate for the total number of disulphide bridges in each chain. 1


reference text

[1] V.I. Abkevich and E.I. Shankhnovich. What can disulfide bonds tell us about protein energetics, function and folding: simulations and bioinformatics analysis. J. Math. Biol., 300:975–985, 2000.

[2] A. Bairoch and R. Apweiler. The SWISS-PROT protein sequence database and its supplement TrEMBL. Nucleic Acids Res., 28:45–48, 2000.

[3] P. Baldi and G. Pollastri. Machine learning structural and functional proteomics. IEEE Intelligent Systems. Special Issue on Intelligent Systems in Biology, 17(2), 2002.

[4] P. Baldi and G. Pollastri. The principled design of large-scale recursive neural network architectures–dag-rnns and the protein structure prediction problem. Journal of Machine Learning Research, 4:575–602, 2003.

[5] P. Baldi and M. Rosen-Zvi. On the relationship between deterministic and probabilistic directed graphical models. 2004. Submitted.

[6] H. M. Berman, J. Westbrook, Z. Feng, G. Gilliland, T. N. Bhat, H. Weissig, I. N. Shindyalov, and P. E. Bourne. The Protein Data Bank. Nucl. Acids Res., 28:235–242, 2000.

[7] S. Betz. Disulfide bonds and the stability of globular proteins. Proteins, Struct., Function Genet., 21:167–195, 1993.

[8] J. Clarke and A.R. Fersht. Engineered disulfide bonds as probes of the folding pathway of barnase - increasing stability of proteins against the rate of denaturation. Biochemistry, 32:4322– 4329, 1993.

[9] L. Demetrius. Thermodynamics and kinetics of protein folding: an evolutionary perpective. J. Theor. Biol., 217:397–411, 2000.

[10] P. Fariselli and R. Casadio. Prediction of disulfide connectivity in proteins. Bioinformatics, 17:957–964, 2001.

[11] P. Fariselli, P. L. Martelli, and R. Casadio. A neural network-based method for predicting the disulfide connectivity in proteins. In E. Damiani et al., editors, Knowledge based intelligent information engineering systems and allied technologies (KES 2002), volume 1, pages 464– 468. IOS Press, 2002.

[12] H.N. Gabow. An efficient implementation of Edmond’s algorithm for maximum weight matching on graphs. Journal of the ACM, 23(2):221–234, 1976.

[13] J.L. Klepeis and C.A. Floudas. Prediction of β-sheet topology and disulfide bridges in polypeptides. J. Comput. Chem., 24:191–208, 2003.

[14] T.A. Klink, K.J. Woycechosky, K.M. Taylor, and R.T. Raines. Contribution of disulfide bonds to the conformational stability and catalytic activity of ribonuclease A. Eur. J. Biochem., 267:566– 572, 2000.

[15] M. Matsumura et al. Substantial increase of protein stability by multiple disulfide bonds. Nature, 342:291–293, 1989.

[16] G. Pollastri, D. Przybylski, B. Rost, and P. Baldi. Improving the prediction of protein secondary structure in three and eight classes using recurrent neural networks and profiles. Proteins, 47:228–235, 2002.

[17] A. Vullo and P. Frasconi. Disulfide connectivity prediction using recursive neural networks and evolutionary information. Bioinformatics, 20:653–659, 2004.

[18] W.J. Wedemeyer, E. Welkler, M. Narayan, and H.A. Scheraga. Disulfide bonds and proteinfolding. Biochemistry, 39:4207–4216, 2000.