Prediction of Protein Relative Solvent Accessibility with Support Vector Machines and Long-Range Interaction 3D Local Descriptor

Hyunsoo Kim, Haesun Park

Research output: Contribution to journalArticlepeer-review

107 Scopus citations

Abstract

The prediction of protein relative solvent accessibility gives us helpful information for the prediction of tertiary structure of a protein. The SVMpsi method, which uses support vector machines (SVMs), and the position-specific scoring matrix (PSSM) generated from PSI-BLAST have been applied to achieve better prediction accuracy of the relative solvent accessibility. We have introduced a three-dimensional local descriptor that contains information about the expected remote contacts by both the long-range interaction matrix and neighbor sequences. Moreover, we applied feature weights to kernels in SVMs in order to consider the degree of significance that depends on the distance from the specific amino acid. Relative solvent accessibility based on a two state-model, for 25%, 16%, 5%, and 0% accessibility are predicted at 78.7%, 80.7%, 82.4%, and 87.4% accuracy, respectively. Three-state prediction results provide a 64.5% accuracy with 9%; 36% threshold. The support vector machine approach has successfully been applied for solvent accessibility prediction by considering long-range interaction and handling unbalanced data.

Original languageEnglish (US)
Pages (from-to)557-562
Number of pages6
JournalProteins: Structure, Function and Genetics
Volume54
Issue number3
DOIs
StatePublished - Feb 15 2004
Externally publishedYes

Keywords

  • Directed acyclic graph scheme
  • Long range interaction
  • PSSM
  • Protein structure prediction
  • Solvent accessibility
  • Support vector machines

Fingerprint

Dive into the research topics of 'Prediction of Protein Relative Solvent Accessibility with Support Vector Machines and Long-Range Interaction 3D Local Descriptor'. Together they form a unique fingerprint.

Cite this