Abstract
Computational elucidation of membrane protein (MP) structures is challenging partially due to lack of sufficient solved structures for homology modeling. Here, we describe a high-throughput deep transfer learning method that first predicts MP contacts by learning from non-MPs and then predicts 3D structure models using the predicted contacts as distance restraints. Tested on 510 non-redundant MPs, our method has contact prediction accuracy at least 0.18 better than existing methods, predicts correct folds for 218 MPs, and generates 3D models with root-mean-square deviation (RMSD) less than 4 and 5 Å for 57 and 108 MPs, respectively. A rigorous blind test in the continuous automated model evaluation project shows that our method predicted high-resolution 3D models for two recent test MPs of 210 residues with RMSD ∼2 Å. We estimated that our method could predict correct folds for 1,345–1,871 reviewed human multi-pass MPs including a few hundred new folds, which shall facilitate the discovery of drugs targeting at MPs. A deep transfer learning method is presented to predict membrane protein contact map by learning sequence-structure relationships from non-membrane proteins, which overcomes the challenge that there are not many solved membrane protein structures for deep learning model training. The predicted contacts are pretty accurate and can help predict correct folds and accurate 3D models for ∼40% and ∼20% of 510 non-redundant membrane proteins, respectively.
Original language | English (US) |
---|---|
Pages (from-to) | 202-211.e3 |
Journal | Cell systems |
Volume | 5 |
Issue number | 3 |
DOIs | |
State | Published - Sep 27 2017 |
Keywords
- co-evolution analysis
- deep learning
- deep transfer learning
- homology modeling
- membrane protein contact prediction
- membrane protein folding
- multiple sequence alignment
ASJC Scopus subject areas
- Pathology and Forensic Medicine
- Cell Biology
- Histology