TY - JOUR
T1 - A Literature Review of Gene Function Prediction by Modeling Gene Ontology.
AU - Zhao, Yingwen
AU - Wang, Jun
AU - Chen, Jian
AU - Zhang, Xiangliang
AU - Guo, Maozu
AU - Yu, Guoxian
N1 - KAUST Repository Item: Exported on 2020-10-01
Acknowledgements: Funding. This work was financially supported by Natural Science Foundation of China (61872300), Fundamental Research Funds for the Central Universities (XDJK2019B024 and XDJK2020B028), Natural Science Foundation of CQ CSTC (cstc2018-jcyjAX0228), and King Abdullah University of Science and Technology, under award number FCC/1/1976-19-01.
PY - 2020/5/12
Y1 - 2020/5/12
N2 - Annotating the functional properties of gene products, i.e., RNAs and proteins, is a fundamental task in biology. The Gene Ontology database (GO) was developed to systematically describe the functional properties of gene products across species, and to facilitate the computational prediction of gene function. As GO is routinely updated, it serves as the gold standard and main knowledge source in functional genomics. Many gene function prediction methods making use of GO have been proposed. But no literature review has summarized these methods and the possibilities for future efforts from the perspective of GO. To bridge this gap, we review the existing methods with an emphasis on recent solutions. First, we introduce the conventions of GO and the widely adopted evaluation metrics for gene function prediction. Next, we summarize current methods of gene function prediction that apply GO in different ways, such as using hierarchical or flat inter-relationships between GO terms, compressing massive GO terms and quantifying semantic similarities. Although many efforts have improved performance by harnessing GO, we conclude that there remain many largely overlooked but important topics for future research.
AB - Annotating the functional properties of gene products, i.e., RNAs and proteins, is a fundamental task in biology. The Gene Ontology database (GO) was developed to systematically describe the functional properties of gene products across species, and to facilitate the computational prediction of gene function. As GO is routinely updated, it serves as the gold standard and main knowledge source in functional genomics. Many gene function prediction methods making use of GO have been proposed. But no literature review has summarized these methods and the possibilities for future efforts from the perspective of GO. To bridge this gap, we review the existing methods with an emphasis on recent solutions. First, we introduce the conventions of GO and the widely adopted evaluation metrics for gene function prediction. Next, we summarize current methods of gene function prediction that apply GO in different ways, such as using hierarchical or flat inter-relationships between GO terms, compressing massive GO terms and quantifying semantic similarities. Although many efforts have improved performance by harnessing GO, we conclude that there remain many largely overlooked but important topics for future research.
UR - http://hdl.handle.net/10754/662881
UR - https://www.frontiersin.org/article/10.3389/fgene.2020.00400/full
UR - http://www.scopus.com/inward/record.url?scp=85084367562&partnerID=8YFLogxK
U2 - 10.3389/fgene.2020.00400
DO - 10.3389/fgene.2020.00400
M3 - Article
C2 - 32391061
SN - 1664-8021
VL - 11
JO - Frontiers in genetics
JF - Frontiers in genetics
ER -