TY - JOUR
T1 - Semantic prioritization of novel causative genomic variants
AU - Boudellioua, Imene
AU - Mohamad Razali, Rozaimi
AU - Kulmanov, Maxat
AU - Hashish, Yasmeen
AU - Bajic, Vladimir B.
AU - Goncalves-Serra, Eva
AU - Schoenmakers, Nadia
AU - Gkoutos, Georgios V.
AU - Schofield, Paul N.
AU - Hoehndorf, Robert
N1 - KAUST Repository Item: Exported on 2020-10-01
Acknowledgements: This research used the resources of the Computational Bioscience Research Center and the Supercomputing Laboratory at King Abdullah University of Science & Technology (KAUST) in Thuwal, Saudi Arabia.
PY - 2017/4/17
Y1 - 2017/4/17
N2 - Discriminating the causative disease variant(s) for individuals with inherited or de novo mutations presents one of the main challenges faced by the clinical genetics community today. Computational approaches for variant prioritization include machine learning methods utilizing a large number of features, including molecular information, interaction networks, or phenotypes. Here, we demonstrate the PhenomeNET Variant Predictor (PVP) system that exploits semantic technologies and automated reasoning over genotype-phenotype relations to filter and prioritize variants in whole exome and whole genome sequencing datasets. We demonstrate the performance of PVP in identifying causative variants on a large number of synthetic whole exome and whole genome sequences, covering a wide range of diseases and syndromes. In a retrospective study, we further illustrate the application of PVP for the interpretation of whole exome sequencing data in patients suffering from congenital hypothyroidism. We find that PVP accurately identifies causative variants in whole exome and whole genome sequencing datasets and provides a powerful resource for the discovery of causal variants.
AB - Discriminating the causative disease variant(s) for individuals with inherited or de novo mutations presents one of the main challenges faced by the clinical genetics community today. Computational approaches for variant prioritization include machine learning methods utilizing a large number of features, including molecular information, interaction networks, or phenotypes. Here, we demonstrate the PhenomeNET Variant Predictor (PVP) system that exploits semantic technologies and automated reasoning over genotype-phenotype relations to filter and prioritize variants in whole exome and whole genome sequencing datasets. We demonstrate the performance of PVP in identifying causative variants on a large number of synthetic whole exome and whole genome sequences, covering a wide range of diseases and syndromes. In a retrospective study, we further illustrate the application of PVP for the interpretation of whole exome sequencing data in patients suffering from congenital hypothyroidism. We find that PVP accurately identifies causative variants in whole exome and whole genome sequencing datasets and provides a powerful resource for the discovery of causal variants.
UR - http://hdl.handle.net/10754/623278
UR - http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1005500
UR - http://www.scopus.com/inward/record.url?scp=85018348604&partnerID=8YFLogxK
U2 - 10.1371/journal.pcbi.1005500
DO - 10.1371/journal.pcbi.1005500
M3 - Article
C2 - 28414800
SN - 1553-7358
VL - 13
SP - e1005500
JO - PLOS Computational Biology
JF - PLOS Computational Biology
IS - 4
ER -