TY - JOUR
T1 - KAUST Metagenomic Analysis Platform (KMAP), enabling access to massive analytics of re-annotated metagenomic data
AU - Alam, Intikhab
AU - Kamau, Allan
AU - Ngugi, David Kamanda
AU - Gojobori, Takashi
AU - Duarte, Carlos M.
AU - Bajic, Vladimir B.
N1 - KAUST Repository Item: Exported on 2021-11-21
Acknowledged KAUST grant number(s): BAS/1/1606-01-01, FCC/1/1976-02-01, FCC/1/1976-17-01, FCC/1/1976-18-01
Acknowledgements: This work is supported by KAUST base funding Grant BAS/1/1606-01-01, KAUST CCF Grants FCC/1/1976-02-01 and FCC/1/1976-11-01, CARF Grants FCC/1/1976-17-01 and FCC/1/1976-18-01, and CBRC base line fund.
PY - 2021/6/1
Y1 - 2021/6/1
N2 - AbstractExponential rise of metagenomics sequencing is delivering massive functional environmental genomics data. However, this also generates a procedural bottleneck for on-going re-analysis as reference databases grow and methods improve, and analyses need be updated for consistency, which require acceess to increasingly demanding bioinformatic and computational resources. Here, we present the KAUST Metagenomic Analysis Platform (KMAP), a new integrated open web-based tool for the comprehensive exploration of shotgun metagenomic data. We illustrate the capacities KMAP provides through the re-assembly of ~ 27,000 public metagenomic samples captured in ~ 450 studies sampled across ~ 77 diverse habitats. A small subset of these metagenomic assemblies is used in this pilot study grouped into 36 new habitat-specific gene catalogs, all based on full-length (complete) genes. Extensive taxonomic and gene annotations are stored in Gene Information Tables (GITs), a simple tractable data integration format useful for analysis through command line or for database management. KMAP pilot study provides the exploration and comparison of microbial GITs across different habitats with over 275 million genes. KMAP access to data and analyses is available at https://www.cbrc.kaust.edu.sa/aamg/kmap.start.
AB - AbstractExponential rise of metagenomics sequencing is delivering massive functional environmental genomics data. However, this also generates a procedural bottleneck for on-going re-analysis as reference databases grow and methods improve, and analyses need be updated for consistency, which require acceess to increasingly demanding bioinformatic and computational resources. Here, we present the KAUST Metagenomic Analysis Platform (KMAP), a new integrated open web-based tool for the comprehensive exploration of shotgun metagenomic data. We illustrate the capacities KMAP provides through the re-assembly of ~ 27,000 public metagenomic samples captured in ~ 450 studies sampled across ~ 77 diverse habitats. A small subset of these metagenomic assemblies is used in this pilot study grouped into 36 new habitat-specific gene catalogs, all based on full-length (complete) genes. Extensive taxonomic and gene annotations are stored in Gene Information Tables (GITs), a simple tractable data integration format useful for analysis through command line or for database management. KMAP pilot study provides the exploration and comparison of microbial GITs across different habitats with over 275 million genes. KMAP access to data and analyses is available at https://www.cbrc.kaust.edu.sa/aamg/kmap.start.
UR - http://hdl.handle.net/10754/666521
UR - http://www.nature.com/articles/s41598-021-90799-y
UR - http://www.scopus.com/inward/record.url?scp=85107356540&partnerID=8YFLogxK
U2 - 10.1038/s41598-021-90799-y
DO - 10.1038/s41598-021-90799-y
M3 - Article
C2 - 34075103
SN - 2045-2322
VL - 11
JO - Scientific Reports
JF - Scientific Reports
IS - 1
ER -