TY - JOUR
T1 - SPECTRUM - A MATLAB Toolbox for Proteoform Identification from Top-Down Proteomics Data.
AU - Basharat, Abdul Rehman
AU - Iman, Kanzal
AU - Khalid, Muhammad Farhan
AU - Anwar, Zohra
AU - Hussain, Rashid
AU - Kabir, Humnah Gohar
AU - Tahreem, Maria
AU - Shahid, Anam
AU - Humayun, Maheen
AU - Hayat, Hira Azmat
AU - Mustafa, Muhammad
AU - Shoaib, Muhammad Ali
AU - Ullah, Zakir
AU - Zarina, Shamshad
AU - Ahmed, Sameer
AU - Uddin, Emad
AU - Hamera, Sadia
AU - Ahmad, Fayyaz
AU - Chaudhary, Safee Ullah
N1 - KAUST Repository Item: Exported on 2020-10-01
Acknowledgements: We acknowledge the support provided by Osama Shiraz Shah for the fruitful discussions and suggestions during the development of the toolbox. This work was supported by HEC (21-320SRGP/R&D/HEC/2014, 20-2269/NRPU/R&D/ HEC/12/4792 and 20-3629/NRPU/R&D/HEC/14/585), Ignite (SRG-209), TWAS (RG 14-319 RG/ITC/AS_C) and LUMS (STG-BIO-1008, FIF-BIO-2052 and FIF-BIO-0255) grants.
PY - 2019/8/2
Y1 - 2019/8/2
N2 - Top-Down Proteomics (TDP) is an emerging proteomics protocol that involves identification, characterization, and quantitation of intact proteins using high-resolution mass spectrometry. TDP has an edge over other proteomics protocols in that it allows for: (i) accurate measurement of intact protein mass, (ii) high sequence coverage, and (iii) enhanced identification of post-translational modifications (PTMs). However, the complexity of TDP spectra poses a significant impediment to protein search and PTM characterization. Furthermore, limited software support is currently available in the form of search algorithms and pipelines. To address this need, we propose 'SPECTRUM', an open-architecture and open-source toolbox for TDP data analysis. Its salient features include: (i) MS2-based intact protein mass tuning, (ii) de novo peptide sequence tag analysis, (iii) propensity-driven PTM characterization, (iv) blind PTM search, (v) spectral comparison, (vi) identification of truncated proteins, (vii) multifactorial coefficient-weighted scoring, and (viii) intuitive graphical user interfaces to access the aforementioned functionalities and visualization of results. We have validated SPECTRUM using published datasets and benchmarked it against salient TDP tools. SPECTRUM provides significantly enhanced protein identification rates (91% to 177%) over its contemporaries. SPECTRUM has been implemented in MATLAB, and is freely available along with its source code and documentation at https://github.com/BIRL/SPECTRUM/.
AB - Top-Down Proteomics (TDP) is an emerging proteomics protocol that involves identification, characterization, and quantitation of intact proteins using high-resolution mass spectrometry. TDP has an edge over other proteomics protocols in that it allows for: (i) accurate measurement of intact protein mass, (ii) high sequence coverage, and (iii) enhanced identification of post-translational modifications (PTMs). However, the complexity of TDP spectra poses a significant impediment to protein search and PTM characterization. Furthermore, limited software support is currently available in the form of search algorithms and pipelines. To address this need, we propose 'SPECTRUM', an open-architecture and open-source toolbox for TDP data analysis. Its salient features include: (i) MS2-based intact protein mass tuning, (ii) de novo peptide sequence tag analysis, (iii) propensity-driven PTM characterization, (iv) blind PTM search, (v) spectral comparison, (vi) identification of truncated proteins, (vii) multifactorial coefficient-weighted scoring, and (viii) intuitive graphical user interfaces to access the aforementioned functionalities and visualization of results. We have validated SPECTRUM using published datasets and benchmarked it against salient TDP tools. SPECTRUM provides significantly enhanced protein identification rates (91% to 177%) over its contemporaries. SPECTRUM has been implemented in MATLAB, and is freely available along with its source code and documentation at https://github.com/BIRL/SPECTRUM/.
UR - http://hdl.handle.net/10754/656468
UR - http://www.nature.com/articles/s41598-019-47724-1
UR - http://www.scopus.com/inward/record.url?scp=85071164335&partnerID=8YFLogxK
U2 - 10.1038/s41598-019-47724-1
DO - 10.1038/s41598-019-47724-1
M3 - Article
C2 - 31375721
SN - 2045-2322
VL - 9
JO - Scientific reports
JF - Scientific reports
IS - 1
ER -