TY - GEN
T1 - Domain Randomization for Macromolecule Structure Classification and Segmentation in Electron Cyro-tomograms
AU - Che, Chengqian
AU - Xian, Zhou
AU - Zeng, Xiangrui
AU - Gao, Xin
AU - Xu, Min
N1 - KAUST Repository Item: Exported on 2020-10-01
Acknowledged KAUST grant number(s): BAS/1/1624, FCC/1/1976-18, FCC/1/1976-23, FCC/1/1976-25, FCC/1/1976-26
Acknowledgements: This work was supported in part by U.S. National Institutes of Health (NIH) grant P41 GM103712. XZ was supported by a fellowship from Carnegie Mellon University’s Center for Machine Learning and Health. XG was supported by the King Abdullah University of Science and Technology.
PY - 2020/2/7
Y1 - 2020/2/7
N2 - It is crucial to study and understand cellular processes. In recent years, Cellular Electron CryoTomography (CECT) serves as a powerful 3D imaging tool to visualize spatial structure of macromolecules inside the cell. However, it is challenging to analyze the macromolecular structures in a systematic way due to nature of the structural complexity of subcellular components. Existing computational and deep learning based approaches suffer from limited scalability, discrimination ability and lack of accurate annotated CECT data. Training with cheap simulated data can alleviate this problem while facing new challenges of bridging the 'reality gap' between synthetic training data and real testing data. In this paper, we tackle the tasks of macromolecule structure classification and segmentation in CECT images by adapting a simple but effective technique, domain randomization. We show that by combining deep neural models and domain randomization, we are able to achieve significant improvements of 35.21% and 46.34% in tasks of classification and semantic segmentation for real CECT data, comparing to the model trained only on syhthetic data that aims to faithfully reproduce real-world data distribution.
AB - It is crucial to study and understand cellular processes. In recent years, Cellular Electron CryoTomography (CECT) serves as a powerful 3D imaging tool to visualize spatial structure of macromolecules inside the cell. However, it is challenging to analyze the macromolecular structures in a systematic way due to nature of the structural complexity of subcellular components. Existing computational and deep learning based approaches suffer from limited scalability, discrimination ability and lack of accurate annotated CECT data. Training with cheap simulated data can alleviate this problem while facing new challenges of bridging the 'reality gap' between synthetic training data and real testing data. In this paper, we tackle the tasks of macromolecule structure classification and segmentation in CECT images by adapting a simple but effective technique, domain randomization. We show that by combining deep neural models and domain randomization, we are able to achieve significant improvements of 35.21% and 46.34% in tasks of classification and semantic segmentation for real CECT data, comparing to the model trained only on syhthetic data that aims to faithfully reproduce real-world data distribution.
UR - http://hdl.handle.net/10754/663480
UR - https://ieeexplore.ieee.org/document/8983110/
UR - http://www.scopus.com/inward/record.url?scp=85084341880&partnerID=8YFLogxK
U2 - 10.1109/BIBM47256.2019.8983110
DO - 10.1109/BIBM47256.2019.8983110
M3 - Conference contribution
SN - 9781728118673
SP - 6
EP - 11
BT - 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)
PB - IEEE
ER -