Abstract
Inosine is a prevalent RNA modification in animals and is formed when an adenosine is deaminated by the ADAR family of enzymes. Traditionally, inosines are identified indirectly as variants from Illumina RNA-sequencing data because they are interpreted as guanosines by cellular machineries. However, this indirect method performs poorly in protein-coding regions where exons are typically short, in non-model organisms with sparsely annotated single-nucleotide polymorphisms, or in disease contexts where unknown DNA mutations are pervasive. Here, we show that Oxford Nanopore direct RNA sequencing can be used to identify inosine-containing sites in native transcriptomes with high accuracy. We trained convolutional neural network models to distinguish inosine from adenosine and guanosine, and to estimate the modification rate at each editing site. Furthermore, we demonstrated their utility on the transcriptomes of human, mouse and Xenopus. Our approach expands the toolkit for studying adenosine-to-inosine editing and can be further extended to investigate other RNA modifications.
Original language | English (US) |
---|---|
Pages (from-to) | 833-844 |
Number of pages | 12 |
Journal | Nature Methods |
Volume | 19 |
Issue number | 7 |
DOIs | |
State | Published - Jul 1 2022 |
Externally published | Yes |
ASJC Scopus subject areas
- Biochemistry
- Cell Biology
- Molecular Biology
- Biotechnology