Abstract

C-to-U RNA editing enzymatically converts the base C to U in RNA molecules and could lead to nonsynonymous changes when occurring in coding regions. Hundreds to thousands of coding sites were recently found to be C-to-U edited or editable in humans, but the biological significance of this phenomenon is elusive. Here, we test the prevailing hypothesis that nonsynonymous editing is beneficial because it provides a means for tissue- or time-specific regulation of protein function that may be hard to accomplish by mutations due to pleiotropy. The adaptive hypothesis predicts that the fraction of sites edited and the median proportion of RNA molecules edited (i.e., editing level) are both higher for nonsynonymous than synonymous editing. However, our empirical observations are opposite to these predictions. Furthermore, the frequency of nonsynonymous editing, relative to that of synonymous editing, declines as genes become functionally more important or evolutionarily more constrained, and the nonsynonymous editing level at a site is negatively correlated with the evolutionary conservation of the site. Together, these findings refute the adaptive hypothesis; they instead indicate that the reported C-to-U coding RNA editing is mostly slightly deleterious or neutral, probably resulting from off-target activities of editing enzymes. Along with similar conclusions on the more prevalent A-to-I editing and m6A modification of coding RNAs, our study suggests that, at least in humans, most events of each type of posttranscriptional coding RNA modification likely manifest cellular errors rather than adaptations, demanding a paradigm shift in the research of posttranscriptional modification.

Introduction

RNA editing refers to a heterogeneous group of posttranscriptional modifications of RNAs that include insertion, deletion, and nucleotide conversions but exclude common forms of processing (e.g., splicing, 5′-capping, and 3′-polyadenylation) and conversions to nonstandard nucleotides (e.g., m6A) (Covello and Gray 1993; Nishikura 2006; Farajollahi and Maas 2010). The most prevalent editing of mRNAs transcribed from the animal nuclear genome is A-to-I (inosine) editing (Nishikura 2016). For instance, in humans, millions of sites undergo A-to-I editing, although the vast majority of these sites reside in noncoding regions (Li et al. 2009; Peng et al. 2012; Bazak et al. 2014). The physiological function of A-to-I editing is largely unknown, with evidence for functionality present for only a few incidences (Lomeli et al. 1994; Burns et al. 1997; Seeburg and Hartner 2003; Garrett and Rosenthal 2012; Nishikura 2016). Evolutionary analysis suggests that most observed A-to-I editing events in vertebrate coding regions are neutral or slightly deleterious, likely resulting from promiscuous activities of editing enzymes (Xu and Zhang 2014). However, analyses performed in fruit flies, squids, and octopuses suggest that RNA editing has been subjected to positive Darwinian selection in these lineages, although the selective agents remain unclear (Alon et al. 2015; Duan et al. 2017; Liscovitch-Brauer et al. 2017).

Evolutionary studies of RNA editing of animal nuclear genes have been limited to A-to-I editing, because other types of RNA editing are thought to be rare and hence not amenable to meaningful statistical analysis. A prime example is C-to-U editing, which is predominantly catalyzed by AID (activation-induced cytidine deaminase)/APOBEC (apolipoprotein B mRNA editing enzyme, catalytic polypeptide) proteins, encoded by a family of ten genes (APOBEC1, 2, 3 A–D, 3 F–H, and 4) in the human genome (Smith et al. 2012). Although C-to-U is the second most common RNA editing of animal nuclear genes and has been known for over 30 years (Chen et al. 1987; Powell et al. 1987), only a few examples are well documented in humans (Vu and Tsukahara 2017). Several recent transcriptomic studies of human C-to-U editing, however, have started to change this situation. Specifically, Sharma et al. (2015) identified ∼100 coding sites that are C-to-U edited in M1 macrophages. Furthermore, hypoxia was found to enhance C-to-U RNA editing in monocytes (Baysal et al. 2013), and >100 edited sites in coding regions were detected from hypoxia-induced monocytes (Sharma et al. 2015). Because the relatively high C-to-U editing in macrophages and monocytes is due to the upregulation of cytidine deaminase APOBEC3A (Sharma et al. 2015), Sharma et al. hypothesized that they could identify many editable sites by overexpressing APOBEC3A in any cell line. Indeed, they reported over 2,000 edited coding sites upon overexpression of APOBEC3A in the human embryonic kidney cell line HEK293T (Sharma et al. 2017). Although C-to-U editing in HEK293T is nonnatural because APOBEC3A is not normally expressed in this cell line, at least the identified highly edited sites should represent those that are edited in cell types that normally express APOBEC3A strongly.

Despite the progress in transcriptome-wide identification of C-to-U editing, its function remains unclear. Because C-to-U editing can alter the protein sequence when occurring in coding regions, it provides a means for enhancing proteomic diversity without altering the genome, which could be advantageous when different protein functions are needed in different tissues or developmental times. Furthermore, this means of regulation is thought to be particularly important at evolutionarily conserved sites where mutations altering the protein sequence have obviously been selectively purged perhaps due to pleiotropy (Reenan 2005; Gommans et al. 2009). Analyzing reported human C-to-U editing sites, we here test whether C-to-U editing is generally adaptive, as commonly thought, or is largely nonadaptive, as in the case of human A-to-I editing.

Results

Nonsynonymous Editing Is Less Frequent than Synonymous Editing

We respectively compiled human C-to-U coding RNA editing data from macrophages during the M1 activation phase, monocytes under hypoxia, and HEK293T cells under APOBEC3A overexpression (table 1). APOBEC3A is known to preferentially edit C when it is preceded with a U (Farajollahi and Maas 2010; Sharma et al. 2015, 2017; Kouno et al. 2017; Shi et al. 2017). Indeed, the vast majority (89.8–97.8%) of the edited sites in coding regions are located in the DNA motif TC (edited C is underlined) (table 1). For this reason, we focus on editing events in TC motifs in all analyses.

Table 1.

Data Sets Used in This Study.

Cell TypeConditionNumber of Edited Sites in CDSNumber of Edited Sites in TC Motifs (%)Number of Edited GenesReference
MacrophageM1 polarization9389 (95.7)85Sharma et al. (2015)
MonocyteHypoxia128115 (89.8)109Sharma et al. (2015)
HEK293TAPOBEC3A overexpression2,4552,401 (97.8)1,902Sharma et al. (2017)
Cell TypeConditionNumber of Edited Sites in CDSNumber of Edited Sites in TC Motifs (%)Number of Edited GenesReference
MacrophageM1 polarization9389 (95.7)85Sharma et al. (2015)
MonocyteHypoxia128115 (89.8)109Sharma et al. (2015)
HEK293TAPOBEC3A overexpression2,4552,401 (97.8)1,902Sharma et al. (2017)
Table 1.

Data Sets Used in This Study.

Cell TypeConditionNumber of Edited Sites in CDSNumber of Edited Sites in TC Motifs (%)Number of Edited GenesReference
MacrophageM1 polarization9389 (95.7)85Sharma et al. (2015)
MonocyteHypoxia128115 (89.8)109Sharma et al. (2015)
HEK293TAPOBEC3A overexpression2,4552,401 (97.8)1,902Sharma et al. (2017)
Cell TypeConditionNumber of Edited Sites in CDSNumber of Edited Sites in TC Motifs (%)Number of Edited GenesReference
MacrophageM1 polarization9389 (95.7)85Sharma et al. (2015)
MonocyteHypoxia128115 (89.8)109Sharma et al. (2015)
HEK293TAPOBEC3A overexpression2,4552,401 (97.8)1,902Sharma et al. (2017)

We consider a C-to-U editing in coding regions nonsynonymous if it creates a missense or nonsense change; otherwise it is regarded as synonymous. The adaptive hypothesis of C-to-U editing asserts that nonsynonymous editing is beneficial and hence predicts a higher frequency of nonsynonymous editing than synonymous editing, which is expected to be neutral. Among the 89 TC motifs in which C is edited to U in macrophages, editing is nonsynonymous at n = 26 sites and synonymous at s = 63 sites. In all human protein coding regions, potential C-to-U editing is nonsynonymous at N = 1,171,927 TC motifs and synonymous at S = 1,006,853 TC motifs. Thus, the frequency of nonsynonymous editing observed is fn = n/N = 2.22 × 10−5, whereas the frequency of synonymous editing observed is fs = s/S = 6.26 × 10−5. The difference between fn and fs is statistically significant (P = 5.53 × 10−6, χ2 test; fig. 1A). Using the same method, we calculated fn and fs in the monocyte and HEK293T data sets, respectively. Consistent with the finding from macrophages, fn is significantly lower than fs in both monocytes and HEK293T cells (P = 0.042 and 1.39 × 10−14, respectively; fig. 1A). These results suggest that genotypes with nonsynonymous C-to-U editing often have reduced fitness and are purged by natural selection such that the observed fn is lower than the neutral expectation. We noticed that the fraction of nonsynonymous editing that has been selectively removed (1−fn/fs) is much greater in macrophages (65%) than in monocytes (32%) and HEK293T (27%). This is probably because the condition (M1 activation phase) under which the macrophages were examined is much more physiological than the conditions (hypoxia and APOBEC3A overexpression) under which the other two cell types were respectively examined. Consequently, editing observed in the former has been subjected to natural selection more frequently than editing observed in the latter.

Fig. 1.

Frequencies and editing levels of synonymous and nonsynonymous C-to-U RNA editing in human coding sequences. (A) Frequency of nonsynonymous editing is lower than that of synonymous editing in macrophages, monocytes, and HEK293T cells. P values are from χ2 tests. (B) Nonsynonymous editing levels are lower than synonymous editing levels. In the boxplot, the lower edge and upper edge of a box represent the 25% quartile (q1) and 75% quartile (q3), respectively. The horizontal line inside a box indicates the median (md). The whiskers extend to the most extreme values inside inner fences, md ± 1.5(q3q1). P values are from two-tailed Wilcoxon rank sum tests.

We also examined the fn/fs ratio of individual genes in the three data sets. Because the number of edited sites per gene is very small, many genes show fn > fs simply by chance. However, we found no gene whose fn significantly exceeds fs at the nominal P value of 0.05.

Nonsynonymous Editing Levels Are Lower than Synonymous Editing Levels

It is possible that some nonsynonymous editing is deleterious and purged by natural selection but the remaining nonsynonymous editing which is observed is advantageous. To test this hypothesis, we examined the editing level of each edited site, which is the fraction of RNA molecules that are edited at the site. The adaptive hypothesis predicts that editing level is generally higher for observed nonsynonymous editing than synonymous editing. Contrary to the prediction, we found the median nonsynonymous editing level (Ln) lower than the median synonymous editing level (Ls) in all three cell types (fig. 1B). This difference is statistically significant in macrophages (P = 0.025, two-tailed Wilcoxon rank sum test) but not in the other two cell types, which again is likely owing to fewer opportunities for natural selection on the observed editing in the latter than the former. Regardless, our results indicate that even observed nonsynonymous editing is generally deleterious rather than beneficial such that its editing level is lower than the neutral expectation reflected by the level of synonymous editing.

Editing Frequency Reduces with Gene Importance and Evolutionary Constraint

The above analyses suggest that nonsynonymous C-to-U editing is generally harmful rather than beneficial. Hence, we predict that, relative to fs, fn should decrease as gene importance increases. To verify this prediction, we obtained a list of 183 human essential genes and 14,388 human nonessential genes (see Materials and Methods) and respectively estimated their fn and fs values. In part due to the small sample sizes of the macrophage and monocyte data sets, no nonsynonymous editing is identified in essential genes. We thus decided to focus on the HEK293T data set, from which we estimated fn/fs = 0.32 for essential genes (fig. 2A). To test whether this value is significantly lower than the corresponding value of nonessential genes, we randomly picked 183 nonessential genes to estimate its fn/fs and repeated this process 1,000 times. Only 3 of the 1,000 fn/fs values from nonessential genes are equal to or lower than the observed fn/fs from essential genes (fig. 2A). Hence, fn/fs is significantly lower for essential genes than nonessential genes (P = 0.003).

Fig. 2.

Nonsynonymous C-to-U RNA editing frequency (fn), relative to synonymous editing frequency (fs), reduces with gene importance and evolutionary constraint on the gene. (A) fn/fs is lower in essential genes than in nonessential genes. Open bars show the frequency distribution of fn/fs from 1,000 random sets of nonessential genes with the same number of genes as essential genes. The black arrow indicates the observed fn/fs in essential genes. (B) fn/fs increases with dN/dS, which is computed using human–mouse orthologs. We divided human genes into ten equal-sized bins by dN/dS, and each dot shows the fn/fs and median dN/dS of each bin. Spearman’s correlation coefficient (ρ) and its associated P value are shown.

The hypothesis that nonsynonymous editing is generally harmful further predicts a lower fn/fs ratio for genes that are under stronger evolutionary constraints. The evolutionary constraint on a gene is commonly measured by the ratio between the rate of nonsynonymous nucleotide substitution (dN) and that of synonymous substitution (dS). The lower the dN/dS ratio, the stronger the evolutionary constraint. In other words, we predict a positive correlation between fn/fs and dN/dS. We computed the dN/dS ratio of each human gene by comparing with its one-to-one mouse ortholog. We then divided all genes into ten equal-sized bins based on the dN/dS ratio and computed the median dN/dS ratio of each bin. Analyzing the HEK293T data set, we calculated the fn/fs ratio for each bin and found a significant positive correlation between fn/fs and median dN/dS across bins (Spearman’s correlation ρ = 0.70; P = 0.031; fig. 2B). We noticed that fn/fs >1 for three bins with intermediate dN/dS (fig. 2B), but fn/fs does not significantly exceed 1 in any of these bins (nominal P = 0.402, 0.226, and 0.075, respectively).

We also analyzed the macrophage and monocyte data. Because the number of edited genes is much smaller in each of these data sets, we divided each data set into five bins. Although positive correlations were observed between fn/fs and median dN/dS across bins for both macrophages (ρ = 0.70, P = 0.23) and monocytes (ρ = 0.69, P = 0.21), they were not significant, possibly due to the small sample sizes (table 1).

Evolutionarily Conserved Sites Tend to Have Lower Editing Levels

As mentioned, the adaptive hypothesis of nonsynonymous RNA editing proposes that nonsynonymous editing should preferentially occur at evolutionarily conserved or mutation-intolerant sites because editing is especially less pleiotropic and hence more beneficial than DNA-level changes at such sites (Reenan 2005; Gommans et al. 2009). This hypothesis predicts that 1) evolutionary conservation is greater for sites undergoing nonsynonymous editing than those undergoing synonymous editing; and 2) editing level increases with the evolutionary conservation of the site that undergoes nonsynonymous editing. To verify these predictions, we retrieved 573 protein-coding genes that have one-to-one orthologs in all 43 mammals examined (see Materials and Methods). We aligned these sequences and removed alignment gaps, and identified from the HEK293T data 32 synonymous and 53 nonsynonymous C-to-U editing sites (within TC motifs) in the human sequences of these alignments. Using the method based on Jensen–Shannon divergence (Capra and Singh 2007), we computed an amino acid conservation score for each codon that contains an edited site. As a control, we also identified unedited Cs that are in TC motifs in the human sequences of these alignments, and similarly computed conservation scores. We found no significant difference in conservation between codons with a synonymously edited C and codons with an unedited C that could potentially be synonymously edited (fig. 3A), suggesting that the occurrence of synonymous editing is unbiased with regard to the conservation of a codon. We similarly found no significant difference in conservation between codons with a nonsynonymously edited C and codons with an unedited C that could potentially be nonsynonymously edited (fig. 3B), suggesting that the occurrence of nonsynonymous editing is also unbiased with regard to the conservation of a codon. Hence, our findings provide no support to the adaptive hypothesis, although they may be attributable to a lack of statistical power.

Fig. 3.

Relationship between editing and evolutionary conservation. (A) Codons with a synonymously edited C and those with an unedited C that is potentially synonymously editable have similar evolutionary conservations at the amino acid level. (B) Codons with a nonsynonymously edited C and those with an unedited C that is potentially nonsynonymously editable have similar evolutionary conservations at the amino acid level. In the boxplot of (A) and (B), the lower edge and upper edge of a box represent the 25% quartile (q1) and 75% quartile (q3), respectively. The horizontal line inside a box indicates the median (md). The whiskers extend to the most extreme values inside inner fences, md ± 1.5(q3 q1). P values are from Wilcoxon rank sum tests. (C) Synonymous editing level at a site is not significantly correlated with the evolutionary conservation (at the amino acid level) of the codon encompassing the site. Each dot is an edited site. (D) Nonsynonymous editing level at a site is significantly negatively correlated with the evolutionary conservation (at the amino acid level) of the codon encompassing the site. In (C) and (D), Spearman’s correlation coefficient (ρ) and its associated P value are shown.

Next, we correlated the conservation score and editing level for codons undergoing synonymous editing, and observed no significant correlation (ρ = 0.13, P = 0.48; fig. 3C). By contrast, a significant negative correlation was observed among sites undergoing nonsynonymous editing (ρ = −0.28, P = 0.039; fig. 3D). Hence, conserved sites tend to have reduced levels of nonsynonymous but not synonymous editing, which is opposite to the prediction of the adaptive hypothesis, but is consistent with the hypothesis that most nonsynonymous editing is deleterious.

Discussion

In this work, we performed the first transcriptome-wide test of the adaptive hypothesis of C-to-U editing of mRNAs originating from animal nuclear genomes. We found the frequency of nonsynonymous editing lower than that of synonymous editing and the editing levels of nonsynonymous editing lower than those of synonymous editing. These results contrast predictions of the adaptive hypothesis. Instead, they suggest that most C-to-U RNA editing of coding sequences is nonadaptive, either neutral or deleterious. Consistent with the nonadaptive hypothesis, nonsynonymous editing, relative to synonymous editing, is rarer in essential genes and genes under relatively strong evolutionary constraints than in nonessential genes and genes under relatively weak constraints. Furthermore, we found no evidence for the conjecture that nonsynonymous editing is enriched at evolutionarily conserved sites. Instead, nonsynonymous editing levels tend to be lowered at evolutionarily conserved sites.

Because C-to-U coding RNA editing in HEK293T was detected only upon the artificial overexpression of the editing enzyme APOBEC3A, why did we observe signals of purifying selection against editing in this cell line? One possibility is that RNA editing is stochastic such that some cells have more nonsynonymous editing than other cells in a population of HEK293T cells. If cells having more nonsynonymous editing grows slower than cells having less nonsynonymous editing, fn/fs < 1 may be observed. Alternatively and more likely, what we observed in HEK293T reflects natural selection acting in tissues/cells where APOBEC3A is naturally highly expressed. Under this scenario, genes frequently coexpressing with APOBEC3A across human tissues should have been subjected to the strongest selection against editing. Hence, we predict that these genes are underrepresented among those edited in HEK293T upon APOBEC3A overexpression. To verify this prediction, we downloaded gene expression data of 53 human tissues from GTEx (www.gtexportal.org/). Using the R package WGCNA (Langfelder and Horvath 2008), we found from the GTEx data that 123 genes are located in the same coexpression module as APOBEC3A. Interestingly, none of these 123 genes are edited in the HEK293T data. By contrast, among the 19, 353 genes that do not share the coexpression module with APOBEC3A, 1872 are edited. Hence, genes that frequently coexpress with APOBEC3A across human tissues tend not to be edited upon APOBEC3A overexpression (P = 0.0005, χ2 test), further supporting that C-to-U RNA editing is selected against in humans.

The generally nonadaptive nature of C-to-U coding RNA editing in humans begs for an explanation of its existence and prevalence. It is notable that nearly all APOBEC family members appear to inhibit retroviruses by the deamination of cytosines in nascent retroviral cDNAs, which is a powerful mechanism of innate immunity (Harris and Liddament 2004). The mechanism by which APOBEC3G restricts retroviral infection is best understood (Harris et al. 2003; Lecossier et al. 2003; Mangeat et al. 2003; Zhang et al. 2003). Briefly, when retroviruses enter host cells and generate the minus strand of DNA from reverse transcription of their RNA genomes, APOBEC3G converts Cs to Us in the DNA. Us are not supposed to exist in DNA and thus are excised by host DNA repair enzymes. However, these Us cannot be completely removed, resulting in mutations in newly synthesized viral RNA genomes that will reduce the viral fitness (Harris and Liddament 2004). This consideration suggests the possibility that the natural substrate of APOBEC3G is viral cDNA and that C-to-U RNA editing originates from its off-target activity (Sharma et al. 2016). Similarly, APOBEC3A is implicated in the inhibition of retroviruses, including human immunodeficiency virus HIV-1, human T-lymphotropic virus HTLV-1, human papillomavirus HPV, parvoviruses, and hepatitis B (Chen et al. 2006; Narvaiza et al. 2009; Berger et al. 2011; Sharma et al. 2017); it is likely that the viral cDNA is the natural substrate of APOBEC3A. This hypothesis is consistent with the high expression of APOBEC3A in macrophages and monocytes, which play important immune functions such as inhibiting HIV (Koppensteiner et al. 2012). In other words, the observed C-to-U RNA editing probably results from promiscuous targeting by APOBECs and was at least initially a deleterious byproduct of the beneficial functions of their antiviral activities. Note that, in the presence of the C-to-U RNA editing activity, a genomic site in a coding region that otherwise accepts T only can now accept C, because C can be converted to U at the RNA level. This may lead to the fixation of T-to-C mutations at a number of such sites. As a result, the RNA editing activity can no longer be removed, because it is impossible to reverse simultaneously all these Cs to Ts in the genome through evolution. Although there is little evidence that C-to-U RNA editing is beneficial, the above scenario explains why C-to-U editing is evolutionarily maintained (Covello and Gray 1993) and suggests the possible existence of sites where the editing is required for normal cellular function. But such sites, even if they are found, should not be automatically regarded as adaptations, because, at least in the above model, the present-day fitness is not higher than that before the origin of C-to-U RNA editing. Bona fide adaptive C-to-U RNA editing, reserved for beneficial tissue- or time-specific regulation of protein function, may be a minority among the already small proportion of editing that potentially plays physiological roles.

Our results on human C-to-U RNA editing are highly similar to previous findings of human A-to-I coding RNA editing (Xu and Zhang 2014) in that both are generally nonadaptive and likely reflect the off-target activities of respective editing enzymes. In addition, we discovered recently that most events of m6A modification (methylation of the adenosine base at the nitrogen-6 position), a highly abundant internal modification of mRNA molecules, are nonadaptive (Liu and Zhang 2018). These similarities suggest a general trend of noisy posttranscriptional modification that requires us to rethink the functionality of such modifications. Specifically, it may be futile to assume that most events of any posttranscriptional modification are functional and search for their functions. Rather, searching for the function of a modification event only when there is indication that the modification is beneficial or evolutionarily conserved (Xu and Zhang 2015; Liu and Zhang 2018) may be a more productive approach.

Note, however, that our analysis focused on C-to-U RNA editing in coding sequences, whereas such editing is also known to occur in noncoding sequences, especially in 3′-untranslated regions (3′-UTR) (Blanc et al. 2014; Harjanto et al. 2016). It is possible that RNA editing at microRNA target sites in 3′-UTRs affects mRNA stability (Gu et al. 2012; Zhang et al. 2017), although direct evidence is currently lacking. In the future, it will be interesting to investigate the biological significance of C-to-U RNA editing in noncoding regions. Furthermore, because the C-to-U RNA editing investigated in our study is primarily due to the activity of APOBEC3A, it will be important to verify if our findings apply to editing by other cytidine deaminases.

Materials and Methods

C-to-U editing sites in human coding regions were obtained from three publicly available data sets (Sharma et al. 2015,, 2017). Specifically, edited sites in the first data set were identified by comparing whole transcriptomes of M1 and M2 macrophages that were generated in vitro from peripheral blood monocytes (Beyer et al. 2012; Sharma et al. 2015). Among different macrophage activation phases induced by IFNγ, C-to-U RNA editing is most prevalent in M1 and is significantly lowered in M2 (Sharma et al. 2015). A total of 93 sites in coding regions with a minimal editing level of 5% were considered to be edited. Edited sites in the second data set were identified from sequenced transcriptomes of monocytes under hypoxia (Sharma et al. 2015), and 128 sites in coding regions with a minimal editing level of 5% were considered to be edited. The third data set was derived from the transcriptome of HEK293T cells with APOBEC3A overexpressed, and 2,455 sites with a minimal editing level of 15% were considered to be edited (Sharma et al. 2017). All three data sets provided the chromosomal location, strand, flanking sequences, synonymous or nonsynonymous annotation, and editing level for each edited site. A total of 22, 013 human protein coding genes were downloaded from Ensembl (GRCh38.p10; Ensembl Genes 90). If a gene has several isoforms, only the coding sequence corresponding to the longest isoform was included in our analyses.

From each coding sequence, we first isolated all TC motifs using in-house Perl scripts and then examined the number of potential synonymous editing sites, which would lead to a synonymous change if edited from C to U, and the number of potential nonsynonymous editing sites, which would cause a nonsynonymous change if edited from C to U. All dN/dS values between human and mouse orthologous genes were retrieved from Ensembl using BioMart (http://www.ensembl.org/biomart/martview/; last accessed January 30, 2018). A total of 183 essential and 14, 388 nonessential genes from humans were obtained from the OGEE database (Chen et al. 2017). We discarded conditional essential genes, which are essential in some but not all conditions. To calculate the conservation score of each amino acid position, we retrieved 574 aligned one-to-one orthologous genes of 43 mammals from the database OrthoMaM (Douzery et al. 2014). We calculated the evolutionary conservation score of each amino acid position using the method of Jensen–Shannon divergence (Capra and Singh 2007).

Acknowledgments

We thank members of the Zhang lab and two anonymous reviewers for valuable comments. This work was supported in part by a research grant from the U.S. National Institutes of Health (R01GM120093) to J.Z. Z.L. was supported by China Scholarship Council (201604910442).

References

Alon
S
,
Garrett
SC
,
Levanon
EY
,
Olson
S
,
Graveley
BR
,
Rosenthal
JJC
,
Eisenberg
E.
2015
.
The majority of transcripts in the squid nervous system are extensively recoded by A-to-I RNA editing
.
eLife
4
:
e05198.

Baysal
BE
,
De Jong
K
,
Liu
B
,
Wang
J
,
Patnaik
SK
,
Wallace
PK
,
Taggart
RT.
2013
.
Hypoxia-inducible C-to-U coding RNA editing downregulates SDHB in monocytes
.
Peer J.
1
:
e152.

Bazak
L
,
Haviv
A
,
Barak
M
,
Jacob-Hirsch
J
,
Deng
P
,
Zhang
R
,
Isaacs
FJ
,
Rechavi
G
,
Li
JB
,
Eisenberg
E
et al. ,
2014
.
A-to-I RNA editing occurs at over a hundred million genomic sites, located in a majority of human genes
.
Genome Res.
24
3
:
365
376
.

Berger
G
,
Durand
S
,
Fargier
G
,
Nguyen
XN
,
Cordeil
S
,
Bouaziz
S
,
Muriaux
D
,
Darlix
JL
,
Cimarelli
A.
2011
.
APOBEC3A is a specific inhibitor of the early phases of HIV-1 infection in myeloid cells
.
PLoS Pathog.
7
9
:
e1002221.

Beyer
M
,
Mallmann
MR
,
Xue
J
,
Staratschek-Jox
A
,
Vorholt
D
,
Krebs
W
,
Sommer
D
,
Sander
J
,
Mertens
C
,
Nino-Castro
A
et al. ,
2012
.
High-resolution transcriptome of human macrophages
.
PLoS One
7
9
:
e45466
.

Blanc
V
,
Park
E
,
Schaefer
S
,
Miller
M
,
Lin
Y
,
Kennedy
S
,
Billing
AM
,
Ben Hamidane
H
,
Graumann
J
,
Mortazavi
A
et al. ,
2014
.
Genome-wide identification and functional analysis of Apobec-1-mediated C-to-U RNA editing in mouse small intestine and liver
.
Genome Biol.
15
6
:
R79
.

Burns
CM
,
Chu
H
,
Rueter
SM
,
Hutchinson
LK
,
Canton
H
,
Sanders-Bush
E
,
Emeson
RB.
1997
.
Regulation of serotonin-2C receptor G-protein coupling by RNA editing
.
Nature
387
6630
:
303
308
.

Capra
JA
,
Singh
M.
2007
.
Predicting functionally important residues from sequence conservation
.
Bioinformatics
23
15
:
1875
1882
.

Chen
H
,
Lilley
CE
,
Yu
Q
,
Lee
DV
,
Chou
J
,
Narvaiza
I
,
Landau
NR
,
Weitzman
MD.
2006
.
APOBEC3A is a potent inhibitor of adeno-associated virus and retrotransposons
.
Curr Biol.
16
5
:
480
485
.

Chen
SH
,
Habib
G
,
Yang
CY
,
Gu
ZW
,
Lee
BR
,
Weng
SA
,
Silberman
SR
,
Cai
SJ
,
Deslypere
JP
,
Rosseneu
M.
1987
.
Apolipoprotein B-48 is the product of a messenger-RNA with an organ-specific in-frame stop codon
.
Science
238
4825
:
363
366
.

Chen
WH
,
Lu
G
,
Chen
X
,
Zhao
XM
,
Bork
P.
2017
.
OGEE v2: an update of the online gene essentiality database with special focus on differentially essential genes in human cancer cell lines
.
Nucleic Acids Res.
45
(
D1
):
D940
D944
.

Covello
PS
,
Gray
MW.
1993
.
On the evolution of RNA editing
.
Trends Genet.
9
8
:
265
268
.

Douzery
EJ
,
Scornavacca
C
,
Romiguier
J
,
Belkhir
K
,
Galtier
N
,
Delsuc
F
,
Ranwez
V.
2014
.
OrthoMaM v8: a database of orthologous exons and coding sequences for comparative genomics in mammals
.
Mol Biol Evol.
31
7
:
1923
1928
.

Duan
Y
,
Dou
S
,
Luo
S
,
Zhang
H
,
Lu
J.
2017
.
Adaptation of A-to-I RNA editing in Drosophila
.
PLoS Genet.
13
3
:
e1006648.

Farajollahi
S
,
Maas
S.
2010
.
Molecular diversity through RNA editing: a balancing act
.
Trends Genet.
26
5
:
221
230
.

Garrett
S
,
Rosenthal
JJ.
2012
.
RNA editing underlies temperature adaptation in K+ channels from polar octopuses
.
Science
335
6070
:
848
851
.

Gommans
WM
,
Mullen
SP
,
Maas
S.
2009
.
RNA editing: a driving force for adaptive evolution?
BioEssays
31
10
:
1137
1145
.

Gu
T
,
Buaas
FW
,
Simons
AK
,
Ackert-Bicknell
CL
,
Braun
RE
,
Hibbs
MA.
2012
.
Canonical A-to-I and C-to-U RNA editing is enriched at 3′UTRs and microRNA target sites in multiple mouse tissues
.
PLoS One
7
3
:
e33720.

Harjanto
D
,
Papamarkou
T
,
Oates
CJ
,
Rayon-Estrada
V
,
Papavasiliou
FN
,
Papavasiliou
A.
2016
.
RNA editing generates cellular subsets with diverse sequence within populations
.
Nat Commun.
7
:
12145.

Harris
RS
,
Bishop
KN
,
Sheehy
AM
,
Craig
HM
,
Petersen-Mahrt
SK
,
Watt
IN
,
Neuberger
MS
,
Malim
MH.
2003
.
DNA deamination mediates innate immunity to retroviral infection
.
Cell
113
6
:
803
809
.

Harris
RS
,
Liddament
MT.
2004
.
Retroviral restriction by APOBEC proteins
.
Nat Rev Immunol.
4
11
:
868
877
.

Koppensteiner
H
,
Brack-Werner
R
,
Schindler
M.
2012
.
Macrophages and their relevance in human immunodeficiency virus type I infection
.
Retrovirology
9
:
82
.

Kouno
T
,
Silvas
TV
,
Hilbert
BJ
,
Shandilya
SMD
,
Bohn
MF
,
Kelch
BA
,
Royer
WE
,
Somasundaran
M
,
Kurt Yilmaz
N
,
Matsuo
H
et al. ,
2017
.
Crystal structure of APOBEC3A bound to single-stranded DNA reveals structural basis for cytidine deamination and specificity
.
Nat Commun.
8
:
15024.

Langfelder
P
,
Horvath
S.
2008
.
WGCNA: an R package for weighted correlation network Analysis
.
BMC Bioinformatics
9
:
559
.

Lecossier
D
,
Bouchonnet
F
,
Clavel
F
,
Hance
AJ.
2003
.
Hypermutation of HIV-1 DNA in the absence of the Vif protein
.
Science
300
5622
:
1112.

Li
JB
,
Levanon
EY
,
Yoon
JK
,
Aach
J
,
Xie
B
,
LeProust
E
,
Zhang
K
,
Gao
Y
,
Church
GM.
2009
.
Genome-wide identification of human RNA editing sites by parallel DNA capturing and sequencing
.
Science
324
5931
:
1210
1213
.

Liscovitch-Brauer
N
,
Alon
S
,
Porath
HT
,
Elstein
B
,
Unger
R
,
Ziv
T
,
Admon
A
,
Levanon
EY
,
Rosenthal
JJ
,
Eisenberg
E.
2017
.
Trade-off between transcriptome plasticity and genome evolution in cephalopods
.
Cell
169
2
:
191
202 e111
.

Liu
Z
,
Zhang
J.
2018
.
Most m6A RNA modifications in protein-coding regions are evolutionarily unconserved and likely nonfunctional
.
Mol Biol Evol
.
35
3
:
666
675
.

Lomeli
H
,
Mosbacher
J
,
Melcher
T
,
Hoger
T
,
Geiger
JR
,
Kuner
T
,
Monyer
H
,
Higuchi
M
,
Bach
A
,
Seeburg
PH.
1994
.
Control of kinetic properties of AMPA receptor channels by nuclear RNA editing
.
Science
266
5191
:
1709
1713
.

Mangeat
B
,
Turelli
P
,
Caron
G
,
Friedli
M
,
Perrin
L
,
Trono
D.
2003
.
Broad antiretroviral defence by human APOBEC3G through lethal editing of nascent reverse transcripts
.
Nature
424
6944
:
99
103
.

Narvaiza
I
,
Linfesty
DC
,
Greener
BN
,
Hakata
Y
,
Pintel
DJ
,
Logue
E
,
Landau
NR
,
Weitzman
MD.
2009
.
Deaminase-independent inhibition of parvoviruses by the APOBEC3A cytidine deaminase
.
PLoS Pathog.
5
5
:
e1000439.

Nishikura
K.
2006
.
Editor meets silencer: crosstalk between RNA editing and RNA interference
.
Nat Rev Mol Cell Biol.
7
12
:
919
931
.

Nishikura
K.
2016
.
A-to-I editing of coding and non-coding RNAs by ADARs
.
Nat Rev Mol Cell Biol
.
17
2
:
83
96
.

Peng
Z
,
Cheng
Y
,
Tan
BC
,
Kang
L
,
Tian
Z
,
Zhu
Y
,
Zhang
W
,
Liang
Y
,
Hu
X
,
Tan
X
et al. ,
2012
.
Comprehensive analysis of RNA-Seq data reveals extensive RNA editing in a human transcriptome
.
Nat Biotechnol.
30
3
:
253
260
.

Powell
LM
,
Wallis
SC
,
Pease
RJ
,
Edwards
YH
,
Knott
TJ
,
Scott
J.
1987
.
A novel form of tissue-specific RNA processing produces apolipoprotein-B48 in intestine
.
Cell
50
6
:
831
840
.

Reenan
RA.
2005
.
Molecular determinants and guided evolution of species-specific RNA editing
.
Nature
434
7031
:
409
413
.

Seeburg
PH
,
Hartner
J.
2003
.
Regulation of ion channel/neurotransmitter receptor function by RNA editing
.
Curr Opin Neurobiol.
13
3
:
279
283
.

Sharma
S
,
Patnaik
SK
,
Kemer
Z
,
Baysal
BE.
2017
.
Transient overexpression of exogenous APOBEC3A causes C-to-U RNA editing of thousands of genes
.
RNA Biol.
14
5
:
603
610
.

Sharma
S
,
Patnaik
SK
,
Taggart
RT
,
Baysal
BE.
2016
.
The double-domain cytidine deaminase APOBEC3G is a cellular site-specific RNA editing enzyme
.
Sci Rep.
6
:
39100.

Sharma
S
,
Patnaik
SK
,
Taggart
RT
,
Kannisto
ED
,
Enriquez
SM
,
Gollnick
P
,
Baysal
BE.
2015
.
APOBEC3A cytidine deaminase induces RNA editing in monocytes and macrophages
.
Nat Commun.
6
:
6881.

Shi
K
,
Carpenter
MA
,
Banerjee
S
,
Shaban
NM
,
Kurahashi
K
,
Salamango
DJ
,
McCann
JL
,
Starrett
GJ
,
Duffy
JV
,
Demir
O
et al. ,
2017
.
Structural basis for targeted DNA cytosine deamination and mutagenesis by APOBEC3A and APOBEC3B
.
Nat Struct Mol Biol.
24
2
:
131
139
.

Smith
HC
,
Bennett
RP
,
Kizilyer
A
,
McDougall
WM
,
Prohaska
KM.
2012
.
Functions and regulation of the APOBEC family of proteins
.
Semin Cell Dev Biol.
23
3
:
258
268
.

Vu
LT
,
Tsukahara
T.
2017
.
C-to-U editing and site-directed RNA editing for the correction of genetic mutations
.
Biosci Trends
11
3
:
243
253
.

Xu
G
,
Zhang
J.
2014
.
Human coding RNA editing is generally nonadaptive
.
Proc Natl Acad Sci U S A.
111
10
:
3769
3774
.

Xu
G
,
Zhang
J.
2015
.
In search of beneficial coding RNA editing
.
Mol Biol Evol.
32
2
:
536
541
.

Zhang
H
,
Yang
B
,
Pomerantz
RJ
,
Zhang
C
,
Arunachalam
SC
,
Gao
L.
2003
.
The cytidine deaminase CEM15 induces hypermutation in newly synthesized HIV-1 DNA
.
Nature
424
6944
:
94
98
.

Zhang
R
,
Deng
P
,
Jacobson
D
,
Li
JB.
2017
.
Evolutionary analysis reveals regulatory and functional landscape of coding and non-coding RNA editing
.
PLoS Genet.
13
2
:
e1006563.

Author notes

Associate editor: Meredith Yeager

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/about_us/legal/notices)