-
PDF
- Split View
-
Views
-
Cite
Cite
Emanuele Buratti, Marco Baralle, Francisco E. Baralle, Defective splicing, disease and therapy: searching for master checkpoints in exon definition, Nucleic Acids Research, Volume 34, Issue 12, 1 June 2006, Pages 3494–3510, https://doi.org/10.1093/nar/gkl498
Close - Share Icon Share
Abstract
The number of aberrant splicing processes causing human disease is growing exponentially and many recent studies have uncovered some aspects of the unexpectedly complex network of interactions involved in these dysfunctions. As a consequence, our knowledge of the various cis - and trans -acting factors playing a role on both normal and aberrant splicing pathways has been enhanced greatly. However, the resulting information explosion has also uncovered the fact that many splicing systems are not easy to model. In fact we are still unable, with certainty, to predict the outcome of a given genomic variation. Nonetheless, in the midst of all this complexity some hard won lessons have been learned and in this survey we will focus on the importance of the wide sequence context when trying to understand why apparently similar mutations can give rise to different effects. The examples discussed in this summary will highlight the fine ‘balance of power’ that is often present between all the various regulatory elements that define exon boundaries. In the final part, we shall then discuss possible therapeutic targets and strategies to rescue genetic defects of complex splicing systems.
INTRODUCTION
The study of the connections between defective splicing and occurrence of disease has become a central issue in the medical research field ( 1 – 7 ). This is a consequence of the growing list of pathological alterations that can be directly linked with aberrant splicing processes. In fact, for many genes, genomic variations which affect the splicing process may represent up to 50% of all mutations that lead to gene dysfunction ( 8 , 9 ). This awareness has also led to the critical reappraisal of many past observations, and it is now clear that substitutions which had for a long time been regarded as harmless synonymous changes in protein coding regions may have some very severe consequences on splicing processes, and thus on the appearance of disease ( 10 ).
The underlying biological reason for such an exuberant growth in the number of diseases derived from splicing derangements is due not only on the intrinsic properties of the exon itself (presence of regulatory elements within the exon, strength of the splice sites etc.) but also on the wider environment within which the exon is found, such as the presence of regulatory elements dispersed throughout the gene at much longer distances ( 11 – 14 ), the splicing events that have taken place before the exon of interest ( 15 , 16 ) or the type of promoter architecture and the consequent polymerase II processivity ( 17 ). This means that splicing spoilers can occur virtually everywhere within any given gene and many of the novel types of mutations identified have still poorly characterized functional mechanisms ( 18 ). Furthermore, alternative splicing processes ( 19 ) are responsible for increasing the functional diversity of the human genome ( 20 – 24 ). Therefore, it is very likely that variations anywhere in the genome may alter the delicate modulation of the splicing process and produce pathological alterations ( 3 ).
Numerous methodological developments have also aided researchers recently in the task of building connections between splicing and disease. For example, the refinement of minigene-based technologies for alternative splicing analysis initially described 20 years ago ( 25 ) has allowed a relatively fast approach to identify splicing spoilers and to study their underlying functional mechanisms ( 26 , 27 ). In addition, a great boost for the preliminary identification of general splicing regulatory sequences (SREs) has been the development of SELEX methodologies to characterize binding specificities of trans -acting factors involved in splicing regulation ( 28 , 29 ).
Last but not least, scientific interest in this subject has also been steadily growing as potential applications in the molecular therapy field have begun to be appreciated and have started to be tested, such as the recent advances in the use of bi-functional or inhibitory oligonucleotides in rescuing aberrant splicing events ( 3 , 30 – 35 ), in the engineering of specific compounds by chemical synthesis to target splicing factor protein families ( 36 – 38 ), or in the development of novel vectors based on U7snRNA to rescue protein functionality by exon skipping ( 39 ).
The fact that even single nucleotide changes in a pre-mRNA sequence away from the canonical AG/GU nucleotides at the splice junctions are ever more frequently been described as capable of affecting the splicing process ( 40 ) was rather unexpected. Indeed, computational analyses confirm that when most of the main determinants of exon definition (3′ss, 5′ ss, branch site and pY tract) are close to the consensus minimal variations may be possible without impairing recognition by the spliceosome ( 41 ). This conclusion is also consistent with the rather loose sequence-specificities of many essential splicing factors. For example, in vitro studies have shown that a minimal amount of all the potential base pairings between the 5′ splice site and U1snRNA is sufficient for its correct recognition by U1snRNP ( 42 ). Moreover, even in the absence of the 5′ tail of U1snRNA, the U1snRNP particle is capable of selecting a 5′ss-like sequence probably through the U1-C subunit ( 43 ). Finally, the observation that over-expression of SR proteins could compensate for absence of U1snRNP in vitro ( 44 , 45 ) provides support to the view that the splicing process can be precise even in the absence of key participants.
Reality, however, has painted a much more complex picture, with the micro-environment having an important role in the ability of a splice site to tolerate a nucleotide transition ( 46 ). Furthermore, microarray technology and comparative expressed sequence tags analyses are now vividly depicting a world of widespread differences in alternative splicing levels, both within normal tissues or following pathological alterations ( 47 – 49 ). For this reason, particular effort has been devoted recently to characterize splicing abnormalities in cell transformation and cancer ( 50 , 51 ) as they may provide an additional therapeutic target for this kind of diseases.
The fact that splicing regulation appears to be a linked network ( 52 – 56 ) means that researchers and clinicians have to consider as a potential splicing mutation virtually every substitution that is encountered within a pre-mRNA sequence ( 18 ). The recent evidences that many splicing factors may also be actively involved in other important cellular processes, such as genome stability ( 57 ) or mRNA transport and translation ( 58 – 61 ), have just begun to unravel all the connections of the RNA processing system ( 62 , 63 ). In fact, rather than becoming simplified, splicing regulation in the new millenium could be best characterized as right on the edge of chaos. Figure 1 shows a simplified view of how the number of accepted players and functions have grown in recent years. It is for this reason that the connotation of ‘difficult to characterize’ is increasingly associated with many new splicing mutations.
The following observations exemplify the problems that the molecular machinery has to resolve to obtain proper splicing:
Potential exon boundaries are particularly abundant in pre-mRNA molecules. For example, in the human hprt gene there are ∼10 times more consensus splice site than those effectively recognized by the splicing machinery ( 64 ),
The splicing process occurs in a background of repression, as random human genomic sequences display a higher than average splicing inhibitory activity when tested in minigene systems ( 65 – 67 ),
Positive or negative SREs are abundant in all pre-mRNAs, both within introns and exons regardless if they are constitutively or alternatively spliced ( 67 – 69 ).
All the processes that deal with an RNA life cycle including transcription, splicing, transport, translation and degradation are no longer considered as separated from each other but rather, in line with the literal translation of the Latin word ‘ complexus ’, appear to be functionally entwined/twisted together ( 54 , 70 , 71 ). Indeed, just how close relationships within the spliceosome can grow is clearly highlighted by the recent demonstration of extensive molecular tethering between the pre-mRNA being processed and the splicing machinery, allowing processing to occur correctly even when the nucleotide chain of the pre-mRNA molecule is interrupted ( 72 ).
As a consequence, the sequence and genomic ‘context’ in which mutations are observed to occur are essential to explain the sometimes widely different outcomes of similar substitutions in different genes ( 18 , 55 ). Several excellent articles have reviewed the individual connections between aberrant splicing and disease in recent years ( 1 – 7 ). Our purpose will be that of using selected examples to highlight the importance of genomic and local context when investigating splicing outcomes. For example, the inactivation of donor site (5′ss) sequences produce a great variety of splicing outcomes that vividly depicts the mind boggling variety of exon/intron definition mechanisms.
WHAT MAY HAPPEN WHEN A DONOR SITE IS INACTIVATED?
The recognition of the donor site is one of the key events of the splicing process ( 73 ), and many of its peculiarities are still under investigation ( 42 , 43 , 74 – 76 ). Mutations that destroy natural donor sequences are usually observed to cause the skipping of their associated exon ( 77 ). However, in a considerable number of cases additional events can also take place, which include cryptic splice site activation and full intron inclusion. Now, whether any of these events takes place or not is often dependent on the individual context in which the substitution occurs. Sometimes evaluating this context may be fairly straightforward; however, in some cases it may also involve long-range genomic context interactions or be dependent on some kind of ‘priming’ effect to the splicing machinery by events happening upstream of the exon in question (see below).
Cryptic splice site activation
Activation of cryptic splice sites is probably the most prominent outcome (after exon skipping) that follows the inactivation of the original donor site. In fact, cryptic splice site activation in the β-globin gene was one of the first medically relevant splicing defects described ( 78 , 79 ). The usage of alternative splice sites following the inactivation of the proper site often leads to a frameshift in the protein coding sequence. It should be noted, however, that cryptic splice site sequences do not simply represent a harmful background to normal splicing pathways. Rather, it is now increasingly clear that they have played a very important role in the still very much debated evolutionary mechanisms of intron gain/loss ( 80 ) and recent work has suggested that in some cases they act as preferential sites for intron insertion ( 81 , 82 ).
Following the disruption of the normal donor site it would be convenient to predict beforehand which sites (if any) become activated by simply looking at all the nearby sequences that match the consensus. In practice, however, there is not always a straight correlation between the chosen cryptic site and the calculated strength of the potential sites in the region ( 83 ). Contemporary studies on donor ( 83 ) and acceptor sites ( 84 ) have established that local context features may help the splicing machinery to actively discriminate against the usage of cryptic sites and in favor of natural 5′ and 3′ss. These local context features may play a role in cryptic splice site usage after disruption of the natural site.
For example, SR protein levels have been long known to affect relative cryptic splice site usage in the human β-globin gene both in vitro and in vivo ( 85 , 86 ). Now, two recent reports have demonstrated a critical role of SR-binding sequences on cryptic splice site selection: in the E1α Pyruvate Dehydrogenase pre-mRNA a single point mutation away from the natural ss strengthens a weak, pre-existing SC35-binding site and has the effect of shifting donor site usage to a previously silent cryptic site ( 87 ) ( Figure 2A ). This mutation has an important biological impact and is linked with mental retardation. In the second case, a strong SF2/ASF-binding region in fibrinogen exon 7 which is not normally needed for constitutive inclusion of this exon in the mature mRNA has the ability to determine an increase in the relative usage levels of a particular cryptic donor site when the natural donor site is inactivated by a IVS7 + 1G > U mutation ( 88 ). Interestingly, changing the local context by swapping the native SF2/ASF-binding sequence with a different SR-binding ESE sequence from the fibronectin gene ( fn ESE) has the ability to change the relative cryptic site usage ratios with respect to the natural fibrinogen sequence ( Figure 2B ).
Finally, an additional general factor that has been described to affect cryptic splice site usage is represented by RNA secondary structure ( 12 – 14 , 89 ). For example, in the early region 3 of adenovirus type 2 a stem–loop structure allows the correct and exclusive usage of its D1 natural donor site rather than a cryptic donor site (Dcr1) located 74 nt away ( 90 , 91 ) ( Figure 2C ). Disruption of this structure results in a shift towards the Dcr1 site.
Intron retention
An intron retention event is identified as the preservation of an entire intronic sequence within some forms of the final processed mRNA. Using this criteria, the frequency of normal intron retention events in the human genome has been estimated recently to be ∼15% in a set of >21 000 annotated genes ( 92 ). Most of these events naturally occur in the untranslated regions of the pre-mRNA where the rate of retention is observed to be >10-fold greater with respect to protein coding sequence ( 92 ). The biological role of most of these events is currently unknown; however, retained introns within the coding region are significantly shorter than non-retained introns ( 92 , 93 ). Furthermore, their sequences are under selective pressure for coding potential ( 92 ), suggesting a potentially regulatory role in protein functions. In recent years, the biological significance of some physiological intron retention events has also been clearly established in several cases such as the generation of the P element and Msl2 transcripts in Drosophila ( 94 , 95 ) or in the developmental regulation of the pro insulin messenger RNA in chicken embryos ( 96 ).
Most importantly, aberrant intron retention events have also been shown to be associated with several pathological splicing defects. For example, in the Ret tyrosine kinase gene ( 97 ) where it has been connected with the presence of pheocromocytomas, in the HERG gene where it has been identified as being responsible for the development of a long QT syndrome (LQT2 phenotype) in an individual which carried a +6T > C mutation in the donor site of intron 7 of this gene ( 98 ) and in a patient affected by Leigh syndrome where the retention of intron 3 in the SURF1 gene follows a single-nucleotide mutation in its donor site ( 99 ). As similar mutations in other contexts cause exon skipping it is highly probable that local context features either in the retained intron itself or in the flanking exonic sequences determine this particular outcome. At the moment, very little is known concerning this issue and it is to be hoped that future studies will target these particular issues.
An interesting phenomenon that also deals with intron retention/recognition has been observed by analyzing the last exon of the TPO gene. In 26% of the TPO transcripts a 116 n sequence located in the last exon is removed and hence treated as an ‘alternative intron’ ( Figure 3A , upper panel) ( 15 ). Strikingly, this short intron behaves as such only if there has been an upstream splicing event. In fact, if the cDNA of the TPO gene is transfected in Hep3B cells removal of the 116 nt intron is completely abolished ( Figure 3A , lower panel). The recognition of the small intron 5′ and 3′ss within TPO 's last exon seems to require spliceosome assembly on a previous intron, as the effect is obtained not only with the intact intron structure of the TPO gene but also by inserting any single intron in the cDNA sequence ( Figure 3B ). Although the mechanism responsible for these coordinated intron recognition events is currently unknown, it is clear that some kind of ‘priming’ event from the splicing machinery may be required for the processing of this intron. Intriguingly, a somewhat analogous mechanism, this time on exon recognition, has been proposed recently to exist in the coordination of splicing between two independent alternatively spliced regions within a fibronectin minigene ( 16 ) ( Figure 3C ). In this case, mutations that inhibit inclusion of the proximal exon (pEDA) correlate directly with inclusion levels of the distal exon (dEDA). Furthermore, existence of such a phenomenon has also been described in vivo in a mouse knock-in model ( 100 ) between two different alternatively spliced regions of the fibronectin gene (the EDA exon and the V region) ( 16 ).
These examples, together with the recent description of connections between chromatin factors ( 101 ), promoter usage and transcript elongation on the splicing process ( 17 , 102 ) clearly highlight how difficult it is to separate the functional effect of a given nucleotide variation from its wide genomic context, and further examples of connections between the upstream history of the transcription/splicing machinery and downstream exon definition will be discussed below.
Pseudoexon inclusion
The observation that consensus splice site sequences are also very abundant in intronic sequences indicates that introns have the potential ability to encode for ‘false’ exons (commonly referred to as pseudoexons) ( 64 , 66 ). Pseudoexons insertion in mature mRNA have been mainly reported in connection with human diseases and Table 1 shows a comprehensive list of genes in which this kind of events has been described. Nonetheless, and possibly more important from a biological point of view, pseudoexon insertion may well represent a substrate for the evolution of new exons. Indeed, it has been proposed recently that ‘exonization’ of a class of particularly abundant intronic sequences (known as Alu repeats) may well represent a major and frequent source of ‘new’ exons and thus play a very important and useful role in evolution ( 103 – 105 ).
Despite the abundance in the genome of potential pseudoexons (intronic sequences between 50 and 200 nt in length with apparently viable 5′ss and 3′ss sites at either end) their inclusion does not seem to be a frequent event during normal pre-mRNA processing. Several different reasons have been put forward to account for this, e.g. it was originally noted that many of these ‘non-spliced’ pseudoexons contain multiple defects in splicing controlling regions despite their apparently good agreement with real exonic sequences ( 64 ). In keeping with this, three putative splicing silencer motifs have been found recently to be enriched in these non-spliced pseudoexons compared with their surrounding intronic regions ( 66 ). This observation is consistent with previous work which showed that human DNA is particularly rich in sequences which can inhibit splicing ( 65 ). Furthermore, an extended computational analysis of 8mer sequences that either promote or inhibit splicing in internal non-coding exons versus non-spliced pseudoexons has revealed that enhancer sequences are highly enriched in real exons whilst silencer sequences tend to cluster within pseudoexons ( 67 ). Finally, computational studies have suggested recently that pseudoexons flanking regions have a distinct tendency to form double-stranded structures that include the pseudo exon itself, providing an additional layer that differentiates them from real exons, where flanking regions have the tendency to base pair between themselves ( 106 ).
Nonetheless, many intronic sequences seem poised on the brink of becoming exons because, as shown in Table 1 , most of these disease-connected pseudoexon inclusion events originate from a single activating mutation. This mutation generally involves the ex novo creation of a strong donor or acceptor site, followed by the subsequent selection of weaker ‘opportunistic’ acceptor or donor site sequences. However, other mechanisms that cause the inclusion of pseudoexon sequences may also take place, and pseudoexon-activating mutations may also involve the creation/deletion of SREs. There are in fact at least two cases, in the α-Gal A ( 107 ) ( Figure 4A ) and the ATM gene ( 108 ) ( Figure 4B ), in which the creation/deletion of SREs has been observed to occur. In the ATM gene the inclusion of a 65 nt long pseudoexon is the consequence of a 4 nt deletion in ATM intron 20 that disrupts a high-affinity U1snRNP-binding site located within the pseudoexon sequence itself. This region was called ISPE (intron splicing processing element) and its disruption resulted in the activation of an immediately upstream cryptic acceptor site and of a downstream cryptic donor site ( 108 ). The comparison of this intronic sequence with its homologous region in the mouse has shown the importance of local context also in this case. In fact, an ISPE sequence is also present in the mouse and only a critically missing donor site downstream of this element is the reason why the mouse sequence cannot give rise to an analogous pseudoexon insertion event ( 109 ).
It is interesting to note that in recent years a lot of effort has gone into deciphering the code that underlies alternatively spliced exon recognition ( 29 , 53 , 69 ). However, the recent increase in pseudoexon and intron retention events reported in clinical investigations indicates that we should focus also on deciphering splicing codes within introns. In fact, investigations on pseudoexons may well acquire a wider importance in the future, as it has also been proposed recently that many of the these sequences which are thought to be ignored by the splicing machinery also participate in splicing regulation processes ( 110 ). The interesting notion is that the functional usage of many of these putative pseudoexon sequences may be often overlooked owing to the fact that their inclusion can lead to premature insertion of a termination codon in the mature mRNA, and its consequently rapid degradation by the nonsense-mediated decay (NMD) pathway ( 111 ). This intriguing phenomenon has been described to occur in the rat α -tropomyosin gene in which a pseudoexon sequence localized downstream of the mutually exclusive exons 2 (that is included only in smooth muscle tissue) and 3 (that is included in most cell types) can give rise to different splicing outcomes when it is joined to either exon ( 110 ) ( Figure 4C ). In particular, when joined to exon 2 the formation of an efficient ZLE (zero length exon) sequence allows to resplice exon 2 to the acceptor site of exon 4 (thus giving rise to a normally processed mRNA molecule). On the other hand, when the pseudoexon sequence is joined with exon 3 the resulting ZLE is not sufficient for the resplicing pathway and the pseudoexon 5′ss is used to join with the acceptor site of exon 4 (thus giving rise to a mRNA molecule which is rapidly degraded by NMD.
GENOMIC CONTEXT IS IMPORTANT IN DEFINING EXONIC REGIONS
A practical example of the importance of genomic context on the splicing process is represented by the splicing regulation of the neurofibromatosis type-1 ( NF-1 ) exon 37 ( 112 ). This exon is localized in an exon-rich genomic cluster ( Figure 5A ) and mutations within its sequence have been shown to be involved in the development of neurofibromatosis ( 113 ). It has been shown recently that this exon is badly recognized by the splicing machinery if it is placed with limited flanking intronic sequences in a minigene with heterologous genomic context. However, it becomes efficiently recognized when located within its natural milieu in the region spanning NF-1 exons 34-38 ( Figure 5B ). Interestingly, mutations disrupting a SRE sequence within this exon (including some that have been described in neurofibromatosis patients such as C6792G) have genomic context-dependent effects ( 112 ). In fact, analysis of patient's lymphoblasts has revealed that this mutation causes both the skipping of exons 36–37 together and the skipping of exon 37 alone. Also in this case, the use of a minigene system to mimick these effects has shown that, after taking in consideration the contribution of the unmutated allele in the patient's lymphoblasts, increasing the genomic context to comprise exons 31–38 allows to duplicate almost exactly the patient's in vivo splicing pattern ( Figure 5C ).
To complicate matters further, it has been observed that whether an exon is recognized as such and to what an extent is also decided by the overall architecture of the pre-mRNA. Two models exist in mRNA processing, the exon definition model and the intron definition model where, as their names suggest, in the former the exon is defined by the acceptor and donor sites that flank it whilst in the latter it is the acceptor and donor sites on the same intron which define its recognition ( 114 , 115 ). Recent experimental evidence has allowed to determine that intron recognition ceases to occur when the intron size increases beyond the 200–250 nt range ( 116 ). As splice site recognition across introns of similar size to exons tends to be significantly more efficient than across the exons ( 116 ), it follows that exonic sequences with similar exon/intron junction sequences can be recognized with widely different efficiencies depending on the local pre-mRNA architecture where they happen to be present.
SREs ADD A LAYER OF REDUNDANCY FOR THE DEFINITION OF WEAK SPLICE SITES
The first observation that exonic sequences were modulating alternative splicing in a cellular gene was done in 1987 ( 117 ). At that time this observation appeared to most researchers as an exceptional case, difficult to interpret. However, in the last 10 years it has been shown that far from being exceptional the presence of SREs in exonic sequences is the rule and their importance in the modulation of pre mRNA processing is now widely recognized.
These SREs reinforce the limited information in the splice sites and, depending on their position of and their effect on splicing, have been generally classified as exonic splicing enhancer and silencer (ESE/ESS) or intronic splicing enhancer and silencer (ISE/ISS) sequences ( Figure 1 ). Although the number of reported proteins recognizing specific enhancer and silencer elements is steadily growing ( 118 ), most enhancer sequences have been found to bind members of the SR protein family ( 119 – 125 ) whilst many silencing elements have been found to bind members of the hnRNP protein family ( 55 , 126 – 128 ). There are, however, several difficulties when trying to evaluate the action and the presence of SREs in many splicing events.
Firstly, it should be noted that there are many exceptions to the rough division stated above. For example, classical SR proteins have also been described to be involved in splicing repression in a few number of cases ( 129 – 131 ) and some members of this protein family (SRp38) act as splicing repressors upon dephosphorylation ( 132 , 133 ). Analogously, some well-represented hnRNP proteins (hnRNP L) may also act as splicing enhancers ( 134 ) whilst others, such as hnRNP H, may be involved either in splicing repression or enhancement depending on context ( 46 , 135 , 136 ). Hence, the simple belonging to a particular family of splicing factors is not a guarantee of function conservation and experimental studies are often required to clarify the role played by even well known splicing factor found in new systems under study.
Secondly, the RNA-binding sequences of many of these splicing factors can be highly degenerate, making it difficult to spot them beforehand on the basis of the primary sequence alone. For example, SELEX analysis of SR protein-binding motifs showed that the major family members recognized fairly degenerate consensus sequences varying from 5 to 7 nt ( 137 ) and, although this finding is entirely consistent with the great diversity of ESE sequences mapped within exons ( 119 , 138 , 139 ), it obviously provides an additional hurdle when trying to identify likely binding sites. To these difficulties it has to be added that much of the same problems apply when investigating common splicing inhibitors, such as hnRNP A1 ( 140 ). Moreover, it has to be noted that local context factors, such as RNA secondary structure, may profoundly influence the way that optimal binding motifs are recognized by their specific binding factors ( 141 ). Therefore, the simple presence of a high-score-binding motif may not necessarily translate in a strong binding affinity.
In order to obviate to these difficulties several computational approaches have been proposed based on functional SELEX methodologies to identify potential SR protein-binding sites ( 142 ), on statistical analysis of genomic exonic sequences to identify splicing-enhancing sequences ( 67 , 68 , 143 , 144 ), or on the information content of the splice site sequences ( 145 ). The reason why so much effort is devoted to the development of faster and more efficient ways of predicting SRE presence/action is justified by the clear indication that disruption/creation of these elements has become a very prominent cause of splicing related alterations in disease ( 4 ). Indeed, it is probable that many mutations which were previously thought to be harmless polymorphisms may in reality be very powerful modifiers of normal splicing processes ( 10 , 146 – 148 ).
At present, the accuracy of these methods to successfully predict changes in binding of splicing factors has yielded in some cases encouraging results ( 143 , 149 , 150 ). However, in several genes such as dystrophin ( 151 ), CFTR ( 147 , 148 ) and ATR ( 152 ) the predictive power of these in silico approaches has failed to establish a clear connection between splicing and abolishment (or creation) of SREs. Thus, experimental validation of predictions is still a mandatory step in most splicing studies. Nonetheless, because of the realization that accurate in silico approaches will be very useful in the diagnostic and therapeutic field ( 34 , 35 ), studies of this kind are destined to increase. Indeed, statistical approaches have also been aimed recently at characterizing inhibitory sequences ( 66 , 69 , 153 ) or other kinds of repeated motifs such as UAGG or G-runs which are often found near or within alternatively spliced exons and which may contribute heavily to inclusion/skipping decisions by the splicing machinery ( 153 , 154 ).
Finally, the mechanisms through which splicing factors bind SRE sequences has been the subject of many studies, especially regarding the interplay between classic antagonistic factors such as ASF/SF2 (a SR protein family member that promotes inclusion) and hnRNP A1 (a classic hnRNP protein that has been historically associated with exon skipping) ( 155 , 156 ). In many instances, however, the number of ‘duelling partners’ increases considerably and a complex number of SRE sequences and binding factors is involved in regulating tissue- or developmentally specific alternatively spliced exons. A schematic diagram of four well-studied regulatory mechanisms is shown in Figure 6 . The example given are the c-src exon N1 ( 157 , 158 ) ( Figure 6A ), the CD44 exon v5 ( 159 – 161 ) ( Figure 6B ), chicken cTNT exon 5 ( 162 ) ( Figure 6C ) and HipK3 ‘T’ exon ( 163 ) ( Figure 6D ). In all four cases it should be noted that in order to achieve tissue- and/or temporal-precise accuracy a complex arrangement of cis -acting elements (with either positive or negative influence on exon recognition) is complemented by a correspondingly high number of splicing factors.
SPLICE SITE STRENGTHS CAN EITHER MASK OR UNCOVER ADDITIONAL SPLICING REGULATORY REGIONS PRESENT IN EXONS
A well-studied working example in which the interplay between splice sites and regulatory sequences has been analyzed in depth is represented by exon 7 in the SMN1/SMN2 genes. This exon plays a key role in a common autosomal recessive disorder, spinal muscular atrophy. In humans, there are two copies of the SMN gene, named SMN1 and SMN2 , and inactivation of the SMN1 gene leads to disease because the SMN2 gene cannot compensate for its absence. The reason for this is a critical C to U substitution in exon 7 of the SMN2 gene that does not change the codon but either disrupts an ASF/SF2-binding site ( 149 , 164 ), creates an hnRNP A1-binding site ( 165 ), or both (although recent evidence supports the ASF/SF2 model whilst confirming an indirect role for hnRNP A1 in splicing repression). As a result of this substitution, the SMN2 gene produces a transcript where exon 7 is predominantly skipped, leading to the production of a truncated protein and inability to compensate for SMN1 inactivation. In addition to this region, several other SRE sequences both within the exon and in the upstream intron have been identified. These elements include a Tra2-binding site in the region adjacent to the SF2/ASF interaction ( 166 ) and a highly structured ISE region in the IVS7 intron a short distance away from the donor site ( 167 ) ( Figure 7A ). Recently, Singh et al . ( 168 ) have used a functional selection technique with the aim of identifying additional controlling elements that regulate the splicing of an entire exon. As shown in Figure 7B , they have achieved this purpose by engineering a pre-mRNA template which contained the full sequence of SMN exon 6 and 8 with some intronic sequences and a partially randomized (30%) exon 7 sequences in order to substantially preserve the wild-type characteristics of this exon but at the same time allow mutability of each nucleotide. This pool of pre-mRNA molecules was then transfected in a human cervical carinoma cell line and the spliced products were purified to select the exon 7 sequences that displayed correct recognition by the splicing machinery ( Figure 7B ). After successive rounds of in vivo selection the different exon 7 sequences were subsequently analyzed to evaluate the range and type of nucleotide substitutions that had occurred with respect to the initial wild-type sequence. Beside confirming previous results with regards to the SF2/ASF and Tra2-binding regions the results of this analysis allowed the identification of two novel regulatory regions, one near the 3′ end of the exon (3′-Cluster) and a conserved CT tract in the middle. Most importantly, however, the weak 5′ss consensus sequence was also found to represent a critical factor in regulating SMN exon 7 recognition, as 82% of the spliced exon 7 clones identified following the functional SELEX contained a A54G substitution ( Figure 7B ). It is interesting to note that this substitution alone was quite capable of overcoming the negative effect of several mutations within other conserved elements such as the first position of the exon or single mutations within the Tra2-binding site ( 168 ).
It is also interesting to note that several splice site prediction programs such as MaxEnt, MDD and MM ( 169 ), NNSplice ( 170 ) and Spliceview ( 171 ) fail to detect a major improvement in donor site score following the A54G substitution ( Figure 7C ). As these programs are normally very accurate in detecting splice site strength variations the most likely conclusion is that only the slightly weakening of exon 7 splice site can ‘highlight’ several sequences that would normally not be able to influence its splicing process. This is consistent with the observation that alternatively spliced exons possess splice site sequences which are on average weaker than splice sites belonging to constitutively recognized exons and stresses the importance of the context ( 172 ). In addition, it has also to be noted that splice site prediction programs may not pick up additional protein–RNA interactions at the 5′ splice site that may substantially contribute in determining its relative strength. For example, in the NF-1 exon 3 donor site a G-four stretch in correspondence to the donor site has been identified as a binding site of hnRNP H that severely constrained the 5′ss sequences that could vary without disrupting splicing ( 46 ).
Nonetheless, the result of SELEX analysis suggests that many skipping events originating from various mutations within the exon may be rescued by even small improvements in splice site quality of the affected exon(s). However, changing the donor site in exons of patients affected by mutations within SRE elements would probably involve some too costly and uncertain investments in gene therapy schemes and, for this reason, alternative targets should be preferred.
INACTIVATION OF STRONG NEGATIVE ELEMENTS CAN RESCUE SPLICING DEFECTS ORIGINATING FROM OTHER NEARBY REGULATORY ELEMENTS
Therefore, in order to optimize therapeutic strategies an important question that needs to be addressed is the existence of hierarchical levels between these splicing regulatory elements, as this may decide which region should be preferentially targeted by the effector molecule of choice. Recently, the presence of a human-specific ISS near to the 5′ss of SMN1 / SMN2 exon 7 has been described to have analogous compensatory effects as the A54G substitution ( 173 ). Indeed, Figure 8A shows that the effects of mutating three well-characterized SRE elements in SMN1 exon 7 (the first nucleotide, the Tra2 binding sequence, and the conserved tract) can all be overcome by deleting the ISS-N1 sequence ( Figure 8B ). In keeping with this result, much the same effect can be obtained by the deletion of the ISS-N1 region in the SMN2 sequence that carries the C6U substitution that abrogates SF2/ASF binding.
Interestingly, an analogous situation has been described for exon 9 of the CFTR gene. Skipping of this exon is involved in the occurrence of several forms of cystic fibrosis depending on the presence of particular polymorphisms in a (UG)m(U)n repeated motif near to its 3′ splice site ( 174 ). Recently, several studies have confirmed that a nuclear protein, TDP-43, binds to the (UG)m sequence and can inhibit its inclusion in the final mRNA ( 175 – 177 ). However, in addition to this regulatory sequence several other exonic and intronic SREs have been described to take part in splicing regulation of exon 9: a composite exonic regulatory element of splicing (CERES) ( 147 ), a positive polypyrimidine rich element near to the 5′ss that binds the TIA-1 splicing factor (PCE) ( 178 ), and an ISS further down in IVS9 that binds several members of the SR protein family ( 129 ) ( Figure 8C ). Mutations in all these sequences have modulating effects in CFTR exon inclusion in the mRNA. In particular, the CERES element identified by Pagani et al. ( 129 ) lies in correspondence to a previously described C1496A missense mutation (A455E) which should be therefore also considered a splicing mutation. The recent use of siRNA technology to knockdown TDP-43 expression ( 179 ) has resulted in a generalized improvement in the inclusion of exon 9 even in the presence of many of these mutations ( Figure 8D ), demonstrating that removal of this key inhibitory factor may be a useful way of repairing negative mutations also in other controlling elements. In this respect, it is important to note that depletion of this factor has also been described to override the need for several ESEs and ISEs of Apo AII exon 3 ( 180 ). Obviously, knowledge of other critical functions of this or any other factor is essential before a general inhibitory approach is used in therapy.
CONCLUDING REMARKS
Taken together, recent results clearly highlight that any pre-mRNA sequence is embedded with regulatory elements, some of which may only be uncovered by disrupting mutations or by other facets of mRNA synthesis (such as transcript elongation rates or secondary structure). The practical consequences of such an arrangement is that many splicing events are the result of a fine ‘balance of power’ between a myriad of controlling factors which can vary from tissue to tissue, during development or in the presence of external stimuli. Moreover, the recent discovery that snoRNAs can also participate actively in the regulation of splicing ( 181 ) emphasizes that we are still ignorant of entire classes of splicing regulators. This complexity makes the job of predicting the splicing outcomes of given genomic variations exceedingly difficult for the researcher. Nonetheless, not all these factors possess the same weight when determining splicing outcomes, and the SMN1 / SMN2 exon 7 and CFTR exon 9 examples demonstrate that identifying master checkpoints may, if acted upon, compensate for negative effects in several other regulatory elements. This conclusion provides a considerable encouragement towards the search for novel therapeutic strategies aimed at covering as great an amount as possible of splicing mutations. Paradoxically, though, it is very probable that effective cures for many splicing mutations will be developed long before the functional mechanisms that underlie them may be satisfactorily explained.
The three schematic representations show the increase during the last decade of the number of interactions reported to occur during RNA processing. The upper diagram depicts a classic pre-mRNA molecule in which the exon (closed box) is defined by a small number of relatively conserved sequences that bind a well-defined set of basic splicing factors. The complexity of the system begins to increase, as shown in the middle diagram, when the vast and still growing array of enhancer and silencer factors are to be added to the basic picture. These factors can bind particular enhancer and silencer sequences either within the exon itself (ESE and ESS) or in the nearby flanking regions (ISE and ISS) and help increase/decrease exon recognition. As shown in the bottom diagram, in these recent years the complexity of the regulatory elements that participate in splicing control has taken an enormous leap forward by the discovery of the potential connections between splicing, the specific structural features of pre-RNAs and many of the processes that participate in a their life-cycle such as genome stability, RNA processing speed, transport and translation.
( A ) Schematic diagram showing how local context arrangements can influence cryptic splice site activation. For example, in intron 7 of the PDH gene a G26A mutation greatly enhances a preexisting binding site for the SC35 protein (upper and middle panels). The increased binding of this protein causes the shift of donor site usage towards a cryptic donor site in the downstream sequence (lower panel). ( B ) Another example is represented by a strong SF2/ASF-binding sequence that is not normally needed by the splicing machinery to recognize fibrinogen exon 7 (upper panel). However, if the natural donor site is inactivated by a +1G > U substitution the relative usage of the three cryptic sites which become activated (c1, c2 and c3) and the levels of exon skipping (es) are profoundly influenced by its presence, especially c1 (middle panel). In this case, the importance of local context has also been demonstrated by replacing the natural SF2/ASF-binding sequence with a well-known ESE sequence from the fibronectin gene ( fn ESE) and observing that the level of c1 usage are substantially increased with respect to the natural sequence (lower panel). ( C ) Finally, in the case of RNA secondary structure it has been shown that correct utilization of the natural donor site sequence (D1) can be achieved only in the presence of a highly conserved stem-loop structure within Adenovirus type 2 first exon (upper panel). Removal of this sequence has the consequence of activating usage of a cryptic donor site sequence (Dcr1) (lower panel).
( A ) This figure shows the effect on internal intron retention in exon 6 of the TPO gene. When the TPO gene is transfected in Hep3B cells only two processed mRNAs are produced, a major form lacking all the intervening introns (74%) and a minor form in which there is the removal of an additional 116 nt long sequence (26%) from the processed mRNA (upper panel, boxed in red). On the other hand, lower panel shows that when the TPO cDNA is transfected in Hep3B cells no processing of this internal intron can be observed. ( B ) The only way of recovering internal intron processing is that of reintroducing at least one of the normally processed introns of the TPO gene. An analogous situation in which the system has to be ‘primed’ before it can function efficiently is provided by the recent observation that coordinated splicing can occur in correspondence of distant alternatively spliced exons. ( C ) Shows such an experiment where two alternatively spliced EDA exons (pEDA and dEDA) separated by a 3400 nt long spacer containing three constitutively spliced exons are inserted in a minigene systems in either wild type or mutated form (lacking an important ESE sequence). The graph on the left shows that efficient inclusion of the proximal exon strictly correlates with inclusion efficiencies of the distal exon.
list of pseudoexon inclusion events connected with disease in humans
| Gene . | Pseudoexon length (bp) . | Activating mutation . | Reference . |
|---|---|---|---|
| DMD | 58 | 5′ss creation | 182 |
| DMD | 67 | 5′ss creation | 183 |
| DMD | 89 | 5′ss creation | 183 |
| DMD | 90 | 5′ss creation | 183 |
| DMD | 95 | 3′ss strengthening | 182 |
| DMD | 149 | 3′ss creation | 183 |
| DMD | 172/202 | 5′ss creation | 184 |
| CFTR | 49 | 5′ss creation | 185 |
| CFTR | 84 | 5′ss creation | 186 |
| CFTR | 184 | 3′ss deletion | 187 |
| CFTR | 214 | 5′ss creation | 188 |
| CYBB | 56 | 5′ss strengthening | 189 |
| CYBB | 94 | 3′ss creation | 190 |
| PHEX | 50/100/170 | 5′ss creation | 191 |
| α -Gal A | 57 | SRE creation | 107 |
| ATM | 65 | SRE deletion | 108 |
| BRCA1 | 66 | 3′ss creation | 192 |
| GHER | 69 | 5′ss creation | 193 |
| CHM | 98 | 3′ss creation | 194 |
| GHR | 102 | 5′ss creation | 195 |
| ATM | 137 | 5′ss creation | 196 |
| OA1/GPR143 | 165 | 3′ss creation | 197 |
| β -globin | 165 | 5′ss creation | 78 |
| NF-1 | 172 | 3′ss creation | 198 |
| Gene . | Pseudoexon length (bp) . | Activating mutation . | Reference . |
|---|---|---|---|
| DMD | 58 | 5′ss creation | 182 |
| DMD | 67 | 5′ss creation | 183 |
| DMD | 89 | 5′ss creation | 183 |
| DMD | 90 | 5′ss creation | 183 |
| DMD | 95 | 3′ss strengthening | 182 |
| DMD | 149 | 3′ss creation | 183 |
| DMD | 172/202 | 5′ss creation | 184 |
| CFTR | 49 | 5′ss creation | 185 |
| CFTR | 84 | 5′ss creation | 186 |
| CFTR | 184 | 3′ss deletion | 187 |
| CFTR | 214 | 5′ss creation | 188 |
| CYBB | 56 | 5′ss strengthening | 189 |
| CYBB | 94 | 3′ss creation | 190 |
| PHEX | 50/100/170 | 5′ss creation | 191 |
| α -Gal A | 57 | SRE creation | 107 |
| ATM | 65 | SRE deletion | 108 |
| BRCA1 | 66 | 3′ss creation | 192 |
| GHER | 69 | 5′ss creation | 193 |
| CHM | 98 | 3′ss creation | 194 |
| GHR | 102 | 5′ss creation | 195 |
| ATM | 137 | 5′ss creation | 196 |
| OA1/GPR143 | 165 | 3′ss creation | 197 |
| β -globin | 165 | 5′ss creation | 78 |
| NF-1 | 172 | 3′ss creation | 198 |
list of pseudoexon inclusion events connected with disease in humans
| Gene . | Pseudoexon length (bp) . | Activating mutation . | Reference . |
|---|---|---|---|
| DMD | 58 | 5′ss creation | 182 |
| DMD | 67 | 5′ss creation | 183 |
| DMD | 89 | 5′ss creation | 183 |
| DMD | 90 | 5′ss creation | 183 |
| DMD | 95 | 3′ss strengthening | 182 |
| DMD | 149 | 3′ss creation | 183 |
| DMD | 172/202 | 5′ss creation | 184 |
| CFTR | 49 | 5′ss creation | 185 |
| CFTR | 84 | 5′ss creation | 186 |
| CFTR | 184 | 3′ss deletion | 187 |
| CFTR | 214 | 5′ss creation | 188 |
| CYBB | 56 | 5′ss strengthening | 189 |
| CYBB | 94 | 3′ss creation | 190 |
| PHEX | 50/100/170 | 5′ss creation | 191 |
| α -Gal A | 57 | SRE creation | 107 |
| ATM | 65 | SRE deletion | 108 |
| BRCA1 | 66 | 3′ss creation | 192 |
| GHER | 69 | 5′ss creation | 193 |
| CHM | 98 | 3′ss creation | 194 |
| GHR | 102 | 5′ss creation | 195 |
| ATM | 137 | 5′ss creation | 196 |
| OA1/GPR143 | 165 | 3′ss creation | 197 |
| β -globin | 165 | 5′ss creation | 78 |
| NF-1 | 172 | 3′ss creation | 198 |
| Gene . | Pseudoexon length (bp) . | Activating mutation . | Reference . |
|---|---|---|---|
| DMD | 58 | 5′ss creation | 182 |
| DMD | 67 | 5′ss creation | 183 |
| DMD | 89 | 5′ss creation | 183 |
| DMD | 90 | 5′ss creation | 183 |
| DMD | 95 | 3′ss strengthening | 182 |
| DMD | 149 | 3′ss creation | 183 |
| DMD | 172/202 | 5′ss creation | 184 |
| CFTR | 49 | 5′ss creation | 185 |
| CFTR | 84 | 5′ss creation | 186 |
| CFTR | 184 | 3′ss deletion | 187 |
| CFTR | 214 | 5′ss creation | 188 |
| CYBB | 56 | 5′ss strengthening | 189 |
| CYBB | 94 | 3′ss creation | 190 |
| PHEX | 50/100/170 | 5′ss creation | 191 |
| α -Gal A | 57 | SRE creation | 107 |
| ATM | 65 | SRE deletion | 108 |
| BRCA1 | 66 | 3′ss creation | 192 |
| GHER | 69 | 5′ss creation | 193 |
| CHM | 98 | 3′ss creation | 194 |
| GHR | 102 | 5′ss creation | 195 |
| ATM | 137 | 5′ss creation | 196 |
| OA1/GPR143 | 165 | 3′ss creation | 197 |
| β -globin | 165 | 5′ss creation | 78 |
| NF-1 | 172 | 3′ss creation | 198 |
Shows several factors that determine pseudoexon inclusion aside the simple creation of ex novo functional splice sites. In the a -GalA gene ( A ), a single point mutation G > A observed to occur in position-4 with respect to a cryptic donor site has the effect of creating a novel A/C rich enhancer element that causes inclusion of a 57 bp. pseudoexon in the processed mRNA. In a second example, a 4 nt deletion (GUAA) in the intronic region of the ATM gene between exons 20 and 21 caused the insertion of a 65 nt long pseudoexon in the final mRNA ( B ). Functional analysis has demonstrated that this deletion abolished binding of an U1snRNP molecule in this position and activated a 3′ss lying 12 nt upstream of this element. Finally, ( C ) shows the effect of the inclusion of a pseudoexon sequence in the a -tropomyosin pre-mRNA with respect to two mutually exclusive exons 2 and 3. When the pseudoexon sequence (red box) is spliced to exon 2 a 5′ss ZLE sequence is regenerated, thus allowing splicing to occur again giving rise to a normal transcript bearing exon 2 joined to exon 4 (upper panel). On the other hand, when the pseudoexon is joined to exon 3 the ZLE sequence that is regenerated is insufficient to promote joining together of exon 3 to exon 4 and the resulting transcript is thus degraded by NMD (lower panel). This difference in resplicing activity has been proposed to be consistent with the match to consensus of the two ZLE elements (higher for exon 2 than exon 3) and for the higher exon spicing enhancer activity of exon 2.
( A ) Shows a schematic diagram (drawn to scale) of the exon-rich genomic region surrounding exon 37 of the neurofibromatosis ( NF-1 ) gene. This exon is normally included constitutively in the mature mRNA but in the presence of a pathological substitution in position 6792 (a C is replaced by a G) the analysis from patient lymphoblasts reveals both the double skipping of exons 36 and 37 and also skipping of only exon 37. The influence of the genomic context has been tested by engineering a series of minigene constructs that gradually included larger portions of this genomic region, and their functionality compared with the in vivo pattern. ( B ) Wild-type exon 37 alone with a small portion of flanking intronic regions is not sufficient to yield the observed >99% inclusion in normal lymphoblasts. However, this result can be readily achieved when larger portions of this genomic region are included in the genomic construct. In addition to the normal situation, an increased genomic context also provides a closer correspondence with regards to the exon skipping effects of the C6792G substitution ( C ). In fact, only the construct carrying most of the NF-1 exons 31 to 38 together with a mutated exon 37 yields a percentage of normal transcript, exon 37 skipping and exon 36–37 skipping that reflects the observed percentage in a patient's lymphoblasts that carry the C6792G substitution (the asterisk indicates the levels of normal mRNA from the C6792G-carrying allele after deducting the estimated contribution of the un mutated allele).
Modulation of tissue- and developmental-specific expression of four different exons. ( A ) Shows that in non-neuronal cells PTB binds to two CU elements which are located both upstream and downstream c-src exon N1 and form a silencing complex that interferes with the interaction between U1snRNP bound at the donor site of this exon and U2AF. This complex does not interfere with binding of U2AF with the U1snRNP present at the donor site of exon 3 and this allows splicing of exon 3 to exon 4, thereby excluding exon N1 from the mature mRNA. On the other hand, in neuronal cells PTB is removed from the CU elements by an unknown factor ‘X’ in the presence of ATP and exon inclusion is promoted by the SF2/ASF protein binding to this exon (which is also bound by its antagonistic molecule, hnRNP A1) and by the DCS complex (which contains hnRNP H/F (H/F), KSRP, plus a neuronal form of PTB known as nPTB or WERI PTB). ( B ) Shows the Ras signaling pathway-dependent regulation of splicing of the variable v5 exon in the CD44 gene that occurs during T cell activation. In normal situations the enhancer complex in correspondence to a GAA-rich motif near the donor site of this exon is repressed by a strong hnRNP A1-binding sequence. During T cell activation, however, the repression can be overcome by either inhibition or displacement of A1 by phosphorylation of Sam 68 (Src-associated in mitosis) and involvement of powerful splicing coactivators such as SRm160. ( C ) Represents a model of cTNT exon 5 alternative splicing in the chicken developing heart. In the embryonic cardiomyocite CUG-BP and ETR-3 are expressed at high levels in the nucleus and they bind to the regulatory regions downstream of exon 5 (MSE2 to MSE4) to promote its inclusion. On the other hand, in the adult cardiomyocite CUG-BP and ETR-3 expression is downregulated and skipping is favored by PTB and MBNL proteins that are now free to bind on either side of the exon. Finally, ( D ) contains a model of HipK3 ‘T’ (testis-specific) differential exon inclusion in both somatic cells and in germ lines. In somatic cells, hnRNP A1 (A1) is principally responsible for the inhibition of the T exon donor site, probably with some help by hnRNP H (H). This is favored by the fact that in these cells the Tra2β protein tends to be in low amounts and is mostly phosphorylated. On the other hand, in germ cells Tra2β expression is upregulated and tends to be in its hypo-phosphorylated form whilst hnRNP A1 is downregulated. This arrangement allows the Tra2β protein to bind in place of A1 and to promote T exon inclusion with the assistance of SF2/ASF and SRp40 which also bind within this exon.
( A ) Shows a schematic diagram of the splicing regulatory regions that have been described to affect recognition of SMN1 / SMN 2 exon 7. Recently, a functional SELEX analysis has been applied to this whole exonic sequence. ( B ) Shows the minigene system based on the human SMN gene used for this assay in which the entire exon 7 is partially randomized. These sequences can then be selected for inclusion in the final spliced products by subjecting them to rounds of selection in vivo . The borders of this randomized exon are dotted in (B) to indicate the fact that these sequences will not always be used as an exon in this technique. After each round of selection the spliced exon 7 sequences are recovered by RT–PCR and reinserted in the minigene splicing cassette. This selection process has allowed the precise mapping, at the single nucleotide level, of all the major determinants involved in the control of exon 7 splicing. Interestingly, 82% of all the final clones contained the same substitution in correspondence to the last position of this exon, and this change was quite capable of overcoming the effects of other negative regulatory mutations. ( C ) Shows a comparison of the two donor site strengths (the wild type and the one carrying the A54G substitution) using a panel of splice site prediction programs.
( A ) Shows a diagrammatic representation of several positive elements that regulate SMN1 / SMN 2 exon 7 inclusion. The elements analyzed in this experiment are the first nucleotide of exon 7 (1G), the Tra2β-binding site (Tra2), the conserved tract in the middle of the exon (ConsTract) and the intronic splicing enhancer (element-2) in IVS7. Also shown is the dominant ISS element (ISS-N1). The graphical representation underneath shows the levels of exon 7 inclusion following mutation of all of these elements in a suitable minigene construct. Also included are the basal exon 7 inclusion levels from SMN 2 exon 7 that carries a natural C6U substitution. ( B ) Shows the effect of the same mutations/substitutions when the ISS-N1 sequence has been deleted from the minigene constructs. An analogous situation which regards the CFTR exon 9 sequence is schematically represented in ( C ) in which the (UG)m(U)n repeated motif near the 3′ss of this exon has been shown to bind a powerful negative regulator of splicing, TDP-43. Several other cis -acting elements are also shown in this diagram: a CERES element within the exon, a positive polypyrimidine sequence near the 5′ss (PCE) that binds TIA-1, and a human-specific ISS in IVS9 that binds members of the SR protein family. The graph below shows that mutations within these elements are strongly inhibitory with respect to the basal inclusion level on a TG11T5 background (lane CFTR exon 9). Nonetheless, also in this case the removal of TDP-43 using siRNA technology rescues the effect of all these negative mutations ( D ).
F.E.B. is supported by the Telethon Onlus Foundation (Italy) (grants no. GGP02453 and nos. GGP030261 and GGP06147), by FIRB (RBNE01W9PM) and by a European community grant (EURASNET). The Open Access publication charges for this article were waived by Oxford University Press.
Conflict of interest statement. None declared.
REFERENCES
Nissim-Rafinia, M. and Kerem, B.
Caceres, J.F. and Kornblihtt, A.R.
Garcia-Blanco, M.A., Baraniak, A.P., Lasda, E.L.
Cartegni, L., Chew, S.L., Krainer, A.R.
Faustino, N.A. and Cooper, T.A.
Tazi, J., Durand, S., Jeanteur, P.
Stoilov, P., Meshorer, E., Gencheva, M., Glick, D., Soreq, H., Stamm, S.
Ars, E., Serra, E., Garcia, J., Kruyer, H., Gaona, A., Lazaro, C., Estivill, X.
Teraoka, S.N., Telatar, M., Becker-Catania, S., Liang, T., Onengut, S., Tolun, A., Chessa, L., Sanal, O., Bernatowska, E., Gatti, R.A., et al.
Chamary, J.V., Parmley, J.L., Hurst, L.D.
Lenasi, T., Peterlin, B.M., Dovc, P.
Graveley, B.R.
Kreahling, J.M. and Graveley, B.R.
Buratti, E. and Baralle, F.E.
Romano, M., Marcucci, R., Baralle, F.E.
Fededa, J.P., Petrillo, E., Gelfand, M.S., Neverov, A.D., Kadener, S., Nogues, G., Pelisch, F., Baralle, F.E., Muro, A.F., Kornblihtt, A.R.
Kornblihtt, A.R.
Pagani, F. and Baralle, F.E.
Black, D.L.
Maniatis, T. and Tasic, B.
Stetefeld, J. and Ruegg, M.A.
Graveley, B.R.
Stamm, S., Ben-Ari, S., Rafalska, I., Tang, Y., Zhang, Z., Toiber, D., Thanaraj, T.A., Soreq, H.
Vibe-Pedersen, K., Kornblihtt, A.R., Baralle, F.E.
Baralle, D. and Baralle, M.
Cooper, T.A.
Buratti, E. and Baralle, F.E.
Buratti, E., Baralle, F.E., Pagani, F.
Wilton, S.D. and Fletcher, S.
Sazani, P. and Kole, R.
Cartegni, L. and Krainer, A.R.
Skordis, L.A., Dunckley, M.G., Yue, B., Eperon, I.C., Muntoni, F.
Soret, J., Bakkour, N., Maire, S., Durand, S., Zekri, L., Gabut, M., Fic, W., Divita, G., Rivalle, C., Dauzonne, D., et al.
Hagiwara, M.
Goyenvalle, A., Vulin, A., Fougerousse, F., Leturcq, F., Kaplan, J.C., Garcia, L., Danos, O.
Zhang, M.Q.
Lund, M. and Kjems, J.
Du, H. and Rosbash, M.
Tarn, W.Y. and Steitz, J.A.
Crispino, J.D., Blencowe, B.J., Sharp, P.A.
Buratti, E., Baralle, M., De Conti, L., Baralle, D., Romano, M., Ayala, Y.M., Baralle, F.E.
Relogio, A., Ben-Dov, C., Baum, M., Ruggiu, M., Gemund, C., Benes, V., Darnell, R.B., Valcarcel, J.
Pan, Q., Shai, O., Misquitta, C., Zhang, W., Saltzman, A.L., Mohammad, N., Babak, T., Siu, H., Hughes, T.R., Morris, Q.D., et al.
Yeo, G., Holste, D., Kreiman, G., Burge, C.B.
Kalnina, Z., Zayakin, P., Silina, K., Line, A.
Venables, J.P.
Smith, C.W. and Valcarcel, J.
Matlin, A.J., Clark, F., Smith, C.W.
Maniatis, T. and Reed, R.
Singh, R. and Valcarcel, J.
Shin, C. and Manley, J.L.
Li, X. and Manley, J.L.
Nott, A., Le Hir, H., Moore, M.J.
Sanford, J.R., Gray, N.K., Beckmann, K., Caceres, J.F.
Huang, Y., Yario, T.A., Steitz, J.A.
Huang, Y., Gattoni, R., Stevenin, J., Steitz, J.A.
Graveley, B.R.
Blaustein, M., Pelisch, F., Tanos, T., Munoz, M.J., Wengier, D., Quadrana, L., Sanford, J.R., Muschietti, J.P., Kornblihtt, A.R., Caceres, J.F., et al.
Sun, H. and Chasin, L.A.
Fairbrother, W.G. and Chasin, L.A.
Sironi, M., Menozzi, G., Riva, L., Cagliani, R., Comi, G.P., Bresolin, N., Giorda, R., Pozzoli, U.
Zhang, X.H. and Chasin, L.A.
Zhang, X.H., Kangsamaksin, T., Chao, M.S., Banerjee, J.K., Chasin, L.A.
Wang, Z., Rolish, M.E., Yeo, G., Tung, V., Mawson, M., Burge, C.B.
Mendes Soares, L.M. and Valcarcel, J.
Proudfoot, N.J., Furger, A., Dye, M.J.
Dye, M.J., Gromak, N., Proudfoot, N.J.
Horowitz, D.S. and Krainer, A.R.
Freund, M., Hicks, M.J., Konermann, C., Otte, M., Hertel, K.J., Schaal, H.
Roca, X., Sachidanandam, R., Krainer, A.R.
Carmel, I., Tal, S., Vig, I., Ast, G.
Krawczak, M., Reiss, J., Cooper, D.N.
Treisman, R., Orkin, S.H., Maniatis, T.
Wieringa, B., Meyer, F., Reiser, J., Weissmann, C.
Roy, S.W. and Gilbert, W.
Sadusky, T., Newman, A.J., Dibb, N.J.
Roca, X., Sachidanandam, R., Krainer, A.R.
Kralovicova, J., Christensen, M.B., Vorechovsky, I.
Krainer, A.R., Conway, G.C., Kozak, D.
Caceres, J.F., Stamm, S., Helfman, D.M., Krainer, A.R.
Gabut, M., Mine, M., Marsac, C., Brivet, M., Tazi, J., Soret, J.
Spena, S., Tenchini, M.L., Buratti, E.
Domenjoud, L., Gallinaro, H., Kister, L., Meyer, S., Jacob, M.
Domenjoud, L., Kister, L., Gallinaro, H., Jacob, M.
Galante, P.A., Sakabe, N.J., Kirschbaum-Slager, N., de Souza, S.J.
Stamm, S., Zhu, J., Nakai, K., Stoilov, P., Stoss, O., Zhang, M.Q.
Laski, F.A. and Rubin, G.M.
Gebauer, F., Merendino, L., Hentze, M.W., Valcarcel, J.
Mansilla, A., Lopez-Sanchez, C., de la Rosa, E.J., Garcia-Martinez, V., Martinez-Salas, E., de Pablo, F., Hernandez-Sanchez, C.
Le Hir, H., Charlet-Berguerand, N., de Franciscis, V., Thermes, C.
Zhang, L., Vincent, G.M., Baralle, M., Baralle, F.E., Anson, B.D., Benson, D.W., Whiting, B., Timothy, K.W., Carlquist, J., January, C.T., et al.
Pequignot, M.O., Dey, R., Zeviani, M., Tiranti, V., Godinot, C., Poyau, A., Sue, C., Di Mauro, S., Abitbol, M., Marsac, C.
Muro, A.F., Chauhan, A.K., Gajovic, S., Iaconcig, A., Porro, F., Stanta, G., Baralle, F.E.
Batsche, E., Yaniv, M., Muchardt, C.
Kornblihtt, A.R.
Kreahling, J. and Graveley, B.R.
Sorek, R., Lev-Maor, G., Reznik, M., Dagan, T., Belinky, F., Graur, D., Ast, G.
Lei, H., Day, I.N., Vorechovsky, I.
Zhang, X.H., Leslie, C.S., Chasin, L.A.
Ishii, S., Nakao, S., Minamikawa-Tachino, R., Desnick, R.J., Fan, J.Q.
Pagani, F., Buratti, E., Stuani, C., Bendix, R., Dork, T., Baralle, F.E.
Lewandowska, M.A., Stuani, C., Parvizpur, A., Baralle, F.E., Pagani, F.
Grellscheid, S.N. and Smith, C.W.
Baralle, M., Skoko, N., Knezevich, A., De Conti, L., Motti, D., Baralle, D., Buratti, E., Baralle, F.E.
Vandenbroucke, I., Callens, T., De Paepe, A., Messiaen, L.
Fox-Walsh, K.L., Dou, Y., Lam, B.J., Hung, S.P., Baldi, P.F., Hertel, K.J.
Mardon, H.J., Sebastio, G., Baralle, F.E.
Jurica, M.S. and Moore, M.J.
Bourgeois, C.F., Lejeune, F., Stevenin, J.
Blencowe, B.J.
Sanford, J.R., Ellis, J., Caceres, J.F.
Zhang, Z. and Krainer, A.R.
Krecic, A.M. and Swanson, M.S.
Dreyfuss, G., Kim, V.N., Kataoka, N.
Spellman, R. and Smith, C.W.
Pagani, F., Buratti, E., Stuani, C., Romano, M., Zuccato, E., Niksic, M., Giglio, L., Faraguna, D., Baralle, F.E.
Wang, Y., Wang, J., Gao, L., Lafyatis, R., Stamm, S., Andreadis, A.
Kanopka, A., Muhlemann, O., Akusjarvi, G.
Shin, C., Feng, Y., Manley, J.L.
Hui, J., Stangl, K., Lane, W.S., Bindereif, A.
Martinez-Contreras, R., Fisette, J.F., Nasim, F.U., Madden, R., Cordeau, M., Chabot, B.
Chen, C.D., Kobayashi, R., Helfman, D.M.
Liu, H.X., Zhang, M., Krainer, A.R.
Ladd, A.N. and Cooper, T.A.
Zheng, Z.M.
Abdul-Manan, N. and Williams, K.R.
Buratti, E., Muro, A.F., Giombi, M., Gherbassi, D., Iaconcig, A., Baralle, F.E.
Cartegni, L., Wang, J., Zhu, Z., Zhang, M.Q., Krainer, A.R.
Fairbrother, W.G., Yeh, R.F., Sharp, P.A., Burge, C.B.
Fairbrother, W.G., Yeo, G.W., Yeh, R., Goldstein, P., Mawson, M., Sharp, P.A., Burge, C.B.
Nalla, V.K. and Rogan, P.K.
Pagani, F., Raponi, M., Baralle, F.E.
Pagani, F., Buratti, E., Stuani, C., Baralle, F.E.
Pagani, F., Stuani, C., Tzetis, M., Kanavakis, E., Efthymiadou, A., Doudounakis, S., Casals, T., Baralle, F.E.
Cartegni, L. and Krainer, A.R.
Liu, H.X., Cartegni, L., Zhang, M.Q., Krainer, A.R.
Sironi, M., Cagliani, R., Pozzoli, U., Bardoni, A., Comi, G.P., Giorda, R., Bresolin, N.
O'Driscoll, M., Ruiz-Perez, V.L., Woods, C.G., Jeggo, P.A., Goodship, J.A.
Han, K., Yeo, G., An, P., Burge, C.B., Grabowski, P.J.
Kralovicova, J. and Vorechovsky, I.
Zhu, J., Mayeda, A., Krainer, A.R.
Eperon, I.C., Makarova, O.V., Mayeda, A., Munroe, S.H., Caceres, J.F., Hayward, D.G., Krainer, A.R.
Sharma, S., Falick, A.M., Black, D.L.
Chou, M.Y., Underwood, J.G., Nikolic, J., Luu, M.H., Black, D.L.
Matter, N., Herrlich, P., Konig, H.
Matter, N., Marx, M., Weg-Remers, S., Ponta, H., Herrlich, P., Konig, H.
Cheng, C. and Sharp, P.A.
Ladd, A.N., Stenberg, M.G., Swanson, M.S., Cooper, T.A.
Venables, J.P., Bourgeois, C.F., Dalgliesh, C., Kister, L., Stevenin, J., Elliott, D.J.
Cartegni, L., Hastings, M.L., Calarco, J.A., de Stanchina, E., Krainer, A.R.
Kashima, T. and Manley, J.L.
Hofmann, Y., Lorson, C.L., Stamm, S., Androphy, E.J., Wirth, B.
Miyaso, H., Okumura, M., Kondo, S., Higashide, S., Miyajima, H., Imaizumi, K.
Singh, N.N., Androphy, E.J., Singh, R.N.
Yeo, G. and Burge, C.B.
Reese, M.G., Eeckman, F.H., Kulp, D., Haussler, D.
Rogozin, I.B. and Milanesi, L.
Zheng, C.L., Fu, X.D., Gribskov, M.
Singh, N.K., Singh, N.N., Androphy, E.J., Singh, R.N.
Groman, J.D., Hefferon, T.W., Casals, T., Bassas, L., Estivill, X., Des Georges, M., Guittard, C., Koudova, M., Fallin, M.D., Nemeth, K., et al.
Buratti, E., Dork, T., Zuccato, E., Pagani, F., Romano, M., Baralle, F.E.
Buratti, E., Brindisi, A., Giombi, M., Tisminetzky, S., Ayala, Y.M., Baralle, F.E.
Wang, H.Y., Wang, I.F., Bose, J., Shen, C.K.
Zuccato, E., Buratti, E., Stuani, C., Baralle, F.E., Pagani, F.
Ayala, Y.M., Pagani, F., Baralle, F.E.
Mercado, P.A., Ayala, Y.M., Romano, M., Buratti, E., Baralle, F.E.
Kishore, S. and Stamm, S.
Tuffery-Giraud, S., Saquet, C., Chambert, S., Claustres, M.
Beroud, C., Carrie, A., Beldjord, C., Deburgrave, N., Llense, S., Carelle, N., Peccate, C., Cuisset, J.M., Pandit, F., Carre-Pigeon, F., et al.
Ikezawa, M., Nishino, I., Goto, Y., Miike, T., Nonaka, I.
Chillon, M., Dork, T., Casals, T., Gimenez, J., Fonknechten, N., Will, K., Ramos, D., Nunes, V., Estivill, X.
Highsmith, W.E., Burch, L.H., Zhou, Z., Olsen, J.C., Boat, T.E., Spock, A., Gorvoy, J.D., Quittel, L., Friedman, K.J., Silverman, L.M., et al.
Will, K., Dork, T., Stuhrmann, M., Meitinger, T., Bertele-Harms, R., Tummler, B., Schmidtke, J.
Monnier, N., Gout, J.P., Pin, I., Gauthier, G., Lunardi, J.
Rump, A., Rosen-Wolff, A., Gahr, M., Seidenberg, J., Roos, C., Walter, L., Gunther, V., Roesler, J.
Noack, D., Heyworth, P.G., Newburger, P.E., Cross, A.R.
Christie, P.T., Harding, B., Nesbit, M.A., Whyte, M.P., Thakker, R.V.
Balz, V., Prisack, H.B., Bier, H., Bojar, H.
Wang, M., Dotzlaw, H., Fuqua, S.A., Murphy, L.C.
van den Hurk, J.A., van de Pol, D.J., Wissinger, B., van Driel, M.A., Hoefsloot, L.H., de Wijs, I.J., van den Born, L.I., Heckenlively, J.R., Brunner, H.G., Zrenner, E., et al.
Metherell, L.A., Akker, S.A., Munroe, P.B., Rose, S.J., Caulfield, M., Savage, M.O., Chew, S.L., Clark, A.J.
McConville, C.M., Stankovic, T., Byrd, P.J., McGuire, G.M., Yao, Q.Y., Lennox, G.G., Taylor, M.R.
Vetrini, F., Tammaro, R., Bondanza, S., Surace, E.M., Auricchio, A., De Luca, M., Ballabio, A., Marigo, V.








Comments