Discovery of novel microRNA mimic repressors of ribosome biogenesis

Abstract While microRNAs and other non-coding RNAs are the next frontier of novel regulators of mammalian ribosome biogenesis (RB), a systematic exploration of microRNA-mediated RB regulation has not yet been undertaken. We carried out a high-content screen in MCF10A cells for changes in nucleolar number using a library of 2603 mature human microRNA mimics. Following a secondary screen for nucleolar rRNA biogenesis inhibition, we identified 72 novel microRNA negative regulators of RB after stringent hit calling. Hits included 27 well-conserved microRNAs present in MirGeneDB, and were enriched for mRNA targets encoding proteins with nucleolar localization or functions in cell cycle regulation. Rigorous selection and validation of a subset of 15 microRNA hits unexpectedly revealed that most of them caused dysregulated pre-rRNA processing, elucidating a novel role for microRNAs in RB regulation. Almost all hits impaired global protein synthesis and upregulated CDKN1A (p21) levels, while causing diverse effects on RNA Polymerase 1 (RNAP1) transcription and TP53 protein levels. We provide evidence that the MIR-28 siblings, hsa-miR-28-5p and hsa-miR-708-5p, potently target the ribosomal protein mRNA RPS28 via tandem primate-specific 3′ UTR binding sites, causing a severe pre-18S pre-rRNA processing defect. Our work illuminates novel microRNA attenuators of RB, forging a promising new path for microRNA mimic chemotherapeutics.

MicroRNAs comprise a class of non-coding (nc)RNAs of approximately 22 nt which can base pair with messenger (m)RNAs to post-transcriptionally reduce transcript stability or translation efficiency, acting as 'sculptors of the transcriptome' to fine-tune gene expression (30)(31)(32).Like RB, mi-croRNAs play critical roles in mediating human development, health, and disease including cancer ( 33 ,34 ).How microR-NAs regulate RB has yet to be explored systematically at the experimental level, although some intriguing links between them have been uncovered.A handful of microRNAs have been experimentally demonstrated to affect RB subprocesses including RNAP1 transcription, 60S assembly, and RP gene transcription ( 35 ).Consistent with this, the microRNA biogenesis factors Drosha and Dicer are required for 28S and 5.8S maturation ( 36 ).AGO2, the microRNA-binding component of the active RISC complex ( 32 ), has been found in the nucleolus ( 37 ) along with several microRNAs (38)(39)(40), though their precise biological function there remains unclear.Computational analysis has implicated microRNA-mediated control of RPs as a key potentiator of RB activity and disease progression ( 41 ), thereby necessitating additional in vivo experiments.MicroRNA targeting biology, including binding sites with non-canonical forms or cooperativity, is complex and remains poorly understood ( 42 ).Software packages have made some inroads towards accurate prediction of microRNA targets ( 43 ,44 ) or functions ( 45 ) although abundant false positives limit their utility ( 41 ,46 ).The limited amount of direct experimental evidence that microRNAs are involved in RB constitutes a significant gap in our understanding of the layers of regulation of nucleolar function in human cells.
To date, no holistic, unbiased discovery campaign for mi-croRNAs functioning in RB regulation has been conducted, and the full complement of microRNAs affecting RB remains poorly understood.We previously hypothesized that microR-NAs may be a key underappreciated conduit linking biochemical RB defects to the pathogenesis of diseases like ri-bosomopathies and cancer ( 35 ).To discover novel microR-NAs negatively regulating RB, we conducted our previouslyestablished high-content screen for changes in nucleolar number following microRNA mimic overexpression in human MCF10A cells.High-throughput screens using microRNA mimics have previously revealed mechanistic insight into microRNA-mediated regulation dynamics during cellular proliferation and signaling ( 47 ,48 ), cardiac regeneration ( 49 ), viral infection ( 50 ), and cancer ( 51 ,52 ).
Here, we identify 72 high-confidence mature human mi-croRNA hits that disrupt RB, which are enriched for mRNA targets involved in cell cycle regulation, cellular proliferation, and localization within the nucleolus.Twenty-seven of these hits are validated, conserved microRNAs present in the Mir-GeneDB database ( 53 ).We validate the roles of a subset of 15 hits in RB subprocesses including pre-rRNA transcription, pre-rRNA processing, and global protein synthesis.For the first time, we define the abilities of 12 microRNA mimics to inhibit pre-rRNA processing.Our work reveals that the MIR-28 family members, hsa-miR-28-5p and hsa-miR-708-5p, are strong inhibitors of pre-18S pre-rRNA processing by way of potent downregulation of the levels of the ribosomal protein mRNA, RPS28 .Our screen's results underscore the broad potential of microRNAs to dysregulate RB, and raise new questions regarding the extent to which microRNAs may connect RB and disease.

Chemical reagents
BMH-21 (Sigma-Aldrich SML1183; CAS 896705-16-1) was diluted to a working stock concentration of 50 μM in DMSO for direct dosing of cells in 384-well plates.

CellProfiler pipeline and data analysis
Image analysis and data processing were conducted using a custom pipeline for CellProfiler 3.1.9as previously described ( 54 -57 ).Strictly-standardized mean difference (SSMD) values were calculated from plate-adjusted one-nucleolus or 5+ nucleoli percent effect values using the uniformly minimal variance unbiased estimate (UMVUE), equation A5 in ( 58 ).Data from the primary or secondary screen were averaged in JMP (version 17.0, SAS Institute, Cary, NC, USA) and graphed with JMP or GraphPad Prism 8 (GraphPad Software).

MCF10A RNA expression dataset
Deposited reads from four RNAseq experiments from MCF10A cells quantifying transcript levels without treatment (negative control conditions; see table below) were reanalyzed using Partek Flow.Reads were aligned to the hg38 genome with HISAT2 2.1.0and quantified using the Ensembl Transcripts version 99 annotation with the Partek E / M algorithm module.Normalized transcripts per million (zTPM) for genes in each experiment was calculated in R as described ( 59 ).For each dataset, a given gene was categorized as expressed if its zTPM was greater than −3, as described in ( 59 ) ( Supplementary Table S8 ).S9 ).Conversely, the number of microRNAs targeting each gene was also calculated with JMP, and all 262 genes targeted by five or more microRNA hits were analyzed for enrichment using Enrichr ( 64-66 ) ( Supplementary Table S10 ).For the subset of 24 evolutionarily-conserved MirGeneDB hits with validated targets from the primary screen, 135 genes targeted by five or more hits were analyzed for enrichment using Enrichr ( Supplementary Table S10 ).To analyze the evolutionary conservation of the MIR-28 binding sites in the RPS28 3 UTR, indicated species from the Multiz 100 Vertebrate Species Alignment and Conservation track in the UCSC Genome Browser were visualized ( 67 ,68 ).

RNA isolation following RNAi transfection
MCF10A cells or hTERT RPE-1 cells were seeded at 100 000 cells per well in 2 ml of medium in 6-well plates and incubated at 37 • C for 24 h.For the MIR-28 dose response, hTERT RPE-1 cells were seeded at 50 000 cells per well in 1 ml of medium in 12-well plates and incubated as above.Cells were transfected with siRNAs or microRNA mimics at 30 nM or as otherwise indicated using Lipofectamine RNAiMAX (Invitrogen 13778-150) and Opti-MEM (Gibco 31985070) per manufacturer's instructions for 72 h.Cells were washed with 1 × PBS, then collected with 1 ml of TRIzol reagent (Invitrogen 15596026).Total RNA was purified following the manufacturer's protocol.

Analysis of RNA transcript levels by RT-qPCR
MCF10A cells or hTERT RPE-1 cells were treated with control siRNAs or microRNA mimics, and RNA was isolated as above.cDNA was synthesized from 1 μg total input RNA using iScript™ gDNA Clear cDNA Synthesis Kit (Bio-Rad 1725035).In a qPCR plate (Bio-Rad MLL9601), 1 μl of cDNA was dispensed, followed by 19 μl of a qPCR master mix containing iTaq Universal SYBR Green Supermix (Bio-Rad 1725121), 500 nM forward primer, 500 nM reverse primer, and water.Technical duplicates were assayed for each biological replicate.The plate was briefly centrifuged and assayed using a Bio-Rad CFX96 Touch Real-Time PCR Detection System.Amplification parameters were: initial denaturation 95ºC for 30 s; 40 cycles 95ºC denaturation for 15 s, 60ºC annealing and extension for 30 s. Melt curve analysis parameters were: 60ºC to 94.8ºC in 0.3ºC increment.Data analysis was completed using the comparative C T method ( C T ) using 7SL RNA as an internal loading control.

Analysis of mature rRNAs
MCF10A cells, hTERT RPE-1 cells, or HEK 293 Flp-In cells were treated with control siRNAs or microRNA mimics, and RNA was isolated as above.One μg of total RNA was resuspended in nuclease-free H 2 O and submitted to the Yale Center for Genomic Analysis for electropherogram analysis.Each experiment was conducted with either a Bioanalyzer 2100 or a Fragment Analyzer 5300 (Agilent).Mature rRNA ratios and mature rRNA relative peak areas were taken from the output reports.The data were graphed and analyzed by ANOVA followed by Holm-Šídák post-hoc testing in GraphPad Prism.
Northern blot analysis of pre-rRNA processing MCF10A cells were treated with siRNAs or microRNA mimics, and RNA was isolated as above.Northern blots were performed using 3 μg of total RNA as published ( 54 ,55 ), and were performed in at least biological triplicate.Blots were quantified with Image Lab 6.0.Dual-luciferase assay for RNAP1 promoter activity MCF10A cells were seeded at 30 000 cells per well in 1 ml of medium in 12-well plates and incubated at 37  ( 72 ) were used to separate total protein at 110 V for 2 h.Total protein was imaged using the ChemiDoc stain-free imaging protocol (Bio-Rad) to ensure even loading at the gel stage.Gels were UV-activated for 5 min in the Chemi-Doc.Following membrane transfer with the Trans-Blot Turbo system (Bio-Rad), blots were imaged again for total protein; these images are presented and quantified as blot loading controls.Immunoblotting was carried out using 5% (w / v) Omniblok dry milk (American Bio AB10109) in 1 × PBST (1 × PBS containing 5% (v / v) Tween) with primary antibodies listed below followed by 1:5000 peroxidase-linked anti-mouse or anti-rabbit IgG (Amersham NXA931 or NA934) as appropriate.Blots were developed using low-or high-sensitivity ECL reagent (Millipore WBKLS0500, Thermo Scientific 34094) for 5 min, then dried and imaged with the ChemiDoc.Images from the ChemiDoc were quantified using Image Lab 6.0.

Identification of putative microRNA binding sites
Candidate microRNA binding sites in transcripts coding for nucleolar proteins were identified using databases including T arBase 8 ( 63 ), T argetScan 7.2 ( 74 ), miRW alk 3 ( 75 ), and DIANA microT-CDS 2023 ( 76 ).Binding sites were computationally tested using the bimolecular DuplexFold algorithm on the RNAStructure Web Server ( 77 ).The sequence of either hsa-miR-28-5p or hsa-miR-708-5p was tested for in silico binding to 70 bp regions of target transcripts containing putative microRNA binding sites.Transcript regions bearing a scrambled binding site were also tested for binding by either mature microRNA sequence.Computed binding energies from DuplexFold are reported in this manuscript.The BLAST algorithm ( 78 ) was used to search the 45 pre-rRNA transcript (NR_046235.3) for potential binding sites for the 72 microRNA hits.BLAST matches were filtered for antisense (+ / -) pairings starting on microRNA nucleotide 1 or 2 with 0 gaps to select potential canonical seed binding sites in the 45S pre-rRNA precursor ( Supplementary Table S13 ).
MicroRNA UTR assays testing for direct interaction of microRNA mimics with putative mRNA targets MCF10A cells were seeded at 40 000 cells per well in 1 ml of medium in 24-well plates and incubated at 37  S14 ).The FLAG-tagged RPS28 plasmid is available on Addgene (plasmid #203156).Polyclonal transformants were isolated following 200 μg / ml Hygromycin B selection (Gibco 10687010).Briefly, 75 000 HEK 293 Flp-In cells were seeded in 1 ml of medium in 12-well plates and incubated at 37 • C for 24 h.Cells were then transfected with siNT or MIR-28 mi-croRNA mimics at 30 nM using RNAiMAX in a total transfection volume of 50 μl and were simultaneously induced with 1 μg / ml tetracycline (MilliporeSigma T7660) as indicated.Treated cells were incubated for 3 days, after which RNA was harvested using TRIzol as above.Electropherograms were collected to quantify the 28S / 18S ratio using 1 μg of total RNA for each sample as above.
miR-eCLIP analysis for identifying direct targets of MIR-28 family members miR-eCLIP was performed as detailed in ( 83 ).Briefly, 3.5 million MCF10A cells were seeded into 15 cm tissue culture dishes and incubated for 48 h in 10 ml medium.Cells were transfected with either hsa-miR-28-5p or hsa-miR-708-5p microRNA mimics at 30 nM using RNAiMAX (see above).After 27 h, cells were washed with 15 ml 1 × PBS, covered with 5 ml 1 × PBS, UV crosslinked with 254 nm light at 400 mJ / cm 2 , and collected by scraping.Approximately 10 million crosslinked cells from each microRNA mimic treatment were pooled and processed for miR-eCLIP sequencing and data analysis (Eclipse Bioinnovations, San Diego, CA).Raw reads for the AGO2 immunoprecipitation (IP) and the size-matched input are available at NCBI under BioProject accession PRJNA923105.Processed data are available in GEO accession GSE242755 and Supplementary Table S15 .Cumulative empirical distribution plots of RNAseq differential expression data for MIR-28 targets or non-targets were graphed and analyzed in R.

Results
A high throughput phenotypic screen for altered nucleolar number identifies 71 novel microRNAs that negatively regulate ribosome biogenesis Following on the success of our previous nucleolar numberbased screens for novel RB regulators ( 54 ,55 ), we hypothesized that microRNAs could also be functioning as nodes of control for RB.To discover novel microRNA negative regulators of ribosome biogenesis, we screened an arrayed library of 2603 human mature microRNA mimics (Dharmacon / Horizon Discovery) for their ability to alter nucleolar number 72 h after transfection into human MCF10A cells (Figure 1 A).While most MCF10A cells treated with a negative control non-targeting siRNA (siNT) display between two and four nucleoli, cells depleted of the tUTP NOL11 (siNOL11) have an increased probability of having one nucleolus per nucleus ('one-nucleolus phenotype') ( 54 ,84 ), and those depleted of the mitotic kinesin KIF11 (siKIF11) have an increased probability of having five or more nucleoli per nucleus ('5+ nucleoli phenotype') ( 55 ) (Figure 1 B, siNT, siNOL11 or siKIF11 panels).Using these siRNAs as controls, we employed our established high-throughput nucleolar number screening platform to count nucleolar number after overexpression of microRNA mimics by using the CellProfiler software to segment and enumerate FBL-stained nucleoli on a per-nucleus basis from images captured by an automated microscope.
We took several steps to ensure our screen's reproducibility and minimize false positives.We conducted the primary screen in biological triplicate, enabling robust calculation of one-nucleolus percent effect and 5+ nucleoli percent effect for each microRNA mimic.To assist with hit selection, we also calculated the strictly standardized mean difference (SSMD) for each microRNA mimic.SSMD is a more robust estimator of hit effect size than percent effect alone, as it also incorporates information on sample size and reproducibility (variance) while simultaneously controlling the false positive and false negative rates ( 85 ).Using these two cutoffs in concert allowed us to pick the strongest, most replicable hits.The percent effect cutoff was set at the top 2.5% for the one-nucleolus effect and at the top 1.0% for the 5+ nucleoli effect; the SSMD cutoff was set at 1.645 for both phenotypes, which allows identification of hits that are at least fairly strong ( 58 ).The median signal-to-background (S / B) for the controls was 2.74 or 2.64 for the one-nucleolus screen or the 5+ nucleoli screen, respectively.The median Z' factor for the one-nucleolus screen was 0.41, although the median Z' factor for the 5+ nucleoli screen was unfavorable at −0.43.Despite poor separation of controls for the 5+ nucleoli screen, reflected in the low median Z' factor, we felt confident that imposing a stricter percent effect cutoff and maintaining the strong SSMD cutoff would enable us to identify hits with reproducible increases in nucleolar number.
Using stringent cutoffs for mean percent effect and SSMD, the nucleolar number primary screen identified 64 one-nucleolus hits and 9 5+ nucleoli hits (Figure 1 C-D, Supplementary Tables S2 -S3 ).Of these hits, we found that 24 / 64 one-nucleolus hits and 2 / 9 5+ nucleoli hits were present in the MirGeneDB database, which catalogs evolutionarilyconserved microRNAs (Figure 1 C-D) ( 53 ).The SSMD cutoff approach allowed us to ignore several less reproducible hits with otherwise high mean percent effect values, mostly from the 5+ nucleoli side of the screen (Figure 1 C-D, bottom right quadrant of each graph).This total of 73 hits equates to an overall 2.8% hit rate.Inspection of images from top hits including hsa-miR -548an, hsa-miR -708-5p and hsa-miR-629-3p (Figure 1 B) confirmed that the relevant phenotype for each hit was observed.We performed a validation screen with replicates for the 73 hits, where each hit was picked from the original microRNA mimic library and rescreened on a new plate for a change in nucleolar number.Overall, 71 / 73 hits (97%) passed validation ( Supplementary Figure S1 , Supplementary Table S3 ).Interestingly, our hits show a statistically-significant difference in chromosomal distribution than the natural distribution of 1918 microRNA genes in the human genome cataloged by miRBase (chi-squared test P < 0.031; Supplementary Tables S4 -S5 ), which may warrant additional future investigation.The standardized residuals indicate the 71 hits are most enriched for loci on chromosomes 20 and 9 ( Supplementary Table S5 ).

Novel microRNA negative regulators of ribosome biogenesis preferentially target transcripts encoding proteins in the nucleolus or involved in cell cycle regulation
We hypothesized that the microRNA mimic hits from the primary screen would be more likely to target genes whose product is nucleolar or is involved in processes affecting the nucleolus.We sought to combine bioinformatic databases containing microRNA:target RNA pairs, MCF10A RNA expression data, and catalogs of known nucleolar proteins to test this hypothesis.To investigate the hits' targets, we utilized Tar-Base 8, a catalog of over 670 000 experimentally-validated microRNA:target RNA interactions collected from the literature ( 63 ).We filtered TarBase 8 to only include 373 890 human microRNA:target interactions with a 'down' regulatory relationship (Figure 2 A).We assembled an MCF10A RNA expression dataset from 4 independent RNAseq experiments (BioProject accessions PRJNA290557, PRJNA384982, PRJNA530983, PRJNA647393) to determine which RNAs were expressed in this cell line.We discovered 20 345 genes bearing one or more transcripts with a normalized (zTPM)   expression value greater than −3 in at least one RNAseq experiment ( Supplementary Table S8 ), supporting expression for each of these genes ( 59 ).We used our RNA expression dataset to further filter out all target genes that were not expressed in MCF10A cells from TarBase 8, after which 1074 microRNAs and 351 983 microRNA:target interactions remained (Figure 2 A).Furthermore, we merged three nucleolar protein databases (60)(61)(62) to create a nucleolar protein reference metadatabase containing 3490 unique nucleolar proteins (Figure 2 A, Supplementary Table S9 ).Using these three datasets, we labeled all microRNA:target interactions potentially present in MCF10A cells that contain targets encoding nucleolar proteins.Only 32 / 71 (45.1%) of the mi-croRNA hits had at least one experimentally-validated tar-get in TarBase 8 (Figure 2 A); however, this figure is consistent with the fact that only 1074 / 2603 (41.3%) microR-NAs in the primary screening library are represented in TarBase 8.

Mean one-nucleolus percent effect
To investigate our hypothesis that hit microRNAs preferentially target transcripts encoding nucleolar proteins, we compared the 32 hit microRNAs to the remaining 1042 nonhit microRNAs in our filtered, annotated TarBase 8 database (Figure 2 B).We counted the number of transcripts coding nucleolar proteins that are targeted by each microRNA.We find that the median number of nucleolar targets for the hit mi-croRNAs is 54, compared to the median value of 40 for 1042 non-hit microRNAs, supporting our hypothesis (Figure 2 B).We also identified 262 genes expressed in MCF10As that are targeted by at least 5 of the novel microRNA hits, representing the top 3.7% of genes most frequently targeted by the hits (Figure 2 A, Supplementary Table S10 ).GO analysis of these genes revealed enrichment for encoded functions in cell cycle regulation, TP53 signaling, cellular proliferation, and for localization within the nucleolus (Figure 2 C).We also identified 135 genes expressed in MCF10As that are targeted by at least 5 of the 24 novel microRNA hits present in TarBase 8 and MirGeneDB ( 53 ); these genes were similarly enriched for nucleolar localization and regulators of the cell cycle and growth ( Supplementary Figure S2 , Supplementary Table S10 ).Altogether, these data support the hypothesis that the novel microRNA hits preferentially target transcripts encoding proteins localized to the nucleolus or involved in cell cycle regulation.

A majority of novel microRNA negative regulators of ribosome biogenesis strongly inhibit nucleolar rRNA biogenesis
To more directly determine the extent to which the novel mi-croRNA hits can abrogate nucleolar function, we harnessed our laboratory's high-throughput assay for nucleolar rRNA biogenesis inhibition ( 56 ).Briefly, we previously optimized and miniaturized a nascent rRNA assay using 5-ethynyl uridine (5-EU) metabolic labeling to quantify changes in nucleolar 5-EU signal by co-staining for the nucleolar protein FBL / fibrillarin (Figure 3 A).Our assay is sensitive to defects in pre-rRNA transcription, processing, or modification, which we collectively refer to as nucleolar rRNA biogenesis ( 56 ).We established an empirical cutoff for nucleolar rRNA biogenesis inhibition of 50%, as we discovered that depletion of almost all RB factors we tested during validation caused inhibition at or above this value ( 56 ).Furthermore, factors involved in both pre-rRNA transcription and processing typically caused the highest percent inhibition values, followed by factors only involved in pre-rRNA transcription, pre-rRNA processing, or pre-rRNA modification, respectively.
We conducted a secondary screen with five biological replicates for nucleolar rRNA biogenesis inhibition on all 71 mi-croRNA mimics that passed primary screen validation as well as one additional microRNA mimic in the original library as a positive control, hsa-miR-34a-5p.We chose to include both hsa-miR-34a strands, as the MIR34A locus has been implicated in Wnt-mediated control of RB ( 86 ), and hsa-miR-34a-5p targets the RMRP RNA which is critical for pre-rRNA processing ( 87 ,88 ).Following treatment, cells were fixed and stained for FBL and 5-EU incorporation (Figure 3 B), before image processing and quantification were completed with CellProfiler.As expected, MCF10A cells treated with siNT had robustly active nucleoli (Figure 3 B, siNT panel), while positive control cells depleted of the RNAP1 subunit POLR1A (siPOLR1A) showed strongly decreased nucleolar rRNA biogenesis (Figure 3 B, siPOLR1A panel) ( 56 ).Acute treatment with BMH-21, a known small molecule inhibitor of RNAP1 activity ( 89 ,90 ), at 1 μM for 1 h also eliminated nucleolar 5-EU signal as expected ( 56 ) (Figure 3 B, siNT + BMH panel).For the secondary screen replicates ( n = 5), the median Z' factor was 0.27 and the median S / B was 1.88, indicating acceptable separation of controls.In additional experiments ( n = 2), we assayed the ability of the 27 evolutionarilyconserved MirGeneDB hits to inhibit nucleolar rRNA biogen-esis in two other human cell lines, hTERT RPE-1 and HeLa ( Supplementary Figure S3 ).
Remarkably, the full secondary screen in MCF10A cells indicated that 51 / 72 (70.8%) microRNA mimic hits assayed caused at least a 50% inhibition of nucleolar rRNA biogenesis (Figure 3 C, Supplementary Table S3 ).Notably, all eight 5+ hits tested strongly inhibited nucleolar rRNA biogenesis, with a mean percent inhibition of 92.6%.Comparing our results in MCF10A cells to those in hTERT RPE-1 and HeLa cells, we found that 17 / 27 (63.0%)MirGeneDB hits tested caused at least a 50% inhibition of nucleolar rRNA biogenesis in two or more of these cell lines ( Supplementary Figure S3 ).These data support the hypothesis that most microRNA hits from the primary screen significantly disrupt human nucleolar rRNA biogenesis, the main function of the nucleolus, in multiple human cell lines.
A di ver se subset of 15 microRNA hits was chosen for mechanistic follow-up We chose a tractable subset of 15 microRNA hits to further study for their specific effects on crucial RB subprocesses, including pre-rRNA transcription, pre-rRNA processing, and global protein synthesis.We prioritized selecting a diverse group of microRNAs that were representative of differences in key variables observed in the screening campaign, but also were considered to be authentic, valid mi-croRNAs by sequencing and evolutionary analyses.To this end, we conducted principal component analysis (PCA) to visualize screening and bioinformatic data in a dimensionreduced format (Figure 4 A-B).A major cluster appeared containing most one-nucleolus hits with relatively few validated nucleolar targets, accompanied by outliers that either were 5+ nucleoli hits or had a high number of validated nucleolar targets (Figure 4 A).To minimize the potential for studying biological false positives, we also classified hits according to their evolutionary conservation, as cataloged by Mir-GeneDB ( 53 ), or their sequencing read quality consistency across 28 866 small RNAseq experiments ( 91 ).The latter analysis accounted for the compliance of small RNAseq reads with Dicer processing rules, particularly the minimal variability of microRNA sequences at the 5 terminus, and for coverage across the microRNA precursor transcriptome annotation.
For further mechanistic assay validation, we combined our PCA analysis, microRNA conservation data, and literature curation to manually select 12 one-nucleolus hits, two 5+ nucleoli hits, and hsa-miR-34a-5p, which was not a hit in the primary screen but did significantly inhibit nucleolar rRNA biogenesis ( Supplementary Tables S6 -S7 ).The median nucleolar rRNA biogenesis percent inhibition of these hits was 72.2% with a range of −26.3% to 130.3%.Of these hits, 10 were recorded in MirGeneDB and classified as 'High Confidence' microRNAs annotated in miRBase ( 91 ), while 5 were not present in MirGeneDB and were classified as 'Low Confidence' microRNAs in miRBase ( 91 ) ( Supplementary Tables S3 , S6 , and S7 ).Therefore, our curated hit selection process significantly enriched for conserved microRNAs present in MirGeneDB, as compared to the full microRNA mimic library and set of primary screen hits (Figure 4 C).Additionally, 14 / 15 hits passed a tripartite filter for sequencing read quality consistency ( 91 ).Two microRNA hits from the MirGeneDB  ( 56 ).siNT negative control is set to 0% inhibition, and siPOLR1A positive control is set to 100% inhibition.Hit names are bolded for MirGeneDB members, and bar graphs are colored according to their primary screen phenotype; hsa-miR-34a-5p was not a primary screen hit but was included as described in the text.Mean ± SEM are shown alongside individual data points ( n = 5 biological replicates).Data were graphed in JMP.MIR-28 family ( 53 ), hsa-miR-28-5p and hsa-miR-708-5p, were included in the subset.

Not in MirGeneDB In
A subset of microRNA hits dysregulates pre-rRNA transcript levels and rDNA promoter activity Since many microRNA hits caused strong inhibition of nucleolar rRNA biogenesis in the secondary 5-EU screen, we hypothesized that these hits might dysregulate pre-rRNA transcription directly by targeting the 47S primary transcript.We tested this hypothesis by comparing the number of predicted canonical seed binding sites on the 47S pre-rRNA transcript (RefSeq transcript NR_046235.3)for each of the 72 hits with its corresponding mean nucleolar rRNA biogenesis percent inhibition ( Supplementary Figure S4 A; Supplementary Table S13 ).Surprisingly, we did not observe a strong correlation, with several hits having a high nucleolar rRNA biogenesis percent inhibition and zero predicted 47S binding sites.
To experimentally test the 15 subset hits' effects on RNAP1 transcription, we measured the steady-state levels of the 45S pre-rRNA transcript as a proxy for transcription after treatment with the subset of microRNA mimics or control siRNAs by RT-qPCR.( Supplementary Figure S4 B).While depletion of the positive control transcription (t)UTP NOL11 led to a stark decrease in 45S levels versus siNT as expected ( 84 ), none of the microRNA hits statistically-significantly altered 45S levels ( Supplementary Figure S4 C).However, 4 microRNA hits (hsa-miR -34a-5p, hsa-miR -212-5p, hsa-miR -330-5p, hsa-miR-526b-5p) caused a mean decrease in 45S levels of at least 50% [less than −1 log 2 (45S transcript levels)].However, considerable experimental noise with this assay may mask the true effect of these microRNA mimics on altering 45S transcript levels.
To further interrogate pre-rRNA transcription, we conducted a dual-luciferase reporter assay for RNAP1 promoter activity ( 71 ,92 ) after microRNA mimic expression.The assay uses a firefly luciferase reporter under the control of the rDNA promoter and a Renilla luciferase reporter constitutively driven by a CMV promoter to normalize for transfection efficiency ( Supplementary Figure S4 D).Strikingly, we found that treatment with the 15 microRNA hits had diverse effects on RNAP1 promoter activity: 3 microRNA mimics (hsa-miR-147a, hsa-miR-526b, hsa-miR-548an) caused a decrease in RNAP1 promoter activity; 5 microRNA mimics (hsa-miR-34a-5p, hsa-miR -124-3p, hsa-miR -330-5p, hsa-miR -629-3p, hsa-miR-646) caused an increase in RNAP1 promoter activity; and the other 7 microRNA mimics did not cause a significant effect ( Supplementary Figure S4 E).Compared to mock and siNT negative controls, siNT treatment followed by positive control 1 μM BMH-21 dosage significantly decreased RNAP1 promoter activity, while positive control NOL11 depletion caused a modest but statistically-insignificant defect ( Supplementary Figure S4 E).Conversely, depletion of the cytoplasmic 60S ribosomal subunit protein RPL4 had no effect on RNAP1 promoter activity ( Supplementary Figure S4 E).These results indicate that the microRNA hits do not reliably affect pre-rRNA transcription as measured by 5-EU incorporation or 45S transcript levels, while they may upregulate, downregulate, or have no effect on RNAP1 promoter activity.Future experiments may be required to better understand the interplay between microRNA activity and pre-rRNA transcription.
A subset of microRNA hits dysregulates maturation of the 30S pre-rRNA precursor Given the inconclusive ability of the microRNA hit subset to consistently regulate pre-rRNA transcription, we also hypothesized that these hits could affect pre-rRNA processing, another step in nucleolar rRNA biogenesis ( 56 ).We carried out northern blotting for all 15 microRNA mimic hits in the subset, probing for pre-rRNA processing intermediate molecules containing probes for either internal transcribed spacer 1 (ITS1, P3 probe for pre-40S defects) or ITS2 (P4 probe for pre-60S defects) (Figure 5 A).Surprisingly, ITS1 blots broadly demonstrated that most microRNA hits dysregulated maturation of the 30S pre-rRNA precursor (Figure 5 B-C).Ratio Analysis of Multiple Precursors calculations [RAMP, ( 70 )] indicated 3 major clusters of pre-rRNA processing defects caused by the microRNA hits, namely, no change (n.c.), a 30S down cluster, and a 30S up / 21S down cluster (Figure 5 C-D).The last cluster contained two subclusters, one with hits causing a moderate 30S processing defect ('30S ↑ , 21S ↓ ' in Figure 5 D) and another with hsa-miR-28-5p and hsa-miR-708-5p which caused a severe 30S processing defect ('30S ↑ , 21S ↓↓ ' in Figure 5 D).We also examined the extent to which ITS2 processing was dysregulated by the subset of microRNA hits ( Supplementary Figure S5 A).We discovered that hsa-miR-708-5p and the two 5+ nucleoli hits, hsa-miR-212-5p and hsa-miR-629-3p, each caused a mild increase in 32S levels and a mild decrease in 12S levels; additionally, hsa-miR-330-5p mildly attenuated levels of the 32S and 12S precursors ( Supplementary Figure S5 B-C).The remaining microRNA hits did not cause a significant change in ITS2-containing precursor levels.
We conducted Bioanalyzer electrophoresis to define the ability of the microRNA hits to modulate steady-state levels of mature 28S and 18S rRNAs (Figure 5 A).Bioanalyzer quantification revealed an increase in the mature 28S / 18S rRNA ratios for 3 microRNA hits, hsa-miR -28-5p, hsa-miR -708-5p, and hsa-miR-4730 (Figure 5 E), portending a defect in 18S maturation.Indeed, calculation of the ratio of mature 18S rRNA to total RNA from the electropherogram data showed stark decreases in 18S levels following treatment with any of these three microRNA mimics (Figure 5 E).Methylene blue staining corroborated the increase in 28S / 18S ratio for hsa-miR -28-5p and hsa-miR -708-5p (Figure 5 F), although this technique has lower precision than Bioanalyzer quantification.All three microRNA mimics with deficient mature 18S rRNA levels were in the severe 30S processing defect cluster, consistent with the pre-rRNA processing pathway (Figure 5 A), and hsa-miR-4730 clustered at the interface between the moderate and severe 30S defect subclusters (Figure 5 D).The other 12 microRNA mimic hits did not cause a significant change in mature rRNA levels.Together these results reveal, for the first time, the dysregulatory potential of microRNAs to interfere with major steps in pre-rRNA processing.

A subset of microRNA hits decreases global translation
We also hypothesized that overexpression of the microRNA hits might repress global translation.We used the SUnSET puromycin labeling assay ( 73 ) to quantify the extent to which each microRNA hit could inhibit global protein synthesis.Following a 72 h transfection period, MCF10A cells were metabolically labeled for 1 h with 1 μM puromycin, which is incorporated into nascent peptides.Total puromycin was measured as a proxy for global translation rate by immunoblotting total protein with an α-puromycin primary antibody.Treatment with 14 / 15 (93.3%) of the microRNA hits significantly decreased global protein synthesis relative to the negative nontargeting siRNA control (Figure 6 A-B).A similar decrease was observed for the positive control, an siRNA depleting the 60S ribosomal protein RPL4 (Figure 6 A-B).In total, these results indicate that nearly all of the subset microRNA hits inhibit global protein synthesis, the ultimate endpoint of ribosome biogenesis.
A subset of microRNA hits alters levels of TP53 or CDKN1A ( p21 ) Since treatment with many microRNA mimic hits significantly inhibited nucleolar rRNA biogenesis and normal pre-rRNA processing, we hypothesized that treatment with the hits may activate the nucleolar stress response via TP53 stabilization and CDKN1A ( p21 ) upregulation.The nucleolar stress response is induced following disruption of RB subprocesses or the normal tripartite nucleolar structure ( 17 , 19 , 93 ).Mechanistically, 5S RNP proteins including the 60S ribosomal proteins RPL5 or RPL11 can bind and sequester MDM2, the E3 ubiquitin ligase targeting TP53 for constitutive degradation.De-repression of TP53 then upregulates CDKN1A ( p21 ) expression, which acts in concert with TP53 to arrest the cell cycle and initiate apoptosis.By harnessing the WT TP53 status of MCF10A cells, we tested the first part of our hypothesis by immunoblotting for steady-state TP53 levels to assess how microRNA hit overexpression affected nucleolar stress response induction (Figure 6 C).Six of the 15 microRNA hits stabilized TP53 levels, while surprisingly, hsa-miR-629-3p significantly decreased steady-state levels of TP53 (Figure 6 C-D).The remaining 8 / 15 microRNAs did not cause significant dysregulation of TP53 levels.Depletion of either of the positive controls, tUTP factor NOL11 or the 60S ribosomal protein RPL4, strongly stabilized TP53 as expected (Figure 6 C-D).
We also investigated how the microRNA hits affected CDKN1A (p21) mRNA transcript levels by RT-qPCR.Again, depletion of the positive control NOL11 robustly increased CDKN1A levels as compared to siNT, while 11 / 15 mi-croRNA mimics also upregulated CDKN1A to a statisticallysignificant degree (Figure 6 E).The TP53 and CDKN1A data largely concur, although there are two notable discrepancies.First, hsa-miR -28-5p and hsa-miR -708-5p caused strong TP53 stabilization yet did not elicit measurable CDKN1A upregulation, which we address in the next section.Second, hsa-miR-629-3p strongly decreased steady-state TP53 levels while simultaneously inducing CDKN1A .These results indicate that the hits have diverse abilities to induce the nucleolar stress response for cell cycle interruption via upregulation of TP53 or CDKN1A .
Two microRNA hits, hsa-miR-28-5p and hsa-miR-708-5p, are family members who each downregulate RPS28 During our mechanistic studies of the 15 microRNA hits, we hypothesized that the two included MIR-28 family members, hsa-miR -28-5p and hsa-miR -708-5p, would elicit similar results from each assay because of their identical seed sequences.These two microRNAs share the same 7 nt seed sequence (A GGA GCU) (Figure 7 A) ( 53 ).Indeed, we observed similar  cellular RB phenotypes following hsa-miR-28-5p or hsa-miR-708-5p treatment, including the same aberrant pre-rRNA processing signature and stark decreases in both mature 18S rRNA levels and global protein synthesis (Figure 7 A, Supplementary Tables S6 -S7 ).To uncover potential mechanisms for these phenotypic changes, we conducted RNAseq and differential expression analysis for MCF10A cells treated with hsa-miR-28-5p or hsa-miR-708-5p mimics versus non-targeting siRNA (siNT) to control for nonspecific effects of small RNA transfection (Figure 7 B, Supplementary Table S11 ).We hypothesized that differential gene expression should correlate strongly between the two microRNA siblings on a per-gene basis, given that the microRNAs should be largely sharing targets due to their identical seed sequences.Remarkably, when graphing per-gene log 2 expression changes for hsa-miR-708-5p (y-axis) versus hsa-miR-28-5p (x-axis) relative to the negative control, the line of best fit was close to y = 0 + 1 x with an R 2 value of 0.61 (Figure 7 B).Such a strong correlation indicates that treatment with either hsa-miR-28-5p or hsa-miR-708-5p has a very similar effect on expression across the transcriptome, with the expression of individual genes increasing or decreasing, on average, to the same degree following treatment with either microRNA.These data strongly support the conclusion that treatment with hsa-miR-28-5p or hsa-miR-708-5p have similar phenotypic effects on MCF10A cells, and also that both microRNA hits cause highly-similar changes to the transcriptome as expected for two microRNAs with the same seed sequence.We validated the MIR-28-induced downregulation of RPS28 by each mi-croRNA mimic sibling at the transcriptomic and protein levels in MCF10A cells (Figure 7 C, Supplementary Figure S8 ).
Based on the results in our differential expression analysis, we chose to follow up on RPS28 , which was strongly downregulated by each MIR-28 sibling microRNA (Figure 7 B).RPS28 is a ribosomal protein component of the 40S ribosomal subunit, and its depletion in human cells is reported to cause the same 30S up, 21S down aberrant pre-rRNA processing signature ( 94 ,95 ) that we observed following treatment with hsa-miR -28-5p or hsa-miR -708-5p.Indeed, in our own hands, siRNA-mediated RPS28 knockdown reduces RPS28 mRNA and RPS28 protein levels ( Supplementary Figure S6 A-B), decreasing nucleolar number and repressing nucleolar rRNA biogenesis in MCF10A cells ( Supplementary Figure S7 A-C).While RPS28 is not essential for maintaining normal 45S transcript levels, it may have a role in supporting RNAP1 promoter activity ( Supplementary Figure S7 D-E).We verified that RPS28 depletion causes the 30S pre-rRNA precursor to accumulate at the expense of 18S rRNA maturation ( Supplementary Figure S7 F-H), inhibiting global protein synthesis at a level commensurate with the depletion of the large ribosomal protein, RPL4 ( Supplementary Figure S7 I).
We tested whether hsa-miR-28-5p or hsa-miR-708-5p mimic treatment in a second near-normal human cell line would yield similar results.We observed that, in hTERT RPE-1 cells, the MIR-28 siblings, miR-28-5p or hsa-miR-708-5p, cause highly-similar transcriptomic changes and extremely potent downregulation of the RPS28 mRNA in a dosedependent manner (Figure 7 D-E, Supplementary Figure S9 , Supplementary Table S12 ).We note that, despite some sensitivity of RPS28 levels to the negative control siRNA, MIR-28 sibling transfection caused a further dose-dependent decrease in RPS28 levels of at least 10-fold compared to siNT ( Supplementary Figure S9 ).Furthermore, MIR-28 overexpres-sion in hTERT RPE-1 also caused a stark decrease in mature 18S rRNA levels (Figure 7 F), as observed in MCF10A cells (Figure 5 E).
Given that both hsa-miR-28-5p and hsa-miR-708-5p treatment caused such a significant decrease in levels of RPS28 , we hypothesized that this transcript may be targeted by the MIR-28 family.While the TargetScan ( 74 ), miRWalk ( 75 ), and miRDB ( 96 ) algorithms failed to predict such an interaction, DIANA microT-CDS ( 76 ) identified two tandem putative MIR-28 family binding sites in the RPS28 mRNA 3 UTR ( Supplementary Figure S10 ).These tandem MIR-28 sites in RPS28 are perfect 6mer seed matches conserved only within a subset of primates ( Supplementary Figure S10 ); we hypothesize that this relatively low conservation throughout the animal kingdom may be the reason for false negatives using other prediction packages.By conducting in silico binding experiments using the DuplexFold algorithm on the RNAstructure server ( 77 ), we confirmed that the seed regions of both MIR-28 family members were predicted to favorably interact in silico with the candidate binding sites in RPS28 ( Supplementary Figure S11 A-D).Scrambling the sequence of both binding sites in the candidate region strongly abrogated the predicted interaction with hsa-miR-28-5p or hsa-miR-708-5p ( Supplementary Figure S11 E-H).
We sought to test the extent to which the MIR-28 siblings could target these putative RPS28 mRNA binding sites by using luciferase 3 UTR reporter assays that are widely used in the field (Figure 8 A) ( 79 ,97 ).Treatment with hsa-miR-28-5p or hsa-miR-708-5p inhibited luciferase reporter activity when the Renilla reporter contained the RPS28 candidate region with WT putative MIR-28 binding sites (Figure 8 B, WT lanes).Crucially, reporter assays containing the RPS28 candidate region with scrambled putative binding sites, where binding cannot occur, completely rescued the microRNA-induced reporter downregulation (Figure 8 B, SCR lanes), supporting the conclusion that these MIR-28 family microRNAs target the RPS28 3 UTR.
We also investigated if expression of a MIR-28-resistant RPS28 could rescue normal 18S rRNA levels in the presence of the MIR-28 siblings.We genomically engineered HEK 293 Flp-In T-REx cells with a Tet-inducible FLAG-tagged RPS28 cassette lacking the tandem MIR-28 binding sites normally present in the WT 3 UTR (Figure 8 C).We first verified that expression of the FLAG-tagged RPS28 protein was inducible only in the engineered cell line following treatment with 1 μg / ml tetracycline for 48 h ( Supplementary Figure S12 ).As expected, empty vector (EV) or non-induced RPS28 HEK 293 Flp-In cells treated with either MIR-28 sibling showed an increase in the 28S / 18S ratio versus the siNT control.This is consistent with the defect in mature 18S rRNA production that we observed for these microRNA mimics in MCF10A cells (Figure 8 D and Figure 5 E-F).In contrast, tetracyclineinduced expression of the MIR-28-resistant RPS28 construct restored normal 28S / 18S ratios in each of the MIR-28 sibling overexpression backgrounds (Figure 8 D).Coupled with the 3 UTR reporter assays (Figure 8 A-B), this rescue experiment provides evidence that hsa-miR-28-5p and hsa-miR-708-5p, the MIR-28 siblings, target RPS28 to inhibit human pre-18S processing, a key step in RB (Figure 9 ).
To search for other possible targets of the MIR-28 family, we conducted miR-eCLIP analysis for MIR-28 mi-croRNA:mRNA target chimeras ( 83 ).Briefly, treated cells were UV crosslinked, pooled, and an AGO2 immuno- precipitation was performed to isolate microRNAs actively complexed to their targets.Despite limit of detection challenges from modest AGO2 enrichment, sequencing revealed 9243 high-confidence reads consisting of mi-croRNA:mRNA chimeras representing functional target pairs ( Supplementary Table S13 ).Each overexpressed MIR-28 sibling was loaded into AGO2 at levels commensurate with highly-expressed endogenous microRNAs ( Supplementary Figure S13 A), illustrating that the functional microRNAome was not expropriated by these transfected microRNA mimics.We uncovered 31 genes in our MIR-28 mimic RNAseq dataset bearing transcripts targeted by both MIR-28 siblings ( Supplementary Figure S13 B), including MYC and CDKN1A ( p21 ) ( Supplementary Figure S14 A-D).An addi-RPS28 mRNA (NM_001031.tional 113 genes in our MIR-28 mimic RNAseq dataset were targeted by either MIR-28 sibling ( Supplementary Figure S13 ).Overall, direct MIR-28 targets identified by miR-eCLIP had reduced transcript levels following MIR-28 overexpression, as determined by RNAseq ( Supplementary Figure S13 D-F).
Our data that the MIR-28 siblings likely directly reduce MYC levels, providing another avenue of RB attenuation since MYC is an RNAP1 transcription factor ( 98 ).Targeting of MYC by the MIR-28 family has not previously been demonstrated in TarBase 8 ( 63 ).Furthermore, CDKN1A was directly targeted by the MIR-28 family, providing a logical explana-tion for the lack of change of CDKN1A mRNA levels in the face of TP53 upregulation that we observed (Figure 6 D-E, hsa-miR-28-5p or hsa-miR-708-5p).While RPS28 was not observed to be a direct target of hsa-miR-28-5p or hsa-miR-708-5p by miR-eCLIP, this inconclusive result may be the product of subpar AGO2 immunoprecipitation.Together, our findings support the ability of the MIR-28 family to perturb the transcriptome with a high degree of similarity among human cell lines, and to engage in direct post-transcriptional downregulation of the central RB regulator MYC and of RPS28 , an RP critical for normal 18S rRNA maturation.

Discussion
Our work represents the first systematic venture into uncovering the complex roles of microRNAs as governors of ribosome biogenesis.Using our unbiased high-content screening platform for changes in nucleolar number, we have uncovered 72 novel microRNA negative regulators of RB.Strikingly, 51 / 72 hits strongly inhibited nucleolar rRNA biogenesis as measured by nucleolar 5-EU incorporation, supporting a role for the hits in antagonizing RB.We highlight 27 validated, conserved mi-croRNA hits present in MirGeneDB, including 17 hits that induce strong repression of nucleolar rRNA biogenesis in multiple human cell lines.Stringent selection and mechanistic validation of a subset of 15 novel microRNA hits unexpectedly revealed a major effect of hit overexpression to be dysregulation of 30S pre-rRNA processing.Prior to our work, no specific microRNAs had yet been observed to directly affect pre-rRNA processing ( 35 ).While hits in the subset did not appear to reliably alter RNAP1 transcription, almost all subset hits inhibited global protein synthesis and caused upregulation of CDKN1A ( p21 ), with nearly half increasing TP53 steady-state levels.We hypothesized that the microRNA hits were acting by targeting mRNAs required for nucleolar function and reducing their levels.Bioinformatics of all the microRNA hits revealed that they were enriched for mRNA targets encoding proteins localized within the nucleolus or bearing functions in cell cycle progression, proliferation, or TP53 signaling, supporting our mechanistic hypothesis.
To further probe the hypothesis that microRNA mimic hits preferentially target nucleolar proteins, we focused on two MIR -28 family members, hsa-miR -28-5p and hsa-miR -708-5p.We chose them because they share the same 7 nt AG-GAGCU seed sequence and their overexpression causes the same RB defects, including a severe pre-18S rRNA processing defect.Comparison of RNAseq results following overexpression of either mimic resulted in highly-similar transcriptomic profiles in two distinct non-cancerous human cell lines, MCF10A (breast epithelial) and hTERT RPE-1 (retinal pigment epithelium), leading us to focus on RPS28 as a putative target.Using established experimental methods to assess targeting of microRNAs, we found evidence that these two MIR-28 family members target the RPS28 3 UTR, providing a convincing explanation for the observed pre-18S rRNA processing defect ( 94 ,95 ) and supporting our proposed model (Figure 9 ).Critically, we showed that conditional co-expression of the MIR-28 siblings with an RPS28 mRNA lacking MIR-28 binding sites in engineered HEK 293 cells was able to restore normal proportions of the 28S and 18S rRNAs.This rescue experiment supports our proposed model in which the targeting of RPS28 by the MIR-28 family members is the mechanism that underlies the RB defect.We note that, despite inconclusive results from our miR-eCLIP experiment on the question of direct MIR-28: RPS28 binding, positive results from UTR reporter and rescue experiments which feature appropriate controls are supportive of our proposed mechanistic model.We hope to further define the extent to which this mi-croRNA:target interaction is direct in future work.Taken together, our microRNA mimic screen has revealed evidence that the MIR-28 family targets the mRNA encoding the ribosomal protein RPS28, leading to a reduction in its levels with concomitant defects in RB and a decrease in cell viability in human cells.
Our unexpected discovery that the RPS28 3 UTR harbors primate-specific MIR-28 binding sites underscores the importance of conducting unbiased functional experiments in human systems to advance basic and translational biology.MIR-28 targeting of RPS28 had not previously been predicted or observed [reviewed in ( 41 )], which is surprising given the extensive published literature on this microRNA family and the ubiquitous presence of the essential RPS28 protein in all growing cells.The 2023 version of DIANA microT-CDS predicts the MIR-28 family binding sites in the 3 UTR of human RPS28 , though other common microRNA target prediction algorithms including TargetScan, miRDB, and miRWalk fail to predict these interactions.Furthermore, the tandem MIR-28 binding sites in the RPS28 3 UTR can be found only in the most related of primates, including humans, old world monkeys, and apes.We hypothesize that this lack of conservation may be the reason that previous experiments in model systems failed to identify these interactions despite strong conservation of RB and the MIR-28 family in animals.Our findings make a strong argument for undertaking unbiased screens in order to discover novel microRNA targets in human cells, instead relying solely on in silico predictions or non-human model organisms.
While our results point to the novel microRNA hits canonically acting to inhibit RB by post-transcriptionally downregulating target genes with nucleolar localization or functions in the cell cycle, we cannot exclude the possibility that these microRNAs may also have a more immediate role inside the nucleolus itself.A number of studies have defined nucleolar subsets of microRNAs in mammalian cells ( 38-40 ,99-100 ), though the function of nucleolar microRNAs remains poorly understood.It has been suggested that the nucleolus may serve as a staging platform for microRNAs to complex with target mRNAs outside the competitive, mRNA-rich cytoplasm ( 100 ), or perhaps that efflux of microRNAs from the nucleolus may be part of a stress response to the invasion of foreign genetic material ( 40 ).Additionally, AGO2 has been observed to bind to regions of rDNA possibly via rRNA-mediated tethering ( 37 ), though more research is needed to fully understand the potential of microRNAs to directly downregulate the 45S transcript.Previous studies ( 38 ,40 ) have observed the nucleolus to contain a number of our hits' families, including miR-19b, miR -25, miR -34a, miR -182, miR -183, miR -192, miR -330 and miR-629.However, in most cases, the microRNA strand was not indicated in these studies.Future investigation of nucleolar microRNAs may shed additional light on the potential direct impacts of the hits inside the nucleolus beyond the present scope.
Based on the promiscuous nature of microRNA activity, screens with microRNA mimics have an increased scale and complexity of direct regulatory perturbation compared to previous RB screening campaigns ( 54-55 ,101-103 ).While prior screens used siRNA technology to surgically deplete expression of a single gene, the transfection of a microRNA mimic may directly deplete tens to thousands of mRNAs.Simultaneous manipulation of multiple gene regulatory networks with microRNAs may lead to complex, potentially-discordant cellular phenotypes.This is because assay results report an integration of many more heterogeneous expression changes than in simpler single-gene siRNA experiments.One key example from this work is that the MIR-28 siblings elevate TP53 protein levels without a concomitant increase in TP53's downstream target, CDKN1A ( p21 ), as expected ( 104 ).Our miR-eCLIP data indicated that both MIR-28 microRNAs in fact directly target CDKN1A in vivo , as previously shown for hsa-miR-28-5p ( 105 ), resolving this discrepancy.While we found a straightforward explanation for this case, other discrepancies in our data likely remain.These results invite additional probing to improve our understanding of the complex functional perturbations associated with each microRNA hit.
Looking forward, we highlight our discovery of novel mi-croRNA negative regulators of RB in the context of cancer.Differential regulation of many of the 72 hits has been observed in various tumors ( 87 ,106-138 ), with hits including hsa-miR-28-5p and hsa-miR-708-5p ( 139-148 ) often acting as tumor suppressors.For example, hsa-miR-28-5p overexpression impedes growth of colorectal cancer cell lines and mice tumor xenografts ( 144 ) as well as prostate cancer cell lines ( 148 ).We propose that targeting of RPS28 by members of the MIR-28 family may be one novel aspect of their effectiveness as tumor suppressors.We emphasize the enrichment of the hits' targets for involvement in the cell cycle, which is tightly intertwined with RB, nucleolar formation and tumorigenesis ( 25 , 55 , 149 ).Additionally, microRNA mimic therapeutics are a promising avenue in oncology ( 150 ), though roadblocks including delivery and dosage have impeded achieving success in the clinic ( 151 ).Combining mimics and small molecules may enable inhibition of multiple targets or decrease required microRNA dose ( 151 ), as seen in clinical trials for hepatitis C ( 152 ,153 ).Given that cancer cells often hyperactivate ribosome production ( 24 ), our study underscores the potential of harnessing conserved microRNAs for chemotherapy as standalone therapeutics or in concert with other potent small molecule inhibitors of RB like BMH-21 to simultaneously target pre-rRNA transcription and processing.
Statistical tests are outlined above and in the Figure Legends.Biological replicates are shown in the figures and sample sizes are noted in the Figure Legends.Statistical tests were conducted in JMP or GraphPad Prism 8. Unless otherwise stated in the Figure Legends, tests were conducted using siNT as the comparator, and p value magnitude is represented as * P < 0.05; ** P < 0.01; *** P < 0.001.

ConductFigure 1 .
Figure 1.A screen for changes in nucleolar number re v eals 71 no v el microRNA mimic negative regulators of RB. ( A ) Screening campaign pipeline.MCF10A cells were reverse-transfected into a library of 2603 mature human microRNA mimics, then fixed and stained for DNA and FBL after 72 h in biological triplicate.The number of nucleoli was calculated using CellProfiler.Hits were called based on a decrease or increase in nucleolar number, respectively termed the one-nucleolus or 5+ nucleoli phenotypes.The primary screen identified 73 high-confidence hits, 71 of which passed hitpick validation screening.While not a primary screen hit, hsa-miR-34a-5p was included for further validation as described in the text.A functional secondary screen found that 51 / 73 hits strongly inhibited nucleolar rRNA biogenesis via 5-EU incorporation.Additional mechanistic assays for pre-rRNA transcription or processing, global translation, and nucleolar stress were carried out for a subset of 15 rigorously-selected hits.Further mechanistic studies were performed on two MIR-28 family siblings, hsa-miR-28-5p and hsa-miR-708-5p.( B ) Representative images from the screen showing 3 top hits alongside the negative control (non-t argeting , siNT) and positive controls for the one-nucleolus (siNOL11) or 5+ nucleoli (siKIF11) phenotypes.hsa-miR-548-an and hsa-miR-708-5p are hits with the one-nucleolus phenotype while hsa-miR-629-3p is a hit with the 5+ nucleoli phenotype.The DNA stain (Hoechst) is shown in blue and fibrillarin (FBL) antibody detection is shown in red.Scale bars, 10 μm. ( C ).Double flashlight plot for one-nucleolus hit selection.A total of 64 one-nucleolus hits were called using cutoffs for the top 2.5% of mean one-nucleolus percent effect and at least fairly strong hits where SSMD ≥ 1.645 as described in the text.Hits in MirGeneDB are shown in blue, while hits not in MirGeneDB are shown in red.Data were graphed in GraphPad Prism 8. ( D ) Double flashlight plot for 5+ nucleoli hit selection.A total of nine 5+ nucleoli hits were called using cutoffs for the top 1.0% of mean 5+ nucleoli percent effect and the same SSMD cutoff as abo v e. Hits in MirGeneDB are shown in blue, while hits not in MirGeneDB are shown in red.Data were graphed in GraphPad Prism 8.

Figure 3 .
Figure 3.A secondary screen re v eals 51 / 72 hits strongly inhibit nucleolar rRNA biogenesis.( A ). Schematic for the nucleolar rRNA biogenesis inhibition assay ( 56 ).Following 72 h of RNAi transfection, MCF10A cells were labeled for 1 h with 1 mM 5-ethynyl uridine (5-EU).Cells were fixed and immunostained for the nucleolar protein FBL and for 5-EU, then imaged.CellProfiler was used to segment nucleoli and calculate the median 5-EU intensity for all nucleoli per well, enabling calculation of the nucleolar rRNA biogenesis percent inhibition.( B ) Representative images of control-and hit-treated MCF10A cells f ollo wing 5-EU incorporation.FBL immunostaining and 5-EU click labeling are shown as separate channels.Scale bars, 10 μm.siNT is the non-targeting negative control siRNA.siNT + BMH is siNT-transfected cells treated with 1 μM BMH-21 for 1 h before and during 5-EU incorporation.siPOLR1A is the POLR1A (RPA1 94) knoc kdo wn positiv e control.( C ). Nucleolar rRNA biogenesis percent inhibition v alues f or 72 microRNA mimic hits.A total of 51 / 72 hits caused at least 50% inhibition of nucleolar rRNA biogenesis, surpassing the assay's empirical cutoff( 56 ).siNT negative control is set to 0% inhibition, and siPOLR1A positive control is set to 100% inhibition.Hit names are bolded for MirGeneDB members, and bar graphs are colored according to their primary screen phenotype; hsa-miR-34a-5p was not a primary screen hit but was included as described in the text.Mean ± SEM are shown alongside individual data points ( n = 5 biological replicates).Data were graphed in JMP.

Figure 4 .
Figure 4.A subset of 15 microRNA hits were rigorously selected for additional mechanistic validation.( A ) Principal component analysis (PCA) of 72 hits.The 15 selected subset hits are indicated with their mature microRNA name and are highlighted on the plot.Membership in MirGeneDB ( 53 ) or classification as high-confidence or low-confidence miRBase ( 91 ) is labeled with each hit's marker shape or color, respectiv ely.Fiv e v ariables w ere used f or PCA including one-nucleolus and 5+ nucleoli percent effect, percent viability, nucleolar rRNA biogenesis (NRB) percent inhibition, and number of validated nucleolar targets for each hit in TarBase 8. Percentages in axis labels denote the proportion of variance explained by each PCA Component.( B ) Loading plot describing contribution of 5 quantitative variables to PCA Components 1 and 2 from abo v e. Percentages in axis labels denote the proportion of variance explained by each PCA Component.( C ) Stacked bar graphs of the percent of each listed group of microRNA mimics present in MirGeneDB.

Figure 5 .Figure 6 .
Figure 5.Most subset microRNA hits dysregulate pre-18S pre-rRNA processing.( A ) Simplified diagram of human pre-rRNA processing intermediates.Mat ure 18S , 5.8S , and 28S rRNA regions are shown as blue (pre-40S) or red (pre-60S) rectangles.Intermediate names are indicated on the left, and external and internal transcribed spacers (ETS or ITS, solid black lines) are labeled at the top.Cleavage sites are labeled with their name and represented with triangles.Dotted lines signify transcribed spacer regions digested by exonucleases.Northern blot probes P3 (teal, ITS1) or P4 (dark red, ITS2) are shown at each pre-rRNA intermediate that they bind.( B ) Representative ITS1 (probe P3) northern blot of 3 μg of total RNA isolated from control-or hit-treated (as indicated) MCF10A cells.Pre-rRNA processing intermediates are labeled on the left.Images were quantified using Bio-Rad Image Lab. siNT is the non-targeting negative control and siNOL11 is the positive control.Hit names are bolded for MirGeneDB members.( C ) Clustered heatmap sho wing log 2 -transf ormed Ratio Analy sis of Multiple P recursor [RAMP ( 70 )] calculations f or microRNA hits, normaliz ed to siNT negativ e control.Values represent mean log 2 -scale RAMP ratio for n = 3. Clusters: no change (n.c., red); 30S down (magenta); 30S up, 21S down (mild defect, blue and severe defect, green).RAMP ratios were calculated in Microsoft Excel.Four clusters were assigned using hierarchical Ward clustering in JMP, and data were graphed in GraphPad Prism 8. Hit names are bolded for MirGeneDB members.( D ) Linear discriminant analysis (LDA) of four 30S pre-rRNA processing defect clusters from ( C ) abo v e. Cluster colors are the same as in ( C ). Canonical component biplot ra y s are shown in the graph.Ellipses represent 50% (outer) and 95% (inner) confidence le v els.Hit names are bolded for MirGeneDB members.Data were graphed in JMP. ( E ) Bioanalyzer analysis for 1 μg of total RNA isolated from control-or hit-treated MCF10A cells.Top, the 28S / 1 8S mature rRNA ratio; bottom, 1 8S mature rRNA / total RNA ratio.Mean ± SEM are shown alongside individual data points, colored by replicate.Hit names are bolded for MirGeneDB members.Data were graphed and analyzed b y ordinary one-w a y ANO V A with multiple comparisons against siNT (non-targeting negative control) and Holm-Šídák correction in GraphPad Prism 8. * P < 0.05; ** P < 0.01; *** P < 0.001.( F ) Methylene blue (MB) analysis of the 28S / 18S mature rRNA ratio.Northern blots from (B, C) were stained with MB, imaged, and quantified using Bio-Rad Image Lab.Mean ± SEM are shown alongside individual data points, colored by replicate.Hit names are bolded for MirGeneDB members.Data were graphed and analyzed by ordinary one-way ANO V A with multiple comparisons against siNT (non-targeting negative control) and Holm-Šídák correction in GraphPad Prism 8. * P < 0.05; ** P < 0.01; *** P < 0.001.

FFigure 7 .
Figure 7. MIR-28 siblings, hsa-miR-28-5p and hsa-miR-708-5p, highly-similar transcriptomes in human MCF10A and hTERT RPE1 cells, including a potent reduction in RPS28 le v els.( A ).Comparison of our results with the MIR-28 family members hsa-miR-28-5p and hsa-miR-708-5p, in MCF10A cells.The full mature microRNA sequence is shown for each sibling, with the shared A GGA GCU seed sequence indicated in blue.The table compares the RB phenotypes observed in MCF10A cells following treatment with either microRNA mimic.( B ). Regression comparing log 2 -scale RNAseq differential expression profiles of MCF10A cells treated with either hsa-miR-28-5p or hsa-miR-708-5p, relative to siNT negative control.Each dot represents one mRNA, with RPS28 labeled.The line-of-best-fit shown in blue, with equation, R 2 value, and p value for non-zero slope indicated in top left.The data from 3 biological replicates were graphed and analyzed in JMP. ( C ). MIR-28-induced RPS28 downregulation in MCF10A cells, measured by normalized RNAseq read counts or RT-qPCR analysis.For RNAseq reads, the mean ± SEM are shown alongside individual data points, colored by replicate (3 replicates).For RT-qPCR, data were normalized to 7SL RNA abundance as an internal control, then to siNT for comparison using the C T method.Mean ± SEM are shown alongside individual data points, colored by replicate (5 replicates).( D ).Regression comparing log 2 -scale RNAseq differential expression profiles of hTERT RPE-1 treated with either or hsa-miR-708-5p, relative to siNT negative control.Details are as written in (B).( E ).MIR-28 sibling-induced RPS28 downregulation in hTERT RPE-1 cells, measured by normalized RNAseq read counts or RT-qPCR analysis.Details are as written in (C).( F ). Bioanalyzer analysis of total RNA isolated from control-or MIR-28 sibling-treated hTERT RPE-1 cells.Left, the 28S / 18S mature rRNA ratio; right, 18S mature rRNA / total RNA ratio.Mean ± SEM are shown alongside individual data points, colored by replicate.Data were graphed and analyzed by ordinary one-way ANO V A with multiple comparisons against siNT (non-targeting negative control) and Holm-Šídák correction in GraphPad Prism 8. * P < 0.05; ** P < 0.01; *** P < 0.001.

Figure 8 .
Figure8.The MIR-28 siblings target tandem binding sites in the RPS28 3 UTR to interrupt pre-18S processing.( A ). Schematic for 3 UTR luciferase reporter assay for testing the targeting of RPS28 by the MIR-28 siblings, miR-28-5p and hsa-miR-708-5p.Diagram for the RPS28 mRNA transcript, indicating untranslated regions (UTR; dark blue) or coding sequence (CDS; light blue).P utativ e MIR-28 binding sites 1 and 2 are shown in red, and their wild-type (WT) sequence is given, along with the design for scrambled MIR-28 sites (SCR) rescue construct.Antisense pairing of the 6mer MIR-28 seed is shown with the putative WT binding sites.Note the perfect complement arit y bet ween the seed sequence and the predicted target.The WT or SCR regions of the RPS28 3 UTR containing both putative MIR-28 sites were cloned into the miR test site in the 3 UTR of the Renilla luciferase expression cassette in the psiCHECK2 plasmid.( B ).Quantification of 3 UTR luciferase reporter assa y s in (A), testing targeting of RPS28 by the MIR-28 siblings.The vector construct is indicated as wild-type (WT) or scrambled (SCR) above the transfection treatment labels.Luciferase data were collected for each treatment and construct combination, then normalized to the siNT treatment on a per-construct basis.RLU stands for relative light units.Mean ± SEM are shown alongside individual data points, colored by replicate (5 replicates).The data were analyzed by unpaired two-sided Welch's t -tests between WT and SCR constructs in GraphPad Prism 8. * P < 0.05.( C ). Schematic for rescue experiment conditionally expressing an RPS28 mRNA lacking the MIR-28 family binding sites to recover normal 28S / 18S mature rRNA ratio in the presence of MIR-28 sibling overexpression.(Top) Diagram of the engineered HEK 293 Flp-In T-REx cells containing a tetracycline-inducible FLAG-tagged RPS28 casset te lac king the WT 3 UTR harboring tandem MIR-28 binding sites.(Bottom) Experimental outline indicating steps for cell seeding, simultaneous MIR-28 mimic transfection and engineered RPS28 tetracycline induction, incubation, and mature rRNA analysis by electropherogram.( D ).Bioanalyzer analysis of the 28S / 18S mature rRNA ratio from engineered HEK cells, treated as indicated.Treatments include siNT siRNA or each of the MIR-28 sibling mimics, hsa-miR-28-5p or hsa-miR-708-5p.Induction of RPS28 expression with 1 μg / ml tetracycline (Tet) is indicated with a +.Cell line type was either the parental non-engineered line (empty vector, EV) or engineered line containing FLAG-tagged RPS28 lacking WT 3 UTR MIR-28 binding sites (FLAG-RPS28).Mean ± SEM are shown alongside individual data points, colored by replicate ( n = 3 biological replicates).Data were graphed and analyzed by ordinary one-way ANO V A with multiple comparisons against siNT (non-targeting negative control) and Holm-Šídák correction in GraphPad Prism 8. ns, not significant; *** P < 0.001.
nucleolar proteins was retrieved, and HGNC gene names were updated based on NCBI Entrez Gene ID using BioMart.An archived copy of NOPdb 3.0 ( 62 ) containing 2242 unique nucleolar proteins was retrieved, and HGNC gene names were updated based on International Protein Index (IPI) ID using the latest IPI database release (v.3.87) and BioMart.The three databases were merged on updated HGNC name, resulting in a metadatabase of 3490 unique nucleolar proteins ( Supplementary Table Three nucleolar protein databases were merged to create a nucleolar protein metadatabase.The Human Protein Atlas subcellular localization database (v.20.0) ( 60 , proteinatlas.org)containing 1350 unique nucleolar proteins was retrieved, and HGNC gene names were updated based on Ensembl Gene ID using BioMart.Proteins labeled with the GO terms 'Nucleoli', 'Nucleoli fibrillar center', or 'Nucleoli rim' were considered to be nucleolar.The T-cell nucleolar proteome ( 61 ) contain-ing 880 unique West Haven, CT) for polyA+ library preparation and sequencing.All samples had an RNA Integrity Number (RIN) of at least 9.6.30-50 million 100 bp paired-end reads were collected per sample using a NovaSeq 6000 Sequencing System (Illumina).RNAseq was conducted in biological triplicate.Partek Flow was used to process, align, and quantify reads.Reads were trimmed of adapters, then aligned to the hg38 genome with HISAT2 2.1.0.
MCF10A cells or hTERT RPE-1 cells were treated with siNT, hsa-miR -28-5p, or hsa-miR -708-5p and RNA was isolated as above.One μg of total RNA was resuspended in nucleasefree H 2 O and submitted to the Yale Center for Genomic Analysis ( ( 70 ) [Ratio Analysis of Multiple Precursors,( 70 )] ratios were calculated in Microsoft Excel, and heatmaps were made using the mean log 2 RAMP ratio for each treatment relative to siNT in Graph-Pad Prism.The following DNA oligonucleotides were radiolabeled for blotting: