-
PDF
- Split View
-
Views
-
Cite
Cite
Smarti Reel, Parminder S Reel, Josie Van Kralingen, Casper K Larsen, Stacy Robertson, Scott M MacKenzie, Alexandra Riddell, John D McClure, Stelios Lamprou, John M C Connell, Laurence Amar, Alessio Pecori, Martina Tetti, Christina Pamporaki, Marek Kabat, Filippo Ceccato, Matthias Kroiss, Michael C Dennedy, Anthony Stell, Jaap Deinum, Paolo Mulatero, Martin Reincke, Anne-Paule Gimenez-Roqueplo, Guillaume Assié, Anne Blanchard, Felix Beuschlein, Gian Paolo Rossi, Graeme Eisenhofer, Maria-Christina Zennaro, Emily Jefferson, Eleanor Davies, Identification of hypertension subtypes using microRNA profiles and machine learning, European Journal of Endocrinology, Volume 192, Issue 4, April 2025, Pages 418–428, https://doi.org/10.1093/ejendo/lvaf052
- Share Icon Share
Abstract
Hypertension is a major cardiovascular risk factor affecting about 1 in 3 adults. Although the majority of hypertension cases (∼90%) are classified as “primary hypertension” (PHT), endocrine hypertension (EHT) accounts for ∼10% of cases and is caused by underlying conditions such as primary aldosteronism (PA), Cushing's syndrome (CS), pheochromocytoma or paraganglioma (PPGL). EHT is often misdiagnosed as PHT leading to delays in treatment for the underlying condition, reduced quality of life and costly, often ineffective, antihypertensive treatment. MicroRNA (miRNA) circulating in the plasma is emerging as an attractive potential biomarker for various clinical conditions due to its ease of sampling, the accuracy of its measurement and the correlation of particular disease states with circulating levels of specific miRNAs.
This study systematically presents the most discriminating circulating miRNA features responsible for classifying and distinguishing EHT and its subtypes (PA, PPGL, and CS) from PHT using 8 different supervised machine learning (ML) methods for the prediction.
The trained models successfully classified PPGL, CS, and EHT from PHT with area under the curve (AUC) of 0.9 and PA from PHT with AUC 0.8 from the test set. The most prominent circulating miRNA features for hypertension identification of different disease combinations were hsa-miR-15a-5p and hsa-miR-32-5p.
This study confirms the potential of circulating miRNAs to serve as diagnostic biomarkers for EHT and the viability of ML as a tool for identifying the most informative miRNA species.
Identification of secondary forms of hypertension is key to targeted management and prevention of complications but stratification of patients is a challenging diagnostic process that can delay clinical treatment. Here, we present the development of a customized machine learning pipeline that uses circulating microRNA expression profiles to identify patients with endocrine forms of hypertension, including primary aldosteronism, Cushing's syndrome, and pheochromocytoma/paraganglioma. The key outcome of this work includes trained classifiers that can predict hypertension subtypes with AUC of up to 0.9 on an unseen test dataset. It provides highly compelling evidence that machine learning can be used to identify the most discriminating diagnostic circulating miRNAs and, ultimately, translate this approach into future clinical diagnostics.
Introduction
Hypertension is a significant health problem affecting a third of adults.1,2 Approximately 85%-95%3 of cases are defined as primary hypertension (PHT) while the remainder has a multiplicity of underlying causes, including such endocrine disorders as primary aldosteronism (PA), Cushing's syndrome (CS), pheochromocytoma or paraganglioma (PPGL). These are difficult to distinguish satisfactorily and, consequently, to treat effectively.4 Therefore, the identification of novel diagnostic biomarkers is a worthwhile endeavor that might also lead to more effective and specific treatments for hypertension in its various forms.5
MicroRNAs (miRNAs) are small, endogenously expressed, noncoding RNAs that negatively regulate cellular gene expression at the post-transcriptional level. MiRNA action has significant regulatory effects in many tissues and biological processes, including tumorigenesis and regulation of endocrine systems. Our previous work demonstrated that specific miRNAs produced by adrenocortical cells influence their secretion of aldosterone and cortisol. Furthermore, we found miRNA expression profiles of aldosterone-producing adenoma samples differ significantly from those of normal adrenal tissue, indicating disruption of miRNA production as a feature of this condition.6,7 We hypothesize such changes will be reflected in the levels of specific miRNAs present in the circulatory system. Such circulating miRNAs are released into the bloodstream from various tissues of the body, either actively or passively, in a process that is still poorly understood. Nevertheless, changes in circulating miRNA levels have proven worth as specific disease biomarkers and have been investigated in cases of hypertension, identifying certain miRNAs as having prognostic, diagnostic, and therapeutic value.8,9 Specifically, hsa-miR-21 and hsa-miR-510 showed potential owing to their increased circulating levels in hypertensive patients.10,11 Considering the markedly distinct molecular pathogenesis of the various endocrine hypertension (EHT) subtypes, we hypothesize that patients with PA, PPGL, and CS are likely to have distinctive circulating miRNA profiles.
Recently, machine learning (ML) methods have gained popularity in next-generation biomarker discovery initiatives using high throughput next-generation sequencing (NGS) data.12-14 This has provided the motivation to use ML algorithms for miRNA-disease association prediction.15,16 Moreover, ML has also been successfully employed to classify lung adenocarcinoma,17 melanoma,18 and tumor origin19 using miRNA profiles. Some studies have focused on associating individual miRNAs with specific EHT subtypes; hsa-miR-483-5p, hsa-miR-101, and hsa-miR-183 were each reported to associate with pheochromocytoma20,21 and Castro-Vega et al. have discussed the genomic features underlying PPGL, including associated changes in miRNA expression.22 Vetrivel et al. reported changes in adrenal tissue23 and circulating miRNA24 levels associated with different forms of CS. However, no studies have explored the use of supervised ML approaches for a combined classifier solution.
Recently, a multiomics study of a retrospectively collected cohort of EHT and PHT patients was conducted under the ENS@T-HT project.25 This took a multiomic approach, combining numerous datasets, including steroids (plasma and urinary), catecholamines, small metabolites, and plasma miRNAs, together with a ML pipeline in order to identify optimal diagnostic signatures that distinguish PPGL, PA, and CS from PHT. This analysis showed circulating miRNAs to have particularly high balanced accuracies within the generated multiomic diagnostic signatures.25 This indicated that circulating miRNAs may have value as diagnostic biomarkers in their own right, separate from the other elements of the multiomics signature. To test this, the present study uses an ML approach to analyze the circulating miRNA dataset in isolation from the other ENS@T-HT omics. As with that multiomic analysis, the key aim here was to stratify hypertensive patients among themselves. EHT (PA + PPGL + CS) and its subtypes (PA, PPGL, and CS) were classified from PHT using various supervised ML methods. The algorithm was also trained to classify different hypertensives into PPGL, PA, CS, and PHT groups with the ultimate objective of developing a circulating miRNA biomarker clinical test. The classification performance was evaluated through overall classification accuracy, specificity, and sensitivity. The most prominent miRNA features responsible for distinguishing hypertension subtypes were also identified.
Materials and methods
Omic dataset
EDTA-plasma samples were collected from 330 male or female patients aged between 11 and 78 years who had been diagnosed with 1 of the underlying 4 hypertension subtypes (PA, PPGL, CS, and PHT). Samples were provided by ENS@T-HT Horizon 2020 project collaborators from previous study archives. No formal sample size calculation was conducted. Specific inclusion and exclusion criteria for each hypertension subtype, including diagnostic criteria and ethics approval details are provided in Supplemental Section 1.1 and 1.2, respectively. All research was conducted in compliance with the Declaration of Helsinki.
Total RNA was isolated from 200 µL EDTA-plasma using the miRNeasy Mini kit (QIAGEN, Manchester, UK) standard protocol. Samples were eluted in 30 µL RNase-free water. cDNA was reverse-transcribed from 4 µL of undiluted RNA in a 20 µL reaction volume according to the standard protocol of the Universal cDNA synthesis kit II (Exiqon, Vedbaek, Denmark). For quality control purposes, plasma samples were spiked with UniSp2, UniSp4, and UniSp5 RNAs before RNA isolation and RNA samples were spiked with cel-miR-39-3p and UniSp6 cDNA during reverse transcription using components from the miRCURY LNA™ Universal microRNA PCR System RNA Spike-in kit (Exiqon). Selected plasma miRNAs were quantified using Serum/Plasma Focus microRNA PCR Panels (384-well, V4.M, Exiqon) according to their standard protocol, using ExiLENT SYBR® Green master mix (Exiqon) and ROX solution (Thermo Fisher, Renfrew, UK) on a Quantstudio 12 K Flex Real-time PCR System (Thermo Fisher). A full list of the 179 miRNAs and 13 controls analyzed on the panel is provided in Table S1.
Raw Ct data generated by qRT-PCR using the QuantStudio System were analyzed using GenEx software (v.6, MultiD Analyses, Vedbaek, Denmark). Interplate calibration was performed using UniSp3 amplification results; this enabled cross-plate comparisons. Quality control checks for both RNA isolation and cDNA preparation were performed using the spike-in controls; where the amplification value (cycle threshold, Ct) exceeded predefined limits the sample was flagged. For RNA isolation spike-in assays, this limit was +/− 2 Ct from the mean across all samples, and for cDNA synthesis spike-in assays the limit was +/− 1 Ct from the mean. Next, if fewer than 90% of miRNAs amplified in a sample, that sample was flagged. If a patient sample was flagged in 2 or more categories, it was excluded from further analysis. If a miRNA had >50% missing data it was excluded from further analysis.
Disease combinations
Table 1 provides a breakdown of the study patients by subtype, age, and sex.
Patient data for all disease types namely Cushing's syndrome (CS), primary aldosteronism (PA), pheochromocytoma or paraganglioma (PPGL), and primary hypertension (PHT).
Disease . | Patient Count (n=) . | Gender . | Age distribution . | ||
---|---|---|---|---|---|
Male . | Female . | Patient age ≥ 50 . | Patient age < 50 . | ||
CS | 35 | 5 | 30 | 17 | 18 |
PA | 109 | 58 | 51 | 44 | 65 |
PPGL | 75 | 31 | 44 | 43 | 32 |
PHT | 111 | 48 | 63 | 71 | 40 |
Disease . | Patient Count (n=) . | Gender . | Age distribution . | ||
---|---|---|---|---|---|
Male . | Female . | Patient age ≥ 50 . | Patient age < 50 . | ||
CS | 35 | 5 | 30 | 17 | 18 |
PA | 109 | 58 | 51 | 44 | 65 |
PPGL | 75 | 31 | 44 | 43 | 32 |
PHT | 111 | 48 | 63 | 71 | 40 |
Patient data for all disease types namely Cushing's syndrome (CS), primary aldosteronism (PA), pheochromocytoma or paraganglioma (PPGL), and primary hypertension (PHT).
Disease . | Patient Count (n=) . | Gender . | Age distribution . | ||
---|---|---|---|---|---|
Male . | Female . | Patient age ≥ 50 . | Patient age < 50 . | ||
CS | 35 | 5 | 30 | 17 | 18 |
PA | 109 | 58 | 51 | 44 | 65 |
PPGL | 75 | 31 | 44 | 43 | 32 |
PHT | 111 | 48 | 63 | 71 | 40 |
Disease . | Patient Count (n=) . | Gender . | Age distribution . | ||
---|---|---|---|---|---|
Male . | Female . | Patient age ≥ 50 . | Patient age < 50 . | ||
CS | 35 | 5 | 30 | 17 | 18 |
PA | 109 | 58 | 51 | 44 | 65 |
PPGL | 75 | 31 | 44 | 43 | 32 |
PHT | 111 | 48 | 63 | 71 | 40 |
Results for 5 disease combinations were assessed:
1. ALL vs ALL (ie, PA vs PPGL vs CS vs PHT)
2. EHT vs PHT
3. PA vs PHT
4. CS vs PHT
5. PPGL vs PHT
ML analysis pipeline
The ML analysis pipeline consisted of 3 key steps (see Figure 1). The first step involved exclusion of extreme outliers followed by random splitting of the data into training (80%) and testing (20%) sets in a stratified manner, according to established practice26 (see Table S2). In the second step, the most suited feature selection method and classifiers were chosen for each disease combination using random subsampling validation (using training/validation set). The top selected features were then saved for each scenario. Lastly, in step 3, the final model training and testing was conducted. The top performing classifiers were trained using the training set (with only the reduced feature set). The trained classifiers were then used to predict the disease type of testing data. These results were then evaluated using various performance metrics. The details of outlier detection, features selection, classifiers, and evaluation scenarios are provided in Supplemental Methods (see Supplemental Section 1.3).

The ML analysis pipeline with 3 steps. AAll features selected, filter method (CFS: correlation-based feature selection) and wrapper method (Boruta). BJ48, Naïve Bayes (NB), IBk, logitboost (LB), logistic model tree (LMT), simple logistic (SL), random forest (RF), and sequential minimal optimization (SMO). CPA vs PPGL vs CS vs PHT (ie, All vs All), EHT vs PHT, PA vs PHT, CS vs PHT and PPGL vs PHT. DScenario 1: Set A (all features inc. age and sex) vs Set B (all features exc. age and sex), Scenario 2: Set C (male patients) vs Set D (female patients) and Scenario 3: Set E (patient age ≥ 50) vs Set F (patient age < 50).
Results
Data generation
Samples from 346 patients were analyzed. Following quality checking, as detailed above, data from 330 of these were subjected to further analysis. Mean non-normalized Ct figures and standard deviations across all 330 patients for each measured miRNA are shown in Table S1. Data normalization was performed to enable direct comparison of the sample results. Normalization used 5 miRNAs identified as being most stably expressed across the dataset by Normfinder software27: hsa-miR-106a-5p, hsa-miR-425-5p, hsa-miR-222-3p, hsa-let-7g-5p, and hsa-let-7i-5p. Normalization was performed for each sample by subtracting the mean Ct value of the 5 normalizer miRNAs from each of the feature miRNA Ct values, generating the deltaCT values used for subsequent analysis. Nondetected miRNA values were imputed by assigning them the value (max + 1), where max was the maximum Ct detected for that miRNA across all samples. (Note that Ct value is inversely proportional to transcript quantity.) Imputation accounted for 1.44% of all data points.
Of the 179 unique human miRNAs measured, 5 were used for normalization and 1 (hsa-miR-208a-3p) was excluded on quality grounds; 173 human miRNAs were therefore put forward for further analysis. A total of 175 features were extracted; these comprised measured circulating levels for the 173 individual human miRNA species (see Table S1) plus the age and sex of the patient. The final dataset was cataloged in RDMP Software28 for systematic access.
ML pipeline analysis
Results of ML analysis for different disease combinations are described in this subsection. First, the performance of including/excluding outliers on classifier performance is evaluated, followed by selection of the best classifiers and feature selection method. Using these parameters, the top discriminating features are then selected for different evaluation scenarios using random subsampling. Finally, these features are used to train and test final models.
Outliers, best classifiers, and feature selection methods
Figure 2A shows balanced accuracy (ie, the mean of specificity and sensitivity), sensitivity and specificity plots for 8 classifiers, comparing results for the ALL vs ALL disease combination using 2 datasets ie, including and excluding outliers. Performance improved when outliers were removed before classifying any given disease combination. The best-performing 4 classifiers across the 3 performance metrics were logitboost (LB), logistic model tree (LMT), simple logistic (SL), and sequential minimal optimization (SMO). Therefore, outliers were removed for all further analysis.

Box plots comparing classification performance for ALL vs ALL disease combination (A) either excluding or including extreme outliers and (B) excluding outliers dataset for the top 4 classifiers using all features, filter, and wrapper feature selection methods.
Next, for the ALL vs ALL disease combination, 2 feature selection methods, wrapper (Boruta) and filter (Correlation-based Feature Selection; CFS) were applied. The classification was also performed by using a complete feature list (ie, no feature reduction). The balanced accuracies, sensitivities and specificities were compared for all 3 feature selection methods using the 4 selected classifiers LB, LMT, SL, and SMO.
Figure 2B compares classification results for these feature selection methods using the top 4 classifiers for the ALL vs ALL disease combination. It was observed that using all features for classification provided best results, followed by the wrapper (Boruta) method, with the filter (CFS) method lowest. However, the wrapper method used only the top discriminating features rather than all 175 features, and mean accuracies were comparable. The wrapper-based feature selection method “Boruta” improved the overall accuracy compared to using the filter selection method and was therefore used for all 5 disease combinations.
Evaluation scenarios
Classification performance
Figure 3 shows the various performance metrics for each of the 5 disease combinations, 3 scenarios, and 4 classifiers over 100 random repeats. In Scenario 1 (comparison of Set A vs Set B), for almost all disease combinations, both Sets A and B provided balanced accuracies with ∼1% variation. The balanced accuracy was highest for PPGL vs PHT (∼78%) and lowest for ALL vs ALL (∼66%). Also, the sensitivities for CS vs PHT and ALL vs ALL were low (<60%) compared to the corresponding specificities (>84%). However, the opposite case applied for EHT vs PHT, where the sensitivities were higher (>85%) compared to the corresponding specificities (<57%). Similar trends were observed for F1, AUC, and Kappa score. There was marginal difference between Sets A and B, indicating that age and sex are not highly differentiating features in circulating miRNA-based EHT classification.

Heatmap comparing classification performance for sets (A–F) using 4 classifiers for 5 disease combinations. The count in each box is a mean of 100 runs (random repeats).
The nonuniform number of samples in the different Sets of Scenarios 2 and 3 does not validate their direct metric comparison. However, it was useful in evaluating the discriminating features. The performance metrics were higher for Set D (female subset) than for Set C (male subset) in all disease combinations for the majority of classifiers, except for CS vs PHT where the specificity was lower for Set D. The lowest balanced accuracy (50%) and zero sensitivity (but 100% specificity) was observed for the male subset (Set C) during CS vs PHT classification due to an extremely low number of male samples compared to female samples (in Set D). Overall, Set D showed the highest balanced accuracy for PA vs PHT (83%) using the SMO classifier. Similar results were observed for the remaining 3 metrics.
The disease combinations were also compared on the basis of patient age, ie, ≥50 years (Set E) or <50 years (Set F). Better metrics were observed for Set E (older patients) for all disease combinations than for Set F except in CS vs PHT. The highest balanced accuracy of 80% was observed in EHT vs PHT (Set E) for all classifiers, except LB.
Significant features
Figure 4A shows a list of the most significant features used during 100 random repeats (100RR) for the different disease classifications and scenarios. The list shows only the features used >50 times. The largest number of features were selected for the ALL vs ALL classification, while the fewest features were selected for CS vs PHT, several of which were exclusive to this classification (hsa-miR-495-3p, hsa-miR-485-3p, hsa-miR-186-5p, hsa-miR-1260a, and hsa-miR-195-5p). Similarly, 2 separate sets of features (hsa-miR-139-5p, hsa-miR-326 and hsa-miR-223-3p, and hsa-miR-133a-3p) were exclusively selected in PA vs PHT classification for Set D (the female subset) and Set E (elder patients), respectively.

Significant features selected during (A) 100RR for the different disease classifications and scenarios, and (B) final classifier training and testing. In Part (A), the count in each box represents the number of times a particular feature was selected during 100RR and only features used >50 times are selected.
Figure 4B shows the joint list of features in Set A of all 5 disease combinations. The hsa-miR-15a-5p and hsa-miR-32-5p were common to all 5 disease classifications, while hsa-miR-424-5p and hsa-miR-574-3p were important features only for CS vs PHT and PA vs PHT, respectively. Age was observed to be a noteworthy feature for ALL vs ALL and PA vs PHT classification.
Final model training and testing
In the final step of the ML pipeline, significant features of each disease combination were used to create a subset of the training dataset. Once trained, the chosen classifiers were tested on the testing set. Table 2 shows the classification results for the top performing classifiers on test data. EHT vs PHT achieved the best classification (balanced accuracy: 89%, sensitivity: 95%, AUC: 0.9) using LMT classifier on 67 test samples (see Figure S2 for confusion matrices). Similarly, for PPGL vs PHT, LB classifier provided 85% balanced accuracy with corresponding sensitivity and AUC of 87% and 0.9. However, performance metrics for ALL vs ALL, CS vs PHT and PA vs PHT were marginally lower, with balanced accuracy of 73%, 71%, and 73%, respectively.
Classification results on test set for 5 disease comparisons of top performing classifiers.
Disease combination . | Classifier . | Performance metrics (test set) . | ||||||||
---|---|---|---|---|---|---|---|---|---|---|
Total instances . | Correctly classified . | Incorrectly classified . | Balanced accuracy (%) . | Sensitivity (%) . | Specificity (%) . | AUC . | F1 . | Kappa . | ||
ALL—ALL | LMT | 67 | 45 | 22 | 73 | 58 | 88 | 0.8 | 0.6 | 0.5 |
EHT—PHT | LMT | 67 | 61 | 6 | 89 | 95 | 83 | 0.9 | 0.9 | 0.8 |
CS—PHT | LB | 30 | 26 | 4 | 71 | 43 | 100 | 0.9 | 0.6 | 0.5 |
PA—PHT | LMT | 45 | 33 | 12 | 73 | 64 | 83 | 0.8 | 0.7 | 0.5 |
PPGL—PHT | LB | 38 | 32 | 6 | 85 | 87 | 83 | 0.9 | 0.8 | 0.7 |
Disease combination . | Classifier . | Performance metrics (test set) . | ||||||||
---|---|---|---|---|---|---|---|---|---|---|
Total instances . | Correctly classified . | Incorrectly classified . | Balanced accuracy (%) . | Sensitivity (%) . | Specificity (%) . | AUC . | F1 . | Kappa . | ||
ALL—ALL | LMT | 67 | 45 | 22 | 73 | 58 | 88 | 0.8 | 0.6 | 0.5 |
EHT—PHT | LMT | 67 | 61 | 6 | 89 | 95 | 83 | 0.9 | 0.9 | 0.8 |
CS—PHT | LB | 30 | 26 | 4 | 71 | 43 | 100 | 0.9 | 0.6 | 0.5 |
PA—PHT | LMT | 45 | 33 | 12 | 73 | 64 | 83 | 0.8 | 0.7 | 0.5 |
PPGL—PHT | LB | 38 | 32 | 6 | 85 | 87 | 83 | 0.9 | 0.8 | 0.7 |
Abbreviations: EHT, endocrine hypertension; PHT, primary hypertension; CS, Cushing's syndrome; PA, primary aldosteronism; PPGL, pheochromocytoma or paraganglioma; LMT, logistic model tree; LB, logitboost.
Classification results on test set for 5 disease comparisons of top performing classifiers.
Disease combination . | Classifier . | Performance metrics (test set) . | ||||||||
---|---|---|---|---|---|---|---|---|---|---|
Total instances . | Correctly classified . | Incorrectly classified . | Balanced accuracy (%) . | Sensitivity (%) . | Specificity (%) . | AUC . | F1 . | Kappa . | ||
ALL—ALL | LMT | 67 | 45 | 22 | 73 | 58 | 88 | 0.8 | 0.6 | 0.5 |
EHT—PHT | LMT | 67 | 61 | 6 | 89 | 95 | 83 | 0.9 | 0.9 | 0.8 |
CS—PHT | LB | 30 | 26 | 4 | 71 | 43 | 100 | 0.9 | 0.6 | 0.5 |
PA—PHT | LMT | 45 | 33 | 12 | 73 | 64 | 83 | 0.8 | 0.7 | 0.5 |
PPGL—PHT | LB | 38 | 32 | 6 | 85 | 87 | 83 | 0.9 | 0.8 | 0.7 |
Disease combination . | Classifier . | Performance metrics (test set) . | ||||||||
---|---|---|---|---|---|---|---|---|---|---|
Total instances . | Correctly classified . | Incorrectly classified . | Balanced accuracy (%) . | Sensitivity (%) . | Specificity (%) . | AUC . | F1 . | Kappa . | ||
ALL—ALL | LMT | 67 | 45 | 22 | 73 | 58 | 88 | 0.8 | 0.6 | 0.5 |
EHT—PHT | LMT | 67 | 61 | 6 | 89 | 95 | 83 | 0.9 | 0.9 | 0.8 |
CS—PHT | LB | 30 | 26 | 4 | 71 | 43 | 100 | 0.9 | 0.6 | 0.5 |
PA—PHT | LMT | 45 | 33 | 12 | 73 | 64 | 83 | 0.8 | 0.7 | 0.5 |
PPGL—PHT | LB | 38 | 32 | 6 | 85 | 87 | 83 | 0.9 | 0.8 | 0.7 |
Abbreviations: EHT, endocrine hypertension; PHT, primary hypertension; CS, Cushing's syndrome; PA, primary aldosteronism; PPGL, pheochromocytoma or paraganglioma; LMT, logistic model tree; LB, logitboost.
Discussion
In this study, a supervised ML pipeline was employed to classify and distinguish EHT and its subtypes (PA, PPGL, and CS) from PHT using circulating miRNA data previously analyzed in a multiomic context under the ENS@T-HT retrospective study.25 In this singleomic analysis, the top discriminating miRNA biomarkers were identified for different classification scenarios and the classification performance was evaluated through overall classification balanced accuracy, specificity, sensitivity, AUC, F1 score, and Kappa. The Boruta-based feature selection approach and exclusion of extreme outliers provided an overall better performance on miRNA data.
It was observed that circulating miRNA helped classify EHT and PPGL from PHT in the test set with balanced accuracy of 89% and 85%, respectively. Overall, hsa-miR-15a-5p and hsa-miR-32-5p were the most important biomarkers for classifying all disease combinations from each other. Also, 2 pairs of miRNAs (hsa-let7d-3p, hsa-miR-335-5p and hsa-miR-162-3p, and hsa-miR-15b-3p) were important in classifying all disease combinations except CS vs PHT and PPGL vs PHT. However, hsa-miR-424-5p, hsa-miR-495-3p, hsa-miR-186-5p, hsa-miR-1260a, hsa-miR-485-3p, hsa-miR-195-5p, and hsa-miR-301a-3p were uniquely selected for CS vs PHT classification, while hsa-miR-629-5p, hsa-miR-92a-3p, and hsa-miR-423-5p were used for both CS vs PHT and ALL vs ALL classification. Alongside other miRNAs, hsa-miR-30c-5p was one of the significant identifiers for ALL vs ALL and EHT vs PHT. Also, hsa-miR-148b-3p, hsa-miR-107, hsa-miR-27b-3p, hsa-miR-324-5p, and hsa-miR-199a-5p were observed to be essential biomarkers for EHT vs PHT (selected for all 100 random repeats). Age appeared as a significant feature only in 2 classifications: ALL vs ALL and PA vs PHT. In the evaluation scenarios (Step 2 of the ML pipeline), it was also observed that female patients (Set D) and older patients (Set E) were better classified with higher balanced accuracies than male patients (Set C) and younger patients (Set F), except in the case of CS vs PHT where younger patients had higher classification accuracy than older patients. However, these trends are likely to be sensitive to the class imbalance, with more female CS patients than males.
The rationale for investigating circulating miRNA levels in the context of EHT is 2-fold. First, circulating miRNAs have shown promise in the diagnosis of several conditions where significant and consistent changes have proved sufficiently informative to justify their use for diagnostic purposes, if certain technological and methodological limitations can be overcome.29,30 Second, we and others have demonstrated that miRNAs have significant roles in regulatory mechanisms that underpin endocrine homeostasis, such as corticosteroid secretion. Disruption of these contributes to PA and CS, with affected adrenal tissue or tumors showing marked changes in miRNA level relative to healthy tissue.7 Although the mechanisms by which miRNAs enter the circulation remain unclear, it is reasonable to hypothesize that such changes at the tissue level might be reflected in the circulation, either as a result of altered miRNA expression within the diseased tissue, or through its impact on the miRNA expression and secretion of other tissues.31
This study identifies several circulating miRNAs as potentially valuable discriminatory features in the diagnosis of EHT (Figure 4A). Few of these have previously been highlighted by studies of these or similar conditions and this lack of correlation is probably due to several factors. First, this work has focused on distinguishing various forms of EHT from PHT whereas previous studies attempted to discriminate between other combinations of diseased (and healthy) individuals,32,33 or else concentrated on pathological or prognostic features of a single condition and/or its subtypes.34-36 Methodological differences may also play a part, and it is probable that our investigation of free circulating miRNAs would have had different outcomes if we had chosen to examine serum or vesicle-associated miRNAs rather than plasma. Finally, the normalizing of raw expression data together with the novel ML approach to identifying feature miRNAs must also be acknowledged as key differences between this and previous studies; these processes have been carefully designed to improve consistency of analysis and to aid its adoption in a multicenter clinical setting as a common diagnostic tool.
Detailed investigation and description of possible tissue sources, genetic targets and pathophysiological roles of the feature miRNAs identified in this study lies beyond the scope of this particular investigation. However, it is worth noting that the key classifying biomarkers identified here have been flagged in previous studies of hypertension. For example, we observed a decrease in circulating levels of hsa-miR-15a-5p in primary hypertensive patients that was not seen in endocrine hypertensive patients. Variability in circulating levels of this miRNA with different forms of hypertension was previously noted by Nandakumar et al., who observed a rise in circulating hsa-miR-15a-5p in hypertensive patients relative to those with hypertension and chronic kidney disease.33
Previous studies examining adrenal effects on circulating miRNA levels have primarily been concerned with differentiating adrenocortical adenoma from adrenocortical carcinoma. These have consistently shown raised circulating levels of hsa-miR-483-5p to correlate with adrenocortical carcinoma37 and, perhaps unsurprisingly, this does not emerge as one of our classifying features. However, it is interesting to note that one such study observed raised hsa-miR-210-3p levels in cortisol-producing adrenocortical carcinoma,38 as this miRNA is one of the classifying features we identified for CS. We also found this miRNA to be one of the classifying features in our PPGL vs PHT comparison, consistent with previous studies associating hsa-miR-210-3p with PPGLs,39 and identifying its serum levels as a potential biomarker of PPGL malignancy.36 An investigation of phaeochromocytoma tissue identified 18 miRNAs as being differentially expressed between malignant and benign samples; in addition to the aforementioned hsa-miR-15a-5p,40 these included hsa-miR-574-3p and hsa-miR-451a which we find to be classifiers of PPGL vs PHT comparison, but only within the female subset (Set D). A study by Vetrivel et al. of circulating miRNAs was primarily concerned with CS diagnosis and classification rather than its differentiation from primary and/or other endocrine forms of hypertension.24 However, of the 8 miRNAs they identified by NGS as being differentially expressed between their 3 study groups (ACTH-dependent CS, ACTH-independent CS, and non-CS controls), we have measured 3 and these include hsa-miR-629-5p, which we find to be one of our strongest classifiers of CS vs PHT. Vetrivel et al. were subsequently unable to validate the differential expression of circulating hsa-miR-629-5p between the 2 forms of CS using RT-PCR, but it should be noted that their protocol involved normalization to a single “housekeeping” circulating miRNA, hsa-miR-16-5p that we find to be a significant classifier for PA vs PHT. The same investigators also used NGS to identify possibly one of our most discriminating miRNA (hsa-miR-486-5p) for CS vs PHT, as 1 of 5 miRNAs commonly downregulated in CS adrenal tissue relative to control but again they were unable to validate their NGS data using RT-PCR.23 Finally, we previously identified significant differences in specific miRNA levels between APAs and normal adrenal tissue,6,7 but none of these emerged here as classifying features of PA in their circulating form, except for hsa-miR-22-3p (only in subset C: male).
Balanced accuracy and AUC were used as the primary metrics to assess performance of the ML models throughout, as they are unaffected by disease prevalence, unlike standard or conventional accuracy metrics. Nevertheless, the estimates derived using these performance metrics will inevitably be more imprecise where sample size is small. Indeed, although this study provided crucial insight into a complex multiclass problem, one key shortcoming was fewer CS samples due to the rarity of the disease. The ENS@T-HT study is capturing data prospectively from a larger population, which should overcome this limitation in the future (ClinicalTrials.gov Identifier: NCT02772315).
Conclusion
This study predicted different subtypes of hypertension using circulating miRNA data. A ML approach using 5 disease combinations and 8 supervised ML classifiers was introduced and different scenarios were evaluated based on age and sex bifurcation. The ML approach provided promising classification results and a reduced set of features that have potential as formal biomarkers for detection of hypertension subtypes. Confirmation of our findings using the same methodology in a different study population is now required. To this end, we are currently conducting a separate prospective study of circulating miRNA levels in EHT patients specifically to test the reproducibility of our findings and confirm the diagnostic utility of these circulating miRNAs as accessible high-throughput biomarkers.
Acknowledgments
Data related to the results presented in this article can be obtained upon reasonable request by researchers who provide a methodologically sound proposal. This is subject to approval by the ENSAT-HT executive committee, which includes representatives of the biosamples collections and omics data generation centers. Researchers would be required to complete a Data Sharing Agreement. All requests should be directed by email to the corresponding author.
Supplementary material
Supplementary material is available at European Journal of Endocrinology online.
Funding
This project was funded by the European Union's Horizon 2020 Research and Innovation Programme under Grant Agreement No. 633983. This work was supported by the Clinical Research Priority Program of the University of Zurich for the CRPP HYRENE and by the Deutsche Forschungsgemeinschaft (DFG) within the CRC/Transregio 205/1 “The Adrenal: Central Relay in Health and Disease” to F.B. C.P. is funded by Deutsche Forschungsgemeinschaft (314061271-TRR/CRC 205- 1/2). S.L. is a PhD student funded by the Medical Research Council (MRC) Precision Medicine Doctoral Training Programme (MR/N013166/1). For the purpose of open access, the author(s) has applied a Creative Commons Attribution (CC BY) license to any Author Accepted Manuscript version arising from this submission.
Authors’ contributions
Smarti Reel (Data curation [equal], Formal analysis [lead], Investigation [equal], Methodology [equal], Software [equal], Validation [equal], Visualization [lead], Writing—original draft [lead], Writing—review & editing [equal]), Parminder Singh Reel (Conceptualization [equal], Data curation [equal], Formal analysis [lead], Investigation [equal], Methodology [equal], Software [equal], Validation [equal], Visualization [equal], Writing—original draft [supporting], Writing—review & editing [equal]), Josie Van Kralingen (Data curation [equal], Formal analysis [equal], Investigation [equal], Methodology [equal], Project administration [equal], Software [supporting], Validation [equal]), Casper K. Larsen (Data curation [equal], Formal analysis [equal], Investigation [equal], Methodology [equal], Project administration [equal], Software [supporting], Validation [equal]), Stacy Robertson (Data curation [equal], Formal analysis [equal], Investigation [supporting], Methodology [equal], Project administration [equal], Software [supporting], Validation [equal]), Scott M. MacKenzie (Data curation [equal], Formal analysis [equal], Funding acquisition [supporting], Investigation [equal], Methodology [equal], Project administration [equal], Software [supporting], Supervision [supporting], Validation [equal], Visualization [supporting], Writing—original draft [supporting],), Writing—review & editing [equal]), Alexandra Riddell (Data curation [equal], Formal analysis [equal], Investigation [supporting], Methodology [equal], Project administration [equal], Software [supporting], Validation [equal]), John D. McClure (Data curation [equal], Formal analysis [equal], Funding acquisition [supporting], Methodology [equal], Software [supporting], Validation [equal], Writing—review & editing [supporting]), Stelios Lamprou (Formal analysis [supporting], Validation [supporting], Writing—review & editing [supporting]), John M.C. Connell (Conceptualization [supporting], Investigation [supporting], Software [supporting], Writing—review & editing [supporting]), Laurence Amar (Conceptualization [supporting], Data curation [supporting], Funding acquisition [supporting], Investigation [supporting], Resources [supporting], Writing—review & editing [supporting]), Alessio Pecori (Data curation [supporting], Investigation [supporting], Resources [supporting], Validation [supporting], Writing—review & editing [supporting]), Martina Tetti (Data curation [supporting], Investigation [supporting], Resources [supporting], Validation [supporting], Writing—review & editing [supporting]), Christina Pamporaki (Data curation [supporting], Investigation [supporting], Resources [supporting], Writing—review & editing [supporting]), Marek Kabat (Investigation [supporting], Resources [supporting], Writing—review & editing [supporting]), Filippo Ceccato (Investigation [supporting], Resources [supporting], Writing—review & editing [supporting]), Matthias Kroiss (Investigation [supporting], Resources [supporting], Writing—review & editing [supporting]), Michael C. Dennedy (Investigation [supporting], Resources [supporting], Writing—review & editing [supporting]), Anthony Stell (Data curation [supporting], Funding acquisition [supporting], Investigation [supporting], Resources [supporting], Software [supporting]), Jaap Deinum (Conceptualization [supporting], Data curation [supporting], Funding acquisition [supporting], Investigation [supporting], Resources [supporting], Writing—review & editing [supporting]), Paolo Mulatero (Conceptualization [supporting], Data curation [supporting], Funding acquisition [supporting], Investigation [supporting], Resources [supporting], Supervision [supporting], Validation [supporting], Writing—review & editing [equal]), Martin Reincke (Conceptualization [supporting], Data curation [supporting], Funding acquisition [supporting], Investigation [supporting], Resources [supporting], Writing—review & editing [equal]), Anne-Paule Gimenez-Roqueplo (Conceptualization [supporting], Data curation [supporting], Funding acquisition [supporting], Investigation [supporting], Methodology [supporting], Resources [supporting], Writing—review & editing [supporting]), Guillaume Assié (Data curation [supporting], Funding acquisition [supporting], Investigation [supporting], Resources [supporting]), Anne Blanchard (Data curation [supporting], Investigation [supporting], Resources [supporting], Writing—review & editing [supporting]), Felix Beuschlein (Conceptualization [supporting], Data curation [supporting], Funding acquisition [supporting], Investigation [supporting], Resources [supporting], Writing—review & editing [equal]), Gian Paolo Rossi (Conceptualization [supporting], Funding acquisition [supporting], Investigation [supporting], Resources [supporting], Writing—review & editing [equal]), Graeme Eisenhofer (Conceptualization [supporting], Funding acquisition [supporting], Investigation [supporting], Resources [supporting], Writing—review & editing [equal]), Maria Christina Zennaro (Conceptualization [supporting], Funding acquisition [lead], Investigation [supporting], Methodology [supporting], Project administration [supporting], Resources [supporting], Supervision [supporting], Writing—original draft [supporting], Writing—review & editing [equal]), Emily Jefferson (Conceptualization [supporting], Data curation [equal], Formal analysis [lead], Investigation [supporting], Methodology [supporting], Project administration [supporting], Software [supporting], Supervision [supporting], Visualization [supporting]), and Eleanor Davies (Conceptualization [supporting], Data curation [equal], Formal analysis [equal], Funding acquisition [supporting], Investigation [lead], Methodology [equal], Project administration [lead], Resources [equal], Supervision [lead], Validation [equal], Visualization [supporting], Writing), and—original draft [equal], Writing—review & editing [equal])
References
Author notes
S.R., P.S.R., and J.C.v.K. contributed equally.
E.D. and E.R.J. contributed equally.
Conflict of interest: Co-authors Guillaume Assie and Felix Beuschlein are on the editorial board of EJE. They were not involved in the review or editorial process for this paper, on which they are listed as authors.