Abstract

Context

Polycystic ovary syndrome (PCOS) is among the most common endocrine disorders of premenopausal women, affecting 5% to15% of this population depending on the diagnostic criteria applied. It is characterized by hyperandrogenism, ovulatory dysfunction, and polycystic ovarian morphology. PCOS is highly heritable, but only a small proportion of this heritability can be accounted for by the common genetic susceptibility variants identified to date.

Objective

The objective of this study was to test whether rare genetic variants contribute to PCOS pathogenesis.

Design, Patients, and Methods

We performed whole-genome sequencing on DNA from 261 individuals from 62 families with one or more daughters with PCOS. We tested for associations of rare variants with PCOS and its concomitant hormonal traits using a quantitative trait meta-analysis.

Results

We found rare variants in DENND1A (P = 5.31 × 10−5, adjusted P = 0.039) that were significantly associated with reproductive and metabolic traits in PCOS families.

Conclusions

Common variants in DENND1A have previously been associated with PCOS diagnosis in genome-wide association studies. Subsequent studies indicated that DENND1A is an important regulator of human ovarian androgen biosynthesis. Our findings provide additional evidence that DENND1A plays a central role in PCOS and suggest that rare noncoding variants contribute to disease pathogenesis.

Polycystic ovary syndrome (PCOS) is a common endocrine disorder affecting 5% to 15% of premenopausal women worldwide (1), depending on the diagnostic criteria applied. PCOS is diagnosed by two or more of its reproductive features of hyperandrogenism, ovulatory dysfunction, and polycystic ovarian morphology. It is frequently associated with insulin resistance and pancreatic β-cell dysfunction, making it a leading risk factor for type 2 diabetes in young women (2).

PCOS is a highly heritable complex genetic disorder. Analogous to other complex traits (3), common susceptibility loci identified in genome-wide association studies (GWASs) (4–8) account for only a small proportion of the estimated genetic heritability of PCOS (9). As GWASs were designed to assess common allelic variants, usually with minor allele frequencies (MAFs) ≥2% to 5%, it has been proposed that less frequently occurring variants with greater effect sizes account for the observed deficit in heritability (10). Next-generation sequencing approaches have identified rare variants that contribute to complex disease pathogenesis (11–18).

We tested the hypothesis that rare variants contribute to PCOS by conducting family-based association analyses using whole-genome sequencing data. We filtered and weighted rare variants (MAF ≤2%) according to their predicted levels of deleteriousness and grouped them regionally and by genes. We not only tested for associations with PCOS diagnosis, but we also tested for association with its quantitative reproductive and metabolic trait levels: testosterone (T), dehydroepiandrosterone sulfate (DHEAS), LH, FSH, SHBG, and fasting insulin. The quantitative trait phenotypes were combined using a meta-analysis approach to identify sets of rare variants that associate with altered hormonal levels in PCOS.

Materials and Methods

Subjects

This study included 261 individuals from 62 families with PCOS who were whites of European ancestry. Families were ascertained by an index case who fulfilled the National Institutes of Health criteria for PCOS (19). Each family consisted of at least a proband–parent trio. Brothers were not included in this study. Phenotypic data and some genetic analyses on these subjects have been previously reported (20–22). The study was approved by the Institutional Review Boards of Northwestern University Feinberg School of Medicine, Penn State Health Milton S. Hershey Medical Center, and Brigham and Women’s Hospital. Written informed consent was obtained from all subjects prior to the study.

Women were ages 14 to 63 years, in good health, and had both ovaries and a uterus. They were not taking medications known to alter reproductive or metabolic hormone levels for at least 1 month prior to the study. Exogenous gonadal steroid administration was discontinued at least 3 months prior to the study. Thyroid, pituitary, and adrenal disorders were excluded by appropriate tests (23). Women were considered to be of reproductive age when they were between the ages of at least 2 years postmenarche and 45 years old, and had FSH levels ≤40 mIU/mL. Hyperandrogenemia was defined by elevated levels of T (>58 ng/dL), non-SHBG bound T (>15 ng/dL), and/or DHEAS (>2683 ng/mL), as established in Legro et al. (23). Ovarian dysfunction was defined as six or fewer menses per year. Ovarian morphology was not assessed because it does not correlate with the endocrine phenotype (6, 24).

Phenotypes in reproductive-age women were assigned as previously reported (23). Reproductive-age women with hyperandrogenemia and ovarian dysfunction were assigned a PCOS phenotype. Reproductive-age women with normal androgen levels and regular menses (every 27 to 35 days) were assigned an unaffected phenotype. Reproductive-age women with hyperandrogenemia and regular menses were assigned a hyperandrogenemic (HA) phenotype. Because androgen levels do not decrease during the menopausal transition (25, 26), women between 46 and 63 years of age with hyperandrogenemia were also assigned the HA phenotype, regardless of menstrual cycle pattern. One index case fulfilled the criteria for PCOS when she was 45 years of age but she was 46 years of age when enrolled in the study. She was confirmed to have persistent hyperandrogenemia and ovarian dysfunction. Women with either the PCOS or HA phenotype were considered affected, as we have done in our previous linkage (24) and family-based association testing (27) studies. In the current study, we also included HA women between 46 and 63 years of age as affected. Women with normal androgen levels who were not of reproductive age and all fathers in the study were not assigned a phenotype.

The quantitative trait analysis examined associations between rare variants and T, DHEAS, SHBG, LH, FSH, and insulin levels. In addition to the women included in the dichotomous trait analysis, women were included in the quantitative trait association testing as follows. Women 46 to 72 years old were included in the analyses for T and DHEAS because androgen levels do not change during the menopausal transition (25, 26). These women were also included in the analysis of SHBG and insulin. Women with bilateral oophorectomy (n = 10) not receiving postmenopausal hormone therapy were included in the analysis of the adrenal androgen, DHEAS, and insulin (28, 29). We compared hormonal traits in women receiving postmenopausal hormone therapy (n = 15) to women from the cohort of comparable age who were not receiving hormonal therapy (n = 10). Only SHBG levels differed significantly. Therefore, women receiving postmenopausal hormone therapy were included in the analyses of T and DHEAS. They were not included in the insulin analysis because of the effect of estrogen on circulating insulin levels (30). No additional women were included in the LH or FSH analyses.

Fathers were included in the insulin level association test because there are not sex differences in this parameter (31). Subjects receiving glucocorticoids (men, 0; women, 2) were excluded from all quantitative trait association tests. Subjects receiving antidiabetic medications (men, 6; women, 7) were excluded from the T, SHBG, and insulin analyses but were included in the DHEAS analysis (32). Subjects with type 2 diabetes not receiving medications (men, 3; women, 7) were excluded from the insulin analysis, but the women were included in the T, DHEAS, and SHBG analyses.

Reference ranges for hormonal parameters from concurrently studied reproductively normal control subjects of comparable age, sex, body mass index (BMI), and ancestry (6, 33, 34) are included in Table 1. All control subjects had normal glucose tolerance with a 75 g oral glucose tolerance test (35). Reproductive-age control women were 18 to 45 years old with regular menses, FSH <40 mIU/mL, and normal androgen levels. Older control women were 46 to 65 years of age with a history of regular menses and normal androgen levels. Control men were 46 to 65 years of age. The hormonal data from control subjects in Table 1 were provided to depict normal reference ranges and were not used for any statistical comparisons.

Table 1.

Clinical and Biochemical Characteristics of Study Population

Subject TypeNAge (y)aNBMI (kg/m2)NTestosterone (ng/dL)NSHBG (nmol/L)NDHEAS (ng/mL)NInsulin (μU/mL)NLH (mIU/mL)NFSH (mIU/mL)
Index cases6229 (19–48)6235.6 (27.9–42.7)6274 (65–90)5956 (36–92)612095 (1638–2710)6221 (14–29)5914 (7–20)599 (8–11)
Fathers6258 (43–85)6128.9 (27.0–32.2)4917 (11–26)
Mothers
 HA651 (40–63)634.5 (25.2–38.2)551 (49–72)5117 (105–135)62630 (1264–2694)319 (15–22)
 Unaffected244 (42–45)224.0 (20.6–27.4)232 (27–37)2280 (218–342)21081 (901–1260)212 (8–15)27 (6–7)28 (6–10)
 Age >45 y5457 (46–72)5328.6 (26.6–35.5)3626 (16–34)2593 (74–166)50645 (501–993)2819 (11–25)
Sisters
 PCOS1029 (19–36)1032.3 (29.0–37.3)1063 (49–74)1055 (45–104)101665 (807–2774)1028 (23–32)1010 (5–15)1011 (10–12)
 HA532 (15–40)522.3 (22.3–28.5)567 (54–68)5126 (69–177)52047 (1509–3775)515 (11–17)44 (3–18)410 (7–14)
 Unaffected5732 (14–45)5725.1 (22.2–29.3)5730 (23–35)55128 (90–181)571408 (1028–1814)5512 (9–15)555 (3–10)5510 (7–12)
 Age >45 y347 (46–49)326.4 (24.5–32.1)330 (20–48)3137 (88–150)3772 (765–1588)313 (12–15)
Reference ranges
 Reproductive–aged women34630 (25–35)34628.5 (23.0–35.4)22729 (21–37)188100 (70–144)2261357 (1018–1756)18512 (10–16)1734 (3–8)1739 (7–12)
 Men5553 (50–57)5528.0 (25.9–31.0)4614 (10–17)
 Older women (46 to 65 y)6956 (52–60)6926.4 (23.2–29.6)6923 (15–30)60102 (75–156)66645 (487–1016)6913 (10–17)4850 (35–69)4980 (57–100)
Subject TypeNAge (y)aNBMI (kg/m2)NTestosterone (ng/dL)NSHBG (nmol/L)NDHEAS (ng/mL)NInsulin (μU/mL)NLH (mIU/mL)NFSH (mIU/mL)
Index cases6229 (19–48)6235.6 (27.9–42.7)6274 (65–90)5956 (36–92)612095 (1638–2710)6221 (14–29)5914 (7–20)599 (8–11)
Fathers6258 (43–85)6128.9 (27.0–32.2)4917 (11–26)
Mothers
 HA651 (40–63)634.5 (25.2–38.2)551 (49–72)5117 (105–135)62630 (1264–2694)319 (15–22)
 Unaffected244 (42–45)224.0 (20.6–27.4)232 (27–37)2280 (218–342)21081 (901–1260)212 (8–15)27 (6–7)28 (6–10)
 Age >45 y5457 (46–72)5328.6 (26.6–35.5)3626 (16–34)2593 (74–166)50645 (501–993)2819 (11–25)
Sisters
 PCOS1029 (19–36)1032.3 (29.0–37.3)1063 (49–74)1055 (45–104)101665 (807–2774)1028 (23–32)1010 (5–15)1011 (10–12)
 HA532 (15–40)522.3 (22.3–28.5)567 (54–68)5126 (69–177)52047 (1509–3775)515 (11–17)44 (3–18)410 (7–14)
 Unaffected5732 (14–45)5725.1 (22.2–29.3)5730 (23–35)55128 (90–181)571408 (1028–1814)5512 (9–15)555 (3–10)5510 (7–12)
 Age >45 y347 (46–49)326.4 (24.5–32.1)330 (20–48)3137 (88–150)3772 (765–1588)313 (12–15)
Reference ranges
 Reproductive–aged women34630 (25–35)34628.5 (23.0–35.4)22729 (21–37)188100 (70–144)2261357 (1018–1756)18512 (10–16)1734 (3–8)1739 (7–12)
 Men5553 (50–57)5528.0 (25.9–31.0)4614 (10–17)
 Older women (46 to 65 y)6956 (52–60)6926.4 (23.2–29.6)6923 (15–30)60102 (75–156)66645 (487–1016)6913 (10–17)4850 (35–69)4980 (57–100)

Traits reported as count, median (25th to 75th percentiles). Values reported here are unadjusted. Trait values were adjusted and normalized for variant association testing.

a

Age reported as count, median (minimum to maximum).

Table 1.

Clinical and Biochemical Characteristics of Study Population

Subject TypeNAge (y)aNBMI (kg/m2)NTestosterone (ng/dL)NSHBG (nmol/L)NDHEAS (ng/mL)NInsulin (μU/mL)NLH (mIU/mL)NFSH (mIU/mL)
Index cases6229 (19–48)6235.6 (27.9–42.7)6274 (65–90)5956 (36–92)612095 (1638–2710)6221 (14–29)5914 (7–20)599 (8–11)
Fathers6258 (43–85)6128.9 (27.0–32.2)4917 (11–26)
Mothers
 HA651 (40–63)634.5 (25.2–38.2)551 (49–72)5117 (105–135)62630 (1264–2694)319 (15–22)
 Unaffected244 (42–45)224.0 (20.6–27.4)232 (27–37)2280 (218–342)21081 (901–1260)212 (8–15)27 (6–7)28 (6–10)
 Age >45 y5457 (46–72)5328.6 (26.6–35.5)3626 (16–34)2593 (74–166)50645 (501–993)2819 (11–25)
Sisters
 PCOS1029 (19–36)1032.3 (29.0–37.3)1063 (49–74)1055 (45–104)101665 (807–2774)1028 (23–32)1010 (5–15)1011 (10–12)
 HA532 (15–40)522.3 (22.3–28.5)567 (54–68)5126 (69–177)52047 (1509–3775)515 (11–17)44 (3–18)410 (7–14)
 Unaffected5732 (14–45)5725.1 (22.2–29.3)5730 (23–35)55128 (90–181)571408 (1028–1814)5512 (9–15)555 (3–10)5510 (7–12)
 Age >45 y347 (46–49)326.4 (24.5–32.1)330 (20–48)3137 (88–150)3772 (765–1588)313 (12–15)
Reference ranges
 Reproductive–aged women34630 (25–35)34628.5 (23.0–35.4)22729 (21–37)188100 (70–144)2261357 (1018–1756)18512 (10–16)1734 (3–8)1739 (7–12)
 Men5553 (50–57)5528.0 (25.9–31.0)4614 (10–17)
 Older women (46 to 65 y)6956 (52–60)6926.4 (23.2–29.6)6923 (15–30)60102 (75–156)66645 (487–1016)6913 (10–17)4850 (35–69)4980 (57–100)
Subject TypeNAge (y)aNBMI (kg/m2)NTestosterone (ng/dL)NSHBG (nmol/L)NDHEAS (ng/mL)NInsulin (μU/mL)NLH (mIU/mL)NFSH (mIU/mL)
Index cases6229 (19–48)6235.6 (27.9–42.7)6274 (65–90)5956 (36–92)612095 (1638–2710)6221 (14–29)5914 (7–20)599 (8–11)
Fathers6258 (43–85)6128.9 (27.0–32.2)4917 (11–26)
Mothers
 HA651 (40–63)634.5 (25.2–38.2)551 (49–72)5117 (105–135)62630 (1264–2694)319 (15–22)
 Unaffected244 (42–45)224.0 (20.6–27.4)232 (27–37)2280 (218–342)21081 (901–1260)212 (8–15)27 (6–7)28 (6–10)
 Age >45 y5457 (46–72)5328.6 (26.6–35.5)3626 (16–34)2593 (74–166)50645 (501–993)2819 (11–25)
Sisters
 PCOS1029 (19–36)1032.3 (29.0–37.3)1063 (49–74)1055 (45–104)101665 (807–2774)1028 (23–32)1010 (5–15)1011 (10–12)
 HA532 (15–40)522.3 (22.3–28.5)567 (54–68)5126 (69–177)52047 (1509–3775)515 (11–17)44 (3–18)410 (7–14)
 Unaffected5732 (14–45)5725.1 (22.2–29.3)5730 (23–35)55128 (90–181)571408 (1028–1814)5512 (9–15)555 (3–10)5510 (7–12)
 Age >45 y347 (46–49)326.4 (24.5–32.1)330 (20–48)3137 (88–150)3772 (765–1588)313 (12–15)
Reference ranges
 Reproductive–aged women34630 (25–35)34628.5 (23.0–35.4)22729 (21–37)188100 (70–144)2261357 (1018–1756)18512 (10–16)1734 (3–8)1739 (7–12)
 Men5553 (50–57)5528.0 (25.9–31.0)4614 (10–17)
 Older women (46 to 65 y)6956 (52–60)6926.4 (23.2–29.6)6923 (15–30)60102 (75–156)66645 (487–1016)6913 (10–17)4850 (35–69)4980 (57–100)

Traits reported as count, median (25th to 75th percentiles). Values reported here are unadjusted. Trait values were adjusted and normalized for variant association testing.

a

Age reported as count, median (minimum to maximum).

Hormone assays

T, DHEAS, SHBG, LH, FSH, and insulin levels were measured as previously reported (6).

Whole-genome sequencing

Genomic DNA was isolated from whole blood samples using the Gentra Puregene Blood Kit (Qiagen, Valencia, CA). Whole-genome sequencing was performed by Complete Genomics, Inc. (CGI) using their proprietary sequencing technology. Their sequencing platform used high-density nanoarrays populated with amplified DNA clusters called DNA nanoballs. DNA was read using a novel, iterative hybridization and ligation approach, which produces paired-end reads up to 70 bases in length (36). CGI’s sequencing service included sample quality control, library construction, whole-genome DNA sequencing, and variant calling. Reads were mapped to the National Center for Biotechnology Information Build 37.2 reference genome (GRCh37).

Variant calling

Variant calling was performed using CGI’s Assembly Pipeline version 2.0. Although raw read data were provided, because of the unique gapped read structure produced by CGI sequencing, the use of other mapping or variant-call software was not recommended by CGI. Results were provided in CGI’s unique variant-call format. Variants included single nucleotide variants (SNVs), as well as insertions, deletions, and substitutions ≤50 bases in length. Calls were assigned confidence scores assuming equal allele fractions for the diploid genome (varScoreEAF). A Bayesian probability model was used to evaluate potential locus calls. The model accounted for read depth, base call quality scores, mapping/alignment probabilities, and empirical priors on gap sizes and discordance rates (37). Based on the relative allele likelihoods a quality score was assigned for the chosen call:
(1)
Scores were therefore reported in decibels, where a score of 10 dB represents a likelihood ratio of 10:1, 20 dB means a 100:1 likelihood, 30 dB means a 1000:1 likelihood, and so forth. Quality thresholds for reporting variants were minimized (≥10 dB for homozygous and ≥20 dB for heterozygous variant calls) to maximize sensitivity. Variants were assigned a basic high/low quality flag (VQHIGH or VQLOW) based on a quality score threshold of 20 dB for homozygous and 40 dB for heterozygous variant calls. CGI variant calls were validated previously using Sanger sequencing (38).

Called variant filtering

Variants were considered rare when they appeared with a frequency ≤2% in the Scripps Wellderly Genome Resource, which consisted of 597 unrelated participants of European ancestry from the Scripps Wellderly Study (39). The Wellderly study population is composed entirely of elderly individuals ≥80 years of age with no history of chronic disease. Within each sequenced family, reported variants that were inconsistent with Mendelian patterns of inheritance were removed from consideration. A variant was considered consistent with Mendelian inheritance when it was called (VQHIGH) in one or more of the offspring and in at least one parent. The vast majority of DNA sequencing errors can be eliminated using Mendelian inheritance analysis (40). As previously described (41–44), an additional set of filters was applied to called variants for each sample to minimize the number of false-positive calls: (i) variants with VQLOW allele tags were removed; (ii) variants in microsatellite regions were removed; (iii) variants within simple tandem repeat regions were removed; (iv) three or more single-nucleotide polymorphisms (SNPs) clustered within a distance of 10 bp were removed; (v) SNPs located within 10 bp of an insertion or deletion (indel) were removed; (vi) calls located within known regions of segmental duplication were removed; and (vii) calls with an observed read depth greater than threefold the average read depth (>168) were removed.

Selection of optimal read depth and quality score thresholds

After systematically applying the filter matrix outlined above, optimal read depth and quality score thresholds were determined for each variant type by comparing calls between replicated samples for a particular family that was sequenced twice by CGI. Variants that were concordant between offspring sample pairs—above a given coverage depth and quality score threshold—were considered as true positive (TP) calls, whereas those that were discordant between replicates were considered as false positive (FP). Accordingly, concordant variant calls that fell below a given depth and quality threshold were classified as false negative (FN), and discordant calls that fell below a given depth and quality threshold were classified as true negative (TN). Optimal depth and quality score thresholds were determined by selecting the thresholds that yielded the greatest Matthews correlation coefficient (MCC) values across each variant type (41):
(2)
The thresholds chosen for association testing corresponded with the greatest MCC values for each variant type (45).

Assessing deleteriousness

After filtering for rare variants called with high confidence, as described above, variants were further characterized according to their predicted effects. Only variants that were predicted to have a deleterious effect were included in the association testing. Deleteriousness was primarily assessed using the Combined Annotation Dependent Depletion (CADD) tool (46), which is trained on output from numerous annotation programs to predict deleteriousness based on conservation relative to our ancestral genome. Variants were retained when they produced a CADD score greater than the gene-specific mutation significance cutoff (MSC; 95% CI) suggested by Itan et al. (47). For variants outside of gene regions or in inconsistently annotated genes, the mean MSC (MSCμ = 12.889) was used as the cutoff. In a further effort to reduce false positives, only coding variants classified as at least possibly damaging by the PolyPhen2 (48) or SIFT (49) variant effect prediction tools and noncoding variants with LINSIGHT (50) scores ≥0.8 were included in our analysis, as integrating individual methods can improve variant effect prediction (47, 51–53). CADD and LINSIGHT are both primarily based on evolutionary conservation, and they therefore carry certain limitations (54). However, the applicability of prediction tools based on functional annotations from specific cell types (55, 56) is extremely limited for PCOS because its pathophysiology involves numerous cell types across multiple organs (57), and annotations for relevant cell types are largely unavailable (58–61).

Genome-wide rare variant association testing

Variants were then grouped for association testing using both gene-based and sliding window approaches. Gene regions included all coding and noncoding DNA from the 3′ untranslated region to 7.5 kb upstream of the 5′ transcriptional start site for all RefSeq annotated genes. The sliding window approach included all noncoding variation contained within sequential windows across the genome using three different window sizes: 10-kb windows with no overlap, 25-kb windows with 12.5-kb overlap, and 100-kb windows with 75-kb overlap.

In rare variant association testing, genes are often filtered according to a minimum number of rare variants detected per gene (62–65) or a minimum cumulative variant frequency (CVF) per gene (66–69) to increase power to detect disease associations. In this study, genes and windows were removed from consideration when they did not harbor deleterious variants in at least 10% of affected subjects (70, 71). For the gene-based test, to reduce the resultant bias toward larger genes, the observed CVFs were adjusted for gene length. Coding and noncoding CVFs were modeled separately using linear regressions against the coding sequence length (72, 73) and the square root of the noncoding sequence length (74, 75), respectively. The models also accounted for gene-level guanine-cytosine content. The CVFs observed in affected subjects for each gene were then adjusted accordingly prior to applying the 10% threshold. Adjusting for gene length in this manner reduced the risk for erroneous associations, as genes with longer open reading frames are more likely to be reported falsely as disease-associated (76).

Sequence kernel association test statistics were computed using the methods described in Schaid et al. (77), which account for pedigree information in calculating trait associations. Custom variant weights were applied for each variant according to their CADD Phred scores:
(3)
where Cv is the CADD Phred score for variant v, g is the gene or window, MSCg is the mutation significance threshold for g, and G is the set of all variants within the genome. In this way, variants that are more likely to be deleterious were weighted more heavily than variants where the functional consequences are less likely to be damaging. For the dichotomous trait analysis, the binary outcome was adjusted for age and BMI using a logistic regression.

Six quantitative traits, that is, T, DHEAS, LH, FSH, SHBG, and insulin levels, were tested for association. The trait distributions were each positively skewed. Testing against skewed trait distributions can result in heavily inflated type I error rates in rare variant association testing (78). Therefore, each trait was modeled against a gamma distribution when adjusting for age and BMI, and nonnormal residuals (Shapiro–Wilk <0.05) were then further normalized using a rank-based inverse normal transformation (INT). The INT has been found to be the optimal method for maintaining type I error control without sacrificing power in rare variant association testing on nonnormally distributed traits (79).

Quantitative trait meta-analysis

For complex diseases with multivariate phenotypes, combining multiple related phenotypic traits into one analysis can increase power in finding disease associations (80, 81). We combined the six aforementioned quantitative trait associations into one meta-statistic using Fisher’s combination function modified to account for correlated traits (82). Intertrait correlations were determined using the Spearman correlation coefficient (45). P values were adjusted using a conservative Bonferroni correction according to the number of variant groupings that were tested that contained at least one variant across both the dichotomous and quantitative trait associations, as well as by the genomic inflation factor, λ. Nominal P values are reported herein as Pn, and corrected P values are reported as Pc. Correlation between meta-analysis association results and dichotomous trait association results was calculated using a Spearman coefficient and tested using a two-tailed t test.

The characteristic disturbance of gonadotropin secretion associated with PCOS is increased LH relative to FSH release (83). For genes with significant meta-analysis associations, LH/FSH ratios were compared between variant carriers and noncarriers using a Wilcoxon rank sum test, adjusted for multiple testing (Bonferroni). Differences in LH/FSH ratios between variant carriers and noncarriers would indicate that the gene variants alter gonadotropin signaling.

Linkage disequilibrium between rare variants and GWAS risk alleles

To evaluate whether rare variants identified in the association tests could account for GWAS association signals in the same gene, we calculated linkage disequilibrium (LD) between each of the rare variants and each of the corresponding GWAS risk alleles. Alleles were phased utilizing pedigree information. Instances in which the allelic phasing could not be resolved between the rare variants and GWAS variants were removed from consideration.

In silico binding effect prediction

To assess the potential functional effects of noncoding variants identified in the quantitative trait meta-analysis, we predicted the corresponding impacts to transcription factor (TF) and RNA-binding protein (RBP) binding in silico. For each noncoding SNV identified in the quantitative trait meta-analysis, TF binding affinities were calculated for all subsequences overlapping the SNV position on each strand within a ±20-bp window. Binding affinity scores were calculated using position weight matrices derived from ENCODE chromatin immunoprecipitation sequencing experiments (76). Scores were determined by summing the logged frequencies for a given sequence across a motif position weight matrix. Binding P values were defined as the probability that a sequence sampled from a genomic background distribution had an affinity score greater than or equal to the largest affinity score produced from one of the tested subsequences. Genomic background sequences were generated using a first-order Markov model (84). The significance of a given change in binding affinity scores between reference and SNV alleles was assessed by determining whether the differences in relative binding affinity rank between the two alleles was significantly different than what would be expected by chance (85, 86). P values were conservatively adjusted to account for multiple testing using the Benjamini–Hochberg procedure (87).

Once binding affinities were calculated for each TF at each SNV, filters were applied to identify the most likely candidates for TF binding site disruption. Instances in which the predicted TF binding affinity score was <80% of the maximum affinity score for the given motif were excluded. SNVs in which the reference allele was not predicted to bind a particular TF with statistical significance (Benjamini–Hochberg P < 0.05) were also removed from consideration, as were variants in which both the reference and SNV alleles were predicted to bind a TF with statistical significance. Only TFs expressed in the ovary were analyzed. Tissue-specific gene expression was determined using GTEx data (88) (median reads per kilobase of transcript per million mapped reads ≥0.1).

The identified SNVs were likewise analyzed for potential alteration to RBP sites following the same procedure but for a few modifications. RBPs and their binding affinity scores were determined using the ATtTRACT database (89). Only sequences on the coding strand were evaluated as to reflect the mRNA sequences. Additionally, significant changes in RBP binding were considered regardless of the direction of effect, such that instances in which the alternate allele was predicted to induce RBP binding were also included.

Supplemental methods

For additional details regarding methodological considerations and rationale, please refer to the online repository (45).

Results

Characteristics of study population

The characteristics of the study population, including counts and trait distributions by familial relationship and the numbers of subjects included in each association test, are summarized in Table 1.

Whole-genome sequencing and variant calling

Genome sequencing yielded average genome coverage of 96.2% per sample and an overall mean sequencing depth of 56× (45). On average, 90.4% and 66.4% of the genome, including 95.2% and 78.8% of the exome, was covered with at least 20× and 40× sequencing depth, respectively. Approximately 4.04 million high-confidence small variant calls were reported per genome. By applying optimal read depth and quality thresholds as well as a series of genomic filters (41), we reduced the discrepancy rate between replicate samples from 0.23% to 0.04% for rare variants (45).

Association testing and quantitative trait meta-analysis

We found 339 genes that had rare, deleterious variants in at least 10% of cases after adjusting for gene length and percentage guanine-cytosine content. No set of rare variants reached genome-wide significance for association with PCOS/HA disease status in the dichotomous trait analysis. We found 32 rare variants (2 coding, 30 noncoding) in the DENND1A gene that were collectively significantly associated with quantitative trait levels (Pn = 5.31×10−5, Pc = 0.039; Table 2), after adjusting for multiple testing and for observed genomic inflation (45). Women with one or more of these DENND1A variants had significantly higher LH/FSH ratios (Pc = 0.0036). PCOS/HA phenotype women with one or more DENND1A variants had significantly higher LH/FSH ratios than did PCOS/HA phenotype women without DENND1A variants (Pc = 0.0180). Unaffected women with one or more DENND1A variants also trended toward higher LH/FSH ratios than unaffected women without DENND1A variants (Pc = 0.1758; Fig. 1). No other gene-based set of rare variants reached genome-wide significance for association with quantitative trait levels. The correlation between the meta-analysis gene associations and the dichotomous trait gene associations was 0.24 (Spearman; P = 5.12 × 10−6).

Table 2.

Top Five Rare Variant Associations From Quantitative Trait Meta-Analysis

ChromosomeGeneLengthVariantsFamiliesaORbPnPc
9DENND1A550 kb3250%1.20 (0.69–2.14)5.31 × 10−50.039
11PKNOX2269 kb1839%1.55 (0.87–2.94)7.28 × 10−40.53
6BMP6155 kb1135%1.41 (0.65–3.27)3.98 × 10−31.00
9C9orf3421 kb1021%2.79 (1.08–8.24)6.14 × 10−31.00
1PRDM2125 kb819%3.26 (0.77–22.51)6.92 × 10−31.00
ChromosomeGeneLengthVariantsFamiliesaORbPnPc
9DENND1A550 kb3250%1.20 (0.69–2.14)5.31 × 10−50.039
11PKNOX2269 kb1839%1.55 (0.87–2.94)7.28 × 10−40.53
6BMP6155 kb1135%1.41 (0.65–3.27)3.98 × 10−31.00
9C9orf3421 kb1021%2.79 (1.08–8.24)6.14 × 10−31.00
1PRDM2125 kb819%3.26 (0.77–22.51)6.92 × 10−31.00

Rare variants exceed thresholds of predicted deleteriousness.

a

Families include all that have daughters with at least one of the rare variants.

b

ORs shown with 95% CIs, estimated from logistic regression of gene variant count against disease status, with equal variant weighting, adjusted for BMI, not considering relatedness (individual variants may have larger effects).

Table 2.

Top Five Rare Variant Associations From Quantitative Trait Meta-Analysis

ChromosomeGeneLengthVariantsFamiliesaORbPnPc
9DENND1A550 kb3250%1.20 (0.69–2.14)5.31 × 10−50.039
11PKNOX2269 kb1839%1.55 (0.87–2.94)7.28 × 10−40.53
6BMP6155 kb1135%1.41 (0.65–3.27)3.98 × 10−31.00
9C9orf3421 kb1021%2.79 (1.08–8.24)6.14 × 10−31.00
1PRDM2125 kb819%3.26 (0.77–22.51)6.92 × 10−31.00
ChromosomeGeneLengthVariantsFamiliesaORbPnPc
9DENND1A550 kb3250%1.20 (0.69–2.14)5.31 × 10−50.039
11PKNOX2269 kb1839%1.55 (0.87–2.94)7.28 × 10−40.53
6BMP6155 kb1135%1.41 (0.65–3.27)3.98 × 10−31.00
9C9orf3421 kb1021%2.79 (1.08–8.24)6.14 × 10−31.00
1PRDM2125 kb819%3.26 (0.77–22.51)6.92 × 10−31.00

Rare variants exceed thresholds of predicted deleteriousness.

a

Families include all that have daughters with at least one of the rare variants.

b

ORs shown with 95% CIs, estimated from logistic regression of gene variant count against disease status, with equal variant weighting, adjusted for BMI, not considering relatedness (individual variants may have larger effects).

LH/FSH ratios in DENND1A variant carriers. LH/FSH ratios in DENND1A rare variant carriers (+) and noncarriers (−) in unaffected women and in women with PCOS/HA. Differences in group means were analyzed using a Wilcoxon rank sum test (*Pc < 0.05). The significance bars illustrate significant differences in LH/FSH ratios between (i) all DENND1A variant carriers and all noncarriers and (ii) DENND1A variant carriers and noncarriers in affected subjects only. Note that PCOS includes HA subjects. Unaff, unaffected.
Figure 1.

LH/FSH ratios in DENND1A variant carriers. LH/FSH ratios in DENND1A rare variant carriers (+) and noncarriers (−) in unaffected women and in women with PCOS/HA. Differences in group means were analyzed using a Wilcoxon rank sum test (*Pc < 0.05). The significance bars illustrate significant differences in LH/FSH ratios between (i) all DENND1A variant carriers and all noncarriers and (ii) DENND1A variant carriers and noncarriers in affected subjects only. Note that PCOS includes HA subjects. Unaff, unaffected.

Using the sliding windows approach, we found a subset of noncoding variants within a 25-kb region of DENND1A (chr9:126,537,500 to 126,562,500) that were significantly associated with altered quantitative trait levels (Pn = 1.92 × 10−5, Pc = 0.019) (45). The region included three noncoding variants that were collectively present in eight subjects with PCOS/HA but in no unaffected subjects. Notably, one of these rare variants, rs117893097 (MAFWellderly = 0.013), was homozygous in one of the subjects. This 25-kb region encompasses one of the DENND1A GWAS risk variants [rs10986105; MAFWellderly = 0.034; ORMeta=1.39 (90)], although all carriers of the rare variants in the region lacked the rs10986105 risk allele. The relative positions of all of the rare variants found in DENND1A are shown in Fig. 2 (91–93). No other windows across the genome were found to have significant associations with disease state or hormonal levels.

Rare variants in DENND1A. The locations of deleterious rare variants and previously reported GWAS SNPs within the DENND1A gene, including the two primary isoforms DENND1A.V1 and DENND1A.V2, are shown. The 25-kb region significantly associated with altered hormone levels is highlighted in light blue. The conservation track was measured on multiple alignments of 100 vertebrate species by phyloP (91). The H3K4Me1 track shows enrichment of monomethylation of lysine 4 of the H3 histone protein, which is associated with enhancers and DNA regions downstream of transcription starts, as determined by chromatin immunoprecipitation sequencing assay and layered by different cell types (58). The DNase clusters track shows regions of DNase hypersensitivity, an indicator of regulatory activity, with darkness proportional to maximum signal strength (58). The GTEx eQTL track displays gene expression quantitative trait loci for DENND1A, as identified from GTEx RNA sequencing and genotype data, with red and blue indicating positive and negative effects on gene expression, respectively (92). The linkage disequilibrium heat map was generated using phase 1 CEU data from the 1000 Genomes Project (93).
Figure 2.

Rare variants in DENND1A. The locations of deleterious rare variants and previously reported GWAS SNPs within the DENND1A gene, including the two primary isoforms DENND1A.V1 and DENND1A.V2, are shown. The 25-kb region significantly associated with altered hormone levels is highlighted in light blue. The conservation track was measured on multiple alignments of 100 vertebrate species by phyloP (91). The H3K4Me1 track shows enrichment of monomethylation of lysine 4 of the H3 histone protein, which is associated with enhancers and DNA regions downstream of transcription starts, as determined by chromatin immunoprecipitation sequencing assay and layered by different cell types (58). The DNase clusters track shows regions of DNase hypersensitivity, an indicator of regulatory activity, with darkness proportional to maximum signal strength (58). The GTEx eQTL track displays gene expression quantitative trait loci for DENND1A, as identified from GTEx RNA sequencing and genotype data, with red and blue indicating positive and negative effects on gene expression, respectively (92). The linkage disequilibrium heat map was generated using phase 1 CEU data from the 1000 Genomes Project (93).

Several other PCOS GWAS candidate genes appeared in our filtered set of genes, including C9orf3 (5, 6, 8) (Pn = 6.14 × 10−3), HMGA2 (5) (Pn = 0.062), ZBTB16 (8) (Pn = 0.20), TOX3 (5, 8) (Pn = 0.22), and THADA (4, 5, 7, 8) (Pn = 0.74). C9orf3 had the fourth strongest association overall (45), but it failed to reach genome-wide significance after correction for multiple testing. The relative quantitative trait associations for these genes are illustrated in Fig. 3.

Trait associations for PCOS GWAS genes. The relative quantitative trait associations are shown for PCOS GWAS susceptibility loci included in meta-analysis results, with meta-analysis association ranking. I, fasting insulin.
Figure 3.

Trait associations for PCOS GWAS genes. The relative quantitative trait associations are shown for PCOS GWAS susceptibility loci included in meta-analysis results, with meta-analysis association ranking. I, fasting insulin.

Linkage disequilibrium with GWAS variants

On average, the 32 rare variants identified in DENND1A were in low LD with each of the previously identified DENND1A GWAS risk alleles: rs2479106 (MAF = 0.25, Dμ = 0.28, r2μ = 0.0077), rs10818854 (MAF = 0.06, Dμ = 0.035, r2μ = 0.0024), and rs10986105 (MAF = 0.05, Dμ = 0.031, r2μ = 0.0026), although individually, some were in high LD with rs2479106 (D′ = 1). Pairwise LD values are provided in Table 3.

Table 3.

Deleterious, Rare Variants in DENND1A

PositionIDVariantAllele FrequencyCADDLINSIGHTTF − (Disrupted)RBP − (Disrupted)RBP + (Enhanced)GWAS Allelic LD (D′)
AffaUnrelbPopcrs2479106rs10818854rs10986105
9:126,144,390drs189947178G → T0.0180.0200.00311.91N/AN/A<0.010.13<0.01
9:126,154,100rs147370674C → T0.0060.0040.00421.100.97XPO5<0.01<0.01<0.01
9:126,154,582rs529224231T → G0.0060.0040.00021.300.97MAFK, MAFB, NRLIFIH1, XPO5N/AN/AN/A
9:126,169,258rs561100869C → T0.0000.0040.00016.940.97IRF1NOVA1<0.01<0.01<0.01
9:126,180,758rs750425892G → C0.0000.0040.00019.310.96STAT6, MZF1, SPI1PCBP2<0.01<0.01<0.01
9:126,203,546T → C0.0060.0040.00020.500.93FOXM1RBMX<0.01<0.01<0.01
9:126,225,586rs538451690T → A0.0060.0040.00216.070.94RC3H1, XPO51<0.01<0.01
9:126,231,902C → T0.0060.0040.00020.200.97<0.01<0.01<0.01
9:126,247,645rs543947590C → A0.0060.0040.00321.000.97STAT5AOAS11<0.01<0.01
9:126,267,980rs558809288C → T0.0060.0040.00019.970.84ARNT<0.01<0.01<0.01
9:126,284,213rs149244424C → T0.0120.0040.00812.440.92ESRRA, ELF1<0.01<0.01<0.01
9:126,312,990rs748274474A → G0.0060.0040.00321.100.96YBX1<0.01<0.01<0.01
9:126,313,234C → A0.0060.0040.00017.840.97SRSF2<0.01<0.01<0.01
9:126,326,081rs138249397C → T0.0180.0160.0029.320.88OAS1SRP680.73<0.01<0.01
9:126,331,427rs184609118A → C0.0000.0040.00218.480.97ELAVL1, SSBCELF2, NOVA11<0.01<0.01
9:126,370,689rs564042790C → T0.0060.0040.00316.840.89SMAD3IFIH1YBX1<0.01<0.01<0.01
9:126,402,348rs182167487C → T0.0000.0040.00014.180.93<0.01<0.01<0.01
9:126,404,312G → A0.0060.0040.00015.720.91SRP54, SRP68PTBP11<0.01<0.01
9:126,414,365drs141759269T → G0.0000.0040.00126.00N/AN/AN/A<0.01<0.01
9:126,440,857rs147844210T → C0.0000.0040.00821.600.95GATA6CMTR1, FUS, SRSF3, YBX1<0.01<0.01<0.01
9:126,441,904rs75342773T → C0.0060.0040.00816.640.95HNRNPH1–2TRA2B<0.01<0.01<0.01
9:126,447,680rs117984673T → A0.0180.0160.00817.700.94SOX10, SOX17NOVA20.73<0.01<0.01
9:126,458,124rs112188193G → C0.0120.0080.01314.430.92STAT4, STAT5A, TEAD1RC3H1NUDT21<0.01<0.01<0.01
9:126,480,236rs543924878A → C0.0060.0040.00418.310.81SREBF1HNRNPH1–3, HNRNPF, KHSRP, SRP14<0.01<0.01<0.01
9:126,513,154rs147058034A → G0.0060.0040.00319.810.97FOXG1, FOXP3, FOXO4, FOXO6, FOXC1, FOXK1CELF1, XPO5NOVA1111
9:126,549,983T → C0.0180.0040.00019.450.811<0.01<0.01
9:126,555,003rs117893097C → G0.0300.0200.01321.400.99HNRNPH1–3, HNRNPF, KHSRP<0.01<0.01<0.01
9:126,557,679rs552299287A → T0.0060.0040.00018.000.84STAT5A, STAT6CELF1–2, RC3H1, XPO510.841
9:126,597,096T → C0.0060.0040.00016.640.97SOX5HNRNPA0, HNRNPA1, HNRNPD, ELAVL1, ZFP36XPO5<0.01<0.01<0.01
9:126,603,402rs78012023TTA → ATG0.0180.0120.0008.990.91N/A<0.01<0.01
9:126,625,643rs116887221C → A0.0060.0040.00816.240.8HNRNPH1–3, HNRNPF, KHSRP<0.01<0.01<0.01
9:126,691,321rs79740971G → A0.0060.0080.01013.520.84ESRRA<0.01<0.01<0.01
PositionIDVariantAllele FrequencyCADDLINSIGHTTF − (Disrupted)RBP − (Disrupted)RBP + (Enhanced)GWAS Allelic LD (D′)
AffaUnrelbPopcrs2479106rs10818854rs10986105
9:126,144,390drs189947178G → T0.0180.0200.00311.91N/AN/A<0.010.13<0.01
9:126,154,100rs147370674C → T0.0060.0040.00421.100.97XPO5<0.01<0.01<0.01
9:126,154,582rs529224231T → G0.0060.0040.00021.300.97MAFK, MAFB, NRLIFIH1, XPO5N/AN/AN/A
9:126,169,258rs561100869C → T0.0000.0040.00016.940.97IRF1NOVA1<0.01<0.01<0.01
9:126,180,758rs750425892G → C0.0000.0040.00019.310.96STAT6, MZF1, SPI1PCBP2<0.01<0.01<0.01
9:126,203,546T → C0.0060.0040.00020.500.93FOXM1RBMX<0.01<0.01<0.01
9:126,225,586rs538451690T → A0.0060.0040.00216.070.94RC3H1, XPO51<0.01<0.01
9:126,231,902C → T0.0060.0040.00020.200.97<0.01<0.01<0.01
9:126,247,645rs543947590C → A0.0060.0040.00321.000.97STAT5AOAS11<0.01<0.01
9:126,267,980rs558809288C → T0.0060.0040.00019.970.84ARNT<0.01<0.01<0.01
9:126,284,213rs149244424C → T0.0120.0040.00812.440.92ESRRA, ELF1<0.01<0.01<0.01
9:126,312,990rs748274474A → G0.0060.0040.00321.100.96YBX1<0.01<0.01<0.01
9:126,313,234C → A0.0060.0040.00017.840.97SRSF2<0.01<0.01<0.01
9:126,326,081rs138249397C → T0.0180.0160.0029.320.88OAS1SRP680.73<0.01<0.01
9:126,331,427rs184609118A → C0.0000.0040.00218.480.97ELAVL1, SSBCELF2, NOVA11<0.01<0.01
9:126,370,689rs564042790C → T0.0060.0040.00316.840.89SMAD3IFIH1YBX1<0.01<0.01<0.01
9:126,402,348rs182167487C → T0.0000.0040.00014.180.93<0.01<0.01<0.01
9:126,404,312G → A0.0060.0040.00015.720.91SRP54, SRP68PTBP11<0.01<0.01
9:126,414,365drs141759269T → G0.0000.0040.00126.00N/AN/AN/A<0.01<0.01
9:126,440,857rs147844210T → C0.0000.0040.00821.600.95GATA6CMTR1, FUS, SRSF3, YBX1<0.01<0.01<0.01
9:126,441,904rs75342773T → C0.0060.0040.00816.640.95HNRNPH1–2TRA2B<0.01<0.01<0.01
9:126,447,680rs117984673T → A0.0180.0160.00817.700.94SOX10, SOX17NOVA20.73<0.01<0.01
9:126,458,124rs112188193G → C0.0120.0080.01314.430.92STAT4, STAT5A, TEAD1RC3H1NUDT21<0.01<0.01<0.01
9:126,480,236rs543924878A → C0.0060.0040.00418.310.81SREBF1HNRNPH1–3, HNRNPF, KHSRP, SRP14<0.01<0.01<0.01
9:126,513,154rs147058034A → G0.0060.0040.00319.810.97FOXG1, FOXP3, FOXO4, FOXO6, FOXC1, FOXK1CELF1, XPO5NOVA1111
9:126,549,983T → C0.0180.0040.00019.450.811<0.01<0.01
9:126,555,003rs117893097C → G0.0300.0200.01321.400.99HNRNPH1–3, HNRNPF, KHSRP<0.01<0.01<0.01
9:126,557,679rs552299287A → T0.0060.0040.00018.000.84STAT5A, STAT6CELF1–2, RC3H1, XPO510.841
9:126,597,096T → C0.0060.0040.00016.640.97SOX5HNRNPA0, HNRNPA1, HNRNPD, ELAVL1, ZFP36XPO5<0.01<0.01<0.01
9:126,603,402rs78012023TTA → ATG0.0180.0120.0008.990.91N/A<0.01<0.01
9:126,625,643rs116887221C → A0.0060.0040.00816.240.8HNRNPH1–3, HNRNPF, KHSRP<0.01<0.01<0.01
9:126,691,321rs79740971G → A0.0060.0080.01013.520.84ESRRA<0.01<0.01<0.01

Positions correspond to GRCh37. Variant queries were facilitated by Kaviar (94). CADD and LINSIGHT columns show CADD and LINSIGHT deleteriousness prediction scores, accordingly. TF− shows TFs for which binding is predicted to be negatively impacted. RBP− and RBP+ show RBPs for which binding is predicted to be negatively impacted or enhanced, respectively. GWAS allelic LDs show linkage disequilibrium D′ values between the rare variants and each of the previously identified DENND1A GWAS risk alleles.

Abbreviation: N/A, not applicable for coding variant.

a

Affected (aff) cohort allele frequencies represent proportion of alleles with variant in PCOS/HA subjects.

b

Unrelated (unrel) cohort allele frequencies represent proportion of variant alleles in parents.

c

Population (pop) allele frequencies correspond to Wellderly cohort.

d

Coding variant (rs189947178 and rs141759269).

Table 3.

Deleterious, Rare Variants in DENND1A

PositionIDVariantAllele FrequencyCADDLINSIGHTTF − (Disrupted)RBP − (Disrupted)RBP + (Enhanced)GWAS Allelic LD (D′)
AffaUnrelbPopcrs2479106rs10818854rs10986105
9:126,144,390drs189947178G → T0.0180.0200.00311.91N/AN/A<0.010.13<0.01
9:126,154,100rs147370674C → T0.0060.0040.00421.100.97XPO5<0.01<0.01<0.01
9:126,154,582rs529224231T → G0.0060.0040.00021.300.97MAFK, MAFB, NRLIFIH1, XPO5N/AN/AN/A
9:126,169,258rs561100869C → T0.0000.0040.00016.940.97IRF1NOVA1<0.01<0.01<0.01
9:126,180,758rs750425892G → C0.0000.0040.00019.310.96STAT6, MZF1, SPI1PCBP2<0.01<0.01<0.01
9:126,203,546T → C0.0060.0040.00020.500.93FOXM1RBMX<0.01<0.01<0.01
9:126,225,586rs538451690T → A0.0060.0040.00216.070.94RC3H1, XPO51<0.01<0.01
9:126,231,902C → T0.0060.0040.00020.200.97<0.01<0.01<0.01
9:126,247,645rs543947590C → A0.0060.0040.00321.000.97STAT5AOAS11<0.01<0.01
9:126,267,980rs558809288C → T0.0060.0040.00019.970.84ARNT<0.01<0.01<0.01
9:126,284,213rs149244424C → T0.0120.0040.00812.440.92ESRRA, ELF1<0.01<0.01<0.01
9:126,312,990rs748274474A → G0.0060.0040.00321.100.96YBX1<0.01<0.01<0.01
9:126,313,234C → A0.0060.0040.00017.840.97SRSF2<0.01<0.01<0.01
9:126,326,081rs138249397C → T0.0180.0160.0029.320.88OAS1SRP680.73<0.01<0.01
9:126,331,427rs184609118A → C0.0000.0040.00218.480.97ELAVL1, SSBCELF2, NOVA11<0.01<0.01
9:126,370,689rs564042790C → T0.0060.0040.00316.840.89SMAD3IFIH1YBX1<0.01<0.01<0.01
9:126,402,348rs182167487C → T0.0000.0040.00014.180.93<0.01<0.01<0.01
9:126,404,312G → A0.0060.0040.00015.720.91SRP54, SRP68PTBP11<0.01<0.01
9:126,414,365drs141759269T → G0.0000.0040.00126.00N/AN/AN/A<0.01<0.01
9:126,440,857rs147844210T → C0.0000.0040.00821.600.95GATA6CMTR1, FUS, SRSF3, YBX1<0.01<0.01<0.01
9:126,441,904rs75342773T → C0.0060.0040.00816.640.95HNRNPH1–2TRA2B<0.01<0.01<0.01
9:126,447,680rs117984673T → A0.0180.0160.00817.700.94SOX10, SOX17NOVA20.73<0.01<0.01
9:126,458,124rs112188193G → C0.0120.0080.01314.430.92STAT4, STAT5A, TEAD1RC3H1NUDT21<0.01<0.01<0.01
9:126,480,236rs543924878A → C0.0060.0040.00418.310.81SREBF1HNRNPH1–3, HNRNPF, KHSRP, SRP14<0.01<0.01<0.01
9:126,513,154rs147058034A → G0.0060.0040.00319.810.97FOXG1, FOXP3, FOXO4, FOXO6, FOXC1, FOXK1CELF1, XPO5NOVA1111
9:126,549,983T → C0.0180.0040.00019.450.811<0.01<0.01
9:126,555,003rs117893097C → G0.0300.0200.01321.400.99HNRNPH1–3, HNRNPF, KHSRP<0.01<0.01<0.01
9:126,557,679rs552299287A → T0.0060.0040.00018.000.84STAT5A, STAT6CELF1–2, RC3H1, XPO510.841
9:126,597,096T → C0.0060.0040.00016.640.97SOX5HNRNPA0, HNRNPA1, HNRNPD, ELAVL1, ZFP36XPO5<0.01<0.01<0.01
9:126,603,402rs78012023TTA → ATG0.0180.0120.0008.990.91N/A<0.01<0.01
9:126,625,643rs116887221C → A0.0060.0040.00816.240.8HNRNPH1–3, HNRNPF, KHSRP<0.01<0.01<0.01
9:126,691,321rs79740971G → A0.0060.0080.01013.520.84ESRRA<0.01<0.01<0.01
PositionIDVariantAllele FrequencyCADDLINSIGHTTF − (Disrupted)RBP − (Disrupted)RBP + (Enhanced)GWAS Allelic LD (D′)
AffaUnrelbPopcrs2479106rs10818854rs10986105
9:126,144,390drs189947178G → T0.0180.0200.00311.91N/AN/A<0.010.13<0.01
9:126,154,100rs147370674C → T0.0060.0040.00421.100.97XPO5<0.01<0.01<0.01
9:126,154,582rs529224231T → G0.0060.0040.00021.300.97MAFK, MAFB, NRLIFIH1, XPO5N/AN/AN/A
9:126,169,258rs561100869C → T0.0000.0040.00016.940.97IRF1NOVA1<0.01<0.01<0.01
9:126,180,758rs750425892G → C0.0000.0040.00019.310.96STAT6, MZF1, SPI1PCBP2<0.01<0.01<0.01
9:126,203,546T → C0.0060.0040.00020.500.93FOXM1RBMX<0.01<0.01<0.01
9:126,225,586rs538451690T → A0.0060.0040.00216.070.94RC3H1, XPO51<0.01<0.01
9:126,231,902C → T0.0060.0040.00020.200.97<0.01<0.01<0.01
9:126,247,645rs543947590C → A0.0060.0040.00321.000.97STAT5AOAS11<0.01<0.01
9:126,267,980rs558809288C → T0.0060.0040.00019.970.84ARNT<0.01<0.01<0.01
9:126,284,213rs149244424C → T0.0120.0040.00812.440.92ESRRA, ELF1<0.01<0.01<0.01
9:126,312,990rs748274474A → G0.0060.0040.00321.100.96YBX1<0.01<0.01<0.01
9:126,313,234C → A0.0060.0040.00017.840.97SRSF2<0.01<0.01<0.01
9:126,326,081rs138249397C → T0.0180.0160.0029.320.88OAS1SRP680.73<0.01<0.01
9:126,331,427rs184609118A → C0.0000.0040.00218.480.97ELAVL1, SSBCELF2, NOVA11<0.01<0.01
9:126,370,689rs564042790C → T0.0060.0040.00316.840.89SMAD3IFIH1YBX1<0.01<0.01<0.01
9:126,402,348rs182167487C → T0.0000.0040.00014.180.93<0.01<0.01<0.01
9:126,404,312G → A0.0060.0040.00015.720.91SRP54, SRP68PTBP11<0.01<0.01
9:126,414,365drs141759269T → G0.0000.0040.00126.00N/AN/AN/A<0.01<0.01
9:126,440,857rs147844210T → C0.0000.0040.00821.600.95GATA6CMTR1, FUS, SRSF3, YBX1<0.01<0.01<0.01
9:126,441,904rs75342773T → C0.0060.0040.00816.640.95HNRNPH1–2TRA2B<0.01<0.01<0.01
9:126,447,680rs117984673T → A0.0180.0160.00817.700.94SOX10, SOX17NOVA20.73<0.01<0.01
9:126,458,124rs112188193G → C0.0120.0080.01314.430.92STAT4, STAT5A, TEAD1RC3H1NUDT21<0.01<0.01<0.01
9:126,480,236rs543924878A → C0.0060.0040.00418.310.81SREBF1HNRNPH1–3, HNRNPF, KHSRP, SRP14<0.01<0.01<0.01
9:126,513,154rs147058034A → G0.0060.0040.00319.810.97FOXG1, FOXP3, FOXO4, FOXO6, FOXC1, FOXK1CELF1, XPO5NOVA1111
9:126,549,983T → C0.0180.0040.00019.450.811<0.01<0.01
9:126,555,003rs117893097C → G0.0300.0200.01321.400.99HNRNPH1–3, HNRNPF, KHSRP<0.01<0.01<0.01
9:126,557,679rs552299287A → T0.0060.0040.00018.000.84STAT5A, STAT6CELF1–2, RC3H1, XPO510.841
9:126,597,096T → C0.0060.0040.00016.640.97SOX5HNRNPA0, HNRNPA1, HNRNPD, ELAVL1, ZFP36XPO5<0.01<0.01<0.01
9:126,603,402rs78012023TTA → ATG0.0180.0120.0008.990.91N/A<0.01<0.01
9:126,625,643rs116887221C → A0.0060.0040.00816.240.8HNRNPH1–3, HNRNPF, KHSRP<0.01<0.01<0.01
9:126,691,321rs79740971G → A0.0060.0080.01013.520.84ESRRA<0.01<0.01<0.01

Positions correspond to GRCh37. Variant queries were facilitated by Kaviar (94). CADD and LINSIGHT columns show CADD and LINSIGHT deleteriousness prediction scores, accordingly. TF− shows TFs for which binding is predicted to be negatively impacted. RBP− and RBP+ show RBPs for which binding is predicted to be negatively impacted or enhanced, respectively. GWAS allelic LDs show linkage disequilibrium D′ values between the rare variants and each of the previously identified DENND1A GWAS risk alleles.

Abbreviation: N/A, not applicable for coding variant.

a

Affected (aff) cohort allele frequencies represent proportion of alleles with variant in PCOS/HA subjects.

b

Unrelated (unrel) cohort allele frequencies represent proportion of variant alleles in parents.

c

Population (pop) allele frequencies correspond to Wellderly cohort.

d

Coding variant (rs189947178 and rs141759269).

Protein binding effect prediction

Nine of the DENND1A variants were predicted to significantly impact TF binding motifs, whereas most variants were predicted to significantly alter RBP binding motifs (17 disrupted, 14 induced). The specific TF and RBP motifs associated with each noncoding variant are listed in Table 3. Binding by the heterogeneous nuclear ribonucleoprotein family of RBPs appeared to be the most commonly impacted.

Discussion

We identified rare variants in DENND1A that were significantly associated with altered reproductive and metabolic hormone levels in PCOS. Fifty percent of the PCOS families included in this study carried at least one of these rare DENND1A variants. These findings are of considerable interest because common variants in DENND1A were associated with PCOS diagnosis in a GWAS of Han Chinese women (4); these associations were replicated in women of European ancestry (95). Subsequently, McAllister and colleagues (61, 96) discovered that DENND1A plays a key role in androgen biosynthesis in human theca cells and is upregulated in PCOS theca cells. Our study, using an independent family-based WGS analytical approach, further implicates DENND1A in the pathogenesis of PCOS. Additionally, our results provide genetic evidence to support our hypothesis that hyperandrogenemia is a core biologic pathway in PCOS (22, 97). The increased LH/FSH ratios that we observed in rare variant carriers (Fig. 1) also suggest that DENND1A contributes to the disruption of gonadotropin secretion that is a characteristic feature of PCOS (98).

DENND1A encodes two transcripts as the result of alternative splicing, DENND1A.V1 and DENND1A.V2 (61). The encoded V2 protein was found in ovarian theca cells and its abundance was correlated with increased androgen production (96). Forced expression of the V2 transcript produced a PCOS phenotype in normal theca cells, whereas knockdown of V2 in PCOS theca cells reduced thecal androgen biosynthesis (96). The three DENND1A risk variants identified by GWASs (rs2479106, rs10818854, rs10986105) were located in introns, and the functional consequences of these variants are unknown (61). There have been no reported large-scale sequencing studies to fine-map causal variants in DENND1A (90). Targeted (61, 96, 99) and whole-exome sequencing (100) in small PCOS case-control cohorts have failed to identify any variants in DENND1A that were associated with PCOS or with V2 isoform expression. The GWAS risk variants and most of the rare DENND1A variants in the current study lie well upstream (100 to 400 kb) of this splicing region (Fig. 2). Nevertheless, based on the LD we observed between the rare variants and the GWAS risk variants, some of the rare variants (particularly those with D′ = 1) may contribute to the GWAS signal at rs2479106, but a larger cohort would be needed to conduct conditional association analyses.

A number of our rare DENND1A variants were predicted to disrupt conserved TF binding motifs, which could affect gene expression, but most of the variants were predicted to alter affinities of RBPs to the mRNA transcript (Table 3). It is plausible, therefore, that these rare variants were selectively driving V2 expression via posttranscriptional regulation. Collectively, the rare DENND1A variants were found in 50% of families, but each individual variant was typically found in only one or two families. Our findings, therefore, support a model of PCOS in which causal variants are individually uncommon but collectively occur in key genes regulating core biologic pathways contributing to disease pathogenesis, such as androgen biosynthesis (23). Our recent findings (18, 38) of multiple rare variants in the AMH and AMHR2 genes that reduce their biologic activity in ∼7% of women with PCOS is consistent with this model.

Several PCOS GWAS candidate genes besides DENND1A appeared among the top gene associations, but they failed to reach genome-wide significance. These genes included C9orf3 (5, 6, 8), HMGA2 (5), ZBTB16 (8), TOX3 (5, 8), and THADA (4, 5, 7, 8) (Fig. 3). Two additional genes with strong, but not genome-wide significant, associations with PCOS quantitative traits were highly plausible PCOS candidate genes. BMP6 had the third strongest association in our meta-analysis (P = 4.00 × 10−3, Table 2). It is a member of the bone morphogenetic protein family, which are growth factors involved in folliculogenesis. BMP6 expression was previously found to be significantly higher in granulosa cells from women with PCOS compared with reproductively normal control women (101). Moreover, BMP6 was found to increase expression of the FSH receptor, inhibin/activin β subunits, and AMH genes in human granulosa cells (102). PRDM2 had the fifth strongest association in our meta-analysis (P = 6.95×10−3, Table 2). PRDM2 is an estrogen receptor coactivator (103) that is highly expressed in the ovary and pituitary gland (88). Ligand-bound estrogen receptor α binds with PRDM2 to open chromatin at estrogen receptor α target genes (103, 104). PRDM2 also binds with the retinoblastoma protein (105), which has been shown to play an important role in follicular development in granulosa cells (106, 107).

Our results align with the emerging evidence that rare coding variants with large effect sizes do not play a major role in complex disease (108). The average estimated effect size of the rare DENND1A variants—assuming additive, equal variant weighting—was relatively modest (OR, 1.20; 95% CI, 0.69 to 2.14; Table 2), although individual variants may have larger effects. Of the 32 rare variants we identified in DENND1A, 30 were noncoding. Indeed, it appears that complex traits are primarily driven by noncoding variation (109, 110), both common and rare (110, 111). Paired WGS and transcriptome sequencing analysis from one extensive kindred demonstrated that rare noncoding variants had strong effects on individual gene-expression profiles (113). In a study designed similarly to ours, Ament et al. (114) found rare variants, the vast majority of which were noncoding, associated with increased risk for bipolar disorder. Owing to the relative cost-effectiveness of whole-exome sequencing (109), the limited availability of computational tools designed to predict the effects of noncoding variants on phenotypes (115), and our relatively poor understanding of regulatory mechanisms in the genome (116), noncoding variants have been noticeably understudied in complex trait genetics. As larger WGS data sets are accumulated and the focus of complex trait studies shifts more toward understanding regulatory mechanisms, the contribution of rare noncoding variants in various complex diseases will become clearer.

The central statistical challenge in studying rare variants is achieving adequate power to detect significant associations while controlling for type I error. The family-based structure of our cohort provided an enrichment of individual rare variants and enabled modeling of familial segregation (117). To mitigate variant calling errors, we used replicate samples from one family to determine optimal read depth and quality thresholds. To remove irrelevant variants from consideration, we applied a LINSIGHT score threshold and gene-specific CADD score thresholds (77) to filter for deleteriousness. We further prioritized variants by weighting them by their relative CADD scores. To group rare variants effectively, we applied several windows-based binning methods, in addition to the gene-based approach, to ensure that different kinds of functionally correlated genomic regions were tested, both of fixed length and of variable length. To limit our search to genes that were more likely to have specific roles in PCOS etiology, we only considered genes with rare deleterious variants in at least 10% of cases, as causal rare variants are more likely to accumulate in core disease genes (109). We greatly increased our power to detect relevant disease genes and account for pleiotropic effects by consolidating quantitative trait association results into a meta-analysis (80, 81). To reduce type I error, in addition to the variant calling quality control measures, we modeled the quantitative traits against skewed distributions and further normalized trait residuals using an INT.

Given the size of our cohort, it was necessary to apply relatively strict filters based on predicted variant effects and cumulative allele frequencies to detect rare variant associations. Very large sample sizes are otherwise required for WGS studies of rare variants (118, 119). By only including genes with rare, likely-deleterious variants in at least 10% of cases, we greatly reduced the multiple-testing burden of our analysis, thereby increasing our power to detect core PCOS genes (120, 121). Applying a priori hypotheses regarding which variants and genes may be relevant to disease, however, is analogous to a candidate gene approach (118). Any sets of rare variants that contribute to PCOS in smaller subpopulations of women with PCOS, as we found for AMH (38), were likely removed from consideration. Likewise, any causal rare variants that were not predicted bioinformatically to be deleterious based on existing annotations and evolutionary conservation (46, 50) would not have been detected. Furthermore, because variants were filtered for consistency with Mendelian inheritance, de novo mutations were not considered.

Because the number of genetic variants is directly correlated with the size of the gene (76), the CVF threshold introduced a bias toward larger genes in some of the analyses. However, this bias was mitigated by adjusting for gene length. Furthermore, any such bias was not applicable to our fixed-length windows-based approach, which replicated our DENND1A findings. It is also possible that causal rare variants with larger effect sizes were omitted from the meta-analysis because we tested against normalized trait residuals in an effort to reduce type I errors. Using normalized trait residuals may have excluded variants with large effects that produced outliers. However, as mentioned above, recent evidence has demonstrated that complex traits are primarily driven by noncoding variation with modest effect sizes (109, 110).

Despite the numerous steps taken to increase power, our study ultimately remained underpowered to detect rare variant associations with PCOS diagnosis in the dichotomous trait analysis. Although an association with PCOS quantitative traits implicates genetic variants in disease pathogenesis, it does not necessarily mean that a gene is associated with PCOS itself. The noncoding variants identified in this study, despite being rare and predicted to be deleterious, may be in LD with the actual pathogenic variant on the same alleles that were removed by the applied allele frequency or predicted effect thresholds. These noncoding variants may also affect the expression of one or more distant genes outside of the gene in which the variants are located. Importantly, the results of this study have not been replicated in an independent cohort and therefore, despite the established association of DENND1A variation with PCOS, should be regarded as hypothesis generating. Replication and functional studies are needed to confirm individual variant functionality and disease associations. Finally, because the women with PCOS studied herein were diagnosed according to National Institutes of Health criteria, our results may not be generalizable to other PCOS phenotypes defined by different sets of diagnostic criteria.

In summary, by applying family-based sequence kernel association tests on filtered whole-genome variant call data from a cohort of PCOS families, we were able to identify rare variants in the DENND1A gene that were associated with quantitative hormonal traits of PCOS. Because DENND1A is an important regulator of androgen biosynthesis, these findings provide additional genetic evidence to support our hypothesis that hyperandrogenemia is a core biologic pathway in PCOS (23, 97). Our results also suggest that rare noncoding variants contribute to the distinctive hormonal profile of PCOS. This study further demonstrates that using a quantitative trait meta-analysis can be a powerful approach in rare variant association testing, particularly for complex diseases with pleiotropic etiologies.

Acknowledgments

We thank Drs. Terry Farrah and Gustavo Glusman, from the Institute for Systems Biology, for facilitating variant queries via Kaviar (http://db.systemsbiology.net/kaviar). We thank the Northwestern University Research Computing Services team for supporting the computational needs of this research. We are very grateful to all of the families who took part in the study.

Financial Support: This study was supported by National Institutes of Health Grants P50 HD044405 (to A.D.) and R01 HD085227 (to A.D.). M.D. was supported by National Institutes of Health National Research Service Award T32 DK007169. This study uses data from the Scripps Wellderly Genome Resource, which is funded under National Institutes of Health Grant UL1 TR001114 and a Scripps Translational Science Institute Clinical and Translational Science Award.

Disclosure Summary: The authors have nothing to disclose.

Abbreviations:

    Abbreviations:
     
  • BMI

    body mass index

  •  
  • CADD

    Combined Annotation Dependent Depletion

  •  
  • CGI

    Complete Genomics, Inc.

  •  
  • CVF

    cumulative variant frequency

  •  
  • DHEAS

    dehydroepiandrosterone sulfate

  •  
  • GWAS

    genome-wide association study

  •  
  • HA

    hyperandrogenemic

  •  
  • INT

    inverse normal transformation

  •  
  • LD

    linkage disequilibrium

  •  
  • MAF

    minor allele frequency

  •  
  • MSC

    mutation significance cutoff

  •  
  • Pc

    corrected P value

  •  
  • Pn

    nominal P value

  •  
  • PCOS

    polycystic ovary syndrome

  •  
  • RBP

    RNA-binding protein

  •  
  • SNP

    single-nucleotide polymorphism

  •  
  • SNV

    single nucleotide variant

  •  
  • T

    testosterone

  •  
  • TF

    transcription factor

  •  
  • WGS

    whole-genome sequencing

  •  
  • μ (as subscript)

    mean

References and Notes

1.

Azziz
R
.
New insights into the genetics of polycystic ovary syndrome (published correction appears in Nat Rev Endocrinol. 2016;12(3):183)
.
Nat Rev Endocrinol
.
2016
;
12
(
2
):
74
75
.

2.

Gambineri
A
,
Patton
L
,
Altieri
P
,
Pagotto
U
,
Pizzi
C
,
Manzoli
L
,
Pasquali
R
.
Polycystic ovary syndrome is a risk factor for type 2 diabetes: results from a long-term prospective study
.
Diabetes
.
2012
;
61
(
9
):
2369
2374
.

3.

Shi
H
,
Kichaev
G
,
Pasaniuc
B
.
Contrasting the genetic architecture of 30 complex traits from summary association data
.
Am J Hum Genet
.
2016
;
99
(
1
):
139
153
.

4.

Chen
ZJ
,
Zhao
H
,
He
L
,
Shi
Y
,
Qin
Y
,
Shi
Y
,
Li
Z
,
You
L
,
Zhao
J
,
Liu
J
,
Liang
X
,
Zhao
X
,
Zhao
J
,
Sun
Y
,
Zhang
B
,
Jiang
H
,
Zhao
D
,
Bian
Y
,
Gao
X
,
Geng
L
,
Li
Y
,
Zhu
D
,
Sun
X
,
Xu
JE
,
Hao
C
,
Ren
CE
,
Zhang
Y
,
Chen
S
,
Zhang
W
,
Yang
A
,
Yan
J
,
Li
Y
,
Ma
J
,
Zhao
Y
.
Genome-wide association study identifies susceptibility loci for polycystic ovary syndrome on chromosome 2p16.3, 2p21 and 9q33.3
.
Nat Genet
.
2011
;
43
(
1
):
55
59
.

5.

Shi
Y
,
Zhao
H
,
Shi
Y
,
Cao
Y
,
Yang
D
,
Li
Z
,
Zhang
B
,
Liang
X
,
Li
T
,
Chen
J
,
Shen
J
,
Zhao
J
,
You
L
,
Gao
X
,
Zhu
D
,
Zhao
X
,
Yan
Y
,
Qin
Y
,
Li
W
,
Yan
J
,
Wang
Q
,
Zhao
J
,
Geng
L
,
Ma
J
,
Zhao
Y
,
He
G
,
Zhang
A
,
Zou
S
,
Yang
A
,
Liu
J
,
Li
W
,
Li
B
,
Wan
C
,
Qin
Y
,
Shi
J
,
Yang
J
,
Jiang
H
,
Xu
JE
,
Qi
X
,
Sun
Y
,
Zhang
Y
,
Hao
C
,
Ju
X
,
Zhao
D
,
Ren
CE
,
Li
X
,
Zhang
W
,
Zhang
Y
,
Zhang
J
,
Wu
D
,
Zhang
C
,
He
L
,
Chen
ZJ
.
Genome-wide association study identifies eight new risk loci for polycystic ovary syndrome
.
Nat Genet
.
2012
;
44
(
9
):
1020
1025
.

6.

Hayes
MG
,
Urbanek
M
,
Ehrmann
DA
,
Armstrong
LL
,
Lee
JY
,
Sisk
R
,
Karaderi
T
,
Barber
TM
,
McCarthy
MI
,
Franks
S
,
Lindgren
CM
,
Welt
CK
,
Diamanti-Kandarakis
E
,
Panidis
D
,
Goodarzi
MO
,
Azziz
R
,
Zhang
Y
,
James
RG
,
Olivier
M
,
Kissebah
AH
,
Stener-Victorin
E
,
Legro
RS
,
Dunaif
A
;
Reproductive Medicine Network
.
Genome-wide association of polycystic ovary syndrome implicates alterations in gonadotropin secretion in European ancestry populations (published correction appears in Nat Commun.
2016;7:10762)
.
Nat Commun
.
2015
;
6
(
1
):
7502
.

7.

Day
FR
,
Hinds
DA
,
Tung
JY
,
Stolk
L
,
Styrkarsdottir
U
,
Saxena
R
,
Bjonnes
A
,
Broer
L
,
Dunger
DB
,
Halldorsson
BV
,
Lawlor
DA
,
Laval
G
,
Mathieson
I
,
McCardle
WL
,
Louwers
Y
,
Meun
C
,
Ring
S
,
Scott
RA
,
Sulem
P
,
Uitterlinden
AG
,
Wareham
NJ
,
Thorsteinsdottir
U
,
Welt
C
,
Stefansson
K
,
Laven
JS
,
Ong
KK
,
Perry
JR
.
Causal mechanisms and balancing selection inferred from genetic associations with polycystic ovary syndrome
.
Nat Commun
.
2015
;
6
(
1
):
8464
.

8.

Day
F
,
Karaderi
T
,
Jones
MR
,
Meun
C
,
He
C
,
Drong
A
,
Kraft
P
,
Lin
N
,
Huang
H
,
Broer
L
,
Magi
R
,
Saxena
R
,
Laisk
T
,
Urbanek
M
,
Hayes
MG
,
Thorleifsson
G
,
Fernandez-Tajes
J
,
Mahajan
A
,
Mullin
BH
,
Stuckey
BGA
,
Spector
TD
,
Wilson
SG
,
Goodarzi
MO
,
Davis
L
,
Obermayer-Pietsch
B
,
Uitterlinden
AG
,
Anttila
V
,
Neale
BM
,
Jarvelin
M-R
,
Fauser
B
,
Kowalska
I
,
Visser
JA
,
Andersen
M
,
Ong
K
,
Stener-Victorin
E
,
Ehrmann
D
,
Legro
RS
,
Salumets
A
,
McCarthy
MI
,
Morin-Papunen
L
,
Thorsteinsdottir
U
,
Stefansson
K
,
Styrkarsdottir
U
,
Perry
JR
,
Dunaif
A
,
Laven
J
,
Franks
S
,
Lindgren
CM
,
Welt
CK
;
23andMe Research Team
.
Large-scale genome-wide meta-analysis of polycystic ovary syndrome suggests shared genetic architecture for different diagnosis criteria
.
PLoS Genet
.
2018
;
14
(
12
):
e1007813
.

9.

Vink
JM
,
Sadrzadeh
S
,
Lambalk
CB
,
Boomsma
DI
.
Heritability of polycystic ovary syndrome in a Dutch twin-family study
.
J Clin Endocrinol Metab
.
2006
;
91
(
6
):
2100
2104
.

10.

Manolio
TA
,
Collins
FS
,
Cox
NJ
,
Goldstein
DB
,
Hindorff
LA
,
Hunter
DJ
,
McCarthy
MI
,
Ramos
EM
,
Cardon
LR
,
Chakravarti
A
,
Cho
JH
,
Guttmacher
AE
,
Kong
A
,
Kruglyak
L
,
Mardis
E
,
Rotimi
CN
,
Slatkin
M
,
Valle
D
,
Whittemore
AS
,
Boehnke
M
,
Clark
AG
,
Eichler
EE
,
Gibson
G
,
Haines
JL
,
Mackay
TF
,
McCarroll
SA
,
Visscher
PM
.
Finding the missing heritability of complex diseases
.
Nature
.
2009
;
461
(
7265
):
747
753
.

11.

Li
X
,
Kim
Y
,
Tsang
EK
,
Davis
JR
,
Damani
FN
,
Chiang
C
,
Hess
GT
,
Zappala
Z
,
Strober
BJ
,
Scott
AJ
,
Li
A
,
Ganna
A
,
Bassik
MC
,
Merker
JD
,
Hall
IM
,
Battle
A
,
Montgomery
SB
;
GTEx Consortium
;
Laboratory, Data Analysis and Coordinating Center (LDACC)—Analysis Working Group
;
Statistical Methods groups—Analysis Working Group
;
Enhancing GTEx (eGTEx) groups
;
NIH Common Fund
;
NIH/NCI
;
NIH/NHGRI
;
NIH/NIMH
;
NIH/NIDA
;
Biospecimen Collection Source Site—NDRI
;
Biospecimen Collection Source Site—RPCI
;
Biospecimen Core Resource—VARI
;
Brain Bank Repository—University of Miami Brain Endowment Bank
;
Leidos Biomedical—Project Management
;
ELSI Study
;
Genome Browser Data Integration and Visualization—EBI
;
Genome Browser Data Integration and Visualization—UCSC Genomics Institute, University of California Santa Cruz
.
The impact of rare variation on gene expression across tissues
.
Nature
.
2017
;
550
(
7675
):
239
243
.

12.

Walter
K
,
Min
JL
,
Huang
J
,
Crooks
L
,
Memari
Y
,
McCarthy
S
,
Perry
JR
,
Xu
C
,
Futema
M
,
Lawson
D
,
Iotchkova
V
,
Schiffels
S
,
Hendricks
AE
,
Danecek
P
,
Li
R
,
Floyd
J
,
Wain
LV
,
Barroso
I
,
Humphries
SE
,
Hurles
ME
,
Zeggini
E
,
Barrett
JC
,
Plagnol
V
,
Richards
JB
,
Greenwood
CM
,
Timpson
NJ
,
Durbin
R
,
Soranzo
N
;
UK10K Consortium
.
The UK10K project identifies rare variants in health and disease
.
Nature
.
2015
;
526
(
7571
):
82
90
.

13.

Surakka
I
,
Horikoshi
M
,
Mägi
R
,
Sarin
AP
,
Mahajan
A
,
Lagou
V
,
Marullo
L
,
Ferreira
T
,
Miraglio
B
,
Timonen
S
,
Kettunen
J
,
Pirinen
M
,
Karjalainen
J
,
Thorleifsson
G
,
Hägg
S
,
Hottenga
JJ
,
Isaacs
A
,
Ladenvall
C
,
Beekman
M
,
Esko
T
,
Ried
JS
,
Nelson
CP
,
Willenborg
C
,
Gustafsson
S
,
Westra
HJ
,
Blades
M
,
de Craen
AJ
,
de Geus
EJ
,
Deelen
J
,
Grallert
H
,
Hamsten
A
,
Havulinna
AS
,
Hengstenberg
C
,
Houwing-Duistermaat
JJ
,
Hyppönen
E
,
Karssen
LC
,
Lehtimäki
T
,
Lyssenko
V
,
Magnusson
PK
,
Mihailov
E
,
Müller-Nurasyid
M
,
Mpindi
JP
,
Pedersen
NL
,
Penninx
BW
,
Perola
M
,
Pers
TH
,
Peters
A
,
Rung
J
,
Smit
JH
,
Steinthorsdottir
V
,
Tobin
MD
,
Tsernikova
N
,
van Leeuwen
EM
,
Viikari
JS
,
Willems
SM
,
Willemsen
G
,
Schunkert
H
,
Erdmann
J
,
Samani
NJ
,
Kaprio
J
,
Lind
L
,
Gieger
C
,
Metspalu
A
,
Slagboom
PE
,
Groop
L
,
van Duijn
CM
,
Eriksson
JG
,
Jula
A
,
Salomaa
V
,
Boomsma
DI
,
Power
C
,
Raitakari
OT
,
Ingelsson
E
,
Järvelin
MR
,
Thorsteinsdottir
U
,
Franke
L
,
Ikonen
E
,
Kallioniemi
O
,
Pietiäinen
V
,
Lindgren
CM
,
Stefansson
K
,
Palotie
A
,
McCarthy
MI
,
Morris
AP
,
Prokopenko
I
,
Ripatti
S
;
ENGAGE Consortium
.
The impact of low-frequency and rare variants on lipid levels
.
Nat Genet
.
2015
;
47
(
6
):
589
597
.

14.

Zou
Y
,
Lu
P
,
Shi
J
,
Liu
W
,
Yang
M
,
Zhao
S
,
Chen
N
,
Chen
M
,
Sun
Y
,
Gao
A
,
Chen
Q
,
Zhang
Z
,
Ma
Q
,
Ning
T
,
Ying
X
,
Jin
J
,
Deng
X
,
Shen
B
,
Zhang
Y
,
Yuan
B
,
Kauderer
S
,
Liu
S
,
Hong
J
,
Liu
R
,
Ning
G
,
Wang
W
,
Gu
W
,
Wang
J
.
IRX3 promotes the browning of white adipocytes and its rare variants are associated with human obesity risk
.
EBioMedicine
.
2017
;
24
:
64
75
.

15.

Turcot
V
,
Lu
Y
,
Highland
HM
,
Schurmann
C
,
Justice
AE
,
Fine
RS
,
Bradfield
JP
,
Esko
T
,
Giri
A
,
Graff
M
,
Guo
X
,
Hendricks
AE
,
Karaderi
T
,
Lempradl
A
,
Locke
AE
,
Mahajan
A
,
Marouli
E
,
Sivapalaratnam
S
,
Young
KL
,
Alfred
T
,
Feitosa
MF
,
Masca
NGD
,
Manning
AK
,
Medina-Gomez
C
,
Mudgal
P
,
Ng
MC
,
Reiner
AP
,
Vedantam
S
,
Willems
SM
,
Winkler
TW
,
Abecasis
G
,
Aben
KK
,
Alam
DS
,
Alharthi
SE
,
Allison
M
,
Amouyel
P
,
Asselbergs
FW
,
Auer
PL
,
Balkau
B
,
Bang
LE
,
Barroso
I
,
Bastarache
L
,
Benn
M
,
Bergmann
S
,
Bielak
LF
,
Blüher
M
,
Boehnke
M
,
Boeing
H
,
Boerwinkle
E
,
Böger
CA
,
Bork-Jensen
J
,
Bots
ML
,
Bottinger
EP
,
Bowden
DW
,
Brandslund
I
,
Breen
G
,
Brilliant
MH
,
Broer
L
,
Brumat
M
,
Burt
AA
,
Butterworth
AS
,
Campbell
PT
,
Cappellani
S
,
Carey
DJ
,
Catamo
E
,
Caulfield
MJ
,
Chambers
JC
,
Chasman
DI
,
Chen
YI
,
Chowdhury
R
,
Christensen
C
,
Chu
AY
,
Cocca
M
,
Collins
FS
,
Cook
JP
,
Corley
J
,
Corominas Galbany
J
,
Cox
AJ
,
Crosslin
DS
,
Cuellar-Partida
G
,
D’Eustacchio
A
,
Danesh
J
,
Davies
G
,
Bakker
PIW
,
Groot
MC
,
Mutsert
R
,
Deary
IJ
,
Dedoussis
G
,
Demerath
EW
,
Heijer
M
,
Hollander
AI
,
Ruijter
HM
,
Dennis
JG
,
Denny
JC
,
Di Angelantonio
E
,
Drenos
F
,
Du
M
,
Dubé
MP
,
Dunning
AM
,
Easton
DF
,
Edwards
TL
,
Ellinghaus
D
,
Ellinor
PT
,
Elliott
P
,
Evangelou
E
,
Farmaki
AE
,
Farooqi
IS
,
Faul
JD
,
Fauser
S
,
Feng
S
,
Ferrannini
E
,
Ferrieres
J
,
Florez
JC
,
Ford
I
,
Fornage
M
,
Franco
OH
,
Franke
A
,
Franks
PW
,
Friedrich
N
,
Frikke-Schmidt
R
,
Galesloot
TE
,
Gan
W
,
Gandin
I
,
Gasparini
P
,
Gibson
J
,
Giedraitis
V
,
Gjesing
AP
,
Gordon-Larsen
P
,
Gorski
M
,
Grabe
HJ
,
Grant
SF
,
Grarup
N
,
Griffiths
HL
,
Grove
ML
,
Gudnason
V
,
Gustafsson
S
,
Haessler
J
,
Hakonarson
H
,
Hammerschlag
AR
,
Hansen
T
,
Harris
KM
,
Harris
TB
,
Hattersley
AT
,
Have
CT
,
Hayward
C
,
He
L
,
Heard-Costa
NL
,
Heath
AC
,
Heid
IM
,
Helgeland
Ø
,
Hernesniemi
J
,
Hewitt
AW
,
Holmen
OL
,
Hovingh
GK
,
Howson
JM
,
Hu
Y
,
Huang
PL
,
Huffman
JE
,
Ikram
MA
,
Ingelsson
E
,
Jackson
AU
,
Jansson
JH
,
Jarvik
GP
,
Jensen
GB
,
Jia
Y
,
Johansson
S
,
Jørgensen
ME
,
Jørgensen
T
,
Jukema
JW
,
Kahali
B
,
Kahn
RS
,
Kähönen
M
,
Kamstrup
PR
,
Kanoni
S
,
Kaprio
J
,
Karaleftheri
M
,
Kardia
SLR
,
Karpe
F
,
Kathiresan
S
,
Kee
F
,
Kiemeney
LA
,
Kim
E
,
Kitajima
H
,
Komulainen
P
,
Kooner
JS
,
Kooperberg
C
,
Korhonen
T
,
Kovacs
P
,
Kuivaniemi
H
,
Kutalik
Z
,
Kuulasmaa
K
,
Kuusisto
J
,
Laakso
M
,
Lakka
TA
,
Lamparter
D
,
Lange
EM
,
Lange
LA
,
Langenberg
C
,
Larson
EB
,
Lee
NR
,
Lehtimäki
T
,
Lewis
CE
,
Li
H
,
Li
J
,
Li-Gao
R
,
Lin
H
,
Lin
KH
,
Lin
LA
,
Lin
X
,
Lind
L
,
Lindström
J
,
Linneberg
A
,
Liu
CT
,
Liu
DJ
,
Liu
Y
,
Lo
KS
,
Lophatananon
A
,
Lotery
AJ
,
Loukola
A
,
Luan
J
,
Lubitz
SA
,
Lyytikäinen
LP
,
Männistö
S
,
Marenne
G
,
Mazul
AL
,
McCarthy
MI
,
McKean-Cowdin
R
,
Medland
SE
,
Meidtner
K
,
Milani
L
,
Mistry
V
,
Mitchell
P
,
Mohlke
KL
,
Moilanen
L
,
Moitry
M
,
Montgomery
GW
,
Mook-Kanamori
DO
,
Moore
C
,
Mori
TA
,
Morris
AD
,
Morris
AP
,
Müller-Nurasyid
M
,
Munroe
PB
,
Nalls
MA
,
Narisu
N
,
Nelson
CP
,
Neville
M
,
Nielsen
SF
,
Nikus
K
,
Njølstad
PR
,
Nordestgaard
BG
,
Nyholt
DR
,
O’Connel
JR
,
O’Donoghue
ML
,
Olde Loohuis
LM
,
Ophoff
RA
,
Owen
KR
,
Packard
CJ
,
Padmanabhan
S
,
Palmer
CNA
,
Palmer
ND
,
Pasterkamp
G
,
Patel
AP
,
Pattie
A
,
Pedersen
O
,
Peissig
PL
,
Peloso
GM
,
Pennell
CE
,
Perola
M
,
Perry
JA
,
Perry
JRB
,
Pers
TH
,
Person
TN
,
Peters
A
,
Petersen
ER
,
Peyser
PA
,
Pirie
A
,
Polasek
O
,
Polderman
TJ
,
Puolijoki
H
,
Raitakari
OT
,
Rasheed
A
,
Rauramaa
R
,
Reilly
DF
,
Renström
F
,
Rheinberger
M
,
Ridker
PM
,
Rioux
JD
,
Rivas
MA
,
Roberts
DJ
,
Robertson
NR
,
Robino
A
,
Rolandsson
O
,
Rudan
I
,
Ruth
KS
,
Saleheen
D
,
Salomaa
V
,
Samani
NJ
,
Sapkota
Y
,
Sattar
N
,
Schoen
RE
,
Schreiner
PJ
,
Schulze
MB
,
Scott
RA
,
Segura-Lepe
MP
,
Shah
SH
,
Sheu
WH
,
Sim
X
,
Slater
AJ
,
Small
KS
,
Smith
AV
,
Southam
L
,
Spector
TD
,
Speliotes
EK
,
Starr
JM
,
Stefansson
K
,
Steinthorsdottir
V
,
Stirrups
KE
,
Strauch
K
,
Stringham
HM
,
Stumvoll
M
,
Sun
L
,
Surendran
P
,
Swift
AJ
,
Tada
H
,
Tansey
KE
,
Tardif
JC
,
Taylor
KD
,
Teumer
A
,
Thompson
DJ
,
Thorleifsson
G
,
Thorsteinsdottir
U
,
Thuesen
BH
,
Tönjes
A
,
Tromp
G
,
Trompet
S
,
Tsafantakis
E
,
Tuomilehto
J
,
Tybjaerg-Hansen
A
,
Tyrer
JP
,
Uher
R
,
Uitterlinden
AG
,
Uusitupa
M
,
Laan
SW
,
Duijn
CM
,
Leeuwen
N
,
van Setten
J
,
Vanhala
M
,
Varbo
A
,
Varga
TV
,
Varma
R
,
Velez Edwards
DR
,
Vermeulen
SH
,
Veronesi
G
,
Vestergaard
H
,
Vitart
V
,
Vogt
TF
,
Völker
U
,
Vuckovic
D
,
Wagenknecht
LE
,
Walker
M
,
Wallentin
L
,
Wang
F
,
Wang
CA
,
Wang
S
,
Wang
Y
,
Ware
EB
,
Wareham
NJ
,
Warren
HR
,
Waterworth
DM
,
Wessel
J
,
White
HD
,
Willer
CJ
,
Wilson
JG
,
Witte
DR
,
Wood
AR
,
Wu
Y
,
Yaghootkar
H
,
Yao
J
,
Yao
P
,
Yerges-Armstrong
LM
,
Young
R
,
Zeggini
E
,
Zhan
X
,
Zhang
W
,
Zhao
JH
,
Zhao
W
,
Zhao
W
,
Zhou
W
,
Zondervan
KT
,
Rotter
JI
,
Pospisilik
JA
,
Rivadeneira
F
,
Borecki
IB
,
Deloukas
P
,
Frayling
TM
,
Lettre
G
,
North
KE
,
Lindgren
CM
,
Hirschhorn
JN
,
Loos
RJ;
CHD Exome+ Consortium
;
EPIC-CVD Consortium
;
ExomeBP Consortium
;
Global Lipids Genetic Consortium
;
GoT2D Genes Consortium
;
EPIC InterAct Consortium
;
INTERVAL StudyReproGen Consortium
;
T2D-Genes Consortium
;
MAGIC Investigators
;
Understanding Society Scientific Group
.
Protein-altering variants associated with body mass index implicate pathways that control energy intake and expenditure in obesity (published corrctions appear in Nat Genet. 2018;50(5):765–766 and Nat Genet. 2018;50(5):766–767)
.
Nat Genet
.
2018
;
50
(
1
):
26
41
.

16.

Russo
A
,
Di Gaetano
C
,
Cugliari
G
,
Matullo
G
.
Advances in the genetics of hypertension: the effect of rare variants
.
Int J Mol Sci
.
2018
;
19
(
3
):
E688
.

17.

Gorsik
LK
,
Kosova
G
,
Werstein
B
,
Sisk
R
,
Legro
RS
,
Hayes
MG
,
Teixeira
JM
,
Dunaif
A
,
Urbanek
M
.
Pathogenic anti-Müllerian hormone variants in polycystic ovary syndrome
.
J Clin Endocrinol Metab.
2017
;
102
(
8
):
2862
2872
.

18.

Gorsik
LK
,
Dapas
M
,
Legro
RS
,
Hayes
MG
,
Urbanek
M
.
Functional genetic variation in the anti-Müllerian hormone pathway in women with polycystic ovary syndrome
.
J Clin Endocrinol Metab.
2019
;
104
(
7
):
2855
2874
.

19.

Zawadski
JK
,
Dunaif
A
.
Diagnostic criteria for polycystic ovary syndrome: towards a rational approach. In: Dunaif A, Givens JR, Haseltine FP, Merriam GR, eds. Polycystic Ovary Syndrome. Boston, MA: Blackwell Scientific; 1992:377–384
.

20.

Diamanti-Kandarakis
E
,
Dunaif
A
.
Insulin resistance and the polycystic ovary syndrome revisited: an update on mechanisms and implications
.
Endocr Rev
.
2012
;
33
(
6
):
981
1030
.

21.

Kosova
G
,
Urbanek
M
.
Genetics of the polycystic ovary syndrome
.
Mol Cell Endocrinol
.
2013
;
373
(
1–2
):
29
38
.

22.

Dunaif
A
.
Perspectives in polycystic ovary syndrome: from hair to eternity
.
J Clin Endocrinol Metab
.
2016
;
101
(
3
):
759
768
.

23.

Legro
RS
,
Spielman
R
,
Urbanek
M
,
Driscoll
D
,
Strauss
JF
III
,
Dunaif
A
.
Phenotype and genotype in polycystic ovary syndrome
.
Recent Prog Horm Res
.
1998
;
53
:
217
256
.

24.

Urbanek
M
,
Legro
RS
,
Driscoll
DA
,
Azziz
R
,
Ehrmann
DA
,
Norman
RJ
,
Strauss
JF
III
,
Spielman
RS
,
Dunaif
A
.
Thirty-seven candidate genes for polycystic ovary syndrome: strongest evidence for linkage is with follistatin
.
Proc Natl Acad Sci USA
.
1999
;
96
(
15
):
8573
8578
.

25.

Janssen
I
,
Powell
LH
,
Kazlauskaite
R
,
Dugan
SA
.
Testosterone and visceral fat in midlife women: the Study of Women’s Health Across the Nation (SWAN) fat patterning study
.
Obesity (Silver Spring)
.
2010
;
18
(
3
):
604
610
.

26.

Kim
C
,
Harlow
SD
,
Zheng
H
,
McConnell
DS
,
Randolph
JF
Jr
.
Changes in androstenedione, dehydroepiandrosterone, testosterone, estradiol, and estrone over the menopausal transition
.
Womens Midlife Health
.
2017
;
3
(
1
):
9
.

27.

Urbanek
M
,
Woodroffe
A
,
Ewens
KG
,
Diamanti-Kandarakis
E
,
Legro
RS
,
Strauss
JF
III
,
Dunaif
A
,
Spielman
RS
.
Candidate gene region for polycystic ovary syndrome on chromosome 19p13.2
.
J Clin Endocrinol Metab
.
2005
;
90
(
12
):
6623
6629
.

28.

Kotsopoulos
J
,
Shafrir
AL
,
Rice
M
,
Hankinson
SE
,
Eliassen
AH
,
Tworoger
SS
,
Narod
SA
.
The relationship between bilateral oophorectomy and plasma hormone levels in postmenopausal women
.
Horm Cancer
.
2015
;
6
(
1
):
54
63
.

29.

Yazdani
S
,
Sharbatdaran
M
,
Abedi Samakoosh
M
,
Bouzari
Z
,
Masoudi
Z
.
Glucose tolerance and lipid profile changes after surgical menopause
.
Caspian J Intern Med
.
2014
;
5
(
2
):
114
117
.

30.

Espeland
MA
,
Hogan
PE
,
Fineberg
SE
,
Howard
G
,
Schrott
H
,
Waclawiw
MA
,
Bush
TL
.
Effect of postmenopausal hormone therapy on glucose and insulin concentrations
.
Diabetes Care
.
1998
;
21
(
10
):
1589
1595
.

31.

Magkos
F
,
Wang
X
,
Mittendorfer
B
.
Metabolic actions of insulin in men and women
.
Nutrition
.
2010
;
26
(
7–8
):
686
693
.

32.

Sohrevardi
SM
,
Nosouhi
F
,
Hossein Khalilzade
S
,
Kafaie
P
,
Karimi-Zarchi
M
,
Halvaei
I
,
Mohsenzadeh
M
.
Evaluating the effect of insulin sensitizers metformin and pioglitazone alone and in combination on women with polycystic ovary syndrome: an RCT
.
Int J Reprod Biomed (Yazd)
.
2016
;
14
(
12
):
743
754
.

33.

Torchen
LC
,
Kumar
A
,
Kalra
B
,
Savjani
G
,
Sisk
R
,
Legro
RS
,
Dunaif
A
.
Increased antimüllerian hormone levels and other reproductive endocrine changes in adult male relatives of women with polycystic ovary syndrome
.
Fertil Steril
.
2016
;
106
(
1
):
50
55
.

34.

Sam
S
,
Legro
RS
,
Essah
PA
,
Apridonidze
T
,
Dunaif
A
.
Evidence for metabolic and reproductive phenotypes in mothers of women with polycystic ovary syndrome
.
Proc Natl Acad Sci USA
.
2006
;
103
(
18
):
7030
7035
.

35.

World Health Organization. Definition and diagnosis of diabetes mellitus and intermediate hyperglycaemia: report of a WHO/IDF consultation. Available at: https://apps.who.int/iris/handle/10665/43588. Accessed 16 August 2018
.

36.

Drmanac
R
,
Sparks
AB
,
Callow
MJ
,
Halpern
AL
,
Burns
NL
,
Kermani
BG
,
Carnevali
P
,
Nazarenko
I
,
Nilsen
GB
,
Yeung
G
,
Dahl
F
,
Fernandez
A
,
Staker
B
,
Pant
KP
,
Baccash
J
,
Borcherding
AP
,
Brownley
A
,
Cedeno
R
,
Chen
L
,
Chernikoff
D
,
Cheung
A
,
Chirita
R
,
Curson
B
,
Ebert
JC
,
Hacker
CR
,
Hartlage
R
,
Hauser
B
,
Huang
S
,
Jiang
Y
,
Karpinchyk
V
,
Koenig
M
,
Kong
C
,
Landers
T
,
Le
C
,
Liu
J
,
McBride
CE
,
Morenzoni
M
,
Morey
RE
,
Mutch
K
,
Perazich
H
,
Perry
K
,
Peters
BA
,
Peterson
J
,
Pethiyagoda
CL
,
Pothuraju
K
,
Richter
C
,
Rosenbaum
AM
,
Roy
S
,
Shafto
J
,
Sharanhovich
U
,
Shannon
KW
,
Sheppy
CG
,
Sun
M
,
Thakuria
JV
,
Tran
A
,
Vu
D
,
Zaranek
AW
,
Wu
X
,
Drmanac
S
,
Oliphant
AR
,
Banyai
WC
,
Martin
B
,
Ballinger
DG
,
Church
GM
,
Reid
CA
.
Human genome sequencing using unchained base reads on self-assembling DNA nanoarrays
.
Science
.
2010
;
327
(
5961
):
78
81
.

37.

Baccash
J
,
Halpern
A
,
Tian
C
,
Pant
K
,
Carnevali
P
,
inventors; Complete Genomics, Inc., assignee. Determining variants in genome of a heterogeneous sample. US patent application 2013/0110407-A1. 2 May 2013
.

38.

Gorsic
LK
,
Kosova
G
,
Werstein
B
,
Sisk
R
,
Legro
RS
,
Hayes
MG
,
Teixeira
JM
,
Dunaif
A
,
Urbanek
M
.
Pathogenic anti-Müllerian hormone variants in polycystic ovary syndrome
.
J Clin Endocrinol Metab
.
2017
;
102
(
8
):
2862
2872
.

39.

Erikson Galina
A
,
Bodian Dale
L
,
Rueda
M
,
Molparia
B
,
Scott Erick
R
,
Scott-Van Zeeland Ashley
A
,
Topol Sarah
E
,
Wineinger Nathan
E
,
Niederhuber John
E
,
Topol Eric
J
,
Torkamani
A.
Whole-Genome Sequencing of a Healthy Aging Cohort
.
Cell
.
2016
;
165
(
4
):
1002
1011
.

40.

Roach
JC
,
Glusman
G
,
Smit
AF
,
Huff
CD
,
Hubley
R
,
Shannon
PT
,
Rowen
L
,
Pant
KP
,
Goodman
N
,
Bamshad
M
,
Shendure
J
,
Drmanac
R
,
Jorde
LB
,
Hood
L
,
Galas
DJ
.
Analysis of genetic inheritance in a family quartet by whole-genome sequencing
.
Science
.
2010
;
328
(
5978
):
636
639
.

41.

Reumers
J
,
De Rijk
P
,
Zhao
H
,
Liekens
A
,
Smeets
D
,
Cleary
J
,
Van Loo
P
,
Van Den Bossche
M
,
Catthoor
K
,
Sabbe
B
,
Despierre
E
,
Vergote
I
,
Hilbush
B
,
Lambrechts
D
,
Del-Favero
J
.
Optimized filtering reduces the error rate in detecting genomic variants by short-read sequencing
.
Nat Biotechnol
.
2011
;
30
(
1
):
61
68
.

42.

Yuen
RK
,
Thiruvahindrapuram
B
,
Merico
D
,
Walker
S
,
Tammimies
K
,
Hoang
N
,
Chrysler
C
,
Nalpathamkalam
T
,
Pellecchia
G
,
Liu
Y
,
Gazzellone
MJ
,
D’Abate
L
,
Deneault
E
,
Howe
JL
,
Liu
RS
,
Thompson
A
,
Zarrei
M
,
Uddin
M
,
Marshall
CR
,
Ring
RH
,
Zwaigenbaum
L
,
Ray
PN
,
Weksberg
R
,
Carter
MT
,
Fernandez
BA
,
Roberts
W
,
Szatmari
P
,
Scherer
SW
.
Whole-genome sequencing of quartet families with autism spectrum disorder
.
Nat Med
.
2015
;
21
(
2
):
185
191
.

43.

Abecasis
GR
,
Altshuler
D
,
Auton
A
,
Brooks
LD
,
Durbin
RM
,
Gibbs
RA
,
Hurles
ME
,
McVean
GA
;
1000 Genomes Project Consortium
.
A map of human genome variation from population-scale sequencing (published correction appears in Nature. 2011;473(7348):544)
.
Nature
.
2010
;
467
(
7319
):
1061
1073
.

44.

Jiang
YH
,
Yuen
RK
,
Jin
X
,
Wang
M
,
Chen
N
,
Wu
X
,
Ju
J
,
Mei
J
,
Shi
Y
,
He
M
,
Wang
G
,
Liang
J
,
Wang
Z
,
Cao
D
,
Carter
MT
,
Chrysler
C
,
Drmic
IE
,
Howe
JL
,
Lau
L
,
Marshall
CR
,
Merico
D
,
Nalpathamkalam
T
,
Thiruvahindrapuram
B
,
Thompson
A
,
Uddin
M
,
Walker
S
,
Luo
J
,
Anagnostou
E
,
Zwaigenbaum
L
,
Ring
RH
,
Wang
J
,
Lajonchere
C
,
Wang
J
,
Shih
A
,
Szatmari
P
,
Yang
H
,
Dawson
G
,
Li
Y
,
Scherer
SW
.
Detection of clinically relevant genetic variants in autism spectrum disorder by whole-genome sequencing
.
Am J Hum Genet
.
2013
;
93
(
2
):
249
263
.

45.

Dapas
M
,
Sisk
R
,
Legro
RS
,
Urbanek
M
,
Dunaif
A
,
Hayes
MG.
Data from: Family-based quantitative trait meta-analysis implicates rare noncoding variants in DENND1A in polycystic ovary syndrome. figshare 2019. Deposited 12 March 2019. https://dx.doi.org/10.6084/m9.figshare.7834658
.

46.

Kircher
M
,
Witten
DM
,
Jain
P
,
O’Roak
BJ
,
Cooper
GM
,
Shendure
J
.
A general framework for estimating the relative pathogenicity of human genetic variants
.
Nat Genet
.
2014
;
46
(
3
):
310
315
.

47.

Itan
Y
,
Shang
L
,
Boisson
B
,
Ciancanelli
MJ
,
Markle
JG
,
Martinez-Barricarte
R
,
Scott
E
,
Shah
I
,
Stenson
PD
,
Gleeson
J
,
Cooper
DN
,
Quintana-Murci
L
,
Zhang
SY
,
Abel
L
,
Casanova
JL
.
The mutation significance cutoff: gene-level thresholds for variant predictions
.
Nat Methods
.
2016
;
13
(
2
):
109
110
.

48.

Adzhubei
IA
,
Schmidt
S
,
Peshkin
L
,
Ramensky
VE
,
Gerasimova
A
,
Bork
P
,
Kondrashov
AS
,
Sunyaev
SR
.
A method and server for predicting damaging missense mutations
.
Nat Methods
.
2010
;
7
(
4
):
248
249
.

49.

Kumar
P
,
Henikoff
S
,
Ng
PC
.
Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm
.
Nat Protoc
.
2009
;
4
(
7
):
1073
1081
.

50.

Huang
YF
,
Gulko
B
,
Siepel
A
.
Fast, scalable prediction of deleterious noncoding variants from functional and population genomic data
.
Nat Genet
.
2017
;
49
(
4
):
618
624
.

51.

Dong
C
,
Wei
P
,
Jian
X
,
Gibbs
R
,
Boerwinkle
E
,
Wang
K
,
Liu
X
.
Comparison and integration of deleteriousness prediction methods for nonsynonymous SNVs in whole exome sequencing studies
.
Hum Mol Genet
.
2015
;
24
(
8
):
2125
2137
.

52.

Gnad
F
,
Baucom
A
,
Mukhyala
K
,
Manning
G
,
Zhang
Z
.
Assessment of computational methods for predicting the effects of missense mutations in human cancers
.
BMC Genomics
.
2013
;
14
(
Suppl 3
):
S7
.

53.

González-Pérez
A
,
López-Bigas
N
.
Improving the assessment of the outcome of nonsynonymous SNVs with a consensus deleteriousness score, Condel
.
Am J Hum Genet
.
2011
;
88
(
4
):
440
449
.

54.

Nishizaki
SS
,
Boyle
AP
.
Mining the unknown: assigning function to noncoding single nucleotide polymorphisms
.
Trends Genet
.
2017
;
33
(
1
):
34
45
.

55.

Ernst
J
,
Kellis
M
.
Chromatin-state discovery and genome annotation with ChromHMM
.
Nat Protoc
.
2017
;
12
(
12
):
2478
2492
.

56.

Zhang
Y
,
Hardison
RC
.
Accurate and reproducible functional maps in 127 human cell types via 2D genome segmentation
.
Nucleic Acids Res
.
2017
;
45
(
17
):
9823
9836
.

57.

Rosenfield
RL
,
Ehrmann
DA
.
The pathogenesis of polycystic ovary syndrome (PCOS): the hypothesis of PCOS as functional ovarian hyperandrogenism revisited
.
Endocr Rev
.
2016
;
37
(
5
):
467
520
.

58.

Consortium
EP
,
Bernstein
BE
,
Birney
E
,
Dunham
I
,
Green
ED
,
Gunter
C
,
Snyder
M
;
ENCODE Project Consortium
.
An integrated encyclopedia of DNA elements in the human genome
.
Nature
.
2012
;
489
(
7414
):
57
74
.

59.

Forgacs
AL
,
Ding
Q
,
Jaremba
RG
,
Huhtaniemi
IT
,
Rahman
NA
,
Zacharewski
TR
.
BLTK1 murine Leydig cells: a novel steroidogenic model for evaluating the effects of reproductive and developmental toxicants
.
Toxicol Sci
.
2012
;
127
(
2
):
391
402
.

60.

Huang-Doran
I
,
Franks
S
.
Genetic rodent models of obesity-associated ovarian dysfunction and subfertility: insights into polycystic ovary syndrome
.
Front Endocrinol (Lausanne)
.
2016
;
7
:
53
.

61.

Tee
MK
,
Speek
M
,
Legeza
B
,
Modi
B
,
Teves
ME
,
McAllister
JM
,
Strauss
JF
III
,
Miller
WL
.
Alternative splicing of DENND1A, a PCOS candidate gene, generates variant 2
.
Mol Cell Endocrinol
.
2016
;
434
:
25
35
.

62.

Erikson
GA
,
Bodian
DL
,
Rueda
M
,
Molparia
B
,
Scott
ER
,
Scott-Van Zeeland
AA
,
Topol
SE
,
Wineinger
NE
,
Niederhuber
JE
,
Topol
EJ
,
Torkamani
A
.
Whole-genome sequencing of a healthy aging cohort
.
Cell
.
2016
;
165
(
4
):
1002
1011
.

63.

Tombácz
D
,
Maróti
Z
,
Kalmár
T
,
Csabai
Z
,
Balázs
Z
,
Takahashi
S
,
Palkovits
M
,
Snyder
M
,
Boldogkői
Z
.
High-coverage whole-exome sequencing identifies candidate genes for suicide in victims with major depressive disorder
.
Sci Rep
.
2017
;
7
(
1
):
7106
.

64.

Wong
JK
,
Gui
H
,
Kwok
M
,
Ng
PW
,
Lui
CH
,
Baum
L
,
Sham
PC
,
Kwan
P
,
Cherny
SS
.
Rare variants and de novo variants in mesial temporal lobe epilepsy with hippocampal sclerosis
.
Neurol Genet
.
2018
;
4
(
3
):
e245
.

65.

He
Z
,
Zhang
D
,
Renton
AE
,
Li
B
,
Zhao
L
,
Wang
GT
,
Goate
AM
,
Mayeux
R
,
Leal
SM
.
The rare-variant generalized disequilibrium test for association analysis of nuclear and extended pedigrees with application to Alzheimer disease WGS data (published correction appears in
Am J Hum Genet.
2017;100(2):371)
.
Am J Hum Genet
.
2017
;
100
(
2
):
193
204
.

66.

Igartua
C
,
Myers
RA
,
Mathias
RA
,
Pino-Yanes
M
,
Eng
C
,
Graves
PE
,
Levin
AM
,
Del-Rio-Navarro
BE
,
Jackson
DJ
,
Livne
OE
,
Rafaels
N
,
Edlund
CK
,
Yang
JJ
,
Huntsman
S
,
Salam
MT
,
Romieu
I
,
Mourad
R
,
Gern
JE
,
Lemanske
RF
,
Wyss
A
,
Hoppin
JA
,
Barnes
KC
,
Burchard
EG
,
Gauderman
WJ
,
Martinez
FD
,
Raby
BA
,
Weiss
ST
,
Williams
LK
,
London
SJ
,
Gilliland
FD
,
Nicolae
DL
,
Ober
C
.
Ethnic-specific associations of rare and low-frequency DNA sequence variants with asthma
.
Nat Commun
.
2015
;
6
(
1
):
5965
.

67.

Lubitz
SA
,
Brody
JA
,
Bihlmeyer
NA
,
Roselli
C
,
Weng
LC
,
Christophersen
IE
,
Alonso
A
,
Boerwinkle
E
,
Gibbs
RA
,
Bis
JC
,
Cupples
LA
,
Mohler
PJ
,
Nickerson
DA
,
Muzny
D
,
Perez
MV
,
Psaty
BM
,
Soliman
EZ
,
Sotoodehnia
N
,
Lunetta
KL
,
Benjamin
EJ
,
Heckbert
SR
,
Arking
DE
,
Ellinor
PT
,
Lin
H
;
NHLBI GO Exome Sequencing Project
.
Whole exome sequencing in atrial fibrillation
.
PLoS Genet
.
2016
;
12
(
9
):
e1006284
.

68.

Li
M
,
Li
Y
,
Weeks
O
,
Mijatovic
V
,
Teumer
A
,
Huffman
JE
,
Tromp
G
,
Fuchsberger
C
,
Gorski
M
,
Lyytikäinen
LP
,
Nutile
T
,
Sedaghat
S
,
Sorice
R
,
Tin
A
,
Yang
Q
,
Ahluwalia
TS
,
Arking
DE
,
Bihlmeyer
NA
,
Böger
CA
,
Carroll
RJ
,
Chasman
DI
,
Cornelis
MC
,
Dehghan
A
,
Faul
JD
,
Feitosa
MF
,
Gambaro
G
,
Gasparini
P
,
Giulianini
F
,
Heid
I
,
Huang
J
,
Imboden
M
,
Jackson
AU
,
Jeff
J
,
Jhun
MA
,
Katz
R
,
Kifley
A
,
Kilpeläinen
TO
,
Kumar
A
,
Laakso
M
,
Li-Gao
R
,
Lohman
K
,
Lu
Y
,
Mägi
R
,
Malerba
G
,
Mihailov
E
,
Mohlke
KL
,
Mook-Kanamori
DO
,
Robino
A
,
Ruderfer
D
,
Salvi
E
,
Schick
UM
,
Schulz
CA
,
Smith
AV
,
Smith
JA
,
Traglia
M
,
Yerges-Armstrong
LM
,
Zhao
W
,
Goodarzi
MO
,
Kraja
AT
,
Liu
C
,
Wessel
J
,
Boerwinkle
E
,
Borecki
IB
,
Bork-Jensen
J
,
Bottinger
EP
,
Braga
D
,
Brandslund
I
,
Brody
JA
,
Campbell
A
,
Carey
DJ
,
Christensen
C
,
Coresh
J
,
Crook
E
,
Curhan
GC
,
Cusi
D
,
de Boer
IH
,
de Vries
AP
,
Denny
JC
,
Devuyst
O
,
Dreisbach
AW
,
Endlich
K
,
Esko
T
,
Franco
OH
,
Fulop
T
,
Gerhard
GS
,
Glümer
C
,
Gottesman
O
,
Grarup
N
,
Gudnason
V
,
Hansen
T
,
Harris
TB
,
Hayward
C
,
Hocking
L
,
Hofman
A
,
Hu
FB
,
Husemoen
LL
,
Jackson
RD
,
Jørgensen
T
,
Jørgensen
ME
,
Kähönen
M
,
Kardia
SL
,
König
W
,
Kooperberg
C
,
Kriebel
J
,
Launer
LJ
,
Lauritzen
T
,
Lehtimäki
T
,
Levy
D
,
Linksted
P
,
Linneberg
A
,
Liu
Y
,
Loos
RJ
,
Lupo
A
,
Meisinger
C
,
Melander
O
,
Metspalu
A
,
Mitchell
P
,
Nauck
M
,
Nürnberg
P
,
Orho-Melander
M
,
Parsa
A
,
Pedersen
O
,
Peters
A
,
Peters
U
,
Polasek
O
,
Porteous
D
,
Probst-Hensch
NM
,
Psaty
BM
,
Qi
L
,
Raitakari
OT
,
Reiner
AP
,
Rettig
R
,
Ridker
PM
,
Rivadeneira
F
,
Rossouw
JE
,
Schmidt
F
,
Siscovick
D
,
Soranzo
N
,
Strauch
K
,
Toniolo
D
,
Turner
ST
,
Uitterlinden
AG
,
Ulivi
S
,
Velayutham
D
,
Völker
U
,
Völzke
H
,
Waldenberger
M
,
Wang
JJ
,
Weir
DR
,
Witte
D
,
Kuivaniemi
H
,
Fox
CS
,
Franceschini
N
,
Goessling
W
,
Köttgen
A
,
Chu
AY
;
CHARGE Glycemic-T2D Working Group
;
CHARGE Blood Pressure Working Group
.
SOS2 and ACP1 loci identified through large-scale exome chip analysis regulate kidney development and function
.
J Am Soc Nephrol
.
2017
;
28
(
3
):
981
994
.

69.

Crosby
J
,
Peloso
GM
,
Auer
PL
,
Crosslin
DR
,
Stitziel
NO
,
Lange
LA
,
Lu
Y
,
Tang
ZZ
,
Zhang
H
,
Hindy
G
,
Masca
N
,
Stirrups
K
,
Kanoni
S
,
Do
R
,
Jun
G
,
Hu
Y
,
Kang
HM
,
Xue
C
,
Goel
A
,
Farrall
M
,
Duga
S
,
Merlini
PA
,
Asselta
R
,
Girelli
D
,
Olivieri
O
,
Martinelli
N
,
Yin
W
,
Reilly
D
,
Speliotes
E
,
Fox
CS
,
Hveem
K
,
Holmen
OL
,
Nikpay
M
,
Farlow
DN
,
Assimes
TL
,
Franceschini
N
,
Robinson
J
,
North
KE
,
Martin
LW
,
DePristo
M
,
Gupta
N
,
Escher
SA
,
Jansson
JH
,
Van Zuydam
N
,
Palmer
CN
,
Wareham
N
,
Koch
W
,
Meitinger
T
,
Peters
A
,
Lieb
W
,
Erbel
R
,
Konig
IR
,
Kruppa
J
,
Degenhardt
F
,
Gottesman
O
,
Bottinger
EP
,
O’Donnell
CJ
,
Psaty
BM
,
Ballantyne
CM
,
Abecasis
G
,
Ordovas
JM
,
Melander
O
,
Watkins
H
,
Orho-Melander
M
,
Ardissino
D
,
Loos
RJ
,
McPherson
R
,
Willer
CJ
,
Erdmann
J
,
Hall
AS
,
Samani
NJ
,
Deloukas
P
,
Schunkert
H
,
Wilson
JG
,
Kooperberg
C
,
Rich
SS
,
Tracy
RP
,
Lin
DY
,
Altshuler
D
,
Gabriel
S
,
Nickerson
DA
,
Jarvik
GP
,
Cupples
LA
,
Reiner
AP
,
Boerwinkle
E
,
Kathiresan
S
;
TG and HDL Working Group of the Exome Sequencing Project, National Heart, Lung, and Blood Institute
.
Loss-of-function mutations in APOC3, triglycerides, and coronary disease
.
N Engl J Med
.
2014
;
371
(
1
):
22
31
.

70.

Dai
W
,
Zheng
H
,
Cheung
AK
,
Tang
CS
,
Ko
JM
,
Wong
BW
,
Leong
MM
,
Sham
PC
,
Cheung
F
,
Kwong
DL
,
Ngan
RK
,
Ng
WT
,
Yau
CC
,
Pan
J
,
Peng
X
,
Tung
S
,
Zhang
Z
,
Ji
M
,
Chiang
AK
,
Lee
AW
,
Lee
VH
,
Lam
KO
,
Au
KH
,
Cheng
HC
,
Yiu
HH
,
Lung
ML
.
Whole-exome sequencing identifies MST1R as a genetic susceptibility gene in nasopharyngeal carcinoma
.
Proc Natl Acad Sci USA
.
2016
;
113
(
12
):
3317
3322
.

71.

Ferre-Fernández
JJ
,
Aroca-Aguilar
JD
,
Medina-Trillo
C
,
Bonet-Fernández
JM
,
Méndez-Hernández
CD
,
Morales-Fernández
L
,
Corton
M
,
Cabañero-Valera
MJ
,
Gut
M
,
Tonda
R
,
Ayuso
C
,
Coca-Prados
M
,
García-Feijoo
J
,
Escribano
J
.
Whole-exome sequencing of congenital glaucoma patients reveals hypermorphic variants in GPATCH3, a new gene involved in ocular and craniofacial development
.
Sci Rep
.
2017
;
7
(
1
):
46175
.

72.

Strom
SP
,
Gorin
MB
.
Evaluation of autosomal dominant retinal dystrophy genes in an unaffected cohort suggests rare or private missense variants may often be benign
.
Mol Vis
.
2013
;
19
:
980
985
.

73.

Schroeder
JW
,
Hirst
WG
,
Szewczyk
GA
,
Simmons
LA
.
The effect of local sequence context on mutational bias of genes encoded on the leading and lagging strands
.
Curr Biol
.
2016
;
26
(
5
):
692
697
.

74.

Iossifov
I
,
Levy
D
,
Allen
J
,
Ye
K
,
Ronemus
M
,
Lee
YH
,
Yamrom
B
,
Wigler
M
.
Low load for disruptive mutations in autism genes and their biased transmission
.
Proc Natl Acad Sci USA
.
2015
;
112
(
41
):
E5600
E5607
.

75.

Gao
L
,
Fang
Z
,
Zhang
K
,
Zhi
D
,
Cui
X
.
Length bias correction for RNA-seq data in gene set analyses
.
Bioinformatics
.
2011
;
27
(
5
):
662
669
.

76.

Shyr
C
,
Tarailo-Graovac
M
,
Gottlieb
M
,
Lee
JJ
,
van Karnebeek
C
,
Wasserman
WW
.
FLAGS, frequently mutated genes in public exomes [published correction appears in
BMC Med Genomics
.
2017;10(1):69]
.
BMC Med Genomics
.
2014
;
7
(
1
):
64
.

77.

Schaid
DJ
,
McDonnell
SK
,
Sinnwell
JP
,
Thibodeau
SN
.
Multiple genetic variant association testing by collapsing and kernel methods with pedigree or population structured data
.
Genet Epidemiol
.
2013
;
37
(
5
):
409
418
.

78.

Wei
P
,
Cao
Y
,
Zhang
Y
,
Xu
Z
,
Kwak
IY
,
Boerwinkle
E
,
Pan
W
.
On robust association testing for quantitative traits and rare variants
.
G3 (Bethesda)
.
2016
;
6
(
12
):
3941
3950
.

79.

Auer
PL
,
Reiner
AP
,
Leal
SM
.
The effect of phenotypic outliers and non-normality on rare-variant association testing
.
Eur J Hum Genet
.
2016
;
24
(
8
):
1188
1194
.

80.

Solovieff
N
,
Cotsapas
C
,
Lee
PH
,
Purcell
SM
,
Smoller
JW
.
Pleiotropy in complex traits: challenges and strategies
.
Nat Rev Genet
.
2013
;
14
(
7
):
483
495
.

81.

Wu
B
,
Pankow
JS
.
Fast and accurate genome-wide association test of multiple quantitative traits
.
Comput Math Methods Med
.
2018
;
2018
:
2564531
.

82.

Yang
JJ
,
Li
J
,
Williams
LK
,
Buu
A
.
An efficient genome-wide association test for multivariate phenotypes based on the Fisher combination function
.
BMC Bioinformatics
.
2016
;
17
(
1
):
19
.

83.

Lal
L
,
Bharti
A
,
Perween
A.
To study the status of LH: FSH ratio in obese and non-obese patients of polycystic ovarian syndrome
.
J Dent Med Sci
.
2017
;
16
(
1
):
20
23
.

84.

Chan
HP
,
Zhang
NR
,
Chen
LH
.
Importance sampling of word patterns in DNA and protein sequences
.
J Comput Biol
.
2010
;
17
(
12
):
1697
1709
.

85.

Macintyre
G
,
Bailey
J
,
Haviv
I
,
Kowalczyk
A
.
is-rSNP: a novel technique for in silico regulatory SNP detection
.
Bioinformatics
.
2010
;
26
(
18
):
i524
i530
.

86.

Zuo
C
,
Shin
S
,
Keleş
S
.
atSNP: transcription factor binding affinity testing for regulatory SNP detection
.
Bioinformatics
.
2015
;
31
(
20
):
3353
3355
.

87.

Benjamini
Y
,
Hochberg
Y
.
Controlling the false discovery rate—a practical and powerful approach to multiple testing
.
J R Stat Soc B
.
1995
;
57
(
1
):
289
300
.

88.

GTEx Consortium
.
The Genotype-Tissue Expression (GTEx) project
.
Nat Genet
.
2013
;
45
(
6
):
580
585
.

89.

Giudice
G
,
Sánchez-Cabo
F
,
Torroja
C
,
Lara-Pezzi
E.
ATtRACT—a database of RNA-binding proteins and associated motifs. Database (Oxford). 2016;2016;baw035
.

90.

Gao
J
,
Xue
JD
,
Li
ZC
,
Zhou
L
,
Chen
C
.
The association of DENND1A gene polymorphisms and polycystic ovary syndrome risk: a systematic review and meta-analysis
.
Arch Gynecol Obstet
.
2016
;
294
(
5
):
1073
1080
.

91.

Pollard
KS
,
Hubisz
MJ
,
Rosenbloom
KR
,
Siepel
A
.
Detection of nonneutral substitution rates on mammalian phylogenies
.
Genome Res
.
2010
;
20
(
1
):
110
121
.

92.

Battle
A
,
Brown
CD
,
Engelhardt
BE
,
Montgomery
SB;
GTEx Consortium; Laboratory, Data Analysis and Coordinating Center (LDACC)—Analysis Working Group
;
Statistical Methods groups—Analysis Working Group
;
Enhancing GTEx (eGTEx) groups
;
NIH Common Fund
;
NIH/NCI
;
NIH/NHGRI
;
NIH/NIMH
;
NIH/NIDA
;
Biospecimen Collection Source Site—NDRI
;
Biospecimen Collection Source Site—RPCI
;
Biospecimen Core Resource—VARI
;
Brain Bank Repository—University of Miami Brain Endowment Bank
;
Leidos Biomedical—Project Management
;
ELSI Study
;
Genome Browser Data Integration andVisualization—EBI
;
Genome Browser Data Integration and Visualization—UCSC Genomics Institute, University of California Santa Cruz
;
Lead analysts
;
Laboratory, Data Analysis and Coordinating Center (LDACC)
;
NIH program management
;
Biospecimen collection
;
Pathologye
;
QTL manuscript working group
.
Genetic effects on gene expression across human tissues [published correction appears in Nature. 2018;553(7689):530]
.
Nature
.
2017
;
550
(
7675
):
204
213
.

93.

Auton
A
,
Brooks
LD
,
Durbin
RM
,
Garrison
EP
,
Kang
HM
,
Korbel
JO
,
Marchini
JL
,
McCarthy
S
,
McVean
GA
,
Abecasis
GR
;
1000 Genomes Project Consortium
.
A global reference for human genetic variation
.
Nature
.
2015
;
526
(
7571
):
68
74
.

94.

Glusman
G
,
Caballero
J
,
Mauldin
DE
,
Hood
L
,
Roach
JC
.
Kaviar: an accessible system for testing SNV novelty
.
Bioinformatics
.
2011
;
27
(
22
):
3216
3217
.

95.

Welt
CK
,
Styrkarsdottir
U
,
Ehrmann
DA
,
Thorleifsson
G
,
Arason
G
,
Gudmundsson
JA
,
Ober
C
,
Rosenfield
RL
,
Saxena
R
,
Thorsteinsdottir
U
,
Crowley
WF
,
Stefansson
K
.
Variants in DENND1A are associated with polycystic ovary syndrome in women of European ancestry
.
J Clin Endocrinol Metab
.
2012
;
97
(
7
):
E1342
E1347
.

96.

McAllister
JM
,
Modi
B
,
Miller
BA
,
Biegler
J
,
Bruggeman
R
,
Legro
RS
,
Strauss
JF
III
.
Overexpression of a DENND1A isoform produces a polycystic ovary syndrome theca phenotype
.
Proc Natl Acad Sci USA
.
2014
;
111
(
15
):
E1519
E1527
.

97.

Torchen
LC
,
Idkowiak
J
,
Fogel
NR
,
O’Neil
DM
,
Shackleton
CH
,
Arlt
W
,
Dunaif
A
.
Evidence for increased 5α-reductase activity during early childhood in daughters of women with polycystic ovary syndrome
.
J Clin Endocrinol Metab
.
2016
;
101
(
5
):
2069
2075
.

98.

McCartney
CR
,
Eagleson
CA
,
Marshall
JC
.
Regulation of gonadotropin secretion: implications for polycystic ovary syndrome
.
Semin Reprod Med
.
2002
;
20
(
4
):
317
326
.

99.

Eriksen
MB
,
Nielsen
MF
,
Brusgaard
K
,
Tan
Q
,
Andersen
MS
,
Glintborg
D
,
Gaster
M
.
Genetic alterations within the DENND1A gene in patients with polycystic ovary syndrome (PCOS)
.
PLoS One
.
2013
;
8
(
9
):
e77186
.

100.

Khan
MJ
,
Nazli
R
,
Ahmed
J
,
Basit
S
.
Whole genome sequencing instead of whole exome sequencing is required to identify the genetic causes of polycystic ovary syndrome in Pakistani families
.
Pak J Med Sci
.
2018
;
34
(
3
):
540
545
.

101.

Khalaf
M
,
Morera
J
,
Bourret
A
,
Reznik
Y
,
Denoual
C
,
Herlicoviez
M
,
Mittre
H
,
Benhaim
A
.
BMP system expression in GCs from polycystic ovary syndrome women and the in vitro effects of BMP4, BMP6, and BMP7 on GC steroidogenesis
.
Eur J Endocrinol
.
2013
;
168
(
3
):
437
444
.

102.

Shi
J
,
Yoshino
O
,
Osuga
Y
,
Koga
K
,
Hirota
Y
,
Hirata
T
,
Yano
T
,
Nishii
O
,
Taketani
Y
.
Bone morphogenetic protein-6 stimulates gene expression of follicle-stimulating hormone receptor, inhibin/activin β subunits, and anti-Müllerian hormone in human granulosa cells
.
Fertil Steril
.
2009
;
92
(
5
):
1794
1798
.

103.

Di Zazzo
E
,
De Rosa
C
,
Abbondanza
C
,
Moncharmont
B
.
PRDM proteins: molecular mechanisms in signal transduction and transcriptional regulation
.
Biology (Basel)
.
2013
;
2
(
1
):
107
141
.

104.

Carling
T
,
Kim
KC
,
Yang
XH
,
Gu
J
,
Zhang
XK
,
Huang
S
.
A histone methyltransferase is required for maximal response to female sex hormones
.
Mol Cell Biol
.
2004
;
24
(
16
):
7032
7042
.

105.

Liu
L
,
Shao
G
,
Steele-Perkins
G
,
Huang
S
.
The retinoblastoma interacting zinc finger gene RIZ produces a PR domain-lacking product through an internal promoter
.
J Biol Chem
.
1997
;
272
(
5
):
2984
2991
.

106.

Andreu-Vieyra
C
,
Chen
R
,
Matzuk
MM
.
Conditional deletion of the retinoblastoma (Rb) gene in ovarian granulosa cells leads to premature ovarian failure
.
Mol Endocrinol
.
2008
;
22
(
9
):
2141
2161
.

107.

Yang
QE
,
Nagaoka
SI
,
Gwost
I
,
Hunt
PA
,
Oatley
JM
.
Inactivation of retinoblastoma protein (Rb1) in the oocyte: evidence that dysregulated follicle growth drives ovarian teratoma formation in mice
.
PLoS Genet
.
2015
;
11
(
7
):
e1005355
.

108.

Wray
NR
,
Wijmenga
C
,
Sullivan
PF
,
Yang
J
,
Visscher
PM
.
Common disease is more complex than implied by the core gene omnigenic model
.
Cell
.
2018
;
173
(
7
):
1573
1580
.

109.

Boyle
EA
,
Li
YI
,
Pritchard
JK
.
An expanded view of complex traits: from polygenic to omnigenic
.
Cell
.
2017
;
169
(
7
):
1177
1186
.

110.

Ma
M
,
Ru
Y
,
Chuang
LS
,
Hsu
NY
,
Shi
LS
,
Hakenberg
J
,
Cheng
WY
,
Uzilov
A
,
Ding
W
,
Glicksberg
BS
,
Chen
R
.
Disease-associated variants in different categories of disease located in distinct regulatory elements
.
BMC Genomics
.
2015
;
16
(
Suppl 8
):
S3
.

111.

Igartua
C
,
Mozaffari
SV
,
Nicolae
DL
,
Ober
C
.
Rare non-coding variants are associated with plasma lipid traits in a founder population
.
Sci Rep
.
2017
;
7
(
1
):
16415
.

112.

Zhao
J
,
Akinsanmi
I
,
Arafat
D
,
Cradick
TJ
,
Lee
CM
,
Banskota
S
,
Marigorta
UM
,
Bao
G
,
Gibson
G
.
A burden of rare variants associated with extremes of gene expression in human peripheral blood
.
Am J Hum Genet
.
2016
;
98
(
2
):
299
309
.

113.

Li
X
,
Battle
A
,
Karczewski
KJ
,
Zappala
Z
,
Knowles
DA
,
Smith
KS
,
Kukurba
KR
,
Wu
E
,
Simon
N
,
Montgomery
SB
.
Transcriptome sequencing of a large human family identifies the impact of rare noncoding variants
.
Am J Hum Genet
.
2014
;
95
(
3
):
245
256
.

114.

Ament
SA
,
Szelinger
S
,
Glusman
G
,
Ashworth
J
,
Hou
L
,
Akula
N
,
Shekhtman
T
,
Badner
JA
,
Brunkow
ME
,
Mauldin
DE
,
Stittrich
AB
,
Rouleau
K
,
Detera-Wadleigh
SD
,
Nurnberger
JI
Jr
,
Edenberg
HJ
,
Gershon
ES
,
Schork
N
,
Price
ND
,
Gelinas
R
,
Hood
L
,
Craig
D
,
McMahon
FJ
,
Kelsoe
JR
,
Roach
JC
;
Bipolar Genome Study
.
Rare variants in neuronal excitability genes influence risk for bipolar disorder
.
Proc Natl Acad Sci USA
.
2015
;
112
(
11
):
3576
3581
.

115.

Liu
Q
,
Chen
C
,
Gao
A
,
Tong
HH
,
Xie
L
;
Alzheimer’s Disease Neuroimaging Initiative
.
VariFunNet, an integrated multiscale modeling framework to study the effects of rare non-coding variants in genome-wide association studies: applied to Alzheimer’s Disease
.
Proceedings IEEE Int Conf Bioinformatics Biomed
.
2017
;
2017
:
2177
2182
.

116.

Zhou
L
,
Zhao
F
.
Prioritization and functional assessment of noncoding variants associated with complex diseases
.
Genome Med
.
2018
;
10
(
1
):
53
.

117.

Cirulli
ET
,
Goldstein
DB
.
Uncovering the roles of rare variants in common disease through whole-genome sequencing
.
Nat Rev Genet
.
2010
;
11
(
6
):
415
425
.

118.

Wray
NR
,
Gratten
J
.
Sizing up whole-genome sequencing studies of common diseases
.
Nat Genet
.
2018
;
50
(
5
):
635
637
.

119.

Visscher
PM
,
Wray
NR
,
Zhang
Q
,
Sklar
P
,
McCarthy
MI
,
Brown
MA
,
Yang
J
.
10 Years of GWAS discovery: biology, function, and translation
.
Am J Hum Genet
.
2017
;
101
(
1
):
5
22
.

120.

Bourgon
R
,
Gentleman
R
,
Huber
W
.
Independent filtering increases detection power for high-throughput experiments
.
Proc Natl Acad Sci USA
.
2010
;
107
(
21
):
9546
9551
.

121.

Kim
S
,
Schliekelman
P
.
Prioritizing hypothesis tests for high throughput data
.
Bioinformatics
.
2016
;
32
(
6
):
850
858
.

Author notes

A.D. and M.G.H. jointly supervised this work.