Abstract

Yellowstone National Park is home to the only plains bison population that has continually existed as wildlife, on the same landscape, through the population bottleneck of the late 19th century. Nevertheless, by the early 1900s, only 23 wild bison were known to have survived poaching. Salvation efforts included the addition of 18 females from Montana and 3 bulls from Texas to augment this population. A century later, nuclear microsatellite-based population-level assessment revealed two genetically distinct bison subpopulations. However, in 2016, an analysis of mitochondrial haplotypes showed the two founding lineages were distributed throughout the park. This study is designed to delineate any current substructure in the Yellowstone bison population by strategically sampling the two major summer breeding herds and the two major winter ranges. Population-level metrics were derived using the same microsatellite loci as the original study along with a newly developed set of highly informative bison-specific single nucleotide polymorphisms. Our analyses reveal that the modern bison in Yellowstone National Park currently consists of one interbreeding population, composed of two subunits.

Introduction

Yellowstone National Park is home to arguably the most iconic bison population. Since its establishment in 1872 as the first National Park in the United States and the world, significant and unwavering efforts have been dedicated to the preservation of the bison population residing there. Not only has this population played a part in bison becoming a national icon but they are also a true testament to the resilience, hope, and strength bison symbolize. Yellowstone bison are the largest free-roaming bison population in the United States and the only plains bison to have continually existed in the wild and on same the landscape since prehistoric times.

The history of this population is a tale of near tragedy turned conservation success story. During the population bottleneck of the late 19th century, the Greater Yellowstone Area bison population narrowly avoided extinction, with an estimated 23 individuals remaining in 1902 (American Bison Society 1908). To preserve this population, additional bison were brought in from private ranches. Eighteen females from the Pablo-Allard herd in Montana, three males from the Goodnight herd in Texas, and four calves from the indigenous herd were used to establish a secondary “introduced” population (Meagher, 1973; Coder 1975). The introduced herd was moved to the Lamar Valley in 1907 and closely day herded or corralled through at least 1915 (Meagher 1973). Meanwhile the indigenous herd was isolated, wintering in the Pelican Valley and summering in the high-elevation grasslands of the Upper Lamar River.

The subsequent recovery of Yellowstone bison is well documented (Meagher 1973; White et al. 2015). Intermingling of the native and introduced herds increased after 1921, when managers fenced the introduced herd out of the Lamar Valley and moved them to areas used by the indigenous herd during summer. Managers also preferentially removed adult males from the introduced herd to increase the indigenous lineage. The annual report of Superintendent Toll in 1929 indicated “[t]here seems to be a gradual intermingling of the wild [Pelican] and tame [Lamar] herd. It has reached a point where it is difficult to distinguish the buffalo of the wild herd from those of the Lamar Valley herd” (Toll 1929). Therefore, it is likely the bison formed a single “Northern Herd” by the 1930s summering together and separating into two wintering units called the “Lamar” and “Pelican” bison (Meagher 1973). In 1936, managers relocated 71 bison from the Lamar bison to the Firehole and Hayden Valleys (Skinner and Alcorn 1942). The animals formed the “Mary Mountain Bison” or “Central Herd.” Population reductions and subsequent recovery likely kept the Northern and Central Herds separated through the 1970s. Movements between the herds were believed to increase through the 1980s when Northern Herd animals wintering in the Pelican Valley began moving to the Hayden Valley and integrating into the Central Herd (Meagher 1993, 1998). By the 1990s, the animals from the Central Herd began moving to wintering areas of the Northern Herd (Meagher 1989, 1993, 1998). Today, all Yellowstone bison roam relatively freely within Yellowstone National Park, with limitations on their distribution into areas outside the park boundaries. During the past decade, the population has fluctuated between 4,000 and 6,000 animals. Modern GPS technology confirms that bison congregate into two main herds (Northern, Central) during the breeding season, some animals from each herd share wintering areas, and some individuals switch breeding areas over time.

In a study by Halbert et al. (2012a), evaluating 661 Yellowstone bison sampled between 1997 and 2003, Yellowstone bison appeared to split into two genetically distinct subpopulations defined by microsatellite genotype diversity and allelic distributions. At that time, a concern raised by Halbert et al. (2012a) involved unequal culling across these two bison populations and the impact this could have on long-term genetic diversity. Subsequently, White and Wallen (2012) suggested “managers should be promoting the conservation of wildness and natural selection to retain adaptive capabilities, rather than preconceived notions of ‘natural’ genetic or population structures that were likely created or exacerbated by human actions.” As this discussion continued, (Halbert et al. 2012a) pointed out that because all modern bison populations are due to anthropogenic activities, it may be best to error on the side of caution and focus Yellowstone bison management to preserve genetic diversity in its, at the time, current state (Halbert et al. 2012a) . Although no definitive conclusion was reached at the time, it was agreed that there should be continued monitoring of genetic variation in Yellowstone bison and use the best available scientific data as a foundation for future bison management.

Following the study by Halbert et al. (2012a), a mitochondrial genome study was conducted to analyze haplotype diversity among Yellowstone bison (Forgacs et al. 2016). In this study, 25 mitochondrial genomes of Yellowstone bison were evaluated and compared to an additional 20 bison from diverse populations. This study revealed two distinct mitochondrial haplotype clades found within Yellowstone and the overall population of bison. This evidence was consistent with historical documentation of a second maternal lineage brought in from the Pablo-Allard herd in the early 1900s (Meagher 1973; Coder 1975). Unlike the previous study (Halbert et al. 2012b), there was no evidence, based on mtDNA haplotypes, to support population subdivision among the Yellowstone bison. In fact, this study was able to identify that bison of both mitochondrial haplotype clades, and therefore maternal lineages, were present in both present-day breeding areas (Forgacs et al. 2016).

However, both studies had limitations to their scope. Namely, neither study included Yellowstone bison sampled from both breeding populations during the same breeding season. In the study conducted by Halbert et al. (2012a), although a portion of the samples were collected within the park boundaries, over 90% of the samples were collected during the winter migration when bison were leaving the park. Considering the northern and central groups have an overlapping winter range in the Gardiner Basin, it was not possible to confidently assign home ranges without associated individual movement data such as radio telemetry (Meagher 1989; Halbert et al. 2012b). Therefore, no definitive conclusions could be drawn as to whether the identified genetic population substructure extended to the breeding groups. In the work of Forgacs et al. (2016), the mitochondrial DNA only portrays a portion of the genetic story. Although able to assign bison to maternal lineages and compare life histories, it failed to characterize levels of admixture within these individuals.

The current study was designed to evaluate population substructure in Yellowstone bison by strategically sampling the two major summer breeding populations, where one would expect to see genetic differentiation in ongoing population subdivision, and the two major winter ranges, where migration patterns of both breeding herds can overlap. Metrics used to evaluate population dynamics were determined using a set of 24 microsatellite loci (Halbert 2003), previously characterized in Yellowstone bison (Halbert et al. 2012), as well as a newly developed set of highly informative bison-specific single nucleotide polymorphisms (SNPs) (Stroupe and Derr 2024). Identifying current structure of the Yellowstone bison population is important to understanding the history of this iconic population, as well as developing strategies for its future conservation.

Methods

Biological material

Tissue biopsy samples were collected from 282 Yellowstone National Park bison. These samples were collected during the summer of 2019 and winter of 2021 in two major ranges within the park, Central and Northern (Fig. 1). The summer collection included 154 samples, 62 from the Central breeding area and 92 from the Northern breeding area. The winter collection included 128 samples, 44 from the Central wintering area and 84 from the Northern wintering area. Within each geographic area, Northern or Central, bison groups were opportunistically located during sample collection periods with up to 30% of the bison within a group randomly sampled. Tissue samples were collected using a Type P Pneu-Dart Biopsy RDD. Tissue samples were extracted from the dart using forceps cleaned in ethanol. Tissue samples were suspended in an ethanol solution and refrigerated prior to lab analysis. Biopsy tissue samples were collected according to the NPS IACUC IMR_YELL_White_Ungulates_2022.A3.

Map of sample collection sites.
Fig. 1.

Map of sample collection sites.

Tissue samples were cut into smaller portions in an isolation hood and divided for microsatellite or SNP genotyping. Of the 282 samples, eight were identified as duplicates with SNP data using Sequoia v2.5.3 (Huisman 2017) and confirmed with microsatellite data. Duplicates were removed from final data analysis.

Microsatellites

DNA was extracted from a portion of each biopsy tissue sample using the Gentra Puregene kit (Qiagen) according to manufacture protocols. A set of 24 autosomal microsatellite markers were used in this study including the core set of loci used for parentage determination (Supplementary Table 1) (Schnabel et al. 2000; Halbert 2003). Microsatellite samples were genotyped according to established lab protocol (Halbert 2003) using an ABI 3730 Genetic Analyzer and STRand software (Toonen and Hughes 2001; https://vgl.ucdavis.edu/STRand). Three samples had a call rate below 90% and were removed from further analysis resulting in a final microsatellite data set of 271 samples and 24 microsatellite markers. Of those, 148 samples were collected during the summer breeding season.

Cervus v3.0.7 (Kalinowski et al. 2007) was used for initial overview of population genetic diversity and formatting data set for downstream analysis. A series of principal component analysis (PCA) were used to establish patterns in the data using collection location and time of year to group samples. Ade4 v1.7-22 (Dray and Dufour 2007) was used to calculate eigenvalues and eigenvectors on the final data set and plotted with 95% confidence level ellipses using ggplot (Wickham 2016) in R v4.1.2 (R Core Team 2022). Observed heterozygosity (HO), expected heterozygosity within populations (HS), allelic richness (AR), FST, and FIS with 95% confidence intervals were calculated with the Hierfstat R package (Goudet 2005). FST was calculated using the Weir and Cockerham (1984) equation. Values calculated per individual or per loci were averaged across each population. These analyses were carried out in R v4.1.2 (R Core Team 2022).

Population parameters were evaluated according to the following methods. Evidence of population structure was assessed using STRUCTURE (Pritchard et al. 2000). The data set was evaluated across 20 iterations at each K from 1 to 6. Notable changes in default program setting include a burn-in period of 40,000 replicates and 80,000 Markov chain Monte Carlo replicates. Structure Harvester was used to compile and format the resulting data (Earl and vonHoldt 2012). Results were visualized with all iterations at each K aligned and merged using the R package pophelper v2.3.1 (Francis 2017).

Single nucleotide polymorphisms

Portions of the biopsy tissue samples were sent directly to NeoGen Canada for processing, DNA extraction, and genotyping. Samples were genotyped according to standard Illumnia Infinium HD Ultra Assay protocol guideline on the GGP Equine-Bison chip (NeoGen). Initial data quality control was done by NeoGen using Illumina’s GenomeStudio with a call cutoff of 0.95. Details regarding the development of the SNP panel used in this study are described in Stroupe and Derr (2024). Briefly, a larger set of SNPs was filtered to a panel of 798 autosomal and 13 mitochondrial SNPs that was then evaluated in 995 bison across ten populations. This panel was found to be highly informative for individual identification and population distinction.

The Illumnia SNP Chip final report data was converted to plink lgen format using the script “illumina_to_lgen.R,” originally written by Ryan Schubert (github@RyanSchu), with adjustments made to include metadata specific to our data set. In addition, a plink fam file was created from the sample list. Data were then converted into VCF format using Plink v1.9 (Purcell et al. 2007). Nuclear and mitochondrial SNP genotypes were analyzed separately in downstream analysis.

The nuclear SNP data set was further thinned to remove monomorphic SNPs and SNPs with a genotyping call rate below 90% among the Yellowstone samples, using VCFtools v0.1.16 (Danecek et al. 2011). Two samples had a genotyping call rate below 90% and were therefore removed resulting in a final SNP data set of 272 samples and 725 SNPs. Of those, 151 samples were collected during the summer breeding season.

A series of PCA were used to establish patterns in the data. Plink v1.9 (Purcell et al. 2007; Chang et al. 2015) was used to calculate eigenvalues and eigenvectors on the final data set and plotted with 95% confidence level ellipses using ggplot (Wickham 2016) in R v4.1.2 (R Core Team 2022). Observed heterozygosity (HO), expected heterozygosity within populations (HS), allelic richness (AR), FST, and FIS with 95% confidence intervals were calculated with the Hierfstat R package (Goudet 2005). FST was calculated using the Weir and Cockerham (1984) equation. Values calculated per individual or per loci were averaged across each population. These analyses were carried out in R v4.1.2 (R Core Team 2022).

fastStructure 1.0 (Raj et al. 2014) was then used to determine population structure among the samples with values of K = 1 to K = 6 and visualized using the R package pophelper v2.3.1 (Francis 2017). fastStructure chooseK.py was used to test for the most likely number of subpopulations.

Mitochondrial haplotype assignments were made according to Stroupe and Derr (2024). Briefly, mitochondrial SNPs originally identified by Forgacs et al. (2016) were used to assign interspecific haplotypes (bison or domestic cattle) and intraspecific clade haplotypes (Clade I or Clade II).

Results

Biopsy tissue samples were collected from 282 bison from Yellowstone National Park during the summer breeding season of 2019 and the winter of 2021. Collection location was used to group bison into either the Central or Northern Herd according to previous studies, migration patterns, population history, and observations (Meagher 1973, 1989; Halbert et al. 2012; Geremia et al. 2014) (Fig. 1). Collection time of year was used to distinguish samples during summer breeding season and winter migration. Of these samples, eight were identified as duplicates and removed from final analyses. All duplicate bison were sampled on the same range; however, two bison were sampled during both the summer and winter on the same range. In those cases, the sample collected during the summer breeding season was kept and the winter sample was removed from the data set. Additionally, two samples from the SNP data and three samples from the microsatellite data were removed due to a genotyping call rate below 90%. The summer breeding season samples included 151 bison in the SNP data set and 148 bison in microsatellite data set.

Samples were genotyped at 24 microsatellites (Schnabel et al. 2000; Halbert 2003), 798 autosomal SNPs, and 13 mitochondrial SNPs (Stroupe and Derr 2024). Microsatellite marker frequencies for individual loci can be found in Supplementary Table 1. The autosomal SNP data were further filtered to remove monomorphic variants and variants with a genotyping call rate below 90% resulting in a final data set of 725 autosomal SNPs. All microsatellites and mitochondrial SNPs passed the 90% call rate threshold. Genotyping call rates for the final data sets were 99.94% for microsatellites and 99.67% for SNPs.

Evidence of population structure and differentiation was evaluated using PCA, admixture analysis, and calculations of FST (Figs 2 and 3, Table 1). In the PCA of only samples collected during the summer breeding season, there is not a definitive separation between the breeding herds (Fig. 2). In the PCA based on 725 SNPs, the central herd seems to only contain a subset of the genetic diversity found within Yellowstone, whereas the northern herd has a wider distribution (Fig. 2a). In the PCA based on 24 microsatellites, there is more overlap between the two breeding herds sample distribution (Fig. 2b). The inclusion of samples collected during the winter migration did not substantially change the distribution of samples in the PCA for either herd (Supplementary Fig. 1). In fact, there are minimal differences between the PCAs of summer samples only and all samples besides the density of sample points. Additionally, admixture analyses, fastStructure, with 725 SNPs, and STRUCTURE, with 24 microsatellites, did not separate the geographically defined herds into genetically distinct subpopulations (Fig. 3, Supplementary Fig. 2). In the fastStructure analysis ran from K = 1 to K = 6 for only the samples collected during the breeding season, both the model complexity that maximizes marginal likelihood and model components used to explain the data structure were revealed to be equal, meaning the best fit for this data is one population (Fig. 3a). This was the same when including winter samples (Supplementary Fig. 2a). In the microsatellite-based STRUCTURE analysis, there was again no distinction between the two summer breeding herds (Fig. 3b). Evaluation of multiple runs revealed the best fit as K = 3 according to the delta K (Evanno et al. 2005), meaning there is evidence for three genetically defined clusters. However, this did not extend to geographically defined breeding herds. In fact, all samples seem to have an even representation of all clusters at all instances of K. Inclusion of samples collected during the winter migration was consistent with the summer only analysis in admixture of individuals; however, the best fit was K = 2 instead (Supplementary Fig. 2b).

Table 1.

Measures of population differentiation (FST) between each grouping of Yellowstone bison defined by sample collection location and time of year. Estimates of FST are based on 725 SNPs or 24 microsatellite markers.

SNPs
FSTCentral Herd—SummerCentral Herd—WinterNorthern Herd—SummerNorthern Herd—Winter
Central Herd—Summer−0.00070.00490.0018
Central Herd—Winter−0.00070.00500.0011
Northern Herd—Summer0.00490.00500.0013
Northern Herd—Winter0.00180.00110.0013
FST 95% CICentral Herd—SummerCentral Herd—WinterNorthern Herd—SummerNorthern Herd—Winter
Central Herd—Summer[−0.0019,0.0004][0.0039, 0.0058][0.0004, 0.0028]
Central Herd—Winter[−0.0019,0.0004][0.0037, 0.0067][−0.0001, 0.0020]
Northern Herd—Summer[0.0039, 0.0058][0.0037, 0.0067][0.0005, 0.0020]
Northern Herd—Winter[0.0004, 0.0028][−0.0001, 0.0020][0.0005, 0.0020]
Microsatellites
FSTCentral Herd—SummerCentral Herd—WinterNorthern Herd—SummerNorthern Herd—Winter
Central Herd—Summer−0.00430.00440.0031
Central Herd—Winter−0.00430.00080.0000
Northern Herd—Summer0.00440.00080.0007
Northern Herd—Winter0.00310.00000.0007
FST 95% CICentral Herd—SummerCentral Herd—WinterNorthern Herd—SummerNorthern Herd—Winter
Central Herd—Summer[−0.0059, −0.0015][0.0017, 0.0077][−0.003, 0.0075]
Central Herd—Winter[−0.0059, −0.0015][−0.0014, 0.0031][−0.0036, 0.0043]
Northern Herd—Summer[0.0017, 0.0077][−0.0014, 0.0031][−0.0005, 0.0022]
Northern Herd—Winter[−0.003, 0.0075][−0.0036, 0.0043][−0.0005, 0.0022]
SNPs
FSTCentral Herd—SummerCentral Herd—WinterNorthern Herd—SummerNorthern Herd—Winter
Central Herd—Summer−0.00070.00490.0018
Central Herd—Winter−0.00070.00500.0011
Northern Herd—Summer0.00490.00500.0013
Northern Herd—Winter0.00180.00110.0013
FST 95% CICentral Herd—SummerCentral Herd—WinterNorthern Herd—SummerNorthern Herd—Winter
Central Herd—Summer[−0.0019,0.0004][0.0039, 0.0058][0.0004, 0.0028]
Central Herd—Winter[−0.0019,0.0004][0.0037, 0.0067][−0.0001, 0.0020]
Northern Herd—Summer[0.0039, 0.0058][0.0037, 0.0067][0.0005, 0.0020]
Northern Herd—Winter[0.0004, 0.0028][−0.0001, 0.0020][0.0005, 0.0020]
Microsatellites
FSTCentral Herd—SummerCentral Herd—WinterNorthern Herd—SummerNorthern Herd—Winter
Central Herd—Summer−0.00430.00440.0031
Central Herd—Winter−0.00430.00080.0000
Northern Herd—Summer0.00440.00080.0007
Northern Herd—Winter0.00310.00000.0007
FST 95% CICentral Herd—SummerCentral Herd—WinterNorthern Herd—SummerNorthern Herd—Winter
Central Herd—Summer[−0.0059, −0.0015][0.0017, 0.0077][−0.003, 0.0075]
Central Herd—Winter[−0.0059, −0.0015][−0.0014, 0.0031][−0.0036, 0.0043]
Northern Herd—Summer[0.0017, 0.0077][−0.0014, 0.0031][−0.0005, 0.0022]
Northern Herd—Winter[−0.003, 0.0075][−0.0036, 0.0043][−0.0005, 0.0022]
Table 1.

Measures of population differentiation (FST) between each grouping of Yellowstone bison defined by sample collection location and time of year. Estimates of FST are based on 725 SNPs or 24 microsatellite markers.

SNPs
FSTCentral Herd—SummerCentral Herd—WinterNorthern Herd—SummerNorthern Herd—Winter
Central Herd—Summer−0.00070.00490.0018
Central Herd—Winter−0.00070.00500.0011
Northern Herd—Summer0.00490.00500.0013
Northern Herd—Winter0.00180.00110.0013
FST 95% CICentral Herd—SummerCentral Herd—WinterNorthern Herd—SummerNorthern Herd—Winter
Central Herd—Summer[−0.0019,0.0004][0.0039, 0.0058][0.0004, 0.0028]
Central Herd—Winter[−0.0019,0.0004][0.0037, 0.0067][−0.0001, 0.0020]
Northern Herd—Summer[0.0039, 0.0058][0.0037, 0.0067][0.0005, 0.0020]
Northern Herd—Winter[0.0004, 0.0028][−0.0001, 0.0020][0.0005, 0.0020]
Microsatellites
FSTCentral Herd—SummerCentral Herd—WinterNorthern Herd—SummerNorthern Herd—Winter
Central Herd—Summer−0.00430.00440.0031
Central Herd—Winter−0.00430.00080.0000
Northern Herd—Summer0.00440.00080.0007
Northern Herd—Winter0.00310.00000.0007
FST 95% CICentral Herd—SummerCentral Herd—WinterNorthern Herd—SummerNorthern Herd—Winter
Central Herd—Summer[−0.0059, −0.0015][0.0017, 0.0077][−0.003, 0.0075]
Central Herd—Winter[−0.0059, −0.0015][−0.0014, 0.0031][−0.0036, 0.0043]
Northern Herd—Summer[0.0017, 0.0077][−0.0014, 0.0031][−0.0005, 0.0022]
Northern Herd—Winter[−0.003, 0.0075][−0.0036, 0.0043][−0.0005, 0.0022]
SNPs
FSTCentral Herd—SummerCentral Herd—WinterNorthern Herd—SummerNorthern Herd—Winter
Central Herd—Summer−0.00070.00490.0018
Central Herd—Winter−0.00070.00500.0011
Northern Herd—Summer0.00490.00500.0013
Northern Herd—Winter0.00180.00110.0013
FST 95% CICentral Herd—SummerCentral Herd—WinterNorthern Herd—SummerNorthern Herd—Winter
Central Herd—Summer[−0.0019,0.0004][0.0039, 0.0058][0.0004, 0.0028]
Central Herd—Winter[−0.0019,0.0004][0.0037, 0.0067][−0.0001, 0.0020]
Northern Herd—Summer[0.0039, 0.0058][0.0037, 0.0067][0.0005, 0.0020]
Northern Herd—Winter[0.0004, 0.0028][−0.0001, 0.0020][0.0005, 0.0020]
Microsatellites
FSTCentral Herd—SummerCentral Herd—WinterNorthern Herd—SummerNorthern Herd—Winter
Central Herd—Summer−0.00430.00440.0031
Central Herd—Winter−0.00430.00080.0000
Northern Herd—Summer0.00440.00080.0007
Northern Herd—Winter0.00310.00000.0007
FST 95% CICentral Herd—SummerCentral Herd—WinterNorthern Herd—SummerNorthern Herd—Winter
Central Herd—Summer[−0.0059, −0.0015][0.0017, 0.0077][−0.003, 0.0075]
Central Herd—Winter[−0.0059, −0.0015][−0.0014, 0.0031][−0.0036, 0.0043]
Northern Herd—Summer[0.0017, 0.0077][−0.0014, 0.0031][−0.0005, 0.0022]
Northern Herd—Winter[−0.003, 0.0075][−0.0036, 0.0043][−0.0005, 0.0022]
Principal component analysis (PCA) plot of the summer breeding herds of Yellowstone bison using 725 SNPs (a) or 24 microsatellite markers (b). Breeding herds are identified by color with ellipses of 95% confidence intervals.
Fig. 2.

Principal component analysis (PCA) plot of the summer breeding herds of Yellowstone bison using 725 SNPs (a) or 24 microsatellite markers (b). Breeding herds are identified by color with ellipses of 95% confidence intervals.

Clustering plots of the Yellowstone bison breeding herds for K = 1 to K = 6 using fastStructure with 725 SNPs (a) or Structure with 24 microsatellite markers (b). Each vertical line represents an individual sample and horizontal bars represent a run for each K. Plots were generated using pophelper v2.3.1 (Francis 2017).
Fig. 3.

Clustering plots of the Yellowstone bison breeding herds for K = 1 to K = 6 using fastStructure with 725 SNPs (a) or Structure with 24 microsatellite markers (b). Each vertical line represents an individual sample and horizontal bars represent a run for each K. Plots were generated using pophelper v2.3.1 (Francis 2017).

Estimates of FST were used to measure genetic differentiation among each collection group using both the SNP and microsatellite data sets (Table 1). All FST estimates were less than or equal to 0.0050, with the confidence intervals for most comparisons overlapping and including zero. The central herd summer and winter samples had the lowest measures of differentiation in both the SNP and microsatellite comparisons, FST = −0.007 and −0.0043 respectively. In the SNP comparison, the highest level of differentiation was between the central winter and northern summer samples (FST = 0.0050) though the 95% confidence interval overlapped with the central and northern summer comparison (FST = 0.0049). However, in the microsatellite comparison, the highest differentiation was between the central and northern summer samples (FST = 0.0044) though the 95% confidence interval overlapped with all comparisons except for the central herd summer and winter comparison.

To characterize the genetic diversity of each collection group (central herd—summer, central herd—winter, northern herd—summer, and northern herd—winter) estimates of mean observed heterozygosity (HO), mean gene diversity within populations (HS), mean FIS, and mean allelic richness (AR) were calculated across each group for 725 autosomal SNPs and 24 microsatellites (Supplementary Table 2). Estimates of genetic diversity had a larger range when calculated with the microsatellite data set compared to the SNP data set. In the SNP data set, the central herd had on average higher observed heterozygosity than the northern herd. Both central groups were above the overall mean (HO = 0.4239), whereas both northern groups were below the overall mean. Though in the microsatellite data set, the central summer was above the overall mean (HO = 0.5906) and the other groups were below the overall mean with the central winter group estimate the lowest. All 95% confidence level range estimates of mean FIS calculations overlapped with each other and the overall mean 95% confidence interval in both data sets.

All Yellowstone animals had bison-derived mitochondrial haplotypes based on ten interspecific mtDNA SNPs (Ward et al. 1999; Forgacs et al. 2016; Forgacs 2019). No sample had evidence of domestic cattle mitochondrial DNA. Additionally, three SNPs distinguished the major intraspecific mitochondrial clades found in bison (Forgacs et al. 2016). All 272 samples in the final SNP data set were assigned a mitochondrial haplotype based on the criteria outlined in Stroupe and Derr (2024). Of the two bison-specific mitochondrial haplotype clades, Clade II had the highest frequency of 65% in overall assignment (Fig. 4). Within each group, Clade II was also the predominant mitochondrial haplotype. However, the central herd winter samples had the highest frequency of Clade II followed by central summer, northern winter, and northern summer with frequencies of 78%, 70%, 65%, and 57%, respectively.

Percentages of samples that were assigned to each mitochondrial haplotype, Clade I or II, for each grouping of Yellowstone bison, as defined by collection location and time of year. Bison mitochondrial clades are described by Forgacs et al. (2016).
Fig. 4.

Percentages of samples that were assigned to each mitochondrial haplotype, Clade I or II, for each grouping of Yellowstone bison, as defined by collection location and time of year. Bison mitochondrial clades are described by Forgacs et al. (2016).

Discussion

No evidence supports the hypothesis that the bison in Yellowstone National Park are currently composed of genetically distinct and independently breeding subpopulations. The presented genetic analyses using both microsatellite and SNP markers did not reveal substantial differentiation between bison sampled in the northern and central ranges during the summer breeding season. Moreover, analyses showed clear support for considering the bison in Yellowstone as one interbreeding population, composed of two subunits. However, there is undeniable evidence of multiple genetic lineages contributing to the current genetic diversity as can be seen from historic documentation and the presence of multiple maternal lineages.

The sets of microsatellites and SNPs used in this study have both been used in previous population genetic studies and have revealed genetic distinctions between related populations (Halbert and Derr 2008; Stroupe and Derr 2024). Therefore, it is unlikely the lack of population subdivision is due to a lack of sensitivity in the selected genetic markers. Both previous studies revealed an observational difference in multiple measures of genetic diversity between Yellowstone National Park and other federal bison populations. Based on estimates using the same set of SNP markers, the genetic differentiation between the Yellowstone National Park breeding herds is lower than between closely related federal bison populations such as Theodore Roosevelt National Park’s North and South units (FST = 0.0049 vs 0.0884). Furthermore, the level of genetic differentiation in the Yellowstone breeding herds is more comparable to two Turner Enterprise Inc. bison populations, Snowcrest Ranch and Vermejo Park Ranch, that are derived from the same source and have been separated less than 8 yr (FST = 0.0049 vs 0.0030) (Stroupe and Derr 2024).

Although the SNP and microsatellite analyses both revealed the lack of subdivision among Yellowstone bison, there were some observed differences. PCA and admixture analysis using SNPs seemed to reveal a higher sensitivity of subtle differences. PCA of SNPs exposed the central herd as a subset of the whole, whereas the microsatellite PCA showed no observable differences in the herds. Admixture analysis of the SNPs showed variable levels of admixture among samples, whereas the microsatellite data was congruent across samples. Differences could be due to the SNP data set being more sensitive to individual signals of admixture derived from multiple genetic lineages compared to microsatellites.

Due to anthropogenic movements of bison since the population bottleneck in the late 19th century, all modern bison populations are derived from multiple historic lineages, and all have evidence of domestic cattle introgression (Coder 1975; Stroupe et al. 2022). Therefore, when engaging in tasks such as gauging population divergence and migration rates, a comprehensive understanding of a population’s historical trajectory is imperative for accurate interpretation of findings.

During the population bottleneck in the late 19th century, bison in the Greater Yellowstone Area consisted of a dwindling population of indigenous bison that had persisted since prehistoric times. However, when the population reached an estimated low of 23 individuals in 1902, genetically distinct animals were brought in from Montana (Pablo-Allard herd) and Texas (Goodnight herd) to form an introduced population in the northern range (Meagher 1973). Therefore, the bison in Yellowstone would be better described as initially two subpopulations of genetically distinct lineages that have become a single interbreeding population through gene flow, population growth, range expansion, response to environmental pressures, and migration instead of a historic divergence then convergence of a single source population (Meagher 1989, 2002; White and Wallen 2012).

Although previous studies provided evidence of genetic subdivision in Yellowstone bison, there is no evidence this persists in the modern population (Halbert et al. 2012b). No conclusions at the time could be drawn as to whether the identified genetic population substructure extended to the breeding groups due to limited location data during the summer breeding season. Additionally, two decades (1997–2003 vs 2019–2021) separated when the population was sampled and showed subdivision compared to the presented study, which did not. It is not clear if the observed differences represent a change in population dynamics over time, management actions, increase in apex predators, sampling strategy and study design, or a combination of these factors.

Our estimates of genetic differentiation between the central and northern breeding herds were much lower (Microsatellite FST = 0.0044, SNP FST = 0.0049) compared to previous estimates (FST = 0.0321) (Halbert et al. 2012b). This difference could represent a change in population structure over time or could be due to grouping samples by genetic cluster rather than geographic distribution or a combination thereof. Moreover, the previously estimated migration rate of two bison per generation between the two identified clusters is likely underestimated due to the separation of samples based on genetic clustering rather than location and lack of samples collected during the breeding season (Halbert et al. 2012b).

Throughout the years, there have been many changes in conditions and management that could alter the behavior and movement patterns of bison (White et al. 2015). Yellowstone National Park is one of the only places where bison population size and behavior freely respond to environmental changes such as predators, resource limitations, and climate. Furthermore, the Yellowstone bison population also fluctuates due to culling bison during the winter at the parks boundaries to reduce numbers and the risk of brucellosis transmission to domestic cattle (White et al. 2015; Geremia 2022). In some years, this management practice has resulted in the removal of over 1,000 animals from this population, with disproportionate removals from the central and northern herds (White et al. 2011; Geremia 2022).

In addition, we were able to evaluate maternal lineages and mitochondrial haplotype distribution within the Yellowstone bison population. Similar to the findings of Forgacs et al. (2016), mitochondrial haplotype clades I and II were distributed across the central and northern ranges in both the summer (breeding) and winter. These two mitochondrial haplotype clades likely reflect the mixed lineage history of indigenous and introduced animals that comprise the current Yellowstone National Park bison herd. Overall mitochondrial haplotypes from Clade II were found in higher frequency than Clade I haplotypes (65% for Clade II and 35% for Clade I) and Clade II was also found in higher frequency in both the Central and Northern populations in summer and winter collections (Fig. 4).

This reevaluation of the Yellowstone bison population to delineate the current population structure was able to improve upon previous studies by evaluating the two major summer breeding populations with previously used methods and new SNP-based technologies. The SNP panel has proved effective in population differentiation among closely related bison populations and estimating measures of genetic diversity (Stroupe and Derr 2024). The SNP-based evaluation of this important population provides a point-in-time observation of the Yellowstone bison. With the increased efficiency of SNP-based platforms due to large number of loci, consistency, easier automation (Anderson and Garza 2006), lower mutation rates (Amorim and Pereira 2005), reduced influence of inbreeding (Fernández et al. 2013), ease of data sharing (Forcina and Leonard 2020), and an increase in precision and power for population genetics analyses over microsatellites (Zimmerman et al. 2020), there is more opportunity for utilization in long-term management. Moreover, samples collected during the breeding season are imperative for establishing the current structure since that is when genetic exchange between populations occurs and migration patterns can overlap in the winter ranges. Though the addition of samples collected in the winter in our analyses showed similar results.

Here we present evidence, developed from multiple analyses, that indicate bison within Yellowstone National Park represent a single interbreeding population. Even though there are multiple breeding herds and clear evidence of historical bison lineages, it appears substantial gene flow is occurring throughout the population. Thus, there is no way to confidently assign individuals outside of the summer breeding season to their respective breeding herd without tracking individual movement histories. The bison at Yellowstone National Park are an important biological resource that is essential to the long-term conservation of this species. Continued genetic monitoring is imperative to track genetic diversity indices of both nuclear and mitochondrial DNA in order to maintain stewardship of this iconic bison resource.

Supplementary Material

Supplementary material can be found at http://www.jhered.oxfordjournals.org/.

Acknowledgments

Portions of this research were conducted with the advanced computing resources provided by Texas A&M High Performance Research Computing.

Funding

This work was supported by the Department of Interior, National Park Service [grant M1802772]. S.S. support was provided by a Throlson American Bison Foundation Scholarship, The Houston Safari Club, Jim Womack Endowed Fund in Animal Genomics, and the School of Veterinary Medicine and Biomedical Sciences, Texas A&M University.

Data Avaliability

The data underlying this article are available in the article and in its online supplementary material. Additional data will be shared on reasonable request to the corresponding author.

References

American Bison Society
.
Annual report of the American Bison Society 1905-1907
;
1908
.

Amorim
A
,
Pereira
L.
Pros and cons in the use of SNPs in forensic kinship investigation: a comparative analysis with STRs
.
Forensic Sci Int
.
2005
:
150
:
17
21
. doi: https://doi.org/

Anderson
EC
,
Garza
JC.
The power of single-nucleotide polymorphisms for large-scale parentage inference
.
Genetics
.
2006
:
172
:
2567
2582
. doi: https://doi.org/

Chang
CC
,
Chow
CC
,
Tellier
LCAM
,
Vattikuti
S
,
Purcell
SM
,
Lee
JJ.
Second-generation PLINK: rising to the challenge of larger and richer datasets
.
GigaScience
.
2015
:
4
:
1
16
. doi: https://doi.org/

Coder
GD.
The national movement to preserve the American buffalo in the United States and Canada between 1880 and 1920
[ProQuest Dissertations and Theses].
1975
. http://proxy.library.tamu.edu/login?url= https://www.proquest.com/dissertations-theses/national-movement-preserve-american-buffalo/docview/302745670/se-2?accountid=7082

Danecek
P
,
Auton
A
,
Abecasis
G
,
Albers
CA
,
Banks
E
,
DePristo
MA
,
Handsaker
RE
,
Lunter
G
,
Marth
GT
,
Sherry
ST
, et al. ;
1000 Genomes Project Analysis Group
.
The variant call format and VCFtools
.
Bioinformatics
.
2011
:
27
:
2156
2158
. doi: https://doi.org/

Dray
S
,
Dufour
A-B.
The ade4 package: implementing the duality diagram for ecologists
.
J Stat Softw
.
2007
:
22
:
1
20
.

Earl
DA
,
vonHoldt
BM.
STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method
.
Conserv Genet Resour
.
2012
:
4
:
359
361
. doi: https://doi.org/

Evanno
G
,
Regnaut
S
,
Goudet
J.
Detecting the number of clusters of individuals using the software structure: a simulation study
.
Mol Ecol
.
2005
:
14
:
2611
2620
. doi: https://doi.org/

Fernández
ME
,
Goszczynski
DE
,
Lirón
JP
,
Villegas-Castagnasso
EE
,
Carino
MH
,
Ripoli
M
,
Rogberg-Muñoz
A
,
Posik
DM
,
Peral-García
P
,
Giovambattista
G.
Comparison of the effectiveness of microsatellites and SNP panels for genetic identification, traceability and assessment of parentage in an inbred Angus herd
.
Genet Mol Biol
.
2013
:
36
:
185
191
. doi: https://doi.org/

Forcina
G
,
Leonard
JA.
Tools for monitoring genetic diversity in mammals: past, present, and future BT—conservation genetics in mammals: integrative research using novel approaches
.
In:
Ortega
J
,
Maldonado
JE
, editors.
Conservation Genetics in Mammals
,
Cham, Switzerland
:
Springer International Publishing
;
2020
. p.
13
27
. doi: https://doi.org/

Forgacs
D.
Uncovering the genomic signatures of population structure and introgression in American bison
;
2019
.

Forgacs
D
,
Wallen
RL
,
Dobson
LK
,
Derr
JN.
Mitochondrial genome analysis reveals historical lineages in Yellowstone bison
.
PLoS One
.
2016
:
11
:
e0166081
. doi: https://doi.org/

Francis
RM.
pophelper: an R package and web app to analyse and visualize population structure
.
Mol Ecol Resour
.
2017
:
17
:
27
32
.

Geremia
C.
Status report on the Yellowstone bison population to the superintendent bison program
;
2022
.

Geremia
C
,
White
PJ
,
Hoeting
JA
,
Wallen
RL
,
Watson
FGR
,
Blanton
D
,
Hobbs
NT.
Integrating population- and individual-level information in a movement model of Yellowstone bison
.
Ecol Appl
.
2014
:
24
:
346
362
. doi: https://doi.org/

Goudet
J.
Hierfstat, a package for R to compute and test hierarchical F-statistics
.
Mol Ecol Notes
.
2005
:
5
:
184
186
.

Halbert
ND.
The utilization of genetic markers to resolve modern management issues in historic bison populations: implications for species conservation
.
College Station, TX, USA
:
Texas A&M University
;
2003
.

Halbert
ND
,
Derr
JN.
Patterns of genetic variation in US federal bison herds
.
Mol Ecol
.
2008
:
17
:
4963
4977
. doi: https://doi.org/

Halbert
ND
,
Gogan
PJP
,
Hedrick
PW
,
Wahl
JM
,
Derr
JN.
Genetic population substructure in bison at Yellowstone National Park
.
J Hered
.
2012a
:
103
:
360
370
. doi: https://doi.org/

Halbert
ND
,
Gogan
PJP
,
Hedrick
PW
,
Wahl
JM
,
Derr
JN
.
Yellowstone Bison Genetics: Let Us Move Forward
.
J Hered
.
2012b
:
103
:
754
755
. doi: https://doi.org/

Huisman
J.
Pedigree reconstruction from SNP data: parentage assignment, sibship clustering and beyond
.
Mol Ecol Resour
.
2017
:
17
:
1009
1024
.

Kalinowski
ST
,
Taper
ML
,
Marshall
TC.
Revising how the computer program CERVUS accommodates genotyping error increases success in paternity assignment
.
Mol Ecol
.
2007
:
16
:
1099
1106
.

Meagher
MM.
The bison of Yellowstone National Park (issue 1)
.
Washington D.C., USA
:
US Government Printing Office
;
1973
.

Meagher
MM.
Range expansion by bison of Yellowstone National Park
.
J Mammal
.
1989
:
70
:
670
675
. doi: https://doi.org/

Meagher
MM.
Winter recreation-induced changes in bison numbers and distribution in Yellowstone National Park
.
Mary Meagher
;
1993
.

Meagher
MM.
Recent changes in Yellowstone bison numbers and distribution
. In:
Irby
L
,
Knight
J
, editors.
International Symposium on Bison Ecology and Management
.
Bozeman, MT
:
Montana State University Press
;
1998
. p.
107
112
.

Pritchard
JK
,
Stephens
M
,
Donnelly
P.
Inference of population structure using multilocus genotype data
.
Genetics
.
2000
:
155
:
945
959
.

Purcell
S
,
Neale
B
,
Todd-Brown
K
,
Thomas
L
,
Ferreira
MAR
,
Bender
D
,
Maller
J
,
Sklar
P
,
de Bakker
PIW
,
Daly
MJ
, et al.
PLINK: a tool set for whole-genome association and population-based linkage analyses
.
Am J Hum Genet
.
2007
:
81
:
559
575
. doi: https://doi.org/

R Core Team
.
R: a language and environment for statistical computing. Vienna, Austria: Foundation for Statistical Computing
;
2022
.

Raj
A
,
Stephens
M
,
Pritchard
JK.
fastStructure: variational inference of population structure in large SNP data sets
.
Genetics
.
2014
:
197
:
573
589
. doi: https://doi.org/

Schnabel
RD
,
Ward
TJ
,
Derr
JN.
Validation of 15 microsatellites for parentage testing in North American bison, Bison bison and domestic cattle
.
Anim Genet
.
2000
:
31
:
360
366
. doi: https://doi.org/

Skinner
CK
,
Alcorn
WB.
History of the bison in Yellowstone National Park
. In:
Project report to the chief ranger
.
WY
:
Mammoth Hot Springs, Yellowstone National Park
;
1942
. [Updated in 1952 by WL Evans].

Stroupe
S
,
Derr
JN.
Development and evaluation of a novel single nucleotide polymorphism panel for North American bison
.
Evol Appl
.
2024
:
17
:
1
12
. doi: https://doi.org/

Stroupe
S
,
Forgacs
D
,
Harris
A
,
Derr
JN
,
Davis
BW.
Genomic evaluation of hybridization in historic and modern North American Bison (Bison bison)
.
Sci Rep
.
2022
:
12
:
1
11
. doi: https://doi.org/

Toll
RW.
Annual report for Yellowstone National Park
.
Wyoming
:
Typewritten report on file in YNP, Yellowstone National Park
;
1929
.

Toonen
RJ
,
Hughes
S.
Increased throughput for fragment analysis on an ABI Prism®377 automated sequencer using a membrane comb and STRand software
.
Biotechniques
.
2001
:
31
:
1320
1324
.

Ward
TJ
,
Bielawski
JP
,
Davis
SK
,
Templeton
JW
,
Derr
JN.
Identification of domestic cattle hybrids in wild cattle and bison species: a general approach using mtDNA markers and the parametric bootstrap
.
Anim Conserv
.
1999
:
2
:
51
57
. doi: https://doi.org/

Weir
BS
,
Cockerham
CC.
Estimating F-statistics for the analysis of population structure
.
Evolution
.
1984
:
38
:
1358
1370
.

White
PJ
,
Wallen
RL
,
Geremia
C
,
Treanor
JJ
,
Blanton
DW.
Management of Yellowstone bison and brucellosis transmission risk—implications for conservation and restoration
.
Biol Conserv
.
2011
:
144
:
1322
1334
.

White
PJ
,
Wallen
RL
,
Hallac
DE.
Yellowstone bison: conserving an American icon in modern society
.
Yellowstone National Park, USA
:
Yellowstone Association
;
2015
.

White
PJ
,
Wallen
RL
.
Yellowstone Bison—Should We Preserve Artificial Population Substructure or Rely on Ecological Processes
?.
Journal of Heredity
2012
:
103
:
751
753
. doi: https://doi.org/

Wickham
H.
Programming with ggplot2
. In:
ggplot2: Elegant graphics for data analysis
.
Springer International Publishing
;
2016
. p.
241
253
. doi: https://doi.org/

Zimmerman
SJ
,
Aldridge
CL
,
Oyler-McCance
SJ.
An empirical comparison of population genetic analyses using microsatellite and SNP data for a species of conservation concern
.
BMC Genomics
.
2020
:
21
:
1
16
.

This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact [email protected] for reprints and translation rights for reprints. All other permissions can be obtained through our RightsLink service via the Permissions link on the article page on our site—for further information please contact [email protected].
Corresponding Editor: Alfred Roca
Alfred Roca
Corresponding Editor
Search for other works by this author on: