Abstract

Comparing divergences across multiple sister population pairs has been a focus in phylogeography since its inception. Initial approaches used organelle genetic data and involved qualitative comparisons of phylogenetic patterns to evaluate hypotheses of shared and variable evolutionary responses. This endeavor has progressed with coalescent model-based statistical techniques and advances in next-generation sequencing, yet there remains a need for methods that can exploit aggregated genomic-scale data within a unified analytical framework. To this end, we introduce the aggregate joint site frequency spectrum (ajSFS) by validating its use within a hierarchical Bayesian framework through several in silico experiments. Subsequently, we applied our method against two published restriction site–associated DNA marker datasets consisting of eight local replicates of a lamprey species pair and six co-distributed passerine taxon pairs, respectively, with the aim of inferring variability in co-divergence and co-migration histories. We found that the lamprey population pairs exhibited temporal synchrony in both co-divergence and collective secondary contact times, yet an idiosyncratic pattern in secondary migration intensities. In contrast, the bird population pairs displayed thoroughly asynchronous co-divergence histories. Our results demonstrate that the ajSFS can be exploited for complex and flexible co-demographic inference, opening up new possibilities for comparative phylogeography and population genomic studies.

A long-standing focus in comparative phylogeography is to infer the degree of shared response among replicate sister taxon pairs to emerging or existing landscape barriers for the sake of elucidating processes underlying allopatric or parapatric divergence (Avise et al. 1987; Bermingham and Moritz 1998; Bernatchez and Wilson 1998; Avise 2000; Arbogast and Kenagy 2001; Soltis et al. 2006; Hickerson et al. 2010). It is expected that geographic and climatic changes of major impact could result in similar demographic histories across co-distributed taxa, whereas trait differences (Papadopoulou and Knowles 2016) or even random chance (Rosindell et al. 2012) may cause idiosyncratic demographic trajectories. The interplay between these deterministic and stochastic forces leads to alternative predictions of community assemblage with respect to emergent histories of co-diversification, co-expansion, or co-dispersal. Hence, uncovering the distribution of divergence times across a set of population pairs is of interest to many researchers investigating broader evolutionary and ecological questions (Avise et al. 2016). This endeavor has traditionally relied on qualitatively observing collective patterns of pairwise divergences to circumstantially make inferences about multi-taxa histories (Hoffmann and Baker 2003; Lessios et al. 2003; Alvarado Bremer et al. 2005; Lourie et al. 2005; Goldstien et al. 2006; Richards et al. 2007). Likewise, it is commonplace to approximate whether geographic boundaries that divide structured populations coincide among multiple species or taxa, with the assumption that general similarities suggest concordant demography in joint historical response to environmental features (Taberlet et al. 1998; Zink 2002; Pastorini et al. 2003; Barber et al. 2006; Maliouchenko et al. 2007).

However, the advancement of inferential methods has allowed researchers to investigate historical processes using probabilistic models that offer parameter estimation. For comparative inference, such analyses can be conducted independently per constituent dataset, with results compared post hoc to qualitatively assess historical congruence (Carstens et al. 2004; Fedorov et al. 2008; Bacon et al. 2015; Edwards et al. 2016). Alternatively, multiple demographic histories can be modeled together simultaneously within a single unified analysis while accounting for individual variance across taxa in nonfocal nuisance parameters (Edwards and Beerli 2000; Carstens et al. 2016; Satler and Carstens 2016). One such approach involves hierarchical co-demographic modeling, which allows explicit hypothesis testing among taxa (Hickerson et al. 2007; Chan et al. 2014; Xue and Hickerson 2015, 2017) and increases accuracy due to the statistical pooling power or borrowing strength derived from globally modeling exchangeable datasets (Congdon 2001; Gelman et al. 2003; Beaumont 2010). Consequently, uncertainty measurements from this tactic consider the stochasticity for each constituent dataset in conjunction, in contrast to post hoc qualitative comparisons on individual results.

Hierarchical co-demographic modeling for sister taxon pairs is available in the software programs msBayes (Hickerson et al. 2007, 2014; Huang et al. 2011) and dpp-msbayes (Oaks 2014), which use an approximate Bayesian computation (ABC) inferential framework. In brief, ABC employs Monte Carlo simulations to construct a reference table of random parameter draws from prior distributions coupled with associated summary statistic vectors, which is subsequently applied to empirical data via a rejection algorithm that minimizes Euclidean distance of summary statistics to approximate posterior distributions of tested models and/or parameters (Beaumont et al. 2002; Bertorelle et al. 2010; Csilléry et al. 2010). The variant of ABC deployed by msBayes and dpp-msbayes assumes a hierarchical co-demographic model (hABC), and exploits distribution summaries of standard population genetic summary statistics calculated individually per population pair, to infer hyperparameters and parameter summaries that describe the variation in divergence times among taxon pairs. However, these methods have restricted modeling capabilities with respect to specifying historical events (e.g., complex co-migration trajectories) as well as an inflexible hyperparameterization strategy (e.g., cannot estimate proportion of taxa membership within a hypothesized single synchronous co-divergence pulse, or congruence in parameters besides co-divergence timing). More importantly, this approach is computationally constricted to single-locus data such as mitochondrial barcodes, or a small number of nuclear loci (Carnaval et al. 2009; Stone et al. 2012; Smith et al. 2014), such that statistical power and accuracy are strongly compromised (Overcast et al. 2017).

With the use of next-generation sequencing becoming more commonplace as a result of reducing costs, there is a growing need and utility for extending hierarchical co-demographic modeling to genomic-scale applications (Xue and Hickerson 2015; Oaks 2018). Such implementation could also improve the ability to explore more complex models, including incorporation of various migration and size change parameters as well as quantifying degree of congruence among multiple demographic parameters. The recently developed aggregate site frequency spectrum (aSFS), which permits comparative size change models of independent single populations coupled with genome-wide data and can be used in the R package Multi-DICE (Xue and Hickerson 2015, 2017; Prates et al. 2016; Mastretta-Yanes et al. 2018; Potter et al. 2018), opens the possibility of a large-scale data summarization that contains signal of co-demography across multiple population pairs. Notably, previous implementations of Multi-DICE supplemented hABC results with random forest, a machine learning algorithm that is experiencing increased attention in population genetics due to its ability to predict model assignment for empirical data from training on Monte Carlo simulations (Pudlo et al. 2016; Schrider and Kern 2018).

To this end, we describe, evaluate, and deploy a hierarchical co-demographic model of population pairs, as well as a generalized approach of comprehensively modeling individual population pair divergence and migration scenarios that can accompany our hierarchical model. We also introduce the data summarization associated with our co-demographic model, the aggregate joint site frequency spectrum (ajSFS). The ajSFS is similar in design to the aSFS with respect to its efficient aggregation of independent genomic-scale datasets, which can be effectively exploited by Monte Carlo simulations to achieve statistical inference of hyperparameters that quantify the degree of congruence in co-demographic histories. However, an important distinction from the aSFS is that the ajSFS is specifically intended for hierarchical models of aggregate population pairs rather than single populations, similar to msBayes and dpp-msbayes yet with more flexibility. To establish proof-of-concept, we conduct simulation experiments to assess performance using both random forest regression given our hierarchical model (hRF) as well as hABC, and then reanalyze two previously published restriction site–associated DNA marker (RAD) datasets (Oswald et al. 2017; Rougemont et al. 2017).

Both studies associated with the empirical data employed model selection of various divergence with migration scenarios to address a central inquiry of a comparative nature. The first comparative dataset consists of a species pair between river lamprey (Lampetra fluviatilis), which are anadromous and parasitic, and its freshwater-exclusive and nonparasitic counterpart brook lamprey (Lampetra planeri), replicated in eight sampling localities throughout Western France (Rougemont et al. 2017). For the original investigation, Rougemont et al. (2017) focused on how environmental connectivity affected the lamprey species pair's demographic history, particularly with respect to migration, across these putatively natural replicates. Specifically, four population pairs considered to be connected (CPPs) all had secondary contact as the model with highest likelihood, whereas ancient migration was mostly inferred for the remaining four population pairs deemed disconnected (DPPs). The coarse trends of several relative estimates (i.e., proportion of two parameters within a population pair) likewise contrasted CPPs from DPPs. The second comparative dataset to which we apply our method includes six passerine taxon pairs co-distributed along a barrier separating the Marañón Valley and Tumbesian region in Peru (Oswald et al. 2017). In the initial paper, Oswald et al. (2017) focused on how a shared phylogeographic break broadly affected bird pairs co-distributed in seasonally dry forests. Their results suggested asymmetric gene flow for every taxon pair following an overall asynchronous co-divergence. For our approach, we are interested in extending the findings for these two datasets by directly evaluating temporal synchrony among co-divergence times as well as variability within other demographic parameters related to postdivergence migration and population size change.

Materials and Methods

HIERARCHICAL CO-DEMOGRAPHIC MODEL OF POPULATION PAIRS

We present a hierarchical co-demographic model extending that which is described in Xue and Hickerson (2017) to accommodate population pairs instead of single populations, multiple historical events per individual taxonomic unit, and various co-demographic hyperparameters (Fig. 1; Table S1). As with Xue and Hickerson (2017), one level of the hierarchical model consists of n individual population pairs that are each treated as an independent iteration with its own demographic parameters (therefore no assumptions about shared ancestry are enforced), whereas on another level are co-demographic hyperparameters that explain variability among analogous demographic parameters from the constituent population pairs. Population indices correspond along all n number of incorporated population pairs (e.g., population “0” in one population pair is associated with population “0” from all other population pairs). Demographic parameter symbols notate the corresponding set of taxon-specific demographic parameters across population pairs, with all of the constituent elements, which allude arbitrarily to one of the n individual taxa, identified by numeric subscripts, for example, τ1={τ11,τ21,,τn1}. To estimate variability in taxon-specific responses, we defer to the hyperparameters defined in Xue and Hickerson (2017), specifically ζT, ST, and ψ. In general, these could be applied to any set of analogous demographic parameter values among taxa, thus parameter symbols are subscripted to distinguish co-demographic hyperparameters (e.g., ζT,τ1 and ψτ1 for τ1). Accordingly, a demographic parameter symbol with an s subscript denotes a vector of values that coincide with ψ shared pulses of co-demography, with the vector elements identified by numeric subscripts that reference the specific pulse to which they are assigned (e.g., τs1={τs,11,τs,21,,τs,ψ1,} for synchronous co-divergence timings). Likewise, an i subscript represents the vector of idiosyncratic draws that are each unique in value, with numeric subscripts arbitrarily pertaining to individual population pairs (e.g., τi1={τi,nST+11,τi,nST+21,,τi,n1} ).

Hierarchical co-demographic model. An example model with eight population pairs and half of these pairs within a single temporally synchronous co-divergence pulse. The pulse of synchronous co-divergence occurred at τs1, whereas the remaining four population pairs experienced idiosyncratic co-divergence at independent times of τi,51, τi,61, τi,71, and τi,81, respectively. For visual simplicity, the individual model used here does not have migration nor size change.
Figure 1

Hierarchical co-demographic model. An example model with eight population pairs and half of these pairs within a single temporally synchronous co-divergence pulse. The pulse of synchronous co-divergence occurred at τs1, whereas the remaining four population pairs experienced idiosyncratic co-divergence at independent times of τi,51, τi,61, τi,71, and τi,81, respectively. For visual simplicity, the individual model used here does not have migration nor size change.

For this study, we focused on evaluating hyperparameters, in particular the total proportion of the whole set of taxon-specific demographic parameters that are congruent in value, ζT (Table S1). The specific demographic parameter values per population pair (Table 1) were therefore regarded as nuisance parameters, which are not of direct interest yet important to account for noise and uncertainty stemming from population differences. In particular, we emphasized ζT with respect to co-divergence time, which is formally designated as ζT,τ1 but hereby referred to as ζ for convenience. Within our simulation-based inferential approach, we treated ζ as a freely varying hyperparameter that governs variation in τ1 values across population pairs a priori. Conversely, to efficiently derive values of ζT for other demographic parameters, we deployed a novel post hoc protocol that instead calculates these hyperparameter values from freely varying taxon-specific parameter values (e.g., ζT,τ2 is determined from random draws of τ2 per population pair). This was accomplished by counting how many individual values were within a predefined distance from the median for the total set of draws per simulation, with zero counts assigned to ζT = 1/n because this value represents complete idiosyncrasy and thus is inclusive of cases whereby ζT = 0.0. Notably, the predefined distance from the median approximates our use of β to condition the τi1 prior distribution, for example, assuming this value is set to 0.05 natural log units, then the degree of clumping up to 0.05 natural log units around the central tendency within the range of taxon-specific draws determines the value of post hoc ζT, similar to β = 0.05 natural log units. Values for a post hoc ζT hyperparameter then can be combined across simulations to construct hABC hyperprior and hyperposterior distributions. This simple post hoc conversion process demonstrates that hyperparameters, which provide a convenient interpretation of parameter variability across population pairs, can still be extracted when not incorporated into the simulation design. Such a possibility allows further exploration of the hierarchical co-demographic model in a quick and efficient manner, potentially eliminating the need for heavier initialization of hyperparameters, which must be accommodated by more simulations and thus increasingly expensive computation.

Table 1

Index of parameters for individual population pair model

SymbolDescription
τ1Time of divergence
τ2Time of instantaneous change in migration rates and effective population sizes
M  1-0:1Migration rate from tip population “0” to “1” forward-in-time during recent phase
M  1-1:0Migration rate from tip population “1” to “0” forward-in-time during recent phase
M  2-0:1Migration rate from tip population “0” to “1” forward-in-time during ancient phase
M  2-1:0Migration rate from tip population “1” to “0” forward-in-time during ancient phase
ε0Size change magnitude for tip population “0”
ε1Size change magnitude for tip population “0”
N  0Present-day effective population size for tip population “0”
N  1Present-day effective population size for tip population “1”
uMutation rate for entire population pair
SymbolDescription
τ1Time of divergence
τ2Time of instantaneous change in migration rates and effective population sizes
M  1-0:1Migration rate from tip population “0” to “1” forward-in-time during recent phase
M  1-1:0Migration rate from tip population “1” to “0” forward-in-time during recent phase
M  2-0:1Migration rate from tip population “0” to “1” forward-in-time during ancient phase
M  2-1:0Migration rate from tip population “1” to “0” forward-in-time during ancient phase
ε0Size change magnitude for tip population “0”
ε1Size change magnitude for tip population “0”
N  0Present-day effective population size for tip population “0”
N  1Present-day effective population size for tip population “1”
uMutation rate for entire population pair
Table 1

Index of parameters for individual population pair model

SymbolDescription
τ1Time of divergence
τ2Time of instantaneous change in migration rates and effective population sizes
M  1-0:1Migration rate from tip population “0” to “1” forward-in-time during recent phase
M  1-1:0Migration rate from tip population “1” to “0” forward-in-time during recent phase
M  2-0:1Migration rate from tip population “0” to “1” forward-in-time during ancient phase
M  2-1:0Migration rate from tip population “1” to “0” forward-in-time during ancient phase
ε0Size change magnitude for tip population “0”
ε1Size change magnitude for tip population “0”
N  0Present-day effective population size for tip population “0”
N  1Present-day effective population size for tip population “1”
uMutation rate for entire population pair
SymbolDescription
τ1Time of divergence
τ2Time of instantaneous change in migration rates and effective population sizes
M  1-0:1Migration rate from tip population “0” to “1” forward-in-time during recent phase
M  1-1:0Migration rate from tip population “1” to “0” forward-in-time during recent phase
M  2-0:1Migration rate from tip population “0” to “1” forward-in-time during ancient phase
M  2-1:0Migration rate from tip population “1” to “0” forward-in-time during ancient phase
ε0Size change magnitude for tip population “0”
ε1Size change magnitude for tip population “0”
N  0Present-day effective population size for tip population “0”
N  1Present-day effective population size for tip population “1”
uMutation rate for entire population pair

Additionally, we focused on estimation of the following parameter summaries, which describe the distribution of parameter values across population pairs and can supplement interpretation of inferred hyperparameter values: (1) timing for a single pulse of synchronous co-divergence (formally designated as τs,11, but hereby referred to as τs1 for convenience since τs1={τs,11} for only a single pulse); (2) mean absolute deviation from the median (Δ; e.g., Δ(τ1)=1nj=1n|τj1median(τ1)| ); (3) dispersion index (Ω; i.e., variance/arithmetic mean); and (4) arithmetic mean (Table S1). Notably, although a priori governing hyperparameters are prescriptive determinants for the full distribution of parameter values, these are distribution summaries that are descriptive in nature, and in fact derived similarly from the same set of simulated values as our post hoc ζT hyperparameters. Importantly, our usage of Δ is a new implementation within this context of co-demographic modeling. As a deviation statistic, it provides a standard measure of statistical dispersion that benefits from conveying information in an intuitive manner, specifically the average difference from the central tendency within a set of values such that a range of 2 × Δ contains approximately half of the population pairs, as well as being more robust to low sampling than Ω due to relying less on normality assumptions (Herrey 1965; Pham-Gia and Hung 2001; Leys et al. 2013).

Importantly, we elaborate upon two caveats of our hierarchical co-demographic model with respect to biological interpretation and our assumption of population pair independence within the Supporting Information.

DEMOGRAPHIC MODEL OF INDIVIDUAL POPULATION PAIRS

To build our hierarchical co-demographic model, we used a generalized parameterization for individual population pairs that can efficiently explore the data independently for each population pair and can be further informed to construct the individual taxon level of our hierarchical scheme. Several commonly tested divergence scenarios are nested within our comprehensive individual-level model (Fig. 2A), including isolation with continuous migration through to the present (IM; Fig. 2B), isolation with continuous migration that ceases before present (i.e., ancient migration [AM], often associated with parapatric divergence; Fig. 2C), strict isolation with a period of no migration followed by continuous migration through to the present (i.e., secondary contact [SC]; Fig. 2D), and strict isolation (SI) with absolutely no migration (often associated with allopatric divergence; Fig. 2E). Such isolation/migration syndromes have traditionally been treated as competing discretized models, which are tested by performing model selection through comparing relative likelihoods or model posterior probabilities (Pelletier and Carstens 2014; Roux et al. 2016; Jackson et al. 2017; Oswald et al. 2017; Rougemont et al. 2017). Our global model instead captures these syndromes under different regions of parameter space (Table 1), an approach we favor because a wide span of overall divergence scenarios can be quickly assessed independently across multiple taxa under a single analysis. Importantly, our generalized population pair model can discover trajectories that are intermediate between the aforementioned traditional population pair scenarios (e.g., mostly consistent with SC but with a small yet detectable amount of early phase migration, which would not be captured if selecting among discrete models). Furthermore, in cases where similar patterns are elucidated throughout the individual taxa, the flexible parameterizations can be easily modified to increase model specificity, thus increasing identifiability, while still allowing population pair datasets and associated individual parameterizations to be treated as exchangeable within our downstream hierarchical co-demographic model construction (Schlüter et al. 1997; Gelman et al. 2003; Teh et al. 2005). As an example, preliminary analyses with our generalized parameterizations could reveal that several individual datasets resemble IM while the remaining are indicative of AM, in which case it would be desirable to restrict the migration intensity parameters such that only the model space inclusive of either IM or AM is permitted within a corresponding hierarchical co-demographic approach.

Individual population pair model. (A) Generalized model with two phases of asymmetric migration, as well as instantaneous size changes occurring independently between tip populations and simultaneously with shift in migration rates. This model subsumes other commonly used divergence models, such as (B), (C), (D), and (E), under specific parameter combinations. Demographic parameter symbols are annotated onto this model here and described in Table 1. (B) IM model, which the generalized model converges toward when approximating M1-0:1 = M2-0:1 and M1-1:0 = M2-1:0 (or alternatively τ1 = τ2). (C) AM model, which the generalized model converges toward when approaching M1-0:1 = 0.0 and M1-1:0 = 0.0. (D) SC model, which the generalized model converges toward when near M2-0:1 = 0.0 and M2-1:0 = 0.0. (E) SI model, which the generalized model converges toward when resembling M1-0:1 = 0.0, M1-1:0 = 0.0, M2-0:1 = 0.0, and M2-1:0 = 0.0. Moreover, this model does not contain size change to further illustrate that the generalized model becomes consistent with stable size if ε parameters have values close to 1.0 (or alternatively τ1 = τ2).
Figure 2

Individual population pair model. (A) Generalized model with two phases of asymmetric migration, as well as instantaneous size changes occurring independently between tip populations and simultaneously with shift in migration rates. This model subsumes other commonly used divergence models, such as (B), (C), (D), and (E), under specific parameter combinations. Demographic parameter symbols are annotated onto this model here and described in Table 1. (B) IM model, which the generalized model converges toward when approximating M1-0:1 = M2-0:1 and M1-1:0 = M2-1:0 (or alternatively τ1 = τ2). (C) AM model, which the generalized model converges toward when approaching M1-0:1 = 0.0 and M1-1:0 = 0.0. (D) SC model, which the generalized model converges toward when near M2-0:1 = 0.0 and M2-1:0 = 0.0. (E) SI model, which the generalized model converges toward when resembling M1-0:1 = 0.0, M1-1:0 = 0.0, M2-0:1 = 0.0, and M2-1:0 = 0.0. Moreover, this model does not contain size change to further illustrate that the generalized model becomes consistent with stable size if ε parameters have values close to 1.0 (or alternatively τ1 = τ2).

To this end, our generalized population pair model (Fig. 2A; Table 1) treats the two “tip” lineages as sister panmictic populations derived from their most recent common ancestral population. Deep population structure for the ancestral or either of the descendent populations is assumed to be absent, with tip populations indexed arbitrarily as “0” and “1” to distinguish parameters. Specifically, population “0” and population “1” are subject to respective present-day effective population sizes (NE), N0 and N1, in number of haploids. Divergence of the ancestral population occurs at time τ1 and the descendent tip populations experience two phases of asymmetric continuous migration as well as instantaneous size changes. Both shifts in migration rates as well as size changes are simultaneously associated with time τ2. These two time parameters are scaled by number of generations. The four different asymmetric migration intensity parameters (M) are notated as Mx-y:z, with x symbolizing recent (1) or ancient (2) phases and y:z designating source and sink population indices, respectively, from a forward-in-time perspective (i.e., M1-0:1, M1-1:0, M2-0:1, and M2-1:0). Migration intensities are rates of effective haploid migrants per generation, that is, scaled by NE × m, where m is the generational probability of per haploid individual migration. The two distinct population size change magnitude parameters (ε) are affixed with the aforementioned population indices (i.e., ε0 and ε1), with population-specific size change magnitudes scaled by the ratio of NE before size change to NE after size change from a forward-in-time context (i.e., past:future), such that values <1 denote expansion and >1 signify contraction. Importantly, mutation rate (μ), in number of substitutions per site per generation, can also be incorporated to exploit sequence length information (l) for calibration of NE parameters (via the population genetics parameter θ = 4NEμl), which then inherently calibrates time parameters as assumed by coalescent theory. If μ is omitted, monomorphic sites are implicitly ignored and thus parameter estimates are conditioned on given NE, which may be informed by a priori information. In such case, NE ought to still be governed by a prior distribution to account for uncertainty as well as allow for heterogeneity in NE among population pairs in the downstream hierarchical co-demographic model.

AGGREGATE JOINT SITE FREQUENCY SPECTRUM

To enable inference under supervised machine learning or ABC, we introduce a summarization appropriate for genome-wide single nucleotide polymorphism (SNP) data that extends the aSFS summary statistic vector described in Xue and Hickerson (2015), the ajSFS (Fig. 3). To construct the ajSFS, a joint site frequency spectrum (joint-SFS) with relative SNP frequencies (i.e., proportion of total SNPs rather than SNP count) is calculated for every population pair. As aforementioned, monomorphic sites can also be optionally included to better calibrate absolute parameter estimates as long as the μ parameter (Table 1) is used during simulation, although this count needs to be converted into the proportion to SNPs within the joint-SFS due to the use of relative frequencies. Importantly, all constituent joint-SFS are projected to equivalent sampling for number of individuals, after which they are ideally comparable in sampling of genomic sites as well. With respect to number of sites, although slight deviations are negligible, major differences in sampling among joint-SFS can be addressed by conservatively assuming the lowest amount for the entire aggregate dataset during simulation. After being independently built, the multiple joint-SFS are collated and bins are rearranged in descending order per joint allele frequency class, thus forming a new summary statistic vector, the ajSFS. As with the aSFS, the intuition behind this construction is to efficiently combine independent datasets such that important aspects of the aggregate data are directly extracted, in this case the relative distribution of each joint allele frequency class across population pairs, whereby similarities and deviations between constituent datasets are captured by hyperparameters and associated parameter summaries. To clarify, this exchangeability assumption is the reason that relative SNP frequencies and equivalent sampling for the multiple joint-SFS are necessary to derive the ajSFS. Importantly, although information about the identity of individual population pairs is discarded, the ajSFS heavily increases computational tractability for simulation-based approaches while exploiting valuable multi-taxa insight, allowing assemblage-level hypotheses to be broadly tested while complementary methods could be used to ascertain specific individual responses. To better illustrate our ajSFS protocol, we provide an explicit example within the Supporting Information.

Construction protocol for ajSFS. (A) Example of eight independent joint-SFS with relative SNP frequency values, visualized here as heatmaps, projected to the same haploid sampling. (B) The first joint allele frequency class across the eight individual datasets is combined, such that the eight separate cells are now brought together. (C) The eight bins are reordered in descending fashion. (D) The process repeats for all joint allele frequency classes. (E) The final ajSFS, consisting of values from all eight of the original joint-SFS and visualized here as a one-dimensional vector, is constructed.
Figure 3

Construction protocol for ajSFS. (A) Example of eight independent joint-SFS with relative SNP frequency values, visualized here as heatmaps, projected to the same haploid sampling. (B) The first joint allele frequency class across the eight individual datasets is combined, such that the eight separate cells are now brought together. (C) The eight bins are reordered in descending fashion. (D) The process repeats for all joint allele frequency classes. (E) The final ajSFS, consisting of values from all eight of the original joint-SFS and visualized here as a one-dimensional vector, is constructed.

SIMULATING REFERENCE TABLES AND TEST DATA FROM PRIOR AND HYPERPRIOR DISTRIBUTIONS

To demonstrate our hierarchical co-demographic modeling approach with the ajSFS, we conducted numerous simulations (Excoffier et al. 2013; Xue and Hickerson 2017) to be exploited by in silico experiments as well as empirical analyses of two publicly available datasets, specifically reduced representation genomic-level SNPs from eight replicates of a lamprey species pair and six co-distributed taxon pairs of passerine birds, respectively, within a Bayesian inferential framework. Specifically, simulations were organized into reference tables, which consist of independent entries of a summary statistic vector generated from random parameter draws from user-specified prior distributions, and can be employed for machine learning or rejection sampling to discover the most likely parameter values that fit an observed dataset. Our first group of simulations was produced to test the performance of our generalized population pair modeling scheme within the hierarchical framework. To this end, we constructed three ajSFS reference tables that were governed by this generalized model, an IM scenario, and an SI scenario, respectively. Accordingly, pseudo-observed datasets (PODs) were simulated under nine different co-demographic scenarios, including those three applied for the reference tables, to explore model misspecification (Table S2). For our other batch of ajSFS simulations, we produced reference tables that matched the sampling specifications of the two empirical datasets, allowing both in silico experiments that further acted as power analyses relevant to the empirical hyperparameter and parameter summary inferences, as well as statistical inference on the empirical data. Furthermore, to better inform the underlying hierarchical models, we created joint-SFS reference tables under the generalized population pair model to conduct preliminary empirical analyses independently across the individual taxa (Tables S3 and S4).

For the ajSFS simulations associated with the empirical analyses, we employed alternative hierarchical modeling strategies (Tables S3 and S4): (1) weakly informed parameterization—wide taxon-specific prior distribution ranges that matched those in the preliminary analyses on the individual population pairs; (2) SC parameterization (lamprey analyses only)—enforced SC scenario with more constraint in other taxon-specific prior distributions, as indicated by the preliminary results; (3) fixed τs1 parameterization (bird analyses only)—τs1 fixed in value by leveraging a previous hABC result, thus inference was conditional on a hypothesized synchronous time of co-divergence while still allowing τi1 to freely vary; and (4) SC and fixed τs1 parameterization—SC parameterization, which was supported by the preliminary results among the birds as well, with simultaneous fixing of τs1 .

To explore certain properties of the ajSFS, we subsequently transformed the initial reference tables into alternative sampling specifications. For the lamprey simulations, the ajSFS with 20 haploids per tip population constituted a large composite vector, therefore we down-projected these data to 10 haploids per tip population to discern effects from this sampling difference between 10 versus 20 individuals. The down-projection followed the same procedure applied by the Python module δaδi, whereby the number of individuals within the joint-SFS can be reduced through the hypergeometric distribution, which integrates over every subsampling combination (Gutenkunst et al. 2009). For the bird simulations, we deployed three variants of the ajSFS to assess the effects of incorporating sequence length information: (1) ajSFS configuration omitting monomorphic sites, such that the ajSFS was scaled to relative proportions of SNPs only, as with the lamprey ajSFS; (2) ajSFS configuration including monomorphic sites, with the monomorphic frequency class scaled to the proportion of invariants to SNPs per population pair (remaining ajSFS bins still in relative SNP proportions), as well as subsequent normalization of all bins to account for the orders of magnitude difference in scale and sampling variance between monomorphic and polymorphic bins; and (3) ajSFS configuration involving the same normalization procedure as (2) but with monomorphic bins removed to isolate any effect from this normalization. Normalizing ajSFS simulations entailed a standard protocol of subtracting the mean value and then dividing by the variance for each individual element, with mean and variance calculated per respective element from the whole reference table.

Every ajSFS reference table across in silico experiments consisted of 1.2 million ajSFS simulations, and collectively totaled to 18 separate sets of simulations from: three modeling schemes, which were individually tested against nine sets of 100 ajSFS PODs with varying parameterizations, for the first experiment (Table S2); three parameterizations (Table S3) under two sampling projections for number of individuals (i.e., six reference tables) in the case of the lamprey experiment; and three parameterizations (Table S4) with three data constructions involving monomorphic sites and normalization (i.e., nine reference tables) under the bird experiment. Additionally, each of the two individual population pair reference tables for our preliminary empirical ABC analyses, that is, one for the lampreys (Table S3) and one for the birds (Table S4), were built with 500,000 joint-SFS vectors according to the same simulation specifications (i.e., simulation model, number of individuals, and amount of genetic data) as the corresponding ajSFS reference tables for the lamprey and bird data, respectively. To clarify, ABC reference tables were constructed only given 20 haploid samples for the lamprey data, and only without monomorphic sites nor normalization for the bird data, because these both ostensibly offered the best inferential resolution as determined by the results of the simulation experiments.

We discuss in greater detail the data specifications and model parameterizations of our simulations within the Supporting Information.

EXPERIMENTS ON CO-DEMOGRAPHIC INFERENCE USING SIMULATION REFERENCE TABLES

To perform in silico experiments for our hierarchical co-demographic methodology, we explored bias and accuracy under hRF hyperparameter inference of ζ (Liaw and Wiener 2002) as well as hABC hyperparameter and parameter summary estimation (Table S1) under simple rejection sampling with mean, median, and mode point estimates of the posterior. Specifically, we tested the application of our generalized modeling scheme for population pairs, parameterization options that sequentially increased the degree of a priori information, and ajSFS construction schemes that varied in sampling of individuals, inclusion of monomorphic sites, and normalization of ajSFS bins. To accomplish this, in our first in silico experiment that tested our generalized population pair scenario against other models, every single POD underwent separate inferences against each of the three simulated reference tables. Here, hABC inference included hyperparameter estimation of ζ and parameter summary estimation of Δ(τ1), Ω(τ1), and x¯1). For the other ajSFS reference tables that were based on the two empirical specifications, we conducted “leave-one-out” cross-validation, which entails iteratively treating a single random simulation as a POD and performing inference with the remaining reference table (Csilléry et al. 2012). In this case, hABC inference included estimates of ζ, other ζT hyperparameters derived post hoc to the taxon-specific draws, τs1, and parameter summaries (i.e., Δ, Ω, and x¯ ) for τ2 and several migration parameters. Additionally, we examined robustness to error in assuming SC or the fixed value of τs1 within the context of our empirical applications, by selecting a single set of PODs from the second reference table in sequence (i.e., generated with the second-in-sequence parameterization option) but using the third and final reference table in sequence (i.e., simulated with the SC and fixed τs1 parameterization for both the lampreys and the birds) for hRF and hABC inference of ζ. Notably, the difference in the second parameterization option between the lamprey and bird analyses allowed us to isolate the effect of using solely an SC parameterization and fixed τs1 parameterization, respectively, therefore allowing an independent assessment of each tactic. Across in silico experiments, estimated values were compared against true values by calculating Pearson's r correlation and root mean squared error (RMSE). Additionally, hRF confusion matrices, which required rounding estimates to the nearest discrete interval, and the mean of the hABC hyperposterior distributions per true value were produced for ζ hyperparameter estimation. We further describe this inferential procedure for the simulation experiments within the Supporting Information.

COMPARATIVE INFERENCE ON EMPIRICAL AGGREGATED DATASETS OF LAMPREY AND BIRD POPULATION PAIRS USING SIMULATION REFERENCE TABLES

To empirically demonstrate our hierarchical modeling approach with the ajSFS, we applied our method against two publicly available genome-wide SNP datasets to detect co-demographic pulses of divergence with migration. We selected these two comparative datasets because they starkly contrast in terms of underlying taxonomic relationship, biogeographic question, and sampling, thus span a broad spectrum of empirical applications. The first was collected from a lamprey species pair, consisting of the anadromous, parasitic river lamprey (Lampetra fluviatilis) and freshwater brook, nonparasitic lamprey (Lampetra planeri), across eight sampling localities throughout western France (Rougemont et al. 2017). We adhered to the population and pair assignments as well as groupings of localities based on environmental connectivity of Rougemont et al. (2017), specifically L. fluviatilis and L. planeri populations were designated population “0” and population “1,” respectively, following the structure of the data files, and the connected population pairs (CPPs) included the population pairs AA, BET, OIR, and RIS (from sampling locations Aa, Bethune, Oir, and Risle River, respectively), whereas BRE, CEN, ODO, and SAU (from sampling locations Bresle, Loire-Cens, Odon River, and Garonne-Saucats, respectively) were part of the disconnected population pairs (DPPs). The second dataset was derived from ddRAD sequences for six taxon pairs of dry forest birds from Peru: Arremon abeillei nigricepsA. a. abeillei, Campylorhynchus fasciatus fasciatusC. f. pallescens, Melanopareia maranonicaM. elegans, Mimus longicaudatus maranonicusM. l. longicaudatus, Saltator striatipectus peruvianusS. s. immaculatus, and Thamnophilus bernardi shumbaeT. b. bernardi (Oswald et al. 2017). The first taxon listed for each of these bird population pairs was sampled within the Marañón Valley, whereas the second listed taxon was located within the Tumbesian region, and these taxa were respectively allocated to population “0” and population “1” following the structure of the data files from Oswald et al. (2017).

We first conducted preliminary analyses on the individual population pair datasets independently to explore appropriate parameterization for co-demographic modeling. This was accomplished via ABC under the simple rejection algorithm (Csilléry et al. 2012), which allowed uncertainty measures to be easily obtained. Moreover, a single reference table could be repeatedly applied per joint-SFS across an entire comparative dataset; hence, we have only one set of simulations each for the lampreys and birds yet separate posteriors for every respective population pair. This exploratory step then can be efficiently reiterated many times over to discover broadly applicable prior distributions among constituent population pairs. Subsequently, hierarchical co-demographic estimates were performed (Xue and Hickerson 2017) with both hRF (Liaw and Wiener 2002) and hABC (Csilléry et al. 2012), following the inferential protocol for the corresponding simulation experiments. Notably, within this context of empirical analysis, our manner of informing prior distributions from a previous analysis on the data constitutes an empirical Bayes approach (Morris 1983; Chaudhuri et al. 2018), and our succession of parameterization options that update prior distributions resembles a simplified sequential ABC method (Sisson et al. 2007). To heuristically and visually assess goodness-of-fit of our hierarchical model, we used principal component analysis (PCA; Cornuet et al. 2010; Berlin et al. 2011; Sjödin et al. 2012) to execute both a predictive check sampled from the prior and a predictive check sampled from the posterior (Rubin and Stern 1994).

We present additional details of our empirical analyses within the Supporting Information.

Results

SIMULATION EXPERIMENTS OF HIERARCHICAL CO-DEMOGRAPHIC INFERENCE

Our first simulation experiment, which investigated using our generalized population pair approach against other migration scenarios within our hierarchical co-demographic model (Tables 2 and S5–S7), found that inference under our generalized model has substantially greater accuracy as the true model underlying the PODS increases in migration complexity. Conversely, employing an SI scenario for inference was better in cases whereby migration was minimal for the true model, whereas applying an IM scenario for inference was seemingly an intermediate between these two extremes. Unsurprisingly then, performance was optimized when deploying the correct model for inference (i.e., model generating the reference table was the same as that simulating the PODS; r = 0.652–0.781 for hRF inference of ζ). Similarly, slight model misspecifications were well tolerated, for example, estimation on PODs simulated under AM given the SI reference table (r = 0.684 for hRF inference of ζ), or estimation on PODs generated under strict SC versus SC with minor ancient phase migration (r = 0.623 and 0.685, respectively, for hRF inference of ζ given the reference table under the generalized model). However, major model misspecifications drastically reduced accuracy, such as the SI reference table unable to handle the six POD sets that were not simulated under SI or the two AM scenarios, or the IM reference table behaving more consistently but still suffering against PODs generated with more dynamic migration trajectories as well as SI PODs.

Table 2

Hierarchical co-demographic model misspecification of three reference tables against nine pseudo-observed datasets testing inference of ζ with random forest regression

GM reference tableIM reference tableSI reference table
rRMSErRMSErRMSE
GM0.6520.2130.6120.2310.4650.247
IM0.4900.2620.7080.2130.3440.285
SI0.5310.2610.7030.2260.7810.188
AM0.5810.2490.5210.2620.6840.225
AX0.5170.2490.4340.2680.5030.252
SC0.6230.2390.5570.2610.3880.277
SX0.6850.2120.5530.2430.3140.281
XM0.6400.2260.5260.2490.4150.260
XX0.5110.2410.4530.2490.3990.263
GM reference tableIM reference tableSI reference table
rRMSErRMSErRMSE
GM0.6520.2130.6120.2310.4650.247
IM0.4900.2620.7080.2130.3440.285
SI0.5310.2610.7030.2260.7810.188
AM0.5810.2490.5210.2620.6840.225
AX0.5170.2490.4340.2680.5030.252
SC0.6230.2390.5570.2610.3880.277
SX0.6850.2120.5530.2430.3140.281
XM0.6400.2260.5260.2490.4150.260
XX0.5110.2410.4530.2490.3990.263

Order of models follows the same order they were introduced in the Supporting Information; GM—population pair model with generalized parameterization of migration incorporated within hierarchical co-demographic model; IM—isolation with migration population pair model incorporated within hierarchical co-demographic model; SI—strict isolation population pair model incorporated within hierarchical co-demographic model; AM—ancient migration population pair model incorporated within hierarchical co-demographic model; AX—AM with minor assumption violation in recent phase migration; SC—secondary contact population pair model incorporated within hierarchical co-demographic model; SX—SC with minor assumption violation in ancient phase migration; XM—hierarchical co-demographic model that mixes IM, AM, and SC; XX—hierarchical co-demographic model that mixes GM, AX, and SX.

Table 2

Hierarchical co-demographic model misspecification of three reference tables against nine pseudo-observed datasets testing inference of ζ with random forest regression

GM reference tableIM reference tableSI reference table
rRMSErRMSErRMSE
GM0.6520.2130.6120.2310.4650.247
IM0.4900.2620.7080.2130.3440.285
SI0.5310.2610.7030.2260.7810.188
AM0.5810.2490.5210.2620.6840.225
AX0.5170.2490.4340.2680.5030.252
SC0.6230.2390.5570.2610.3880.277
SX0.6850.2120.5530.2430.3140.281
XM0.6400.2260.5260.2490.4150.260
XX0.5110.2410.4530.2490.3990.263
GM reference tableIM reference tableSI reference table
rRMSErRMSErRMSE
GM0.6520.2130.6120.2310.4650.247
IM0.4900.2620.7080.2130.3440.285
SI0.5310.2610.7030.2260.7810.188
AM0.5810.2490.5210.2620.6840.225
AX0.5170.2490.4340.2680.5030.252
SC0.6230.2390.5570.2610.3880.277
SX0.6850.2120.5530.2430.3140.281
XM0.6400.2260.5260.2490.4150.260
XX0.5110.2410.4530.2490.3990.263

Order of models follows the same order they were introduced in the Supporting Information; GM—population pair model with generalized parameterization of migration incorporated within hierarchical co-demographic model; IM—isolation with migration population pair model incorporated within hierarchical co-demographic model; SI—strict isolation population pair model incorporated within hierarchical co-demographic model; AM—ancient migration population pair model incorporated within hierarchical co-demographic model; AX—AM with minor assumption violation in recent phase migration; SC—secondary contact population pair model incorporated within hierarchical co-demographic model; SX—SC with minor assumption violation in ancient phase migration; XM—hierarchical co-demographic model that mixes IM, AM, and SC; XX—hierarchical co-demographic model that mixes GM, AX, and SX.

For the leave-one-out cross-validations under the empirical data specifications, our experiments strongly suggest that our weakly informed parameterization is a challenging modeling approach, whereas the priors informed through empirical Bayes led to much greater accuracy. Specifically, the SC parameterization achieved powerful inference under the lamprey specifications (Fig. S1; Tables 3 and S8–S10), and the fixed τs1 parameterization likewise offered substantial, albeit less so, gains in sensitivity for the bird analyses (Fig. S2; Tables 4 and S11–S13). Moreover, combining the SC parameterization with a fixed τs1 value yielded additional increase in performance under both the lamprey and the bird sampling configurations, illustrating that either setting M2 parameters to 0.0 or fixing τs1 to a single value separately boosts inferential resolution. Furthermore, we do not see any ostensible changes in the efficacy of the SC and fixed τs1 parameterization when comparing these results to our misspecification test that sampled PODs instead from the preceding reference table generated under the less informed parameterization, that is, under the lamprey SC parameterization (Tables S14-S16) and bird fixed τs1 parameterization (Tables S17–S19), respectively. However, despite making the SC and fixed τs1 parameterization equivalent, we witness quite greater accuracy for the lamprey-inspired cross-validation (e.g., r = 0.965 for hRF estimation of ζ given the 10 haploids data sampling). This is expected given that the lamprey-based simulations contained more population pairs and were much more informed with respect to the prior distributions of τ2 and ε parameters than for the bird-related simulations.

Table 3

Cross-validation of hierarchical co-demographic modeling under lamprey data specifications given 10 haploids for hRF and hABC mean point estimates

Weakly informed parameterizationSC parameterizationSC and fixed τs1 parameterization
rRMSErRMSErRMSE
hRF hyperparameter estimation
ζ0.4910.2550.8240.1630.9650.076
hABC hyperparameter estimation/model selection
ζ0.4920.2540.7730.1820.9330.109
ζτ20.3020.1830.8520.1100.8680.102
ζM  1-0:10.7370.1960.4240.0910.6010.088
hABC parameter summary estimation
τs10.301291,7900.541237,101N/AN/A
Δ(τ1)0.34294,8720.87247,6140.92238,797
Ω(τ1)0.361111,8580.86754,3680.92336,098
x¯1)0.668131,9990.96247,9460.90243,535
Δ(τ2)0.59550,9420.87214,8280.92311,469
Ω(τ2)0.270104,3510.84521,9520.90113,866
x¯2)0.70975,3740.96017,1830.92612,609
Δ(M1-0:1)0.9030.2520.7290.3430.7330.295
Ω(M1-0:1)0.7881.4870.8490.2870.8510.267
x¯ (M1-0:1)0.9160.2440.9150.3730.9450.300
Weakly informed parameterizationSC parameterizationSC and fixed τs1 parameterization
rRMSErRMSErRMSE
hRF hyperparameter estimation
ζ0.4910.2550.8240.1630.9650.076
hABC hyperparameter estimation/model selection
ζ0.4920.2540.7730.1820.9330.109
ζτ20.3020.1830.8520.1100.8680.102
ζM  1-0:10.7370.1960.4240.0910.6010.088
hABC parameter summary estimation
τs10.301291,7900.541237,101N/AN/A
Δ(τ1)0.34294,8720.87247,6140.92238,797
Ω(τ1)0.361111,8580.86754,3680.92336,098
x¯1)0.668131,9990.96247,9460.90243,535
Δ(τ2)0.59550,9420.87214,8280.92311,469
Ω(τ2)0.270104,3510.84521,9520.90113,866
x¯2)0.70975,3740.96017,1830.92612,609
Δ(M1-0:1)0.9030.2520.7290.3430.7330.295
Ω(M1-0:1)0.7881.4870.8490.2870.8510.267
x¯ (M1-0:1)0.9160.2440.9150.3730.9450.300
Table 3

Cross-validation of hierarchical co-demographic modeling under lamprey data specifications given 10 haploids for hRF and hABC mean point estimates

Weakly informed parameterizationSC parameterizationSC and fixed τs1 parameterization
rRMSErRMSErRMSE
hRF hyperparameter estimation
ζ0.4910.2550.8240.1630.9650.076
hABC hyperparameter estimation/model selection
ζ0.4920.2540.7730.1820.9330.109
ζτ20.3020.1830.8520.1100.8680.102
ζM  1-0:10.7370.1960.4240.0910.6010.088
hABC parameter summary estimation
τs10.301291,7900.541237,101N/AN/A
Δ(τ1)0.34294,8720.87247,6140.92238,797
Ω(τ1)0.361111,8580.86754,3680.92336,098
x¯1)0.668131,9990.96247,9460.90243,535
Δ(τ2)0.59550,9420.87214,8280.92311,469
Ω(τ2)0.270104,3510.84521,9520.90113,866
x¯2)0.70975,3740.96017,1830.92612,609
Δ(M1-0:1)0.9030.2520.7290.3430.7330.295
Ω(M1-0:1)0.7881.4870.8490.2870.8510.267
x¯ (M1-0:1)0.9160.2440.9150.3730.9450.300
Weakly informed parameterizationSC parameterizationSC and fixed τs1 parameterization
rRMSErRMSErRMSE
hRF hyperparameter estimation
ζ0.4910.2550.8240.1630.9650.076
hABC hyperparameter estimation/model selection
ζ0.4920.2540.7730.1820.9330.109
ζτ20.3020.1830.8520.1100.8680.102
ζM  1-0:10.7370.1960.4240.0910.6010.088
hABC parameter summary estimation
τs10.301291,7900.541237,101N/AN/A
Δ(τ1)0.34294,8720.87247,6140.92238,797
Ω(τ1)0.361111,8580.86754,3680.92336,098
x¯1)0.668131,9990.96247,9460.90243,535
Δ(τ2)0.59550,9420.87214,8280.92311,469
Ω(τ2)0.270104,3510.84521,9520.90113,866
x¯2)0.70975,3740.96017,1830.92612,609
Δ(M1-0:1)0.9030.2520.7290.3430.7330.295
Ω(M1-0:1)0.7881.4870.8490.2870.8510.267
x¯ (M1-0:1)0.9160.2440.9150.3730.9450.300
Table 4

Cross-validation of hierarchical co-demographic modeling under bird data specifications given no monomorphic sites nor normalization for hRF and hABC mean point estimates

Weakly informed parameterizationFixed τs1 parameterizationSC and fixed τs1 parameterization
rRMSErRMSErRMSE
hRF hyperparameter estimation
ζ0.5360.2440.6730.2140.7360.199
hABC hyperparameter estimation/model selection
ζ0.4350.2590.6310.2360.6680.227
ζM  1-0:10.5510.2750.3350.2680.5760.240
ζM  1-1:00.6610.2380.6740.2220.7860.182
hABC parameter summary estimation
τs10.494725,994N/AN/AN/AN/A
Δ(τ1)0.563270,0600.721262,0220.542266,144
Ω(τ1)0.518313,4220.750284,5080.493281,559
x¯1)0.867334,9130.652287,5010.715290,100
Δ(M1-0:1)0.7440.1190.6600.1100.7300.119
Ω(M1-0:1)0.6100.3980.4410.4180.6910.351
x¯ (M1-0:1)0.7520.1230.6810.1110.7480.119
Δ(M1-1:0)0.8170.0440.8570.0420.8490.044
Ω(M1-1:0)0.7470.1640.7690.1740.8180.147
x¯ (M1-1:0)0.8170.0450.8590.0430.8720.044
Weakly informed parameterizationFixed τs1 parameterizationSC and fixed τs1 parameterization
rRMSErRMSErRMSE
hRF hyperparameter estimation
ζ0.5360.2440.6730.2140.7360.199
hABC hyperparameter estimation/model selection
ζ0.4350.2590.6310.2360.6680.227
ζM  1-0:10.5510.2750.3350.2680.5760.240
ζM  1-1:00.6610.2380.6740.2220.7860.182
hABC parameter summary estimation
τs10.494725,994N/AN/AN/AN/A
Δ(τ1)0.563270,0600.721262,0220.542266,144
Ω(τ1)0.518313,4220.750284,5080.493281,559
x¯1)0.867334,9130.652287,5010.715290,100
Δ(M1-0:1)0.7440.1190.6600.1100.7300.119
Ω(M1-0:1)0.6100.3980.4410.4180.6910.351
x¯ (M1-0:1)0.7520.1230.6810.1110.7480.119
Δ(M1-1:0)0.8170.0440.8570.0420.8490.044
Ω(M1-1:0)0.7470.1640.7690.1740.8180.147
x¯ (M1-1:0)0.8170.0450.8590.0430.8720.044
Table 4

Cross-validation of hierarchical co-demographic modeling under bird data specifications given no monomorphic sites nor normalization for hRF and hABC mean point estimates

Weakly informed parameterizationFixed τs1 parameterizationSC and fixed τs1 parameterization
rRMSErRMSErRMSE
hRF hyperparameter estimation
ζ0.5360.2440.6730.2140.7360.199
hABC hyperparameter estimation/model selection
ζ0.4350.2590.6310.2360.6680.227
ζM  1-0:10.5510.2750.3350.2680.5760.240
ζM  1-1:00.6610.2380.6740.2220.7860.182
hABC parameter summary estimation
τs10.494725,994N/AN/AN/AN/A
Δ(τ1)0.563270,0600.721262,0220.542266,144
Ω(τ1)0.518313,4220.750284,5080.493281,559
x¯1)0.867334,9130.652287,5010.715290,100
Δ(M1-0:1)0.7440.1190.6600.1100.7300.119
Ω(M1-0:1)0.6100.3980.4410.4180.6910.351
x¯ (M1-0:1)0.7520.1230.6810.1110.7480.119
Δ(M1-1:0)0.8170.0440.8570.0420.8490.044
Ω(M1-1:0)0.7470.1640.7690.1740.8180.147
x¯ (M1-1:0)0.8170.0450.8590.0430.8720.044
Weakly informed parameterizationFixed τs1 parameterizationSC and fixed τs1 parameterization
rRMSErRMSErRMSE
hRF hyperparameter estimation
ζ0.5360.2440.6730.2140.7360.199
hABC hyperparameter estimation/model selection
ζ0.4350.2590.6310.2360.6680.227
ζM  1-0:10.5510.2750.3350.2680.5760.240
ζM  1-1:00.6610.2380.6740.2220.7860.182
hABC parameter summary estimation
τs10.494725,994N/AN/AN/AN/A
Δ(τ1)0.563270,0600.721262,0220.542266,144
Ω(τ1)0.518313,4220.750284,5080.493281,559
x¯1)0.867334,9130.652287,5010.715290,100
Δ(M1-0:1)0.7440.1190.6600.1100.7300.119
Ω(M1-0:1)0.6100.3980.4410.4180.6910.351
x¯ (M1-0:1)0.7520.1230.6810.1110.7480.119
Δ(M1-1:0)0.8170.0440.8570.0420.8490.044
Ω(M1-1:0)0.7470.1640.7690.1740.8180.147
x¯ (M1-1:0)0.8170.0450.8590.0430.8720.044

Our simulation experiments demonstrated nearly identical sensitivities among the two lamprey sampling projections, suggesting that sampling more individuals throughout this range yields negligible benefit, if any. In contrast, the analyses based on the bird data show a surprising decrease in hABC power when monomorphic sites were incorporated. Notably, there was greater error in including monomorphic bins, which we coupled with the normalization of ajSFS bins, than normalizing the ajSFS containing only SNPs, corroborating that monomorphic sites specifically impacted hABC performance. However, using monomorphic sites slightly improved ζ inference with hRF (r = 0.725 with monomorphic sites; r = 0.717 with normalization only; r = 0.673 without either), which inherently weighs the importance of the various bins through its machine learning algorithm. Together, this indicates that although there is additional signal to be gained from sequence length, exploiting this information is not straightforward (e.g., must optimize weights between monomorphic and SNP bins, which can be several orders of magnitude different in number) and may only marginally improve accuracy.

Finally, it is strongly evident from the hRF confusion matrices (Tables S6, S9, S12, S15, and S18) and mean of the hABC hyperposterior distributions per true value (Tables S7, S10, S13, S16, and S19) across all the simulation experiments, as well as the leave-one-out cross-validation plots (Figs. S1 and S2), that the signal for hyperparameter estimation was substantially stronger when the true value of ζ was 1.000 (i.e., completely synchronous co-divergence times), whereas there was a bit of upward bias when ζ approached idiosyncrasy.

EMPIRICAL ANALYSES OF REPLICATE LAMPREY SPECIES PAIR

There was a range of 7870–11,899 SNPs produced for the eight lamprey joint-SFS vectors when down-projected to 20 haploids, with a minimal decrease to 6263–9580 SNPs after further down-projection to 10 haploids (Table S20). Individual ABC demographic analyses unveiled a universal pattern across all eight population pairs of ancient M2 estimates near zero in value (mode M2-0:1 = 0.004–0.020; mode M2-1:0 = 0.006–0.018), indicative of an SC scenario that followed a period of isolation (Tables 5 and S21) and hence our SC parameterization for the downstream hierarchical co-demographic model. Migration rates from the parasitic anadromous river populations of L. fluviatilis to nonparasitic freshwater brook populations of L. planeri remained minimal to present day, with mode estimates of less than 0.01 for five of the population pairs (mode M1-1:0 = 0.000–0.017). Notably, although this set of migration results would support fixing M1-1:0 to a value of zero in the SC parameterization as we did for the M2 parameters, we chose to allow M1-1:0 vary within a highly constrained range at lower values to remain more consistent with the original modeling from Rougemont et al. (2017) (Table S3). In contrast, migration rates in the opposite direction from the start of SC to present day were high, with M1-0:1 mode estimates >1.0 for all but one population pair and on the order of several haploid migrants per generation for half of the population pairs (mode M1-0:1 = 0.953–7.225).

Table 5

Lamprey individual population pair ABC mode point estimates

Connected population pairs (CPP)Disconnected population pairs (DPP)
AABETOIRRISBRECENODOSAU
τ1523,114558,859882,020819,962854,589884,503845,225861,784
τ2142,646121,057353,965365,931320,608314,055456,163305,911
M  1-0:12.9343.9597.2254.0811.3541.3340.9531.359
M  1-1:00.0030.0020.0170.0000.0000.0120.0050.017
M  2-0:10.0150.0130.0040.0060.0050.0190.0200.017
M  2-1:00.0070.0060.0070.0160.0080.0180.0130.018
ε  00.0930.1870.2990.3590.3070.4930.3070.420
ε  10.3950.3830.4600.5470.4660.5070.6030.399
N  0760,517737,342674,257809,991642,219619,047450,862607,578
N  112,24113,092457212,1595604559639225712
Connected population pairs (CPP)Disconnected population pairs (DPP)
AABETOIRRISBRECENODOSAU
τ1523,114558,859882,020819,962854,589884,503845,225861,784
τ2142,646121,057353,965365,931320,608314,055456,163305,911
M  1-0:12.9343.9597.2254.0811.3541.3340.9531.359
M  1-1:00.0030.0020.0170.0000.0000.0120.0050.017
M  2-0:10.0150.0130.0040.0060.0050.0190.0200.017
M  2-1:00.0070.0060.0070.0160.0080.0180.0130.018
ε  00.0930.1870.2990.3590.3070.4930.3070.420
ε  10.3950.3830.4600.5470.4660.5070.6030.399
N  0760,517737,342674,257809,991642,219619,047450,862607,578
N  112,24113,092457212,1595604559639225712
Table 5

Lamprey individual population pair ABC mode point estimates

Connected population pairs (CPP)Disconnected population pairs (DPP)
AABETOIRRISBRECENODOSAU
τ1523,114558,859882,020819,962854,589884,503845,225861,784
τ2142,646121,057353,965365,931320,608314,055456,163305,911
M  1-0:12.9343.9597.2254.0811.3541.3340.9531.359
M  1-1:00.0030.0020.0170.0000.0000.0120.0050.017
M  2-0:10.0150.0130.0040.0060.0050.0190.0200.017
M  2-1:00.0070.0060.0070.0160.0080.0180.0130.018
ε  00.0930.1870.2990.3590.3070.4930.3070.420
ε  10.3950.3830.4600.5470.4660.5070.6030.399
N  0760,517737,342674,257809,991642,219619,047450,862607,578
N  112,24113,092457212,1595604559639225712
Connected population pairs (CPP)Disconnected population pairs (DPP)
AABETOIRRISBRECENODOSAU
τ1523,114558,859882,020819,962854,589884,503845,225861,784
τ2142,646121,057353,965365,931320,608314,055456,163305,911
M  1-0:12.9343.9597.2254.0811.3541.3340.9531.359
M  1-1:00.0030.0020.0170.0000.0000.0120.0050.017
M  2-0:10.0150.0130.0040.0060.0050.0190.0200.017
M  2-1:00.0070.0060.0070.0160.0080.0180.0130.018
ε  00.0930.1870.2990.3590.3070.4930.3070.420
ε  10.3950.3830.4600.5470.4660.5070.6030.399
N  0760,517737,342674,257809,991642,219619,047450,862607,578
N  112,24113,092457212,1595604559639225712

Expansion magnitude estimates were low to moderate for L. fluviatilis populations (mode ε0 = 2.0 × to 10.8 ×) and consistently low for L. planeri populations (mode ε1 = 1.6 × to 2.4 ×), with the latter result motivating us to model twofold growth in the L. planeri populations (i.e., population “1”) among all population pairs for the SC parameterization. The ranges for the two historical event timings of divergence and migration/size change were moderate (mode τ1 = 523,114–884,503, with six of the eight pairs clumping ca. 850,000; mode τ2 = 121,057–456,163, with five of the eight pairs clumping ca. 350,000), making the degree to which times were in synchrony ambiguous based on these individual population pair analyses alone. Notably, τ1 and τ2 were highly correlated across population pairs (r = 0.868 given mode estimates), although this could largely be an effect of the τ2 prior being conditional on τ1. Nonetheless, this suggests that regardless of the actual timing values, which coalescent theory dictates are inherently scaled by NE and thus absolute time estimates can only be achieved by assuming a value for NE or including information about monomorphic sites while incorporating a mutation rate parameter, there is a degree of congruence in the temporal length of isolation throughout lamprey population pairs.

Our hierarchical co-demographic approach using hRF and hABC (i.e., posterior distributions for ζ, τs1, Δ(τ1), Ω(τ1), and x¯1); see Table S1 for descriptions of parameter symbols) supports a high degree of temporally synchronous co-divergence during the late Pleistocene (i.e., at τ1) of the freshwater brook populations from their anadromous river complements (Figs. 4 and S3; Tables 6 and S22). Additionally, our hierarchical analyses elucidated key questions regarding variability in SC timing as well as strength of SC co-migration. In fact, the collective SC times (i.e., at τ2) among the eight lamprey population pairs were inferred to be similarly synchronous based on posterior distributions for ζτ2, Δ(τ2), and Ω(τ2), a perhaps unexpected insight into the co-migration dynamics between the nonparasitic L. fluviatilis and parasitic L. planeri populations. Notably, the hierarchical model constrained the start of SC to coincide with potential size change, which could have contributed to this co-demographic signal. For co-migration intensities, ζM1-0:1, Δ(M1-0:1), and Ω(M1-0:1) posterior distributions were shifted toward idiosyncrasy with respect to the corresponding priors, representative of incongruent contemporary migration rates from L. fluviatilis freshwater localities to L. planeri anadramous habitats. Point estimates of Δ(M1-0:1) further imply wide variation of migration rates across population pairs, indicating a total range of at least four haploid migrants per generation, assuming the minimum total variability equals 2 × Δ. Moreover, our PCA-based predictive checks constructed against both simulations retained from the prior as well as re-simulations from the posterior, with subsequent projections of the empirical data, demonstrate that our hierarchical co-demographic model could reasonably generate the observed lamprey data (Fig. S4).

Lamprey hABC prior and posterior distributions. Prior histograms in light blue and posterior histograms in dark green of, from left to right and then top to bottom: ζ, Δ(τ1), Ω(τ1), τs1, ζτ2, Δ(τ2), Ω(τ2), x¯ (τ2), ζM1-0:1, Δ(M1-0:1), Ω(M1-0:1), and x¯ (M1-0:1). Distributions are shown for the SC parameterization as well as SC and fixed τs1 parameterization, given 10 haploids. For each distribution, the y-axis is in units of the probability mass, and the x-axis is in units of the corresponding hyperparameter or parameter summary. Vertical lines in green, blue, red, and dotted black represent mean, median, and mode point estimates as well as 50% credibility intervals, respectively.
Figure 4

Lamprey hABC prior and posterior distributions. Prior histograms in light blue and posterior histograms in dark green of, from left to right and then top to bottom: ζ, Δ(τ1), Ω(τ1), τs1, ζτ2, Δ(τ2), Ω(τ2), x¯2), ζM1-0:1, Δ(M1-0:1), Ω(M1-0:1), and x¯ (M1-0:1). Distributions are shown for the SC parameterization as well as SC and fixed τs1 parameterization, given 10 haploids. For each distribution, the y-axis is in units of the probability mass, and the x-axis is in units of the corresponding hyperparameter or parameter summary. Vertical lines in green, blue, red, and dotted black represent mean, median, and mode point estimates as well as 50% credibility intervals, respectively.

Table 6

Lamprey hierarchical co-demographic hRF as well as hABC mean point estimates and 50% credibility intervals given 10 haploids

SC parameterizationSC and fixed τs1 parameterization
hRF hyperparameter estimation
ζ0.8410.894
Mean point estimate50% credibility intervalsMean point estimate50% credibility intervals
hABC hyperparameter estimation/model selection
ζ0.7850.6251.0000.8610.7501.000
ζτ20.8570.7501.0000.8980.8751.000
ζM  1-0:10.1460.1250.1250.1370.1250.125
hABC parameter summary estimation
τs1452,288357,693510,347N/AN/AN/A
Δ(τ1)85,6250150,08862,5680104,898
Ω(τ1)64,4780112,09655,4390101,084
x¯1)469,010417,493516,582419,315361,117460,631
Δ(τ2)37,36116,68653,53029,55912,83340,861
Ω(τ2)23,026317938,28719,819224434,052
x¯2)151,655134,974165,902137,221121,016148,784
Δ(M1-0:1)2.1001.7612.4052.1551.8162.464
Ω(M1-0:1)1.9471.5422.3632.0001.6012.401
x¯ (M1-0:1)3.7723.4774.0893.7543.4424.063
SC parameterizationSC and fixed τs1 parameterization
hRF hyperparameter estimation
ζ0.8410.894
Mean point estimate50% credibility intervalsMean point estimate50% credibility intervals
hABC hyperparameter estimation/model selection
ζ0.7850.6251.0000.8610.7501.000
ζτ20.8570.7501.0000.8980.8751.000
ζM  1-0:10.1460.1250.1250.1370.1250.125
hABC parameter summary estimation
τs1452,288357,693510,347N/AN/AN/A
Δ(τ1)85,6250150,08862,5680104,898
Ω(τ1)64,4780112,09655,4390101,084
x¯1)469,010417,493516,582419,315361,117460,631
Δ(τ2)37,36116,68653,53029,55912,83340,861
Ω(τ2)23,026317938,28719,819224434,052
x¯2)151,655134,974165,902137,221121,016148,784
Δ(M1-0:1)2.1001.7612.4052.1551.8162.464
Ω(M1-0:1)1.9471.5422.3632.0001.6012.401
x¯ (M1-0:1)3.7723.4774.0893.7543.4424.063
Table 6

Lamprey hierarchical co-demographic hRF as well as hABC mean point estimates and 50% credibility intervals given 10 haploids

SC parameterizationSC and fixed τs1 parameterization
hRF hyperparameter estimation
ζ0.8410.894
Mean point estimate50% credibility intervalsMean point estimate50% credibility intervals
hABC hyperparameter estimation/model selection
ζ0.7850.6251.0000.8610.7501.000
ζτ20.8570.7501.0000.8980.8751.000
ζM  1-0:10.1460.1250.1250.1370.1250.125
hABC parameter summary estimation
τs1452,288357,693510,347N/AN/AN/A
Δ(τ1)85,6250150,08862,5680104,898
Ω(τ1)64,4780112,09655,4390101,084
x¯1)469,010417,493516,582419,315361,117460,631
Δ(τ2)37,36116,68653,53029,55912,83340,861
Ω(τ2)23,026317938,28719,819224434,052
x¯2)151,655134,974165,902137,221121,016148,784
Δ(M1-0:1)2.1001.7612.4052.1551.8162.464
Ω(M1-0:1)1.9471.5422.3632.0001.6012.401
x¯ (M1-0:1)3.7723.4774.0893.7543.4424.063
SC parameterizationSC and fixed τs1 parameterization
hRF hyperparameter estimation
ζ0.8410.894
Mean point estimate50% credibility intervalsMean point estimate50% credibility intervals
hABC hyperparameter estimation/model selection
ζ0.7850.6251.0000.8610.7501.000
ζτ20.8570.7501.0000.8980.8751.000
ζM  1-0:10.1460.1250.1250.1370.1250.125
hABC parameter summary estimation
τs1452,288357,693510,347N/AN/AN/A
Δ(τ1)85,6250150,08862,5680104,898
Ω(τ1)64,4780112,09655,4390101,084
x¯1)469,010417,493516,582419,315361,117460,631
Δ(τ2)37,36116,68653,53029,55912,83340,861
Ω(τ2)23,026317938,28719,819224434,052
x¯2)151,655134,974165,902137,221121,016148,784
Δ(M1-0:1)2.1001.7612.4052.1551.8162.464
Ω(M1-0:1)1.9471.5422.3632.0001.6012.401
x¯ (M1-0:1)3.7723.4774.0893.7543.4424.063

EMPIRICAL ANALYSES OF CO-DISTRIBUTED BIRD TAXON PAIRS

In the case of the six dry forest bird joint-SFS vectors, there was an imputed total of 260,637–849,951 sites given the extracted information about invariants as well as number of SNPs ranging from 5446 to 12,455 (Table S23). Unlike for the lampreys, individual-level ABC estimates for migration parameters were generally quite low, with 18 of the 24 mode M estimates less than 0.01, or one effective haploid migrant every 100 generations (maximum mode M = 1.533; Tables 7 and S24). In fact, both ancient phase M2 migration parameters were approximately zero in value (i.e., <0.01) for all six bird population pairs, whereas both recent phase M1 migration parameters had much more variance. Notably, although this supports an SC parameterization for the downstream hierarchical co-demographic model, to remain more consistent with the original modeling from Oswald et al. (2017), we chose to continue allowing the M2 parameters vary within a highly constrained range at lower values until the final parameterization option, thus we first explored the effect of solely fixing τs1 (Table S4).

Table 7

Bird individual population pair ABC mode point estimates

ArremonCampylorhynchusMelanopareiaMimusSaltatorThamnophilus
τ1553,9781,743,3721,210,602264,790627,849479,811
τ2188,695869,525700,885135,352378,118172,539
M  1-0:10.0051.3690.0031.5330.0180.448
M  1-1:00.0130.0030.0010.0050.0030.446
M  2-0:10.0060.0020.0050.0040.0060.001
M  2-1:00.0000.0010.0010.0000.0010.000
ε  00.0910.0630.0680.0820.0850.052
ε  10.0720.0650.0710.0620.0910.234
N  0444,477406,780357,884453,885460,533273,187
N  1826,113652,565940,162556,931615,656601,824
U1.324 × 10–91.609 × 10–93.869 × 10–91.317 × 10–91.322 × 10–91.131 × 10–9
ArremonCampylorhynchusMelanopareiaMimusSaltatorThamnophilus
τ1553,9781,743,3721,210,602264,790627,849479,811
τ2188,695869,525700,885135,352378,118172,539
M  1-0:10.0051.3690.0031.5330.0180.448
M  1-1:00.0130.0030.0010.0050.0030.446
M  2-0:10.0060.0020.0050.0040.0060.001
M  2-1:00.0000.0010.0010.0000.0010.000
ε  00.0910.0630.0680.0820.0850.052
ε  10.0720.0650.0710.0620.0910.234
N  0444,477406,780357,884453,885460,533273,187
N  1826,113652,565940,162556,931615,656601,824
U1.324 × 10–91.609 × 10–93.869 × 10–91.317 × 10–91.322 × 10–91.131 × 10–9
Table 7

Bird individual population pair ABC mode point estimates

ArremonCampylorhynchusMelanopareiaMimusSaltatorThamnophilus
τ1553,9781,743,3721,210,602264,790627,849479,811
τ2188,695869,525700,885135,352378,118172,539
M  1-0:10.0051.3690.0031.5330.0180.448
M  1-1:00.0130.0030.0010.0050.0030.446
M  2-0:10.0060.0020.0050.0040.0060.001
M  2-1:00.0000.0010.0010.0000.0010.000
ε  00.0910.0630.0680.0820.0850.052
ε  10.0720.0650.0710.0620.0910.234
N  0444,477406,780357,884453,885460,533273,187
N  1826,113652,565940,162556,931615,656601,824
U1.324 × 10–91.609 × 10–93.869 × 10–91.317 × 10–91.322 × 10–91.131 × 10–9
ArremonCampylorhynchusMelanopareiaMimusSaltatorThamnophilus
τ1553,9781,743,3721,210,602264,790627,849479,811
τ2188,695869,525700,885135,352378,118172,539
M  1-0:10.0051.3690.0031.5330.0180.448
M  1-1:00.0130.0030.0010.0050.0030.446
M  2-0:10.0060.0020.0050.0040.0060.001
M  2-1:00.0000.0010.0010.0000.0010.000
ε  00.0910.0630.0680.0820.0850.052
ε  10.0720.0650.0710.0620.0910.234
N  0444,477406,780357,884453,885460,533273,187
N  1826,113652,565940,162556,931615,656601,824
U1.324 × 10–91.609 × 10–93.869 × 10–91.317 × 10–91.322 × 10–91.131 × 10–9

In contrast to migration, inference of size change magnitude was consistently moderate to high (mode ε0 = 10.9 × to 19.2 × expansion; mode ε1 = 4.3 × to 16.1 × expansion). For the two historical event timings, there was a much greater dispersion of estimates among population pairs (mode τ1 = 264,790–1,743,372; median τ2 = 135,352–869,525) in comparison to the lamprey inferences, thus providing a clearer test case whereby the expectation is asynchrony. Interestingly, however, τ1 and τ2 were still highly correlated (r = 0.976 given mode estimates); as with the lamprey analyses, this may be due to the more recent time being inherently conditional on the earlier time, yet implies that the temporal length of isolation is shared across the bird population pairs.

Corroborating the individual-based approach, hRF estimates and hABC posterior distributions for ζ, Δ(τ1), and Ω(τ1) (see Table S1 for descriptions of parameter symbols) suggest temporal idiosyncrasy in co-divergence deep within the past between the Marañón Valley and Tumbesian region (Fig. 5; Tables 8 and S25). Although these inferences allow some degree of intermediate synchrony, there is little room for interpretation of a fully shared, singular co-divergence event. Furthermore, we found similar asynchronous patterns for present-day migration rates in both directions, with the post hoc hyperparameter and parameter summary estimates consistent with a range of migration intensities from nearly nonexistent to moderate among the six taxon pairs. Together, these results from our hierarchical analyses indicate high variability in co-demographic dynamics, portraying a scenario of many independent population pair trajectories. Furthermore, our PCA-based prior and posterior predictive checks exhibit reasonable overall goodness-of-fit for our parameterization options. Notably, although the posterior predictive check for the fixed τs1 parameterization suggests a small degree of model mismatch, this could be due to an insufficient number of simulations for adequately sampling the hyperparameter space, which is implied by the satisfactory model fit for the more informed SC and fixed τs1 parameterization option that succeeded and is in fact nested within this fixed τs1 parameterization (Fig. S5).

Bird hABC prior and posterior distributions. Prior histograms in light blue and posterior histograms in dark green of, from left to right and then top to bottom: ζ, Δ(τ1), Ω(τ1), τs1, ζM1-0:1, Δ(M1-0:1), Ω(M1-0:1), x¯ (M1-0:1), ζM1-1:0, Δ(M1-1:0), Ω(M1-1:0), and x¯ (M1-1:0). Distributions are shown for the fixed τs1 parameterization as well as SC and fixed τs1 parameterization. For each distribution, the y-axis is in units of the probability mass, and the x-axis is in units of the corresponding hyperparameter or parameter summary. Vertical lines in green, blue, red, and dotted black represent mean, median, and mode point estimates as well as 50% credibility intervals, respectively.
Figure 5

Bird hABC prior and posterior distributions. Prior histograms in light blue and posterior histograms in dark green of, from left to right and then top to bottom: ζ, Δ(τ1), Ω(τ1), τs1, ζM1-0:1, Δ(M1-0:1), Ω(M1-0:1), x¯ (M1-0:1), ζM1-1:0, Δ(M1-1:0), Ω(M1-1:0), and x¯ (M1-1:0). Distributions are shown for the fixed τs1 parameterization as well as SC and fixed τs1 parameterization. For each distribution, the y-axis is in units of the probability mass, and the x-axis is in units of the corresponding hyperparameter or parameter summary. Vertical lines in green, blue, red, and dotted black represent mean, median, and mode point estimates as well as 50% credibility intervals, respectively.

Table 8

Bird hierarchical co-demographic hRF as well as hABC mean point estimates and 50% credibility intervals

Fixed τs1 parameterizationSC and fixed τs1 parameterization
hRF hyperparameter estimation
ζ0.5350.547
Mean point estimate50% credibility intervalsMean point estimate50% credibility intervals
hABC hyperparameter estimation/model selection
ζ0.6340.3330.8330.6350.5000.833
ζτ20.3470.1250.6670.3760.1670.667
ζM  1-0:10.2110.1250.1250.2380.1670.167
hABC parameter summary estimation
Δ(τ1)372,473136,303587,086364,315130,951573,636
Ω(τ1)379,10195,619651,958371,34387,854601,943
x¯1)1,238,1511,089,9751,366,3311,221,7451,043,5831,332,173
Δ(τ2)0.2410.1260.3330.2350.1170.335
Ω(τ2)0.7570.3931.0950.7470.3441.108
x¯2)0.2490.1310.3420.2420.1210.341
Δ(M1-0:1)0.1310.0870.1690.1380.0940.174
Ω(M1-0:1)0.2850.1640.3840.2820.1700.376
x¯ (M1-0:1)0.1420.0970.1810.1510.1030.191
Fixed τs1 parameterizationSC and fixed τs1 parameterization
hRF hyperparameter estimation
ζ0.5350.547
Mean point estimate50% credibility intervalsMean point estimate50% credibility intervals
hABC hyperparameter estimation/model selection
ζ0.6340.3330.8330.6350.5000.833
ζτ20.3470.1250.6670.3760.1670.667
ζM  1-0:10.2110.1250.1250.2380.1670.167
hABC parameter summary estimation
Δ(τ1)372,473136,303587,086364,315130,951573,636
Ω(τ1)379,10195,619651,958371,34387,854601,943
x¯1)1,238,1511,089,9751,366,3311,221,7451,043,5831,332,173
Δ(τ2)0.2410.1260.3330.2350.1170.335
Ω(τ2)0.7570.3931.0950.7470.3441.108
x¯2)0.2490.1310.3420.2420.1210.341
Δ(M1-0:1)0.1310.0870.1690.1380.0940.174
Ω(M1-0:1)0.2850.1640.3840.2820.1700.376
x¯ (M1-0:1)0.1420.0970.1810.1510.1030.191
Table 8

Bird hierarchical co-demographic hRF as well as hABC mean point estimates and 50% credibility intervals

Fixed τs1 parameterizationSC and fixed τs1 parameterization
hRF hyperparameter estimation
ζ0.5350.547
Mean point estimate50% credibility intervalsMean point estimate50% credibility intervals
hABC hyperparameter estimation/model selection
ζ0.6340.3330.8330.6350.5000.833
ζτ20.3470.1250.6670.3760.1670.667
ζM  1-0:10.2110.1250.1250.2380.1670.167
hABC parameter summary estimation
Δ(τ1)372,473136,303587,086364,315130,951573,636
Ω(τ1)379,10195,619651,958371,34387,854601,943
x¯1)1,238,1511,089,9751,366,3311,221,7451,043,5831,332,173
Δ(τ2)0.2410.1260.3330.2350.1170.335
Ω(τ2)0.7570.3931.0950.7470.3441.108
x¯2)0.2490.1310.3420.2420.1210.341
Δ(M1-0:1)0.1310.0870.1690.1380.0940.174
Ω(M1-0:1)0.2850.1640.3840.2820.1700.376
x¯ (M1-0:1)0.1420.0970.1810.1510.1030.191
Fixed τs1 parameterizationSC and fixed τs1 parameterization
hRF hyperparameter estimation
ζ0.5350.547
Mean point estimate50% credibility intervalsMean point estimate50% credibility intervals
hABC hyperparameter estimation/model selection
ζ0.6340.3330.8330.6350.5000.833
ζτ20.3470.1250.6670.3760.1670.667
ζM  1-0:10.2110.1250.1250.2380.1670.167
hABC parameter summary estimation
Δ(τ1)372,473136,303587,086364,315130,951573,636
Ω(τ1)379,10195,619651,958371,34387,854601,943
x¯1)1,238,1511,089,9751,366,3311,221,7451,043,5831,332,173
Δ(τ2)0.2410.1260.3330.2350.1170.335
Ω(τ2)0.7570.3931.0950.7470.3441.108
x¯2)0.2490.1310.3420.2420.1210.341
Δ(M1-0:1)0.1310.0870.1690.1380.0940.174
Ω(M1-0:1)0.2850.1640.3840.2820.1700.376
x¯ (M1-0:1)0.1420.0970.1810.1510.1030.191

Discussion

COUPLING HIERARCHICAL CO-DEMOGRAPHIC MODELING OF POPULATION PAIRS WITH THE ajSFS

Bayesian inference of demographic history within the context of population genetics and phylogeography, especially in the genomic era, is often accomplished via approximation through Monte Carlo simulations (Beaumont et al. 2002; Beaumont and Rannala 2004; Beaumont 2010; Hoban et al. 2012; Hoban 2014). This allows flexibility to tractably test complex demographic scenarios assuming a large quantity of data, such as with our hierarchical co-demographic model using the ajSFS. However, as a central tenet of Bayesian statistics, it is best practice to incorporate sensible priors to inform a statistical model (Rannala 2015), which may be derived through empirical Bayes and/or converged toward with sequential ABC. Although model accuracy is then conditional on the assumptions imposed from that prior information, a model that is instead too permissive can become problematic due to identifiability issues and expensive efforts searching probability space. This is an essential consideration for our co-demographic model, which is rich in nuisance parameters from the individual population pairs. Indeed, this concern was highlighted in our weakly informed parameterization options, which may have had parameter draw combinations across different hyperparameter values resulting in similar ajSFS vectors, and/or a large proportion of wasted simulations from improper sampling of parameter space. Fortunately, using more efficient prior distributions that enforced an SC scenario and/or fixed τs1 resulted in drastic improvements in accuracy while maintaining utility of the model as there was a high degree of flexibility still permitted and robustness to misspecification of fixed parameters. Importantly, we illustrated through our preliminary empirical analyses that our generalized population pair model was able to efficiently identify SC as a mutually applicable migration scenario. However, in cases whereby co-demography cannot be compressed into a simplified scenario such as SC (e.g., when migration trajectories are dynamic or the demographic syndrome varies among constituent population pairs), we also demonstrated that incorporating our generalized approach within a hierarchical model is a valid choice. In such instances, parameter space can still be reduced through limiting certain taxon-specific prior distributions (e.g., for ε parameters), and/or likelihood space can be more thoroughly explored through an increase in simulations.

With respect to potential sensitivity to sampling configuration, there was negligible change in accuracy as well as actual estimates between the alternative lamprey sampling projections. This is perhaps unsurprising given similar observations with the aSFS (Xue and Hickerson 2015) and multidimensional SFS (Smith et al. 2017). Regardless, this constitutes a significant practical benefit to potential users because ajSFS vector size scales very quickly relative to haploid sampling, which can incur intense computational cost especially with a large number of simulations. In other words, sampling at a lower number of individuals within this range ostensibly has little direct effect on the statistical power or actual results from this type of analysis, at least assuming the specifications of the lamprey dataset. Importantly, however, sampling more individuals and subsequently down-projecting could perhaps indirectly increase accuracy by producing higher fidelity empirical data vectors (i.e., better representative of the sampled population), due to integrating over subsamples from a more comprehensive sampling scheme as well as minimizing the impact of errant individual calls. Nonetheless, our in silico experiments show promise for scaling down sampling efforts and sequencing costs when designing a project of this nature. However, because this study may not universally apply to other potential datasets, researchers ought to run their own cross-validation analyses to precisely assess optimal sampling. Conversely, more attention should be devoted to sampling more population pairs, as implied when comparing leave-one-out RMSEs given the lamprey dataset with eight pairs versus the bird dataset with six pairs, and as previously demonstrated with related comparative methods (Chan et al. 2014; Xue and Hickerson 2015, 2017).

In contrast, our effort to employ sequence length information under the bird data specifications did not yield substantial improvement. Performance actually worsened for hABC inference, and although hRF estimation indeed benefitted from incorporating monomorphic bins, suggesting that some information can be extracted through optimized weighing of bins, the gain was slight. In fact, hRF accuracy increased much more from normalizing bins than using invariant sites. Hence, rather than incorporating the total set of sites to calibrate absolute estimates, there is perhaps greater value in using informed prior distributions, which can be derived from a separate analysis better suited for exploiting sequence data.

Our leave-one-out experiments demonstrated that useful co-demographic hyperparameters, specifically the post hoc conversion of taxon-specific parameter draws into ζT estimates, can be extracted even when not initially used to construct the hierarchical model. Although hyperprior distributions would be extremely skewed under such post hoc conversion as a consequence of random parameter draws among independent units without hyperparameter governance, this strategy provides convenient exploration of co-demographic results without additional expensive computation. Post hoc converted hyperposterior distributions may also be used to guide and inform subsequent hierarchical co-demographic modeling efforts (e.g., fixing τ2 to the same value for all population pairs based on a result of ζτ2 = 1.000), akin to our empirical Bayes tactic of fixing τs1. Additionally, there was evident power to infer the parameter summary Δ (Table S1) during the simulation exercises. Considering its straightforward co-demographic interpretation, that is, the average difference per population pair from the median value, and lack of assumption in distributions such as normality, this suggests that Δ estimation will provide great utility for future co-demographic analyses.

We introduced an approach that significantly simplifies modeling multiple population pairs by ignoring shared ancestry and assuming exchangeability, yet has minimal information loss with respect to co-demography. Specifically, our findings validate how the ajSFS can be exploited by a hierarchical co-demographic model of population pairs for direct estimation of congruence in parameter values. Importantly, we showcase a wide range of sampling specification, including number of individuals from 8 to 20 haploids, inclusion versus omission of monomorphic sites, and amount of SNPs. Moreover, the organismal structure of the two biological systems is of two extremes, whereby multiple replicates of a single species pair are dispersed throughout different localities and may possibly have a shared recent history in the lamprey case, whereas highly divergent population pairs are co-distributed across a shared barrier and have less a priori expectations of congruence for the birds. As a result, this development provides a broadly applicable method that can beneficially supplement analyses on individual datasets.

COMPARATIVE PHYLOGEOGRAPHIC HISTORY OF RIVER AND BROOK LAMPREYS IN WESTERN FRANCE

Our hierarchical co-demographic model elucidated a strong signal of synchronous co-divergence for the lamprey population pairs, which challenges the suggestion of idiosyncrasy by the neighbor-joining tree from Rougemont et al. (2017). This latter, less parsimonious scenario implies independent regional occurrences of colonization into different freshwater drainages at asynchronous times with complex inter-locality gene flow to maintain each of the two constituent species, also known as the transporter hypothesis (Rougemont et al. 2017). Instead, our results are consistent with either (1) a continental-scale event simultaneously yet separately splitting all of the population pairs, which were already spatially dispersed previously into eight ancestral populations, or (2) spatial expansion into the present total geographic distribution from a singular localized isolation of L. fluviatilis and L. planeri, which perhaps resided at one of the sites that presently harbors one of the population pairs. In the latter case, all the current-day species pair replicates coalesce into this same divergence. Importantly, our analyses do not directly discriminate these two possibilities, although the general correlation between genetic diversity and directionality, specifically an increase in number of SNPs toward more Southern latitudes with a stark jump at the halfway point, corroborates an intermediate scenario of Northern colonization from established population pairs spanning several of the Southern sampling sites (presumably OIR, CEN, ODO, and SAU). This is also consistent with the trajectories of many other organisms that persisted during the last glacial maximum (LGM) within refugia located in Southern Europe and subsequently expanded into the North following global warming ca. 20 KYA (Hewitt 2000). Our hABC approach also indicates a temporally shared collective start of SC, which likely happened before a putative post-LGM Northern expansion given the τ2 and x¯2) estimates. In Southern Europe, another incident of wide impact could have concurrently initiated collective SC across distinct species pair replicates that may have been established within refugia, as with the simultaneous co-divergence. However, the inference of highly variable co-migration magnitudes suggests that environmental differences existing among the sampling locations elicited a range of migration histories, signifying that Northbound spatial expansion coincided with a change in gene flow for colonizing lamprey population pairs. Notably, our results of synchrony within co-divergence and collective SC timings along with idiosyncratic co-migration support that variability in co-migration intensities rather than durations likely caused the heterogeneity in population pair genomic differentiation observed by Rougemont et al. (2017), addressing an open question that they posed.

The question of whether colonization or vicariance incited the species pair split may be resolved by our individual population pair estimates. Namely, the ubiquitous combination of considerable migration from anadromous L. fluviatilis to freshwater L. planeri with extremely low reverse-direction migration, as well as the order of magnitude larger NE for L. fluviatilis populations over corresponding L. planeri, may point to one or more peripatric founder events as the likely beginning for the nonparasitic brook lamprey populations (Mayr 1982). On a related note, the generally increased strengths of expansion in the L. fluviatilis samples (≈2× to 11×) compared to the weak expansions of the L. planeri counterparts (≈2×) could also perhaps be further consistent with patterns left by allele surfing (Travis et al. 2007), such that spatial expansion originating from a geographically constricted distribution in the source resulted in an overall signal of demographic NE expansion for the founding river lamprey, whereas the paired brook lamprey retained this at a lower intensity due to being relegated to the front of the wave.

In agreement with the population pair membership described in Rougemont et al. (2017), our individual population pair ABC inferences recovered appreciably stronger unidirectional migration rates in connected population pairs (CPP: AA, BET, OIR, and RIS; mode M1-0:1 > 2.934 haploids/generation) versus disconnected population pairs (DPP: BRE, CEN, ODO, and SAU; mode M1-1:0 < 1.360 haploids/generation). Notably, the CPP migration intensities potentially encroach on panmixia (Wright 1931; Whitlock and McCauley 1999; Beerli and Palczewski 2010), which is to be expected considering the associated environmental connectivity. However, in contrast to the initial lamprey study, we did not find support for either an AM or IM scenario across DPP. To reconcile this, it is possible that both approaches may be detecting different aspects of the same complex co-demographic trajectory. For example, a scenario of oscillation between isolation and contact could cause an SC, AM, or even IM inference depending on the temporal scale captured. In fact, Rougemont et al. (2017) speculated upon this very case for their AM model choice.

Altogether, our results demonstrate a shared biogeographic history that influenced congruence in co-demographic responses yet with specific characteristics, potentially including those related to environmental connectivity (i.e., CPP vs. DPP), still shaping migration differences throughout localities.

COMPARATIVE PHYLOGEOGRAPHIC HISTORY OF CO-DISTRIBUTED PASSERINES THROUGHOUT SEASONALLY DRY FORESTS OF THE MARAÑÓN VALLEY AND TUMBESIAN REGION IN PERU

Unlike our treatment of the lamprey data, we detected asynchronous patterns across all facets of bird population pair co-demography, with both our hierarchical endeavor as well as individual analyses indicating variability across co-divergence times, migration rates, and size change magnitudes. Consistent with the initial investigation (Oswald et al. 2017), we rejected a single mechanism causing co-divergence, and in fact our findings suggest more widespread idiosyncrasy than those authors first discovered. Specifically, our wide range of τ2 estimates suggests highly variable temporal dynamics for postdivergence migration, addressing an open question posed by Oswald et al. (2017). Moreover, our various estimates for the M parameters demonstrate a spectrum of admixture syndromes. For example, we found convergence toward SI in Melanopareia versus a resemblance to SC in the other five bird taxa. Of those five population pairs, Campylorhynchus, Mimus, and Saltator displayed stronger unidirectional intensity from the Marañón Valley to the Tumbesian region, whereas Arremon had stronger gene flow in the opposite direction, and Thamnophilus exhibited a more symmetric balance. Opposed to the lamprey case, this variation in directionality would imply that neither direction of postdivergence migration was generally more favorable than the other, which was also posited in the initial study. Notably, our results indicate that the greatest exchange of migrants regardless of direction was in Mimus, Campylorhynchus, and Thamnophilus, with the latter two poorly fitting a two cluster model under a population structure method according to Oswald et al. (2017). Additionally, Oswald et al. (2017) stated that Thamnophilus was the only population pair in the dataset previously shown to be in SC within the Marañón Valley. Curiously, Thamnophilus also was estimated to have the strongest Marañón Valley expansion magnitude coupled with the weakest Tumbesian expansion magnitude across the bird pairs.

Such support for idiosyncrasy would then be consistent with independent colonizations occurring throughout time, with exact timing for each individual population pair perhaps dictated by species-specific dispersal ability (Smith et al. 2014) or differences in pre-existing competitors (Hardin 1960). Pleistocene climatic cycles may have also acted on these differences to mediate wide variation in co-divergence times as well as co-migration behavior. Likewise, less stable conditions within the Tumbesian region due to El Niño events may have the consequence of such observations too, as suggested by Oswald et al. (2017). Regardless, we are in strong agreement with the original investigation that vicariant allopatry from the Andean uplift forming a barrier within a wider distribution is a very unlikely explanation due to our timing of isolation being too recent and asynchronous.

Our hierarchical approach concurring with the original findings of idiosyncratic co-demography validates that our model experiences minimal information loss from its simplification of population pairs as exchangeable units, which discards information about identity. Moreover, our generalized population pair model was able to extend the conclusions of Oswald et al. (2017), discovering even greater independence among evolutionary trajectories in the historical assembly of these co-distributed passerine taxon pairs.

Conclusion

Investigating co-demographic histories among population pairs is a central aim of comparative phylogeography that focuses on estimating the geographic distributions and temporal variability of multiple sister species divergences (Taberlet et al. 1998; Avise 2000; Arbogast and Kenagy 2001; Hickerson et al. 2010; Barratt et al. 2018; Peñalba et al. 2018; Rincon-Sandoval et al. 2019). Further comparative insight could be attained, although by exploring congruence in other demographic parameters such as for migration and size change, as these might reflect other important trait-based factors underlying multi-taxa responses to biogeographic and climatic events (Papadopoulou and Knowles 2016). For example, similar migration and expansion behaviors across populations coupled with synchronous co-divergence times could reflect pervasive effects from a common vicariant event with little contribution from species-specific traits, whereas asynchronous migration or expansion trajectories mixed with synchronous co-divergence may suggest that species-specific traits effected differential postdivergence responses from a shared allopatric event. Such inquiries can elucidate other contexts beyond co-vicariance (i.e., allopatry), including heterogeneous environmental gradients that are perhaps facilitated by climate (i.e., parapatry) (Riddle et al. 2000; Hewitt 2001; Moritz et al. 2009), founder/colonization events (i.e., peripatry) (Hickerson and Meyer 2008; Leache et al. 2009), and widespread, commonly experienced biotic interactions such as symbiosis or predation (Stone et al. 2012; Satler and Carstens 2016; Garrick et al. 2017). A classical strategy for addressing questions of this nature is to conduct individual inferential analyses on the constituent taxa within a comparative dataset, the results of which are concatenated to heuristically determine congruence. Although straightforward, this approach deploys an undefined and therefore nonreproducible protocol for interpreting conclusions as well as ignores the uncertainty underlying each single estimate.

To this end, we presented here a hierarchical model that assesses several aspects of co-demography among independent population pairs, including synchrony in divergence and SC timings as well as congruence in migration rates. Our method is targeted for increasingly available reduced representation SNP data (e.g., RAD-seq) through its application of the ajSFS, complementing other recent advances that can likewise treat genomic-scale data from multiple taxon pairs, yet within a multi-level likelihood framework (Bunnefeld et al. 2018; Oaks 2018). However, our approach is differentiated by its flexible co-divergence model, which is designed to easily scale in number of population pairs and can accommodate complex population pair demography. Furthermore, our unique modeling scheme joins population pairs within a unified analysis without assuming shared ancestry among them, and detects shared patterns in focal parameter values while integrating out unknown population-specific nuisance parameters, for example, differences in effective population size. This allows ecological assembly-level questions to be explicitly addressed, measurements of uncertainty that consider stochastic and inter pair variance, and increased accuracy from pooling datasets with shared parameterizations. This methodology could be applied to many classic systems and questions, with future empirical implementations that could include regions such as Isthmus of Panama, Andes uplift, and Hawaiian islands chain, as well as organisms such as African cichlids and Heliconius butterflies. With the increase of studies publishing genomic-scale data for nonmodel population pairs on a comparative scale (Roux et al. 2016; Harvey et al. 2017; Rougeux et al. 2017; Stange et al. 2017; Barratt et al. 2018; Peñalba et al. 2018; Rincon-Sandoval et al. 2019), we anticipate this to be a useful and relevant addition to the suite of tools used within population genetics, biogeography, evolutionary biology, and community ecology.

AUTHOR CONTRIBUTIONS

ATX performed the research. ATX and MJH designed the study and wrote the manuscript.

ACKNOWLEDGMENTS

We would like to thank E. Ma for assistance in designing the model diagram figures; E. A. Myers and I. Prates for access to computational resources; A. D. Kern for thoughtful comments on the manuscript and lab support; A. C. Carnaval, S. Boissinot, and B. M. Henn for thoughtful comments on the manuscript; and A. Siepel for lab support. This work was supported by grants from National Science Foundation (DEB-1253710 to MJH; DEB-1343578 to A. C. Carnaval, MJH, and K. C. McDonald; DEB-1457232 to M. K. Fujita), National Institutes of Health (1R15GM096267-01 to S. Boissinot and MJH), FAPESP (BIOTA, 2013/50297-0 to A. C. Carnaval, MJH, and K. C. McDonald), and NASA through the Dimensions of Biodiversity Program. This work would not have been possible without help from the City University of New York High Performance Computing Center, with support from the National Science Foundation (CNS-0855217 and CNS-0958379).

DATA ARCHIVING

Scripts and result files have been deposited into Dryad (https://doi.org/10.5061/dryad.xksn02vbw).

LITERATURE CITED

Alvarado Bremer
,
J. R.
,
J.
Viñas
,
J.
Mejuto
,
B.
Ely
, and
C.
Pla
.
2005
.
Comparative phylogeography of Atlantic bluefin tuna and swordfish: the combined effects of vicariance, secondary contact, introgression, and population expansion on the regional phylogenies of two highly migratory pelagic fishes
.
Mol. Phylogenet. Evol.
 
36
:
169
187
.

Arbogast
,
B. S.
, and
G. J.
Kenagy
.
2001
.
Comparative phylogeography as an integrative approach to historical biogeography
.
J. Biogeogr.
 
28
:
819
825
.

Avise
,
J. C.
 
2000
.
Phylogeography: the history and formation of species
.
Harvard Univ. Press
,
Cambridge, MA
.

Avise
,
J. C.
,
J.
Arnold
,
R. M.
Ball
,
E.
Bermingham
,
T.
Lamb
,
J. E.
Neigel
,
C. A.
Reeb
, and
N. C.
Saunders
.
1987
.
Intraspecific phylogeography: the mitochondrial DNA bridge between population genetics and systematics
.
Annu. Rev. Ecol. Syst.
 
18
:
489
522
.

Avise
,
J. C.
,
B. W.
Bowen
, and
F. J.
Ayala
.
2016
.
In the light of evolution X: comparative phylogeography
.
Proc. Natl. Acad. Sci.
 
113
:
7957
7961
.

Bacon
,
C. D.
,
D.
Silvestro
,
C.
Jaramillo
,
B. T.
Smith
,
P.
Chakrabarty
, and
A.
Antonelli
.
2015
.
Biological evidence supports an early and complex emergence of the Isthmus of Panama
.
Proc. Natl. Acad. Sci.
 
112
:
6110
6115
.

Barber
,
P. H.
,
M. V.
Erdmann
, and
S. R.
Palumbi
.
2006
.
Comparative phylogeography of three codistributed stomatopods: origins and timing of regional lineage diversification in the coral triangle
.
Evolution
 
60
:
1825
1839
.

Barratt
,
C. D.
,
B. A.
Bwong
,
R.
Jehle
,
H. C.
Liedtke
,
P.
Nagel
,
R. E.
Onstein
,
D. M.
Portik
,
J. W.
Streicher
, and
S. P.
Loader
.
2018
.
Vanishing refuge? Testing the forest refuge hypothesis in coastal East Africa using genome-wide sequence data for seven amphibians
.
Mol. Ecol.
 
27
:
4289
4308
.

Beaumont
,
M. A.
 
2010
.
Approximate Bayesian computation in evolution and ecology
.
Annu. Rev. Ecol. Evol. Syst.
 
41
:
379
406
.

Beaumont
,
M. A.
, and
B.
Rannala
.
2004
.
The Bayesian revolution in genetics
.
Nat. Rev. Genet.
 
5
:
251
261
.

Beaumont
,
M. A.
,
W.
Zhang
, and
D. J.
Balding
.
2002
.
Approximate Bayesian computation in population genetics
.
Genetics
 
162
:
2025
2035
.

Beerli
,
P.
, and
M.
Palczewski
.
2010
.
Unified framework to evaluate panmixia and migration direction among multiple sampling locations
.
Genetics
 
185
:
313
326
.

Berlin
,
S.
,
J.
Fogelqvist
,
M.
Lascoux
,
U.
Lagercrantz
, and
A.-C.
Ronnberg-Wastljung
.
2011
.
Polymorphism and divergence in two willow species, Salix viminalis L. and Salix schwerinii E. Wolf
.
G3
 
1
:
387
400
.

Bermingham
,
E.
, and
C.
Moritz
.
1998
.
Comparative phylogeography: concepts and applications
.
Mol. Ecol.
 
7
:
367
369
.

Bernatchez
,
L.
, and
C. C.
Wilson
.
1998
.
Comparative phylogeography of nearctic and paleartic fishes
.
Mol. Ecol.
 
7
:
431
452
.

Bertorelle
,
G.
,
A.
Benazzo
, and
S.
Mona
.
2010
.
ABC as a flexible framework to estimate demography over space and time: some cons, many pros
.
Mol. Ecol.
 
19
:
2609
2625
.

Bunnefeld
,
L.
,
J.
Hearn
,
G. N.
Stone
, and
K.
Lohse
.
2018
.
Whole-genome data reveal the complex history of a diverse ecological community
.
Proc. Natl. Acad. Sci.
 
115
:
E6507
E6515
.

Carnaval
,
A. C.
,
M. J.
Hickerson
,
C. F. B.
Haddad
,
M. T.
Rodrigues
, and
C.
Moritz
.
2009
.
Stability predicts genetic diversity in the Brazilian Atlantic forest hotspot
.
Science
 
323
:
785
789
.

Carstens
,
B. C.
,
A. L.
Stevenson
,
J. D.
Degenhardt
, and
J.
Sullivan
.
2004
.
Testing nested phylogenetic and phylogeographic hypotheses in the plethodon vandykei species group
.
Syst. Biol.
 
53
:
781
792
.

Carstens
,
B. C.
,
M.
Gruenstaeudl
, and
N. M.
Reid
.
2016
.
Community trees: identifying codiversification in the Páramo dipteran community
.
Evolution
 
70
:
1080
1093
.

Chan
,
Y. L.
,
D.
Schanzenbach
, and
M. J.
Hickerson
.
2014
.
Detecting concerted demographic response across community assemblages using hierarchical approximate Bayesian computation
.
Mol. Biol. Evol.
 
31
:
2501
2515
.

Chaudhuri
,
S.
,
S.
Ghosh
,
D. J.
Nott
, and
K. C.
Pham
.
2018
.
An easy-to-use empirical likelihood ABC method
. arXiv 1810.01675.

Congdon
,
P.
 
2001
.
Bayesian statistical modelling
.
John Wiley & Sons
,
Chichester, West Sussex
.

Cornuet
,
J.-M.
,
V.
Ravigné
, and
A.
Estoup
.
2010
.
Inference on population history and model checking using DNA sequence and microsatellite data with the software DIYABC (v1.0)
.
BMC Bioinformatics
 
11
:
401
.

Csilléry
,
K.
,
M. G. B.
Blum
,
O. E.
Gaggiotti
, and
O.
François
.
2010
.
Approximate Bayesian computation (ABC) in practice
.
Trends Ecol. Evol.
 
25
:
410
418
.

Csilléry
,
K.
,
O.
François
, and
M. G. B.
Blum
.
2012
.
abc: an R package for approximate Bayesian computation (ABC)
.
Methods Ecol. Evol.
 
3
:
475
479
.

Edwards
,
S. V
, and
P.
Beerli
.
2000
.
Perspective: gene divergence, population divergence, and the variance in coalescence time in phylogeographic studies
.
Evolution
 
54
:
1839
1854
.

Edwards
,
S. V.
,
S.
Potter
,
C. J.
Schmitt
,
J. G.
Bragg
, and
C.
Moritz
.
2016
.
Reticulation, divergence, and the phylogeography–phylogenetics continuum
.
Proc. Natl. Acad. Sci.
 
113
:
8025
8032
.

Excoffier
,
L.
,
I.
Dupanloup
,
E.
Huerta-Sánchez
,
V. C.
Sousa
, and
M.
Foll
.
2013
.
Robust demographic inference from genomic and SNP data
.
PLoS Genet.
 
9
:e1003905.

Fedorov
,
V. B.
,
A. V.
Goropashnaya
,
G. G.
Boeskorov
, and
J. A.
Cook
.
2008
.
Comparative phylogeography and demographic history of the wood lemming (Myopus schisticolor): implications for late quaternary history of the taiga species in Eurasia
.
Mol. Ecol.
 
17
:
598
610
.

Garrick
,
R. C.
,
Z. L.
Sabree
,
B. C.
Jahnes
, and
J. C.
Oliver
.
2017
.
Strong spatial-genetic congruence between a wood-feeding cockroach and its bacterial endosymbiont, across a topographically complex landscape
.
J. Biogeogr.
 
44
:
1500
1511
. https://doi.org/10.1111/jbi.12992.

Gelman
,
A.
,
J. B.
Carlin
,
H. S.
Stern
, and
D. B.
Rubin
.
2003
. Exchangeability and setting up hierarchical models. Pp.
121
125
 in  
A.
Gelman
,
J. B.
Carlin
,
H. S.
Stern
,
D. B.
Dunson
,
A.
Vehtari
, and
D. B.
Rubin
, eds.
Bayesian data analysis
.
Chapman & Hall/CRC
,
Boca Raton, FL
.

Goldstien
,
S. J.
,
D. R.
Schiel
, and
N. J.
Gemmell
.
2006
.
Comparative phylogeography of coastal limpets across a marine disjunction in New Zealand
.
Mol. Ecol.
 
15
:
3259
3268
.

Gutenkunst
,
R. N.
,
R. D.
Hernandez
,
S. H.
Williamson
, and
C. D.
Bustamante
.
2009
.
Inferring the joint demographic history of multiple populations from multidimensional SNP frequency data
.
PLoS Genet.
 
5
:e1000695.

Hardin
,
G.
 
1960
.
The competitive exclusion principle
.
Science
 
131
:
1292
1297
.

Harvey
,
M. G.
,
A.
Aleixo
,
C. C.
Ribas
, and
R. T.
Brumfield
.
2017
.
Habitat association predicts genetic diversity and population divergence in Amazonian birds
.
Am. Nat.
 
190
:
631
648
.

Herrey
,
E. M. J.
 
1965
.
Confidence intervals based on the mean absolute deviation of a normal sample
.
J. Am. Stat. Assoc.
 
60
:
257
269
.

Hewitt
,
G.
 
2000
.
The genetic legacy of the quaternary ice ages
.
Nature
 
405
:
907
913
.

Hewitt
,
G. M.
 
2001
.
Speciation, hybrid zones and phylogeography—or seeing genes in space and time
.
Mol. Ecol.
 
10
:
537
549
.

Hickerson
,
M. J.
, and
C. P.
Meyer
.
2008
.
Testing comparative phylogeographic models of marine vicariance and dispersal using a hierarchical Bayesian approach
.
BMC Evol. Biol.
 
8
:
322
.

Hickerson
,
M. J.
,
E.
Stahl
, and
N.
Takebayashi
.
2007
.
msBayes: pipeline for testing comparative phylogeographic histories using hierarchical approximate Bayesian computation
.
BMC Bioinformatics
 
8
:
268
.

Hickerson
,
M. J.
,
B. C.
Carstens
,
J.
Cavender-Bares
,
K. A.
Crandall
,
C. H.
Graham
,
J. B.
Johnson
,
L.
Rissler
,
P. F.
Victoriano
, and
A. D.
Yoder
.
2010
.
Phylogeography's past, present, and future: 10 years after Avise, 2000
.
Mol. Phylogenet. Evol.
 
54
:
291
301
.

Hickerson
,
M. J.
,
G. N.
Stone
,
K.
Lohse
,
T. C.
Demos
,
X.
Xie
,
C.
Landerer
, and
N.
Takebayashi
.
2014
.
Recommendations for using msBayes to incorporate uncertainty in selecting an ABC model prior: a response to Oaks et al
.
Evolution
 
68
:
284
294
.

Hoban
,
S.
 
2014
.
An overview of the utility of population simulation software in molecular ecology
.
Mol. Ecol.
 
23
:
2383
401
.

Hoban
,
S.
,
G.
Bertorelle
, and
O. E.
Gaggiotti
.
2012
.
Computer simulations: tools for population and evolutionary genetics
.
Nat. Rev. Genet.
 
13
:
110
22
.

Hoffmann
,
F. G.
, and
R. J.
Baker
.
2003
.
Comparative phylogeography of short-tailed bats (Carollia: Phyllostomidae)
.
Mol. Ecol.
 
12
:
3403
3414
.

Huang
,
W.
,
N.
Takebayashi
,
Y.
Qi
, and
M. J.
Hickerson
.
2011
.
MTML-msBayes: approximate Bayesian comparative phylogeographic inference from multiple taxa and multiple loci with rate heterogeneity
.
BMC Bioinformatics
 
12
:
1
.

Jackson
,
N. D.
,
A. E.
Morales
,
B. C.
Carstens
, and
B. C.
O'Meara
.
2017
.
PHRAPL: phylogeographic inference using approximate likelihoods
.
Syst. Biol.
 
66
:
1045
1053
. https://doi.org/10.1093/sysbio/syx001.

Leache
,
A. D.
,
M. S.
Koo
,
C. L.
Spencer
,
T. J.
Papenfuss
,
R. N.
Fisher
, and
J. A.
McGuire
.
2009
.
Quantifying ecological, morphological, and genetic variation to delimit species in the coast horned lizard species complex (Phrynosoma)
.
Proc. Natl. Acad. Sci.
 
106
:
12418
12423
.

Lessios
,
H. A.
,
J.
Kane
, and
D. R.
Robertson
.
2003
.
Phylogeography of the pantropical sea urchin tripneustes: contrasting patterns of population structure between oceans
.
Evolution
 
57
:
2026
2036
.

Leys
,
C.
,
C.
Ley
,
O.
Klein
,
P.
Bernard
, and
L.
Licata
.
2013
.
Detecting outliers: do not use standard deviation around the mean, use absolute deviation around the median
.
J. Exp. Soc. Psychol.
 
49
:
764
766
.

Liaw
,
A.
, and
M.
Wiener
.
2002
.
Classification and regression by randomForest
.
R news
 
2
:
18
22
.

Lourie
,
S. A.
,
D. M.
Green
, and
A. C. J.
Vincent
.
2005
.
Dispersal, habitat differences, and comparative phylogeography of Southeast Asian seahorses (Syngnathidae: Hippocampus)
.
Mol. Ecol.
 
14
:
1073
1094
.

Maliouchenko
,
O.
,
A. E.
Palmé
,
A.
Buonamici
,
G. G.
Vendramin
, and
M.
Lascoux
.
2007
.
Comparative phylogeography and population structure of European Betula species, with particular focus on B. pendula and B. pubescens
.
J. Biogeogr.
 
34
:
1601
1610
.

Mastretta-Yanes
,
A.
,
A. T.
Xue
,
A.
Moreno-Letelier
,
T. H.
Jorgensen
,
N.
Alvarez
,
D.
Piñero
, and
B. C.
Emerson
.
2018
.
Long-term in situ persistence of biodiversity in tropical sky islands revealed by landscape genomics
.
Mol. Ecol.
 
27
:
432
448
.

Mayr
,
E.
 
1982
.
Speciation and macroevolution
.
Evolution
 
36
:
1119
1132
.

Moritz
,
C.
,
C. J.
Hoskin
,
J. B.
MacKenzie
,
B. L.
Phillips
,
M.
Tonione
,
N.
Silva
,
J.
VanDerWal
,
S. E.
Williams
, and
C. H.
Graham
.
2009
.
Identification and dynamics of a cryptic suture zone in tropical rainforest
.
Proc. R. Soc. B Biol. Sci.
 
276
:
1235
1244
.

Morris
,
C. N.
 
1983
.
Parametric empirical Bayes inference: theory and applications
.
J. Am. Stat. Assoc.
 
78
:
47
55
.

Oaks
,
J. R.
 
2014
.
An improved approximate-Bayesian model-choice method for estimating shared evolutionary history
.
BMC Evol. Biol.
 
14
:
150
.

Oaks
,
J. R.
 
2018
.
Full Bayesian comparative phylogeography from genomic data
. bioRxiv 324525.

Oswald
,
J. A.
,
I.
Overcast
,
W. M. I.
Mauck
,
M. J.
Andersen
, and
B. T.
Smith
.
2017
.
Isolation with asymmetric gene flow during the nonsynchronous divergence of dry forest birds
.
Mol. Ecol.
 
26
:
1386
1400
.

Overcast
,
I.
,
J. C.
Bagley
, and
M. J.
Hickerson
.
2017
.
Strategies for improving approximate Bayesian computation tests for synchronous diversification
.
BMC Evol. Biol.
 
17
:
203
.

Papadopoulou
,
A.
, and
L. L.
Knowles
.
2016
.
Toward a paradigm shift in comparative phylogeography driven by trait-based hypotheses
.
Proc. Natl. Acad. Sci. USA
 
113
:
8018
8024
.

Pastorini
,
J.
,
U.
Thalmann
, and
R. D.
Martin
.
2003
.
A molecular approach to comparative phylogeography of extant Malagasy lemurs
.
Proc. Natl. Acad. Sci.
 
100
:
5879
5884
.

Pelletier
,
T. A.
, and
B. C.
Carstens
.
2014
.
Model choice for phylogeographic inference using a large set of models
.
Mol. Ecol.
 
23
:
3028
3043
.

Peñalba
,
J. V.
,
L.
Joseph
, and
C.
Moritz
.
2018
.
Current geography masks dynamic history of gene flow during speciation in northern Australian birds
.
Mol. Ecol.
, https://doi.org/10.1111/mec.14978.

Pham-Gia
,
T.
, and
T. L.
Hung
.
2001
.
The mean and median absolute deviations
.
Math. Comput. Model.
 
34
:
921
936
.

Potter
,
S.
,
A. T.
Xue
,
J. G.
Bragg
,
D. F.
Rosauer
,
E. J.
Roycroft
, and
C.
Moritz
.
2018
.
Pleistocene climatic changes drive diversification across a tropical savanna
.
Mol. Ecol.
 
27
:
520
532
.

Prates
,
I.
,
A. T.
Xue
,
J. L.
Brown
,
D. F.
Alvarado-Serrano
,
M. T.
Rodrigues
,
M. J.
Hickerson
, and
A. C.
Carnaval
.
2016
.
Inferring responses to climate dynamics from historical demography in neotropical forest lizards
.
Proc. Natl. Acad. Sci.
 
113
:
7978
7985
.

Pudlo
,
P.
,
J.-M.
Marin
,
A.
Estoup
,
J.-M.
Cornuet
,
M.
Gauthier
, and
C. P.
Robert
.
2016
.
Reliable ABC model choice via random forests
.
Bioinformatics
 
32
:
859
866
.

Rannala
,
B.
 
2015
.
The art and science of species delimitation
.
Curr. Zool.
 
61
:
846
853
.

Richards
,
V. P.
,
J. D.
Thomas
,
M. J.
Stanhope
, and
M. S.
Shivji
.
2007
.
Genetic connectivity in the Florida reef system: comparative phylogeography of commensal invertebrates with contrasting reproductive strategies
.
Mol. Ecol.
 
16
:
139
157
.

Riddle
,
B. R.
,
D. J.
Hafner
,
L. F.
Alexander
, and
J. R.
Jaeger
.
2000
.
Cryptic vicariance in the historical assembly of a Baja California Peninsular Desert biota
.
Proc. Natl. Acad. Sci.
 
97
:
14438
14443
.

Rincon-Sandoval
,
M.
,
R.
Betancur-R
, and
J. A.
Maldonado-Ocampo
.
2019
.
Comparative phylogeography of trans-Andean freshwater fishes based on genome-wide nuclear and mitochondrial markers
.
Mol. Ecol.
 
28
:
1096
1115
. https://doi.org/10.1111/mec.15036.

Rosindell
,
J.
,
S. P.
Hubbell
,
F.
He
,
L. J.
Harmon
, and
R. S.
Etienne
.
2012
.
The case for ecological neutral theory
.
Trends Ecol. Evol.
 
27
:
203
208
.

Rougemont
,
Q.
,
P.-A.
Gagnaire
,
C.
Perrier
,
C.
Genthon
,
A.-L.
Besnard
,
S.
Launey
, and
G.
Evanno
.
2017
.
Inferring the demographic history underlying parallel genomic divergence among pairs of parasitic and non-parasitic lamprey ecotypes
.
Mol. Ecol.
 
26
:
142
162
.

Rougeux
,
C.
,
L.
Bernatchez
, and
P.-A.
Gagnaire
.
2017
.
Modeling the multiple facets of speciation-with-gene-flow toward inferring the divergence history of lake whitefish species pairs (Coregonus clupeaformis)
.
Genome Biol. Evol.
 
9
:
2057
2074
.

Roux
,
C.
,
C.
Fraïsse
,
J.
Romiguier
,
Y.
Anciaux
,
N.
Galtier
, and
N.
Bierne
.
2016
.
Shedding light on the grey zone of speciation along a continuum of genomic divergence
.
PLoS Biol
.
14
:e2000234.

Rubin
,
D. B.
, and
H. S.
Stern
.
1994
. Testing in latent class models using a posterior predictive check distribution. Pp.
420
438
 in  
A.
von Eye
and
C. C
Clogg
, eds.
Latent variables analysis: applications for developmental research
.
Sage Publications, Inc.
,
Thousand Oaks, CA
.

Satler
,
J. D.
, and
B. C.
Carstens
.
2016
.
Phylogeographic concordance factors quantify phylogeographic congruence among co-distributed species in the Sarracenia alata pitcher plant system
.
Evolution
 
70
:
1105
1119
.

Schlüter
,
P. J.
,
J. J.
Deely
, and
A. J.
Nicholson
.
1997
.
Ranking and selecting motor vehicle accident sites by using a hierarchical Bayesian model
.
J. R. Stat. Soc. Ser. D (The Stat.
 
46
:
293
316
.

Schrider
,
D. R.
, and
A. D.
Kern
.
2018
.
Supervised machine learning for population genetics: a new paradigm
.
Trends Genet.
 
34
:
301
312
.

Sisson
,
S. A.
,
Y.
Fan
, and
M. M.
Tanaka
.
2007
.
Sequential Monte Carlo without likelihoods
.
Proc. Natl. Acad. Sci.
 
104
:
1760
1765
.

Sjödin
,
P.
,
A. E.
Sjöstrand
,
M.
Jakobsson
, and
M. G. B.
Blum
.
2012
.
Resequencing data provide no evidence for a human bottleneck in Africa during the penultimate glacial period
.
Mol. Biol. Evol.
 
29
:
1851
1860
.

Smith
,
B. T.
,
J. E.
McCormack
,
A. M.
Cuervo
,
M. J.
Hickerson
,
A.
Aleixo
,
C. D.
Cadena
,
J.
Pérez-Emán
,
C. W.
Burney
,
X.
Xie
,
M. G.
Harvey
, et al.
2014
.
The drivers of tropical speciation
.
Nature
 
515
:
406
409
.

Smith
,
M. L.
,
M.
Ruffley
,
A.
Espíndola
,
D. C.
Tank
,
J.
Sullivan
, and
B. C.
Carstens
.
2017
.
Demographic model selection using random forests and the site frequency spectrum
.
Mol. Ecol.
 
26
:
4562
4573
. https://doi.org/10.1111/mec.14223.

Soltis
,
D. E.
,
A. B.
Morris
,
J. S.
McLachlan
,
P. S.
Manos
, and
P. S.
Soltis
.
2006
.
Comparative phylogeography of unglaciated eastern North America
.
Mol. Ecol.
 
15
:
4261
93
.

Stange
,
M.
,
M. R.
Sánchez-Villagra
,
W.
Salzburger
, and
M.
Matschiner
.
2017
.
Bayesian divergence-time estimation with genome-wide SNP data of sea catfishes (Ariidae) supports Miocene closure of the Panamanian Isthmus
. bioRxiv 102129.

Stone
,
G. N.
,
K.
Lohse
,
J. A.
Nicholls
,
P.
Fuentes-Utrilla
,
F.
Sinclair
,
K.
Schönrogge
,
G.
Csóka
,
G.
Melika
,
J.-L.
Nieves-Aldrey
,
J.
Pujade-Villar
, et al.
2012
.
Reconstructing community assembly in time and space reveals enemy escape in a Western Palearctic insect community
.
Curr. Biol.
 
22
:
532
7
.

Taberlet
,
P.
,
L.
Fumagalli
,
A.-G.
Wust-Saucy
, and
J.-F.
Cosson
.
1998
.
Comparative phylogeography and postglacial colonization routes in Europe
.
Mol. Ecol.
 
7
:
453
464
.

Teh
,
Y. W.
,
M. I.
Jordan
,
M. J.
Beal
, and
D. M.
Blei
.
2005
.
Sharing clusters among related groups: hierarchical Dirichlet processes
.
Adv. Neural Inf. Process. Syst.
 
17
:
1385
1392
.

Travis
,
J. M. J.
,
T.
Münkemüller
,
O. J.
Burton
,
A.
Best
,
C.
Dytham
, and
K.
Johst
.
2007
.
Deleterious mutations can surf to high densities on the wave front of an expanding population
.
Mol. Biol. Evol.
 
24
:
2334
2343
.

Whitlock
,
M. C.
, and
D. E.
McCauley
.
1999
.
Indirect measures of gene flow and migration: FST not equal to 1/(4Nm + 1)
.
Heredity
 
82
:
117
125
.

Wright
,
S.
 
1931
.
Evolution in Mendelian populations
.
Genetics
 
16
:
97
159
.

Xue
,
A. T.
, and
M. J.
Hickerson
.
2015
.
The aggregate site frequency spectrum for comparative population genomic inference
.
Mol. Ecol.
 
24
:
6223
6240
.

Xue
,
A. T.
, and
M. J.
Hickerson
 
2017
.
Multi-DICE: R package for comparative population genomic inference under hierarchical co-demographic models of independent single-population size changes
.
Mol. Ecol. Resour.
 
17
:
e212
e224
. https://doi.org/10.1111/1755-0998.12686.

Zink
,
R. M.
 
2002
.
Methods in comparative phylogeography, and their application to studying evolution in the North American Aridlands
.
Integr. Comp. Biol.
 
42
:
953
959
.

Associate Editor: J. Light

Handling Editor: M. Servedio

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model)

Supplementary data