Abstract

Although several high-resolution recombination maps exist for European-descent populations, the recombination landscape of African populations remains relatively understudied. Given that there is high genetic divergence among groups in Africa, it is possible that recombination hotspots also diverge significantly. Both limitations and opportunities exist for developing recombination maps for these populations. In this review, we discuss various recombination inference methods, and the strengths and weaknesses of these methods in analyzing recombination in African-descent populations. Furthermore, we provide a decision tree and recommendations for which inference method to use in various research contexts. Establishing an appropriate methodology for recombination rate inference in a particular study will improve the accuracy of various downstream analyses including but not limited to local ancestry inference, haplotype phasing, fine-mapping of GWAS loci and genome assemblies.

Introduction

Genetic recombination is defined as the exchange and rearrangement of genetic material between successive generations. Homologous recombination during meiosis arises when this exchange is between homologous pairs of chromosomes and is initiated by the induction of double-strand breaks (DSBs). When these DSBs are repaired, only a few result in the exchange of genetic material, termed crossover events (1). Crossover events result in a contrasting combination of genotypes in gametes that is passed on to the next generation.

Not all sections of the chromosome are equally likely to contain DSBs or a resulting recombination event. This is governed by numerous factors including sex, age, autosomal versus sex chromosomes, proximity to the telomeres or centromeres, various regulatory enzymes etc., while levels of identity-by-descent, linkage disequilibrium and varying degrees of admixture impact our ability to accurately measure the recombination rate (2–7).

We are particularly interested in the factors that influence our ability to accurately measure not only the rate of recombination but the location and extent of recombination hotspots. This is noteworthy given the vast genetic diversity of the recombination hotspot regulatory protein PRDM9 in African populations (8), the difference in hotspot diversity between European and African populations (9) and both recent and ancient admixture events extending to, in some circumstances, populations with 5-way admixture (10). Therefore, care should be taken when selecting one of the various recombination inference methods that have been developed in recent years.

In this review, we therefore discuss principles underlying a variety of recent methods to infer recombination in human populations and suggest the most appropriate (Fig. 1 and Table 1), given the genetic diversity and levels of admixture in African populations.

A decision tree for recombination inference method selection. The figure is an extreme oversimplification and serves only as a starting point. Use the series of questions to find the recombination rate inference method that would be the most likely fit for a given use case. The questions should not necessarily exclude any method, but serve as a guide. For many use cases there will be more than one appropriate method.
Figure 1

A decision tree for recombination inference method selection. The figure is an extreme oversimplification and serves only as a starting point. Use the series of questions to find the recombination rate inference method that would be the most likely fit for a given use case. The questions should not necessarily exclude any method, but serve as a guide. For many use cases there will be more than one appropriate method.

Recombination Inference Methods

Gamete-based inference

Gamete-based inference uses the phased genetic information of a donor and the genetic information derived from the donor’s gametes to infer crossover events. A crossover is said to have occurred, if there is a shift in phase between haplotypes. However, a phase shift caused by gene conversion is not considered a true crossover (11). True crossovers are counted to calculate the recombination fraction which can then be converted to genetic distances. Peñalba and Wolf (12) provide a detailed explanation on how this is done and they explain the use of the three main mapping functions employed to calculate additive measures of genetic distance. The authors also provide a well written review of the factors that affect recombination rate variation.

Gamete-based inference is a useful method to infer recombination at a high resolution and was used in early recombination hotspot studies (13). However, it has certain limitations. In humans, gamete-based inference refers to sperm-typing due to the large number of gametes necessary to produce adequate results (14). Thus, recombination can only feasibly be studied in males. Due to the relatively high level of heterochiasmy in humans (14), this might not be sufficient to study recombination at a fine scale across populations. It is also expensive to produce fine scale recombination maps using this method (15). Most studies will therefore be limited to specific regions of the genome (16). Despite these limitations, gamete-based inference remains a valuable means of studying chromosomal abnormalities caused by abnormal recombination (17) and the evolutionary history of a genomic region (18).

Table 1

The pros and cons of the different recombination inference methods

ProsCons
Gamete-basedSex-specific (14)
Produces high-resolution maps (14)
Unaffected by past demographic changes (12)
Can be used to study chromosomal abnormalities (17)
Not always feasible for both sexes (14)
Genome-wide inference costly (15)
Pedigree-basedSex-specific (27)
Unaffected by past demographic changes (12)
Requires pedigrees (29)
Large sample size needed (19)
Limited to very recent recombination (12)
LAI-basedProduces high-resolution maps with a few thousand samples (43)Limited to populations with admixture (29)
Limited to recent recombination (29)
Dependent on local ancestry inference (29)
IBD-basedProduces high-resolution maps with a few thousand samples (29)
Not limited by admixture (29)
Limited to recent recombination (29)
Dependent on IBD estimates (29)
LD-basedDoes not require a large sample size (30)
Produces high-resolution maps (16)
Computationally expensive (30)
Produces time-averaged estimates (29)
Biased by demographic changes (29,30)
Regression-basedDoes not require a large sample size (30)
Computationally fast (30)
Produces time-averaged estimates (29)
Biased by demographic changes (29,30)
Demography-awareDoes not require a large sample size (43)
Produces high-resolution maps (43)
Accounts for demographic changes (43)
Produces time-averaged estimates (29)
Requires knowledge of a population’s demographic history (43)
ProsCons
Gamete-basedSex-specific (14)
Produces high-resolution maps (14)
Unaffected by past demographic changes (12)
Can be used to study chromosomal abnormalities (17)
Not always feasible for both sexes (14)
Genome-wide inference costly (15)
Pedigree-basedSex-specific (27)
Unaffected by past demographic changes (12)
Requires pedigrees (29)
Large sample size needed (19)
Limited to very recent recombination (12)
LAI-basedProduces high-resolution maps with a few thousand samples (43)Limited to populations with admixture (29)
Limited to recent recombination (29)
Dependent on local ancestry inference (29)
IBD-basedProduces high-resolution maps with a few thousand samples (29)
Not limited by admixture (29)
Limited to recent recombination (29)
Dependent on IBD estimates (29)
LD-basedDoes not require a large sample size (30)
Produces high-resolution maps (16)
Computationally expensive (30)
Produces time-averaged estimates (29)
Biased by demographic changes (29,30)
Regression-basedDoes not require a large sample size (30)
Computationally fast (30)
Produces time-averaged estimates (29)
Biased by demographic changes (29,30)
Demography-awareDoes not require a large sample size (43)
Produces high-resolution maps (43)
Accounts for demographic changes (43)
Produces time-averaged estimates (29)
Requires knowledge of a population’s demographic history (43)
Table 1

The pros and cons of the different recombination inference methods

ProsCons
Gamete-basedSex-specific (14)
Produces high-resolution maps (14)
Unaffected by past demographic changes (12)
Can be used to study chromosomal abnormalities (17)
Not always feasible for both sexes (14)
Genome-wide inference costly (15)
Pedigree-basedSex-specific (27)
Unaffected by past demographic changes (12)
Requires pedigrees (29)
Large sample size needed (19)
Limited to very recent recombination (12)
LAI-basedProduces high-resolution maps with a few thousand samples (43)Limited to populations with admixture (29)
Limited to recent recombination (29)
Dependent on local ancestry inference (29)
IBD-basedProduces high-resolution maps with a few thousand samples (29)
Not limited by admixture (29)
Limited to recent recombination (29)
Dependent on IBD estimates (29)
LD-basedDoes not require a large sample size (30)
Produces high-resolution maps (16)
Computationally expensive (30)
Produces time-averaged estimates (29)
Biased by demographic changes (29,30)
Regression-basedDoes not require a large sample size (30)
Computationally fast (30)
Produces time-averaged estimates (29)
Biased by demographic changes (29,30)
Demography-awareDoes not require a large sample size (43)
Produces high-resolution maps (43)
Accounts for demographic changes (43)
Produces time-averaged estimates (29)
Requires knowledge of a population’s demographic history (43)
ProsCons
Gamete-basedSex-specific (14)
Produces high-resolution maps (14)
Unaffected by past demographic changes (12)
Can be used to study chromosomal abnormalities (17)
Not always feasible for both sexes (14)
Genome-wide inference costly (15)
Pedigree-basedSex-specific (27)
Unaffected by past demographic changes (12)
Requires pedigrees (29)
Large sample size needed (19)
Limited to very recent recombination (12)
LAI-basedProduces high-resolution maps with a few thousand samples (43)Limited to populations with admixture (29)
Limited to recent recombination (29)
Dependent on local ancestry inference (29)
IBD-basedProduces high-resolution maps with a few thousand samples (29)
Not limited by admixture (29)
Limited to recent recombination (29)
Dependent on IBD estimates (29)
LD-basedDoes not require a large sample size (30)
Produces high-resolution maps (16)
Computationally expensive (30)
Produces time-averaged estimates (29)
Biased by demographic changes (29,30)
Regression-basedDoes not require a large sample size (30)
Computationally fast (30)
Produces time-averaged estimates (29)
Biased by demographic changes (29,30)
Demography-awareDoes not require a large sample size (43)
Produces high-resolution maps (43)
Accounts for demographic changes (43)
Produces time-averaged estimates (29)
Requires knowledge of a population’s demographic history (43)

Pedigree-based inference

Pedigree based inference relies on having information about parent–offspring (PO) pairs within the data to detect recombination events between successive generations (19). Since recombination events are inferred from phase transitions in the offspring, inaccurate phase data can thus lead to incorrect recombination event inference. Accurately determining the phase of an individual is thus important.

There are various methods used for phasing, but the most common methods use hidden Markov models (HMM) (20–24). Some software implementations, like MaCH (25), choose a random subset of haplotypes to condition upon, whereas Impute2 (26) and SHAPEIT2 (21) select the most similar haplotypes to the region of the sample being considered. This makes both Impute2 and SHAPEIT2 ideal candidates for phasing in admixed individuals, because the subset of haplotypes chosen would most likely contain representative haplotypes from all the ancestries present in the admixed sample being considered. SHAPEIT2 also has a secondary algorithm, duoHMM (21), that uses pedigree information to correct switch errors after phasing. Conveniently, duoHMM’s output is a detailed log of regions in which recombination events occurred for each PO pair followed by the probability of the event having occurred in that region. After applying various filtering steps, the data can then be used to calculate the recombination fraction (the proportion of haplotypes for which a recombination event is inferred at a given locus) which can then be converted to genetic distances and normalized using the relevant mapping function.

Pedigree-based inference is well suited to inferring recombination in populations with complex ancestries, because it is not affected by different patterns of LD within each ancestry (see below) (12). It is also a valuable method for inferring sex-specific recombination rates (27). Since there is an average of 26.2 recombination events in males and 39.6 recombination events in females that occur between successive generations (28), a very large sample size and high density SNP data are necessary to generate high-resolution recombination maps using this method (19,29).

Linkage-disequilibrium-based inference

Linkage-disequilibrium-based (LD-based) methods use patterns of linkage disequilibrium in polymorphism data to detect historical recombination events that stretch into the distant past (30). Many techniques that utilize this method have been developed over the years (31–34), however, LDHat (16) is by far the most widely used and many publicly available maps, including the 1000 Genomes Project maps (35), were made using this method. LD-based methods generally provide an estimate of the population recombination rate (ρ). ρ is the recombination rate per base pair per generation (r) scaled by the effective population size (Ne) and is represented as ρ = 4Ner (30). Since Ne is often unknown, it has become standard practice to use a high-resolution recombination map generated by other means, like pedigree-based inference, to scale ρ to attain r (35). For instance, by using the overlapping segments of the deCODE map.

Unlike pedigree-based and gamete-based inference, LD-based inference provides a genome-wide, high-resolution map with a small number of individuals. Furthermore, having phased data is not a requirement for methods that can make use of genotype data (16). However, LD-based methods have various aspects that need to be considered before being used as the recombination rate inference method of choice. Depending on the population being investigated, a demographic model might have to be specified (36). LD-based methods by default assume that certain demographic parameters, like the population size and mutation rate, remain constant over time (30,36–38). If the population has undergone drastic demographic changes and these changes are not accounted for, the resultant inference will be distorted (30,37–39). The work of Dapper and Payseur (38) explores this topic thoroughly. LD-based methods are also computationally expensive to run, especially when >50 chromosomes (or 25 individuals) are being analyzed (30,40). Therefore, LD-based methods generally require multinode computational clusters for genome-wide inference (40). Additionally, LD-based inference result in time-averaged (29) and sex-averaged recombination maps (16).

Regression-based inference

Methods that make use of regression based on LD summary statistics have been developed recently. Two of the prominent methods in this category are FastEPRR (40) and LDJump (30). These methods work by partitioning the genome into segments and calculating ρ for each segment by regression on specific summary statistics, for instance Watterson’s θ, Tajima’s D estimator or the haplotype heterozygosity. When <50 chromosomes are being used, both LDJump (30) and FastEPRR (40) perform equally well, but when >50 sequences are used, it is recommended to use FastEPRR (30). Both methods also perform on par with LDHat at large scales (30,40), but both are computationally faster than LDHat by several orders of magnitude (30). Furthermore, when regression-based methods include summary statistics that are dependent on demography, like Tajima’s D estimator, they yield more accurate estimates than LD-based methods (30). This improvement in accuracy and the increased computational efficiency are the primary benefits of regression-based methods over LD-based methods. However, regression-based methods have many of the limitations of LD-based methods. For instance, inferred maps are still time-averaged and sex-averaged, and a demographic model is still important in many cases. Some methods also struggle inferring recombination in windows smaller than two kilobases (30). Adrion et al. (41) recently developed a method, called ReLERNN, that makes use of a recurrent neural network and does not rely on summary statistics. It is worth taking note of this method due to its ability to make use of very small sample sizes, while maintaining a high level of accuracy even when the demographic model is misspecified (41).

Demographic-model-aware inference

Building on the success of LD-based methods to infer recombination at fine scales, demographic model-aware inference seeks to address the assumption that a population’s size remains constant over time. Not only does this assumption lead to biased estimates (30,37,39), but it can produce false positives when inferring recombination hotspots (36). Kamm et al. (42) developed LDpop which can compute exact two-locus sampling probabilities under arbitrary piecewise-constant demographic histories. These likelihoods can then be used with other recombination inference software that use likelihood lookup tables, like LDhat, to infer the recombination rate and account for variable population size over time.

Spence and Song (43) extended the methodology in LDpop to include a computationally efficient recombination inference method called pyrho. Pyrho improves upon the runtime of LDhat by at least 10-fold by avoiding the use of Markov chain Monte Carlo methods and instead uses a penalized likelihood framework and gradient-based optimisation. LDpop and pyrho also allow an increase in sample size to a few hundred individuals. Thus, it can include more meioses in the inference than computationally feasible with LDhat. Pyrho produces more accurate results than LDhat at fine scales, whether LDhat is used with or without a demographic model. However, maps produced by pyrho are sex- and time-averaged. The authors suggest that a larger sample size should favor recent recombination events, but it is unclear to what degree. Furthermore, pyrho and LDpop both require population size histories and accommodate a large number of epochs. More recently Barroso et al. (44) developed a method, called iSMC, that simultaneously infers the recombination rate and the demography even with a single unphased diploid individual.

Local-ancestry-based inference

Populations with recent admixture can be utilized to develop a reflective population based relative recombination map. Individuals who are recently admixed have a mosaic of different ancestral segments along their chromosomes. These segments are identified through local ancestry inference (LAI). The location of the switches in ancestry along the chromosome is indicative of recombination events and can therefore be utilized to develop a recombination map (9,45). This method however relies on numerous upstream analyses and the accuracy thereof. The first is the selection of proxy ancestral populations and how closely related these are to the true ancestral populations (45–47). The second, is the accuracy of the phasing of the data; this is greatly improved if related individuals and a well suited reference panel is used during this process (21,48). The third is the software chosen to infer ancestry switch-points and its robustness toward highly admixed populations; RFMix has been shown to be the most accurate tool for this purpose (46,49). The last upstream analysis that can affect the development of a population-specific recombination map is whether the method used to infer switch-points requires a recombination map or whether it infers recombination as part of its algorithm. Although software utilizing recombination maps is more common, it is our opinion that this could sway the placement of recombination events and thus a recombination map independent method is preferable. Once switch-points are inferred, the posterior mean number of ancestry switch-points is summed across individuals with a resulting relative recombination rate (45). This approach has however proven to not account for multiple hits within a defined window and therefore a postprocessing Empirical Bayes Framework has been implemented to account for this (45). This statistical method has been implemented in RASPberry and proven to be accurate in a simulated African population although there has not been any further accuracy testing on populations with differing degrees of admixture (45).

Identity-by-descent-based inference

More recently, Zhou et al. (29) has shown that identity by descent (IBD) can be used to infer high-resolution population-specific recombination maps. Their method, IBDrecomb (29), produces maps with a similar accuracy than LDhat, but is far more computationally efficient. IBDrecomb, like pedigree-based methods and LAI-based methods, use the genomic consequences of recombination to infer the recombination rate. However, IBDrecomb uses the ends of IBD segments rather than phase switches and ancestry switches to infer past recombination events. First, IBDrecomb calculates the IBD coverage for a given interval across the chromosome. Then, the smallest IBD segments in each interval are removed until the IBD coverage of the interval being analyzed is equal to that of the interval with the lowest IBD coverage. The authors also employ coverage equalization to normalize underestimated IBD ends at chromosome ends. Thus, each segment now contains the same number of IBD segments. Finally, IBD ends within each segment are counted to estimate the relative recombination rate of the segment. The user needs to provide genetic map lengths in order to normalize the estimated relative recombination rates.

IBDrecomb includes methods that help correct errors in IBD estimation caused by phasing errors, gene conversion and genotype errors. IBDrecomb considers recent recombination and since it relies on IBD segments, it infers recombination that occurred before, during and after admixture. This would result in higher resolution maps over similar time scales. Furthermore, the recombination rate of populations that are not admixed can be inferred using this method.

Discussion

Due to the influence of recombination on evolutionary processes, it is important that we develop methods that accurately infer the recombination rate. To date there have been many attempts to do so and some differ drastically in their approach. Therefore, when choosing a method for recombination rate inference in African populations, one should ensure that the chosen method is compatible with the demographic history of the population under consideration.

LD-based, regression-based and demographic-model-aware inference are all based on using patterns of linkage disequilibrium to infer past recombination events. As a result these methods all produce sex- and time-averaged maps and are affected by demography. Additionally, less than a couple hundred samples are required by these methods in order to produce high-resolution recombination maps. Some notable differences between these methods are the ability to infer recombination accurately at fine scales and computational efficiency. These methods are ideal for investigating the fine scale differences in the recombination landscape of different populations as well as the evolutionary history of specific regions in the genome. Given the complex demographic histories of the populations in Africa, these methods should be applied with caution.

IBD-based methods, LAI-based methods and pedigree-based methods infer recent recombination events rather than recombination events that occurred over 1–20 generations (29,43). Thus, these methods are useful in investigating contemporary recombination. However, each of these methods have aspects that could affect the accuracy and resolution of the inferred map, if not accounted for. Pedigree-based methods require data from several orders of magnitude more individuals than IBD-based and LAI-based methods to attain a similar resolution (19,29). Some LAI-based methods require a recombination map for LAI inference which could potentially affect the placement of recombination events. IBD-based methods rely on accurate IBD information which could be difficult to obtain in populations that have undergone rapid expansion or contraction in the recent past (50).

In the context of African populations, special attention should be paid to the assumptions and limitations of each of these methods. The majority of these methods were fine tuned to European demographics and although some of the assumptions may hold, others, like assuming a constant population size, should be avoided. Furthermore, a population-specific recombination map might not be necessary (51). A publically available recombination map might be sufficient, if fine-scale recombination rate information is not required. In this review, albeit far from exhaustive, we attempt to capture the essence of some of the prominent recombination inference methods, while highlighting popular software in each category. None of these methods are applicable to all situations in which an estimate of the recombination rate is needed. However, more than one might be appropriate depending on the research goal, the available sample size, the level of admixture, the level of inbreeding and available budget. Therefore, it is important to consider each method on its merit in relation to the overall goal of the project to find the optimal solution.

Conflict of Interest statement: None declared.

Funding

This research was funded (partially or fully) by the South African government through the South African Medical Research Council and the National Research Foundation. The DST-NRF Innovation Doctoral Scholarship (to G.v.E.). Fellowship from the Claude Leon Foundation (to C.U.). National Institutes of Health (NIH) funding of R35GM133531 from NIGMS (to B.M.H.).

References

1.

Hunter
,
N.
(
2007
) Meiotic recombination. In
Aguilera
,
A.
and
Rothstein
,
R.
(eds),
Molecular Genetics of Recombination
.
Springer Berlin Heidelberg
,
Berlin, Heidelberg
, pp.
381
442
.

2.

Stapley
,
J.
,
Feulner
,
P.G.D.
,
Johnston
,
S.E.
,
Santure
,
A.W.
and
Smadja
,
C.M.
(
2017
)
Variation in recombination frequency and distribution across eukaryotes: patterns and processes
.
Philos. Trans. R. Soc. Lond. Ser. B Biol. Sci.
,
372
,
1736, 20160455
.

3.

Cheung
,
V.G.
,
Burdick
,
J.T.
,
Hirschmann
,
D.
and
Morley
,
M.
(
2007
)
Polymorphic variation in human meiotic recombination
.
Am. J. Hum. Genet.
,
80
,
526
530
.

4.

Charlesworth
,
D.
(
2017
)
Evolution of recombination rates between sex chromosomes
.
Philos. Trans. R. Soc. Lond. Ser. B Biol. Sci.
,
372
,
1736, 20160456
.

5.

Bergero
,
R.
and
Charlesworth
,
D.
(
2009
)
The evolution of restricted recombination in sex chromosomes
.
Trends Ecol Evol (Amst)
,
24
,
94
102
.

6.

Vincenten
,
N.
,
Kuhl
,
L.-M.
,
Lam
,
I.
,
Oke
,
A.
,
Kerr
,
A.R.
,
Hochwagen
,
A.
,
Fung
,
J.
,
Keeney
,
S.
,
Vader
,
G.
and
Marston
,
A.L.
(
2015
)
The kinetochore prevents centromere-proximal crossover recombination during meiosis
.
elife
,
4
,
10850
.

7.

Haenel
,
Q.
,
Laurentino
,
T.G.
,
Roesti
,
M.
and
Berner
,
D.
(
2018
)
Meta-analysis of chromosome-scale crossover rate variation in eukaryotes and its significance to evolutionary genomics
.
Mol. Ecol.
,
27
,
2477
2497
.

8.

Berg
,
I.L.
,
Neumann
,
R.
,
Sarbajna
,
S.
,
Odenthal-Hesse
,
L.
,
Butler
,
N.J.
and
Jeffreys
,
A.J.
(
2011
)
Variants of the protein PRDM9 differentially regulate a set of human meiotic recombination hotspots highly active in African populations
.
Proc. Natl. Acad. Sci. U. S. A.
,
108
,
12378
12383
.

9.

Hinch
,
A.G.
,
Tandon
,
A.
,
Patterson
,
N.
,
Song
,
Y.
,
Rohland
,
N.
,
Palmer
,
C.D.
,
Chen
,
G.K.
,
Wang
,
K.
,
Buxbaum
,
S.G.
,
Akylbekova
,
E.L.
 et al. (
2011
)
The landscape of recombination in African Americans
.
Nature
,
476
,
170
175
.

10.

Uren
,
C.
,
Möller
,
M.
,
van
 
Helden
,
P.D.
,
Henn
,
B.M.
and
Hoal
,
E.G.
(
2017
)
Population structure and infectious disease risk in southern Africa
.
Mol. Gen. Genomics.
,
292
,
499
509
.

11.

Chen
,
J.-M.
,
Cooper
,
D.N.
,
Chuzhanova
,
N.
,
Férec
,
C.
and
Patrinos
,
G.P.
(
2007
)
Gene conversion: mechanisms, evolution and human disease
.
Nat. Rev. Genet.
,
8
,
762
775
.

12.

Peñalba
,
J.V.
and
Wolf
,
J.B.W.
(
2020
)
From molecules to populations: appreciating and estimating recombination rate variation
.
Nat. Rev. Genet.
,
21
,
476
492
.

13.

Jeffreys
,
A.J.
,
Murray
,
J.
and
Neumann
,
R.
(
1998
)
High-resolution mapping of crossovers in human sperm defines a minisatellite-associated recombination hotspot
.
Mol. Cell
,
2
,
267
273
.

14.

Bhérer
,
C.
,
Campbell
,
C.L.
and
Auton
,
A.
(
2017
)
Refined genetic maps reveal sexual dimorphism in human meiotic recombination at multiple scales
.
Nat. Commun.
,
8
,
14994
.

15.

Li
,
J.
,
Zhang
,
M.Q.
and
Zhang
,
X.
(
2006
)
A new method for detecting human recombination hotspots and its applications to the HapMap ENCODE data
.
Am. J. Hum. Genet.
,
79
,
628
639
.

16.

Auton
,
A.
and
McVean
,
G.
(
2007
)
Recombination rate estimation in the presence of hotspots
.
Genome Res.
,
17
,
1219
1227
.

17.

Dréau
,
A.
,
Venu
,
V.
,
Avdievich
,
E.
,
Gaspar
,
L.
and
Jones
,
F.C.
(
2019
)
Genome-wide recombination map construction from single individuals using linked-read sequencing
.
Nat. Commun.
,
10
,
4309
.

18.

Carrington
,
M.
and
Cullen
,
M.
(
2004
)
Justified chauvinism: advances in defining meiotic recombination through sperm typing
.
Trends Genet.
,
20
,
196
205
.

19.

Halldorsson
,
B.V.
,
Palsson
,
G.
,
Stefansson
,
O.A.
,
Jonsson
,
H.
,
Hardarson
,
M.T.
,
Eggertsson
,
H.P.
,
Gunnarsson
,
B.
,
Oddsson
,
A.
,
Halldorsson
,
G.H.
,
Zink
,
F.
 et al. (
2019
)
Characterizing mutagenic effects of recombination through a sequence-level genetic map
.
Science
,
363
,
eaau1043
.

20.

Rastas
,
P.
(
2017
)
Lep-MAP3: robust linkage mapping even for low-coverage whole genome sequencing data
.
Bioinformatics
,
33
,
3726
3732
.

21.

O’Connell
,
J.
,
Gurdasani
,
D.
,
Delaneau
,
O.
,
Pirastu
,
N.
,
Ulivi
,
S.
,
Cocca
,
M.
,
Traglia
,
M.
,
Huang
,
J.
,
Huffman
,
J.E.
,
Rudan
,
I.
 et al. (
2014
)
A general approach for haplotype phasing across the full spectrum of relatedness
.
PLoS Genet.
,
10
,
e1004234
.

22.

Delaneau
,
O.
,
Zagury
,
J.-F.
and
Marchini
,
J.
(
2013
)
Improved whole-chromosome phasing for disease and population genetic studies
.
Nat. Methods
,
10
,
5
6
.

23.

Lander
,
E.S.
,
Green
,
P.
,
Abrahamson
,
J.
,
Barlow
,
A.
,
Daly
,
M.J.
,
Lincoln
,
S.E.
and
Newberg
,
L.A.
(
1987
)
MAPMAKER: an interactive computer package for constructing primary genetic linkage maps of experimental and natural populations
.
Genomics
,
1
,
174
181
.

24.

Tong
,
C.
,
Zhang
,
B.
and
Shi
,
J.
(
2010
)
A hidden Markov model approach to multilocus linkage analysis in a full-sib family
.
Tree Genet. Genomes
,
6
,
651
662
.

25.

Li
,
Y.
,
Willer
,
C.J.
,
Ding
,
J.
,
Scheet
,
P.
and
Abecasis
,
G.R.
(
2010
)
MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes
.
Genet. Epidemiol.
,
34
,
816
834
.

26.

Howie
,
B.N.
,
Donnelly
,
P.
and
Marchini
,
J.
(
2009
)
A flexible and accurate genotype imputation method for the next generation of genome-wide association studies
.
PLoS Genet.
,
5
,
e1000529
.

27.

Kong
,
A.
,
Thorleifsson
,
G.
,
Gudbjartsson
,
D.F.
,
Masson
,
G.
,
Sigurdsson
,
A.
,
Jonasdottir
,
A.
,
Walters
,
G.B.
,
Jonasdottir
,
A.
,
Gylfason
,
A.
,
Kristinsson
,
K.T.
 et al. (
2010
)
Fine-scale recombination rate differences between sexes, populations and individuals
.
Nature
,
467
,
1099
1103
.

28.

Coop
,
G.
,
Wen
,
X.
,
Ober
,
C.
,
Pritchard
,
J.K.
and
Przeworski
,
M.
(
2008
)
High-resolution mapping of crossovers reveals extensive variation in fine-scale recombination patterns among humans
.
Science
,
319
,
1395
1398
.

29.

Zhou
,
Y.
,
Browning
,
B.L.
and
Browning
,
S.R.
(
2020
)
Population-specific recombination maps from segments of identity by descent
.
Am. J. Hum. Genet.
,
107
,
137
148
.

30.

Hermann
,
P.
,
Heissl
,
A.
,
Tiemann-Boege
,
I.
and
Futschik
,
A.
(
2019
)
LDJump: estimating variable recombination rates from population genetic data
.
Mol. Ecol. Resour.
,
19
,
623
638
.

31.

Fearnhead
,
P.
and
Donnelly
,
P.
(
2001
)
Estimating recombination rates from population genetic data
.
Genetics
,
159
,
1299
1318
.

32.

Kuhner
,
M.K.
,
Yamato
,
J.
and
Felsenstein
,
J.
(
2000
)
Maximum likelihood estimation of recombination rates from population data
.
Genetics
,
156
,
1393
1401
.

33.

Myers
,
S.R.
and
Griffiths
,
R.C.
(
2003
)
Bounds on the minimum number of recombination events in a sample history
.
Genetics
,
163
,
375
394
.

34.

Stumpf
,
M.P.H.
and
McVean
,
G.A.T.
(
2003
)
Estimating recombination rates from population-genetic data
.
Nat. Rev. Genet.
,
4
,
959
968
.

35.

1000 Genomes Project Consortium
,
Abecasis
,
G.R.
,
Altshuler
,
D.
,
Auton
,
A.
,
Brooks
,
L.D.
,
Durbin
,
R.M.
,
Gibbs
,
R.A.
,
Hurles
,
M.E.
and
McVean
,
G.A.
(
2010
)
A map of human genome variation from population-scale sequencing
.
Nature
,
467
,
1061
1073
.

36.

Johnston
,
H.R.
and
Cutler
,
D.J.
(
2012
)
Population demographic history can cause the appearance of recombination hotspots
.
Am. J. Hum. Genet.
,
90
,
774
783
.

37.

Smith
,
N.G.C.
and
Fearnhead
,
P.
(
2005
)
A comparison of three estimators of the population-scaled recombination rate: accuracy and robustness
.
Genetics
,
171
,
2051
2062
.

38.

Dapper
,
A.L.
and
Payseur
,
B.A.
(
2018
)
Effects of demographic history on the detection of recombination hotspots from linkage disequilibrium
.
Mol. Biol. Evol.
,
35
,
335
353
.

39.

Chan
,
A.H.
,
Jenkins
,
P.A.
and
Song
,
Y.S.
(
2012
)
Genome-wide fine-scale recombination rate variation in Drosophila melanogaster
.
PLoS Genet.
,
8
,
e1003090
.

40.

Gao
,
F.
,
Ming
,
C.
,
Hu
,
W.
and
Li
,
H.
(
2016
)
New software for the fast estimation of population recombination rates (fasteprr) in the genomic era
.
G3 (Bethesda)
,
6
,
1563
1571
.

41.

Adrion
,
J.R.
,
Galloway
,
J.G.
and
Kern
,
A.D.
(
2020
)
Predicting the landscape of recombination using deep learning
.
Mol. Biol. Evol.
,
37
,
1790
1808
.

42.

Kamm
,
J.A.
,
Spence
,
J.P.
,
Chan
,
J.
and
Song
,
Y.S.
(
2016
)
Two-locus likelihoods under variable population size and fine-scale recombination rate estimation
.
Genetics
,
203
,
1381
1399
.

43.

Spence
,
J.P.
and
Song
,
Y.S.
(
2019
)
Inference and analysis of population-specific fine-scale recombination maps across 26 diverse human populations
.
Sci. Adv.
,
5
,
eaaw9206
.

44.

V Barroso
,
G.
,
Puzović
,
N.
and
Dutheil
,
J.Y.
(
2019
)
Inference of recombination maps from a single pair of genomes and its application to ancient samples
.
PLoS Genet.
,
15
,
e1008449
.

45.

Wegmann
,
D.
,
Kessner
,
D.E.
,
Veeramah
,
K.R.
,
Mathias
,
R.A.
,
Nicolae
,
D.L.
,
Yanek
,
L.R.
,
Sun
,
Y.V.
,
Torgerson
,
D.G.
,
Rafaels
,
N.
,
Mosley
,
T.
 et al. (
2011
)
Recombination rates in admixed individuals identified by ancestry-based inference
.
Nat. Genet.
,
43
,
847
853
.

46.

Maples
,
B.K.
,
Gravel
,
S.
,
Kenny
,
E.E.
and
Bustamante
,
C.D.
(
2013
)
RFMix: a discriminative modeling approach for rapid and robust local-ancestry inference
.
Am. J. Hum. Genet.
,
93
,
278
288
.

47.

Price
,
A.L.
,
Tandon
,
A.
,
Patterson
,
N.
,
Barnes
,
K.C.
,
Rafaels
,
N.
,
Ruczinski
,
I.
,
Beaty
,
T.H.
,
Mathias
,
R.
,
Reich
,
D.
and
Myers
,
S.
(
2009
)
Sensitive detection of chromosomal segments of distinct ancestry in admixed populations
.
PLoS Genet.
,
5
,
e1000519
.

48.

Ongen
,
H.
,
Buil
,
A.
,
Brown
,
A.A.
,
Dermitzakis
,
E.T.
and
Delaneau
,
O.
(
2016
)
Fast and efficient QTL mapper for thousands of molecular phenotypes
.
Bioinformatics
,
32
,
1479
1485
.

49.

Uren
,
C.
,
Hoal
,
E.G.
and
Möller
,
M.
(
2020
)
Putting RFMix and ADMIXTURE to the test in a complex admixed population
.
BMC Genet.
,
21
,
40
.

50.

Palamara, P.F.,

Lencz
,
T.
,
Darvasi
,
A.
and
Pe’er
,
I.
(
2012
)
Length distributions of identity by descent reveal fine-scale demographic history
.
Am. J. Hum. Genet.
,
91
,
809
822
.

51.

Hassan
,
S.
,
Surakka
,
I.
,
Taskinen
,
M.-R.
,
Salomaa
,
V.
,
Palotie
,
A.
,
Wessman
,
M.
,
Tukiainen
,
T.
,
Pirinen
,
M.
,
Palta
,
P.
and
Ripatti
,
S.
(
2020
)
High-resolution population-specific recombination rates and their effect on phasing and genotype imputation
.
Eur. J. Hum. Genet.
doi: .

Author notes

Marlo Möller and Brenna M. Henn Cosenior authors.

Gerald van Eeden and Caitlin Uren Cofirst authors.

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model)