-
PDF
- Split View
-
Views
-
Cite
Cite
Gregory Clark, Rowena Gray, Geography is not destiny: geography, institutions and literacy in England, 1837–63, Oxford Economic Papers, Volume 66, Issue 4, October 2014, Pages 1042–1069, https://doi.org/10.1093/oep/gpu020
Close -
Share
Abstract
Geography made rural society in the southeast of England unequal. Economies of scale in grain growing created a farmer elite and many landless labourers. In the pastoral northwest, in contrast, family farms dominated, with few hired labourers and modest income disparities. Did this geography driven difference in social structure influence educational outcomes in England 1810–45? Using new micro-level data we show that this geographically driven inequality is not a strong predictor of regional literacy rates. We conclude that regional literacy differences seem to have been influenced more by culture. In particular, areas in northern England with more exposure to the highly literate Scottish society seem to have acquired a higher demand for education, independent of local inequality. Geography is not destiny.
1. Introduction
This article is a test of the thesis of Stanley Engerman and Kenneth Sokoloff—that geography is a key determinant of economic growth through the channels of inequality and institution formation.1 Our testing ground is England, which, despite its small area, has distinct regional climates and topography. This geographic diversity produced substantial differences in rural social structure. The southeast had large-scale grain agriculture with a few substantial farmers and large numbers of landless labourers. The west and north had mainly dairy farming, small operating units, worked by many modest owner-occupiers.
Did those differences in turn lead to differences in investment in education across England's regions that later explain the rapid growth of the north in the Industrial Revolution and the relative decline of the south?
For each of the 9,000 parishes of England we have measures of climate and topography. We also have, from the 1831 census, parish-level measures of social structure including the ratio of farmers to farm labourers, and from the 1851 census, measures of religious affiliation at the local level. Finally, we constructed measures of educational attainment for 100 parishes from archival records on the fraction of brides and grooms able to sign the marriage certificate, 1837–63. The marriage certificates also supply occupation information for all males involved.
We focus on parishes in two northern counties—Lancashire and Northumberland—and two southern counties—Essex and Somerset—with very different agricultural organization. Using these we show that rural inequality poorly predicts average schooling attainment. Other cultural factors, independent of geography, dominate in explaining these variations. In doing this, the article joins a growing part of the literature on inequality and institutions which argues that the consensus of Engerman-Sokoloff and others is not substantiated in within-country, micro-level samples.
2. Geography, institutions, and growth
The idea that geography can play a determining role in institutions and subsequent economic growth has a long history. In its modern form it is associated with a series of papers by Engerman and Sokoloff that argue for a chain of causation in New World growth that runs as follows: geography determines production scale determines inequality determines institutions determines human capital determines economic growth.
In their argument, the geography of the Caribbean and Latin America meant that these areas were best suited for crops such as sugar, cotton, coffee, or bananas, which were most efficiently produced on large-scale plantations. North America, in contrast, was best suited to production of grain and dairy products on small-scale family farms. The economies of scale inherent in these farming systems produced societies that differed greatly in their degrees of inequality (Engerman and Sokoloff, 2012, pp. 31–56).
In North America, with low inequality, there was high demand for education and a large political class in favor of public education. This in turn fostered high rates of literacy, and a population capable of high rates of innovation. In contrast, in the Caribbean and Latin America the political elite had little interest in public provision of education because it did not serve their economic interests and would undermine their political dominance. The mass of unskilled workers was unable to afford education for their children through the private market (Sokoloff and Engerman, 2000, p.229). Thus these societies remained unequal, uneducated, and ultimately poor in a world where human capital became the main engine of growth. This connection between rural inequality and investment in human capital has been modelled theoretically in Galor et al. (2009).
They further argued that inequalities of political power and human capital witnessed in South and Central America caused the economic under-development of these areas, as elites maintained growth-hampering institutions for their own gain and inequality stifled growth in either the stock of skilled labour or a sizeable domestic market for new products. Proving or disproving this institutional channel has become a cottage industry for scholars of economic history and development.
Another related literature is that on the channels through which the natural resource curse may work. It is hypothesized that abundance of natural resources may decrease the incentive to invest in human capital for individuals and for governments, which may be controlled by rich elites who have no need to increase the skills of the workforce. The evidence in this literature is mixed. Gylfason (2001) supports the hypothesis, but more recently Brunnschweiler (2008) and Stijns (2006) have produced cross-country evidence in refutation. In the context of this article, we do not find that the southeast's advantage in terms of higher quality agricultural land translated into worse educational outcomes. Other factors dominated in the educational failure of the region relative to the north.
The Engerman-Sokoloff thesis, as a general view of development, has been empirically tested in a variety of ways with mixed results. Most commonly, a variable proxying for staple crops is included in standard growth regressions. Easterly and Levine (2003), for example, found countries suited to the cultivation of staples such as coffee and sugar experienced lower long-run growth. More ambitiously, Easterly (2007) presents cross-country evidence supporting both aspects of the Engerman-Sokoloff thesis: that endowments determine inequality, and that inequality in turn determines growth.2 However, Islam and Montenegro (2002) find that whilst inequality correlates negatively with institutional quality, this result seems to be driven solely by Latin America and Africa. Inclusion of indicator variables for these continents removes the result. Others have explored whether geography can explain the differential development of the south and north over US history, again with mixed results (Mitchener and McLean, 2003; Lagerlof, 2005; Nunn, 2008).
However, even were such tests to show a correlation between geography, inequality, literacy, institutions, and growth, a complication to the interpretation is that there are other pathways by which geography can influence inequality, literacy, and growth. Acemoglu et al. (2001, 2002), for example, emphasize the role of settler mortality rates and prospects of European settlement in different climates and geographies as important determinants of the types of institutions established by European colonialists and of subsequent economic performance. But settlement patterns could also directly influence economic performance, since different settlement groups, particularly those from Europe and Africa, came with very different cultural backgrounds. Tabellini (2010) has argued that at least within Europe, there is a long cultural legacy underlying modern populations that influences modern economic performance. So for most of the cross-country or even cross-region tests of the Engerman-Sokoloff hypothesis, there is a problem of multiple correlations between geography, institutions, inequality, literacy, and population origins.
A number of more recent studies that make use of within-country, micro-level data throw doubt on the consensus that inequality has long-run implications for institutions and growth. Summerhill (2010) provides evidence from São Paulo to show that high levels of land inequality did not cause lower investment in education. He finds instead that education provision was higher in areas that wished to attract more immigrants. It was a supply-side phenomenon. In fact in São Paulo, such immigrants often provided more education privately as well as pushing for greater public provision.3 This kind of community provision was common during the period examined here in England. Similarly, Acemoglu et al. (2007) find that for Colombian localities the land Gini was positively related to long-run education outcomes, including a measure of literacy identical to our approach here, the opposite of what has been predicted in the literature.
Galor et al. (2009) conducted a similar test to that of this article, using data from early twentieth-century USA. They investigate whether geographic differences in public spending on secondary schooling across states can be explained by differences in geography and land inequality. They find that data from 1880 to 1920 supports these predictions.
Here we perform a similar test to these recent studies, asking whether geographically driven inequality can explain differences in literacy rates within England. In doing so, similar to other studies in the economic growth literature, we are taking the general view of the Engerman-Sokoloff thesis seriously. But the advantage for this test is that the English are a much more homogeneous population than many others that have been studied, thus eliminating much of the potential for geography to influence development through other channels.4 Within such a setting can geographically driven inequality influence the development of educational institutions in important ways? Such institutions certainly played an important role in the second Industrial Revolution and, even in the first, surely increased the population of malleable factory workers from which employers drew.
3. Geography and rural social structure in pre-industrial England
We measure social structure and inequality for rural English parishes using the ratio of male farmers to all men employed in farming in 1831. In Fig. 1 the light squares show parishes and townships with fewer than 1 farmer per 10 male farm workers. The dark squares show parishes and townships with more than five farmers per 10 adult males in farming.5
Social structure in rural England, 1831
Notes: In parishes shown as dark squares, at least half of the adult male population engaged in farming was listed as a farmer in 1831. In parishes shown as light squares, the corresponding farmer share was less than 10%.
Source: British Parliamentary Papers, 1831 Census: Abstracts of the Answers and Returns, vols. 36–38 (Irish University Press, Shannon, 1833). UK Ordnance Survey, Gazetteer of Great Britain, 6th ed. (The Survey, Southampton, 1975) (parish grid coordinates).
Social structure in rural England, 1831
Notes: In parishes shown as dark squares, at least half of the adult male population engaged in farming was listed as a farmer in 1831. In parishes shown as light squares, the corresponding farmer share was less than 10%.
Source: British Parliamentary Papers, 1831 Census: Abstracts of the Answers and Returns, vols. 36–38 (Irish University Press, Shannon, 1833). UK Ordnance Survey, Gazetteer of Great Britain, 6th ed. (The Survey, Southampton, 1975) (parish grid coordinates).
There was a marked difference in social structure across regions. In the southeast of England a large proportion of the population in rural areas comprised landless labourers, working for wages. In the northwest the majority of the rural adult male population comprised independent farmers, working for themselves. Even in the small compass of rural England there were differences in social structure that echoed those between pre-industrial North America and the Caribbean and Latin America.
This difference in social structure can be largely attributed to geography, in particular to climate and topography. Figure 2 shows one measure of climate and topographical differences across England. Light squares represent parishes which had more than 800 degree-days above 10°C per year, a measure of potential crop growth.6 In black are shown parishes with fewer than 675 degree-days above 10°C.
High and low degree days above 10°C
Notes: Light squares = more than 800 degree-days above 10°C. Dark squares = less than 675 degree days above 10°C.
Sources:Smith (1976). UK Ordnance Survey, Gazetteer of Great Britain, 6th ed. (The Survey, Southampton, 1975).
High and low degree days above 10°C
Notes: Light squares = more than 800 degree-days above 10°C. Dark squares = less than 675 degree days above 10°C.
Sources:Smith (1976). UK Ordnance Survey, Gazetteer of Great Britain, 6th ed. (The Survey, Southampton, 1975).
This simple climate difference clearly echoes the difference in rural social structure. Indeed if we regress parish FARMSHR (share of farmers amongst adult males in farming) on DEGDAY (degree days above 10°C) and DEGDAY2 (DEGDAY squared), the R2 is 0.34.7 One third of the variation in the numbers of farmers per person in farming can be explained by this one variable. Other climate and topographical variables correlate with organizational structure, as Table 1 shows: total rainfall, the growing season, and ‘days at capacity’.8 In the hotter, drier, flatter southeast, grain production predominated, and in this there were significant economies of scale, so average farm sizes were large. In the wetter, cooler, hillier north and west, dairy production was more profitable, with a smaller optimal farm scale. Family farms predominated with modest amounts of hired labour.
Social structure, climate, and topography
| Farmer share in all farm employment . | 0.0–0.1 . | 0.1–0.3 . | 0.3–0.5 . | 0.5+ . |
|---|---|---|---|---|
| Parish Elevation (m) | 79 | 84 | 107 | 156 |
| Parish gradient (m) | 160 | 207 | 318 | 425 |
| Rain (inches) | 27 | 29 | 34 | 39 |
| Growing season (days) | 231 | 223 | 198 | 169 |
| Degree-days above 10°C | 795 | 762 | 691 | 630 |
| Days soil at moisture capacity | 146 | 158 | 191 | 224 |
| Share of soil chalk | 0.11 | 0.05 | 0.03 | 0.02 |
| Share of soil gravel | 0.10 | 0.08 | 0.06 | 0.06 |
| Farmer share in all farm employment . | 0.0–0.1 . | 0.1–0.3 . | 0.3–0.5 . | 0.5+ . |
|---|---|---|---|---|
| Parish Elevation (m) | 79 | 84 | 107 | 156 |
| Parish gradient (m) | 160 | 207 | 318 | 425 |
| Rain (inches) | 27 | 29 | 34 | 39 |
| Growing season (days) | 231 | 223 | 198 | 169 |
| Degree-days above 10°C | 795 | 762 | 691 | 630 |
| Days soil at moisture capacity | 146 | 158 | 191 | 224 |
| Share of soil chalk | 0.11 | 0.05 | 0.03 | 0.02 |
| Share of soil gravel | 0.10 | 0.08 | 0.06 | 0.06 |
Notes: Parishes or townships with 40 or more adult men employed in farming in 1831, and at least 30% of adult men engaged in farming.
Sources:Smith (1976). Clark (1998). British Parliamentary Papers, 1831 Census: Abstracts of the Answers and Returns, vols. 36–38 (Irish University Press, Shannon, 1833). UK Ordnance Survey, Gazetteer of Great Britain, 6th ed. (The Survey, Southampton, 1975)
Social structure, climate, and topography
| Farmer share in all farm employment . | 0.0–0.1 . | 0.1–0.3 . | 0.3–0.5 . | 0.5+ . |
|---|---|---|---|---|
| Parish Elevation (m) | 79 | 84 | 107 | 156 |
| Parish gradient (m) | 160 | 207 | 318 | 425 |
| Rain (inches) | 27 | 29 | 34 | 39 |
| Growing season (days) | 231 | 223 | 198 | 169 |
| Degree-days above 10°C | 795 | 762 | 691 | 630 |
| Days soil at moisture capacity | 146 | 158 | 191 | 224 |
| Share of soil chalk | 0.11 | 0.05 | 0.03 | 0.02 |
| Share of soil gravel | 0.10 | 0.08 | 0.06 | 0.06 |
| Farmer share in all farm employment . | 0.0–0.1 . | 0.1–0.3 . | 0.3–0.5 . | 0.5+ . |
|---|---|---|---|---|
| Parish Elevation (m) | 79 | 84 | 107 | 156 |
| Parish gradient (m) | 160 | 207 | 318 | 425 |
| Rain (inches) | 27 | 29 | 34 | 39 |
| Growing season (days) | 231 | 223 | 198 | 169 |
| Degree-days above 10°C | 795 | 762 | 691 | 630 |
| Days soil at moisture capacity | 146 | 158 | 191 | 224 |
| Share of soil chalk | 0.11 | 0.05 | 0.03 | 0.02 |
| Share of soil gravel | 0.10 | 0.08 | 0.06 | 0.06 |
Notes: Parishes or townships with 40 or more adult men employed in farming in 1831, and at least 30% of adult men engaged in farming.
Sources:Smith (1976). Clark (1998). British Parliamentary Papers, 1831 Census: Abstracts of the Answers and Returns, vols. 36–38 (Irish University Press, Shannon, 1833). UK Ordnance Survey, Gazetteer of Great Britain, 6th ed. (The Survey, Southampton, 1975)
The climate variables in Table 1 all correlate with FARMSHR, because these measures are all highly correlated. Table 2 shows the correlation between the maximum elevation in England within 10-km grid squares, the yearly rainfall, the growing season, the degree days above 10°C, and the days when the soil is at moisture capacity. England is essentially composed of two farming zones: a lower, drier, warmer southeast and a higher, wetter, colder northwest, each with its associated optimal farming technology and organization.
Correlations between climate and topography
| . | Maximum elevation . | Rain . | Growing season . | Degree- days . | Days at capacity . |
|---|---|---|---|---|---|
| Maximum elevation (m) | 1.00 | ||||
| Rain (inches) | 0.78 | 1.00 | |||
| Growing season (days) | −0.90 | −0.78 | 1.00 | ||
| Degree-days above 10°C | −0.80 | −0.60 | 0.84 | 1.00 | |
| Days at soil capacity | 0.87 | 0.94 | −0.90 | −0.74 | 1.00 |
| . | Maximum elevation . | Rain . | Growing season . | Degree- days . | Days at capacity . |
|---|---|---|---|---|---|
| Maximum elevation (m) | 1.00 | ||||
| Rain (inches) | 0.78 | 1.00 | |||
| Growing season (days) | −0.90 | −0.78 | 1.00 | ||
| Degree-days above 10°C | −0.80 | −0.60 | 0.84 | 1.00 | |
| Days at soil capacity | 0.87 | 0.94 | −0.90 | −0.74 | 1.00 |
Source:Smith (1976).
Correlations between climate and topography
| . | Maximum elevation . | Rain . | Growing season . | Degree- days . | Days at capacity . |
|---|---|---|---|---|---|
| Maximum elevation (m) | 1.00 | ||||
| Rain (inches) | 0.78 | 1.00 | |||
| Growing season (days) | −0.90 | −0.78 | 1.00 | ||
| Degree-days above 10°C | −0.80 | −0.60 | 0.84 | 1.00 | |
| Days at soil capacity | 0.87 | 0.94 | −0.90 | −0.74 | 1.00 |
| . | Maximum elevation . | Rain . | Growing season . | Degree- days . | Days at capacity . |
|---|---|---|---|---|---|
| Maximum elevation (m) | 1.00 | ||||
| Rain (inches) | 0.78 | 1.00 | |||
| Growing season (days) | −0.90 | −0.78 | 1.00 | ||
| Degree-days above 10°C | −0.80 | −0.60 | 0.84 | 1.00 | |
| Days at soil capacity | 0.87 | 0.94 | −0.90 | −0.74 | 1.00 |
Source:Smith (1976).
Regressing FARMSHR on all climate, topographical, and soil variables increases the R2 of the prediction of FARMSHR to 0.39.9 At least some of the remaining variation is random error. If, for example, the estimation is done only for parishes with 150 or more farm workers then the R2 rises to 0.45. If the data are aggregated to the level of the 42 counties in England, the R2 from regressing FARMSHR on DEGDAY and DEGDAY2 becomes 0.68. Adding more geographic variables at the county level can bring the R2 above 0.80, though the adjusted R2 is then only 0.73. Figure 3 shows the connection between FARMSHR and DEGDAY at the county level.
Degree-days above 10°C and the share of farmers, by county
Source: British Parliamentary Papers, 1831 Census: Abstracts of the Answers and Returns, vols. 36–38 (Irish University Press, Shannon, 1833). Smith (1976). UK Ordnance Survey, Gazetteer of Great Britain, 6th ed. (The Survey, Southampton, 1975) for parish locations.
Degree-days above 10°C and the share of farmers, by county
Source: British Parliamentary Papers, 1831 Census: Abstracts of the Answers and Returns, vols. 36–38 (Irish University Press, Shannon, 1833). Smith (1976). UK Ordnance Survey, Gazetteer of Great Britain, 6th ed. (The Survey, Southampton, 1975) for parish locations.
4. Social structure and rural education
Social structure could affect literacy in two ways: through demand for and supply of education. With a higher proportion of the population engaged in occupations where literacy had an economic value, as in the northwest, we would expect more demand for education and more literacy. If this was the only effect of social structure, then once we control for father's occupation, there would be no further effect on literacy. In particular, labourers’ children would be no more likely to be educated in the northwest than the southeast. This is the channel that Cinnerella and Hornung (2010) argue for nineteenth-century Prussia—they do not include variables to measure occupational composition, so we cannot tell whether there is any effect beyond the simple demand-side story.
Social structure could also affect education supply through at least three channels. First, if there are scale economies in schooling, greater demand will induce a lower cost supply of private education, so that a social structure with more farmers would also drive down the cost of education for labourers. In this case a labourer in the northwest would be more likely to be literate than a labourer in the southeast.
Second, it has been argued that local citizens may tax themselves to supply subsidized public education if more of them have political voice, as they would as farmers, as opposed to the voiceless landless labourers (Go and Lindert, 2010). Thus, Go and Lindert credit the high level of northern US educational attainment before 1850 to local communities voting to tax themselves to subsidize schools. In England before 1870, however, this mechanism of support for local schools was blocked by law. Local endowments, subscriptions, and bequests were the only funding for schools (Mitch, 1992, p.115).
But in modest-sized rural communities, those with property, who collectively ran the parish governments, could reach agreements to contribute voluntarily to subsidizing local schools. In his study of rural education in the East Riding of Yorkshire, Bamford (1965, p.10) described how ‘the birth, composition and welfare of most schools was usually a joint effort, involving the churches, the lord of the manor and other local celebrities, together with endowments and subscriptions’.
Thus the differences in social structure across England could have an effect through this third supply mechanism, the willingness of the local propertied to combine voluntarily to subsidize local education. If these contributions would mainly serve to subsidize the education of the children of the contributing group, such support would be easier to arrange. But in the southeast, where there were a few large employers per parish, landlords and farmers should have been indifferent to education, which had no value in the agriculture of the time, and would have no economic incentive to band together to offer subsidies for local charity schools. They would want to secure education for their own children. But there would be too few of them in any parish to provide proper public education.
W. K. Jordan's study of rural parishes, analysing the extent of philanthropy across English regions 1480–1660, provides evidence in support of this channel. He describes how, in a large county like Yorkshire, the development of an institution such as schooling varied a lot across parishes, because it was dependent on the prosperity and generosity of local large landowners or the success of groups of smaller, lower status people, yeomen and husbandmen, in clubbing together to fund a school (Jordan, 1962, pp.402–3). Other examples include the northwest parish of Hyde, which created a local school in the 1770s through public subscription, and the parish of Ashton, where parishioners in 1721 were able to fund the rebuilding of a local school (Harrop, 1983, p.41).
5. Education measures
We use literacy, the ability of grooms and brides to sign the marriage register at their wedding, for the years 1837–63 as a proxy for educational institutions at the parish level. After 1837, a uniform system of marriage registration was implemented. Each certificate was signed or marked by the bride and groom. We record the ability to sign along with other relevant details, including age, parish of residence at time of marriage, and occupation and father's occupation. The data were newly collected for this project and represent a significant addition to the stock of available knowledge on English nineteenth-century parishes and rural literacy. The measure used proxies for educational institutions for the years 1810–45, when these brides and grooms were of school-going age. There are other examples in the economics literature of literacy being used to proxy for educational attainment—the Engerman-Sokoloff series, Acemoglu et al. (2007), Tabellini (2010). Furthermore, Romer (1990) showed that literacy had some indirect effect on growth rates across countries and that this effect was almost identical when using direct measures of educational attainment instead. In the empirical analysis, we control for fathers’ occupation, thus partialling out the variation in literacy rates that might be due to family influence, wealth, or attitudes to education.
This data source has been widely used to study literacy and its advantages and shortcomings have been much discussed. The marriage registers cover the entire married population, and 88% got married.10 Anderson (1975, p.55) has shown that the rates of civil marriage were similar across England in 1844–64. Signature data also have the advantage of being comparable across time and space.
Potential flaws include the possibility that literacy and marriage were positively correlated, which would bias our measure upwards, and the likelihood that, since reading was taught first in schools, some of those who could not sign their name were in fact partially literate, which would mean that our measure is biased downwards. But as long as the magnitude of these biases stayed constant over time and place, they will not affect the tests performed here. Most serious perhaps is the claim that ability to sign one's name did not imply an ability to write more generally. But educational manuals from the eighteenth and nineteenth centuries suggest that unlike today, the first thing that a student learned to write was probably not their name but some religious words and phrases. Again, since we are concerned with relative literacy levels across England, what exactly signing implied for overall levels of education is not of concern here.
To examine the effects of local social structure on literacy we use 8,105 individual records of literacy from 100 parishes/townships in four counties. The counties are Essex, with the second highest ratio of labourers to farmers in England, Lancashire with one of the lowest ratios, and Northumberland and Somerset, which both have intermediate ratios. Figure 4 shows the location of the parishes. The main constraint in generating a large sample was identifying parishes that were largely rural, to facilitate a fair test of the Engerman-Sokoloff channels.
Table 3 displays summary statistics. There are more brides than grooms because grooms were more likely to come from a parish outside the sample. Such observations were omitted to maximize the chances that individuals grew up in the parish where we observe the marriage. In terms of representativeness of the sample, we note that there are higher proportions of women in our sample compared to the national average, and that the average age at marriage is slightly higher than that present in the Cambridge group sample of 26 parishes, which is generally accepted as representative of the English population. Specifically, the 1831 census showed that 51% of English inhabitants were female (Wrigley et al., 1997, Table 3.2, pp.46–47). The figures for our sample parishes in Essex, Lancashire, Somerset, and Northumberland, respectively, are 52%, 54%, 51%, and 60%. The Cambridge group age at first marriage for their sample was given as 23 for women and 25 for men for the years 1830–37 (Wrigley et al., 1997, Table 5.3, p.134). Our sample is close for women, but men are usually older for the marriages we observe. This is partly driven by the fact that we observe all marriages, not just first marriages. However, these summary statistics are reassuringly close to these other representative samples. In terms of occupational distribution, by definition our samples are not representative of the entire country because we focus on rural areas that have experienced the least economic change and therefore where endowments should matter most.
Literacy sample, 1837–63, summary statistics
| . | Essex . | Northumberland . | Somerset . | Lancashire . |
|---|---|---|---|---|
| Share farmers | 0.13 | 0.24 | 0.32 | 0.38 |
| Parishes/townships | 27 | 18 | 18 | 37 |
| Grooms | 1,325 | 468 | 951 | 1,000 |
| Brides | 1,423 | 703 | 987 | 1,166 |
| % from sample parishes | 93 | 83 | 98 | 87 |
| Literacy rate, grooms | 0.43 | 0.93 | 0.55 | 0.66 |
| Literacy rate, brides | 0.50 | 0.84 | 0.48 | 0.44 |
| Average age, grooms | 26 | 28 | 26 | 27 |
| Average age, brides | 23 | 25 | 24 | 24 |
| Father's occupation (%) | ||||
| Gentleman | 1 | 3 | 2 | 1 |
| Professional | 3 | 6 | 2 | 3 |
| Farmer | 7 | 20 | 18 | 39 |
| Trader | 5 | 5 | 5 | 4 |
| Craftsman | 15 | 21 | 18 | 19 |
| Skilled labourer | 6 | 25 | 5 | 9 |
| Unskilled labourer | 64 | 20 | 51 | 26 |
| Sample size | 2,748 | 1,361 | 1,937 | 2,061 |
| . | Essex . | Northumberland . | Somerset . | Lancashire . |
|---|---|---|---|---|
| Share farmers | 0.13 | 0.24 | 0.32 | 0.38 |
| Parishes/townships | 27 | 18 | 18 | 37 |
| Grooms | 1,325 | 468 | 951 | 1,000 |
| Brides | 1,423 | 703 | 987 | 1,166 |
| % from sample parishes | 93 | 83 | 98 | 87 |
| Literacy rate, grooms | 0.43 | 0.93 | 0.55 | 0.66 |
| Literacy rate, brides | 0.50 | 0.84 | 0.48 | 0.44 |
| Average age, grooms | 26 | 28 | 26 | 27 |
| Average age, brides | 23 | 25 | 24 | 24 |
| Father's occupation (%) | ||||
| Gentleman | 1 | 3 | 2 | 1 |
| Professional | 3 | 6 | 2 | 3 |
| Farmer | 7 | 20 | 18 | 39 |
| Trader | 5 | 5 | 5 | 4 |
| Craftsman | 15 | 21 | 18 | 19 |
| Skilled labourer | 6 | 25 | 5 | 9 |
| Unskilled labourer | 64 | 20 | 51 | 26 |
| Sample size | 2,748 | 1,361 | 1,937 | 2,061 |
Notes: Essex average ages are based on a subsample of the total observations, because often individuals simply reported whether they were of full age, so these average ages are biased upwards.
Source: Marriage Certificates from Essex Records Office, Lancashire Records Office, Northumberland Records Office, and Somerset Records Office (the last two from microfilms held by the Church of Jesus Christ of the Latter Day Saints).
Literacy sample, 1837–63, summary statistics
| . | Essex . | Northumberland . | Somerset . | Lancashire . |
|---|---|---|---|---|
| Share farmers | 0.13 | 0.24 | 0.32 | 0.38 |
| Parishes/townships | 27 | 18 | 18 | 37 |
| Grooms | 1,325 | 468 | 951 | 1,000 |
| Brides | 1,423 | 703 | 987 | 1,166 |
| % from sample parishes | 93 | 83 | 98 | 87 |
| Literacy rate, grooms | 0.43 | 0.93 | 0.55 | 0.66 |
| Literacy rate, brides | 0.50 | 0.84 | 0.48 | 0.44 |
| Average age, grooms | 26 | 28 | 26 | 27 |
| Average age, brides | 23 | 25 | 24 | 24 |
| Father's occupation (%) | ||||
| Gentleman | 1 | 3 | 2 | 1 |
| Professional | 3 | 6 | 2 | 3 |
| Farmer | 7 | 20 | 18 | 39 |
| Trader | 5 | 5 | 5 | 4 |
| Craftsman | 15 | 21 | 18 | 19 |
| Skilled labourer | 6 | 25 | 5 | 9 |
| Unskilled labourer | 64 | 20 | 51 | 26 |
| Sample size | 2,748 | 1,361 | 1,937 | 2,061 |
| . | Essex . | Northumberland . | Somerset . | Lancashire . |
|---|---|---|---|---|
| Share farmers | 0.13 | 0.24 | 0.32 | 0.38 |
| Parishes/townships | 27 | 18 | 18 | 37 |
| Grooms | 1,325 | 468 | 951 | 1,000 |
| Brides | 1,423 | 703 | 987 | 1,166 |
| % from sample parishes | 93 | 83 | 98 | 87 |
| Literacy rate, grooms | 0.43 | 0.93 | 0.55 | 0.66 |
| Literacy rate, brides | 0.50 | 0.84 | 0.48 | 0.44 |
| Average age, grooms | 26 | 28 | 26 | 27 |
| Average age, brides | 23 | 25 | 24 | 24 |
| Father's occupation (%) | ||||
| Gentleman | 1 | 3 | 2 | 1 |
| Professional | 3 | 6 | 2 | 3 |
| Farmer | 7 | 20 | 18 | 39 |
| Trader | 5 | 5 | 5 | 4 |
| Craftsman | 15 | 21 | 18 | 19 |
| Skilled labourer | 6 | 25 | 5 | 9 |
| Unskilled labourer | 64 | 20 | 51 | 26 |
| Sample size | 2,748 | 1,361 | 1,937 | 2,061 |
Notes: Essex average ages are based on a subsample of the total observations, because often individuals simply reported whether they were of full age, so these average ages are biased upwards.
Source: Marriage Certificates from Essex Records Office, Lancashire Records Office, Northumberland Records Office, and Somerset Records Office (the last two from microfilms held by the Church of Jesus Christ of the Latter Day Saints).
It is apparent in Table 3 that there were substantial differences in literacy rates across counties. But there were also surprising differences in relative literacy rates for men and women. Northumberland had the highest rates of both male and female literacy. Lancashire, however, whilst having high rates of male literacy, had the lowest rates for female literacy. Essex, with the lowest male literacy rates, had substantially higher female literacy rates. There were also substantial occupational differences across parishes, principally in the shares of farmers versus labourers.
Table 4 displays summary statistics for the geographic and agricultural structure variables that will be used in the regressions for the sample parishes. This can be compared to Table 1, which summarized all of England's agricultural parishes. We do not expect these tables to be identical, because our sample contains rural parishes, but we find that they are reassuringly close and that there is substantial variation in geography and social structure.
Representativeness of our sample
| . | Essex . | North'land . | Somerset . | Lancs. . |
|---|---|---|---|---|
| Share farmers | 0.13 | 0.24 | 0.32 | 0.38 |
| Share farmers, range | 0.04–0.31 | 0.03–0.73 | 0.10–0.69 | 0.07–0.76 |
| Elevation (m) | 44 | 172 | 103 | 99 |
| Rain (inches) | 23 | 34 | 34 | 42 |
| Growing season (days) | 257 | 148 | 212 | 168 |
| Degree-days above 10°C | 837 | 469 | 774 | 669 |
| Days soil at capacity | 109 | 223 | 182 | 234 |
| Share chalk | 0.01 | 0.01 | 0 | 0.01 |
| Share gravel | 0.07 | 0.03 | 0.07 | 0.04 |
| . | Essex . | North'land . | Somerset . | Lancs. . |
|---|---|---|---|---|
| Share farmers | 0.13 | 0.24 | 0.32 | 0.38 |
| Share farmers, range | 0.04–0.31 | 0.03–0.73 | 0.10–0.69 | 0.07–0.76 |
| Elevation (m) | 44 | 172 | 103 | 99 |
| Rain (inches) | 23 | 34 | 34 | 42 |
| Growing season (days) | 257 | 148 | 212 | 168 |
| Degree-days above 10°C | 837 | 469 | 774 | 669 |
| Days soil at capacity | 109 | 223 | 182 | 234 |
| Share chalk | 0.01 | 0.01 | 0 | 0.01 |
| Share gravel | 0.07 | 0.03 | 0.07 | 0.04 |
Sources:Smith (1976) (elevation, rain, growing season, degree-days, soil at capacity). Clark (1998) (share chalk, share gravel). British Parliamentary Papers, 1831 Census: Abstracts of the Answers and Returns, vols. 36–38 (Irish University Press, Shannon, 1833) (share farmers). UK Ordnance Survey, Gazetteer of Great Britain, 6th ed. (The Survey, Southampton, 1975) (parish locations).
Representativeness of our sample
| . | Essex . | North'land . | Somerset . | Lancs. . |
|---|---|---|---|---|
| Share farmers | 0.13 | 0.24 | 0.32 | 0.38 |
| Share farmers, range | 0.04–0.31 | 0.03–0.73 | 0.10–0.69 | 0.07–0.76 |
| Elevation (m) | 44 | 172 | 103 | 99 |
| Rain (inches) | 23 | 34 | 34 | 42 |
| Growing season (days) | 257 | 148 | 212 | 168 |
| Degree-days above 10°C | 837 | 469 | 774 | 669 |
| Days soil at capacity | 109 | 223 | 182 | 234 |
| Share chalk | 0.01 | 0.01 | 0 | 0.01 |
| Share gravel | 0.07 | 0.03 | 0.07 | 0.04 |
| . | Essex . | North'land . | Somerset . | Lancs. . |
|---|---|---|---|---|
| Share farmers | 0.13 | 0.24 | 0.32 | 0.38 |
| Share farmers, range | 0.04–0.31 | 0.03–0.73 | 0.10–0.69 | 0.07–0.76 |
| Elevation (m) | 44 | 172 | 103 | 99 |
| Rain (inches) | 23 | 34 | 34 | 42 |
| Growing season (days) | 257 | 148 | 212 | 168 |
| Degree-days above 10°C | 837 | 469 | 774 | 669 |
| Days soil at capacity | 109 | 223 | 182 | 234 |
| Share chalk | 0.01 | 0.01 | 0 | 0.01 |
| Share gravel | 0.07 | 0.03 | 0.07 | 0.04 |
Sources:Smith (1976) (elevation, rain, growing season, degree-days, soil at capacity). Clark (1998) (share chalk, share gravel). British Parliamentary Papers, 1831 Census: Abstracts of the Answers and Returns, vols. 36–38 (Irish University Press, Shannon, 1833) (share farmers). UK Ordnance Survey, Gazetteer of Great Britain, 6th ed. (The Survey, Southampton, 1975) (parish locations).
This is highlighted particularly in the third row, which displays the range within counties for the share of farmers within agricultural workers—there will be plenty of variation to estimate its effect on literacy, even using county fixed effects.
One final potential data issue is that migration may have caused selection bias in the literacy data we collected. Movers may have different characteristics from stayers. For this period, it is difficult to obtain information on migration rates and characteristics across England. Pooley and Turnbull (2003) present information on migration from 16,091 life histories completed by family historians. They show that from 1750 to 1839, the average move was only 38 km and most moves occurred within region and were across settlements of similar size (Pooley and Turnbull, 2003, pp.65, 104–5.).11 Thus, in rural England a high proportion of moves were to other rural areas. This appears to be true for the counties analysed here. For 1750–1879, 76% of all migration within the north and northwest was classified as within region, as was 67% of migration in the east, and 69% in the southwest (Pooley and Turnbull, 2003, p.81). Similarly, Williamson (1990, pp.20–21) shows that emigration rates from the rural south and rural north of England were similar for the decade 1841–51 (they were both between 500 and 600 per 1,000). Thus, to the best of our knowledge, the patterns were quite similar across our sample regions. We are confident that differential rates of migration will not drive the results. Also individuals appear to have migrated to other similar rural areas for the most part. For example, Clark (1987, p.233) showed that 55% of migrants in a small early modern Norfolk sample migrated from a wood-pasture parish to a wood-pasture parish. It should thus be the case that net migration from our parishes has little effect on our measurement of literacy at the parish level.
Related to this issue is the question of how reasonable is our assumption that individuals grew up in the parish in which they get married. Again, Pooley and Turnbull (2003) are useful here. They explain that couples, or at least one member of a couple, often migrated after marriage, and that a small proportion of migrations occurred before marriage. For the period 1820–49, they estimate that 33% of male migrants and 25% of female migrants moved whilst single. Given that some proportion of these never married, this gives us an upper bound for the potential mismeasurement in the literacy variable (Pooley and Turnbull, 2003, pp.207–13). Kitch (1992), looking at Sussex from the late 1500s to the early 1900s, suggested that only 25–40% of marriages crossed parish boundaries and that it was more likely for higher class individuals to seek a marriage partner outside of their home parish. The mean distance travelled by marriage partners was 4.7 km. Given that our geography data apply to a slightly broader geographic area than parishes, this radius should be in line with the right-hand-side variables.
Finally, our variable choice is in line with those used in research on the Engerman-Sokoloff hypothesis and should not suffer from greater attenuation bias than those studies.
6. Empirical analysis
Does the employment structure explain differences in literacy rates across parishes? To test whether differences in demand and supply created by geography had effects on literacy, we estimate the coefficients in a set of logit regressions, because the dependent variable is dichotomous.12 The first is
which we estimate for grooms and brides together from parishes where at least half of men were employed in farming in 1831. DLIT is an indicator variable, which takes a value of 1 if the individual is literate. FAGR is the share of adult men employed in agriculture in the home parish in 1831; YEAR the year of the wedding (1837–63),13 and FARMSHR is the share of men engaged in agriculture who were farmers in 1831.
Was literacy greater in rural parishes with a larger share of farmers through the direct effect of employment types on education and through the indirect effects from lower priced supply and community provision of education? Table 5 reports the regression estimates as the marginal effect on the probability of being literate of a change in each independent variable of one unit from their mean value.
Explaining literacy rates 1837–63 (logit estimates)
| . | (1) . | (2) . | (3) . | (4) . |
|---|---|---|---|---|
| FAGR | −0.824** | −0.523** | −0.530** | −0.370** |
| (0.150) | (0.103) | (0.098) | (0.074) | |
| FARMSHR | 0.053 | −0.166 | 0.103 | 0.074 |
| (0.118) | (0.104) | (0.083) | (0.088) | |
| Year 1837 | 0.007** | 0.007** | 0.007** | 0.007** |
| (0.001) | (0.001) | (0.001) | (0.001) | |
| Gentleman | 0.460** | 0.430** | ||
| (0.040) | (0.035) | |||
| Professional | 0.319** | 0.270** | ||
| (0.046) | (0.044) | |||
| Farmer | 0.282** | 0.264** | ||
| (0.039) | (0.035) | |||
| Trader | 0.209** | 0.180** | ||
| (0.047) | (0.045) | |||
| Craftsman | 0.159** | 0.123** | ||
| (0.038) | (0.036) | |||
| Skilled labourer | 0.131** | 0.008 | ||
| (0.044) | (0.043) | |||
| Unskilled labourer | −0.178** | −0.178** | ||
| (0.036) | (0.032) | |||
| Lancashire | 0.019 | −0.117** | ||
| (0.040) | (0.039) | |||
| Northumberland | 0.372** | 0.262** | ||
| (0.032) | (0.031) | |||
| Somerset | 0.004 | −0.040 | ||
| (0.039) | (0.032) | |||
| Observations | 8,105 | 8,105 | 8,105 | 8,105 |
| Pseudo-R2 | 0.03 | 0.16 | 0.09 | 0.20 |
| . | (1) . | (2) . | (3) . | (4) . |
|---|---|---|---|---|
| FAGR | −0.824** | −0.523** | −0.530** | −0.370** |
| (0.150) | (0.103) | (0.098) | (0.074) | |
| FARMSHR | 0.053 | −0.166 | 0.103 | 0.074 |
| (0.118) | (0.104) | (0.083) | (0.088) | |
| Year 1837 | 0.007** | 0.007** | 0.007** | 0.007** |
| (0.001) | (0.001) | (0.001) | (0.001) | |
| Gentleman | 0.460** | 0.430** | ||
| (0.040) | (0.035) | |||
| Professional | 0.319** | 0.270** | ||
| (0.046) | (0.044) | |||
| Farmer | 0.282** | 0.264** | ||
| (0.039) | (0.035) | |||
| Trader | 0.209** | 0.180** | ||
| (0.047) | (0.045) | |||
| Craftsman | 0.159** | 0.123** | ||
| (0.038) | (0.036) | |||
| Skilled labourer | 0.131** | 0.008 | ||
| (0.044) | (0.043) | |||
| Unskilled labourer | −0.178** | −0.178** | ||
| (0.036) | (0.032) | |||
| Lancashire | 0.019 | −0.117** | ||
| (0.040) | (0.039) | |||
| Northumberland | 0.372** | 0.262** | ||
| (0.032) | (0.031) | |||
| Somerset | 0.004 | −0.040 | ||
| (0.039) | (0.032) | |||
| Observations | 8,105 | 8,105 | 8,105 | 8,105 |
| Pseudo-R2 | 0.03 | 0.16 | 0.09 | 0.20 |
Explaining literacy rates 1837–63 (logit estimates)
| . | (1) . | (2) . | (3) . | (4) . |
|---|---|---|---|---|
| FAGR | −0.824** | −0.523** | −0.530** | −0.370** |
| (0.150) | (0.103) | (0.098) | (0.074) | |
| FARMSHR | 0.053 | −0.166 | 0.103 | 0.074 |
| (0.118) | (0.104) | (0.083) | (0.088) | |
| Year 1837 | 0.007** | 0.007** | 0.007** | 0.007** |
| (0.001) | (0.001) | (0.001) | (0.001) | |
| Gentleman | 0.460** | 0.430** | ||
| (0.040) | (0.035) | |||
| Professional | 0.319** | 0.270** | ||
| (0.046) | (0.044) | |||
| Farmer | 0.282** | 0.264** | ||
| (0.039) | (0.035) | |||
| Trader | 0.209** | 0.180** | ||
| (0.047) | (0.045) | |||
| Craftsman | 0.159** | 0.123** | ||
| (0.038) | (0.036) | |||
| Skilled labourer | 0.131** | 0.008 | ||
| (0.044) | (0.043) | |||
| Unskilled labourer | −0.178** | −0.178** | ||
| (0.036) | (0.032) | |||
| Lancashire | 0.019 | −0.117** | ||
| (0.040) | (0.039) | |||
| Northumberland | 0.372** | 0.262** | ||
| (0.032) | (0.031) | |||
| Somerset | 0.004 | −0.040 | ||
| (0.039) | (0.032) | |||
| Observations | 8,105 | 8,105 | 8,105 | 8,105 |
| Pseudo-R2 | 0.03 | 0.16 | 0.09 | 0.20 |
| . | (1) . | (2) . | (3) . | (4) . |
|---|---|---|---|---|
| FAGR | −0.824** | −0.523** | −0.530** | −0.370** |
| (0.150) | (0.103) | (0.098) | (0.074) | |
| FARMSHR | 0.053 | −0.166 | 0.103 | 0.074 |
| (0.118) | (0.104) | (0.083) | (0.088) | |
| Year 1837 | 0.007** | 0.007** | 0.007** | 0.007** |
| (0.001) | (0.001) | (0.001) | (0.001) | |
| Gentleman | 0.460** | 0.430** | ||
| (0.040) | (0.035) | |||
| Professional | 0.319** | 0.270** | ||
| (0.046) | (0.044) | |||
| Farmer | 0.282** | 0.264** | ||
| (0.039) | (0.035) | |||
| Trader | 0.209** | 0.180** | ||
| (0.047) | (0.045) | |||
| Craftsman | 0.159** | 0.123** | ||
| (0.038) | (0.036) | |||
| Skilled labourer | 0.131** | 0.008 | ||
| (0.044) | (0.043) | |||
| Unskilled labourer | −0.178** | −0.178** | ||
| (0.036) | (0.032) | |||
| Lancashire | 0.019 | −0.117** | ||
| (0.040) | (0.039) | |||
| Northumberland | 0.372** | 0.262** | ||
| (0.032) | (0.031) | |||
| Somerset | 0.004 | −0.040 | ||
| (0.039) | (0.032) | |||
| Observations | 8,105 | 8,105 | 8,105 | 8,105 |
| Pseudo-R2 | 0.03 | 0.16 | 0.09 | 0.20 |
As can be seen in column (1) the share of the agricultural labour force who were farmers has no significant effect on overall literacy rates in these parishes. Even had the estimated marginal coefficient been statistically significant, the structure of farm employment could explain little of the great range in literacy rates (17–96%) across parishes in the sample, since this variable changes by less than 0.75 over the sample and the estimated coefficient is small at 0.053. In contrast, how rural a parish is, measured by the share of adult males engaged in farming, has a very powerful negative effect on literacy. Since this variable ranges from 0.5 to 1.0 for the sample of parishes used in the regression, this explains 40% of the variation in literacy.
The results are robust to using an alternative measure of inequality, the fraction of farms which are ‘small’, meaning hire fewer than 10 labourers. The results also do not change when an indicator for female is included in the specifications in Table 5. Supplementary Data presents these results, which are consistent with the coefficients displayed in Table 5, and which still find no significant effect of FARMSHR on education. A third robustness check included parish-level wealth, proxied by the value of property in a parish in 1842, as a control. The main conclusions hold and we also find only small, positive effects of wealth on literacy. The full results are presented in the Supplementary Data.
The findings are puzzling given two facts. Higher status occupations by fathers implied higher literacy rates by brides and grooms (as we shall see later). And parishes with a higher FARMSHR had more people in higher status occupations, as is shown in Table 3. The explanation lies in the estimation reported in column (2). This shows the estimated coefficients from a regression where we estimate literacy, controlling for the occupation of fathers. Thus,
where OCCi is one of the seven occupational categories given in Table 3. This regression asks whether, controlling for your father's occupation, you were more likely to be literate in a parish with a larger farmer share.
As expected, fathers’ occupations are powerful predictors of literacy. However, the coefficient on FARMSHR is larger but negative (though insignificant). Controlling for occupation, brides and grooms in parishes where a larger share of the farm labour force are farmers are more likely to be illiterate. This is what explains the failure to find any significant connection between the share of the rural population who were farmers and literacy rates. Seemingly where more men were farmers, farmers had lower status and literacy. Where more men were labourers, labourers had higher status and literacy.
In the third and fourth columns we repeat these estimates, controlling for differences at the county level in literacy rates. In these regressions the effect of FARMSHR on literacy is thus being estimated using only the variation across parishes within each county. Now, in columns (3) and (4), FARMSHR shows up as having a larger but still insignificant positive effect on literacy. Over the sample it might explain a 6–8% difference in parish literacy rates. These effects are still dwarfed by the effect of ruralness (which produces differences of 19–26% in parish literacy rates) and by the county fixed effects (which explain as much as a 38% difference in literacy rates). The structure of the farming population, dictated by climate, can play only a modest role in determining literacy rates by parish. The vast bulk of the explanation for varying literacy rates comes from factors such as how rural parishes were and county fixed effects.
Figure 5 shows the relationship between the share of agricultural workers who were farmers and the literacy rate for 70 parishes with 30 or more observations, and at least half of men in 1831 employed in agriculture. The figure illustrates two things. The first is that there are clearly local variations in literacy rates, independent of variations in the structure of agriculture. Northumberland literacy rates were high (89% on average), whatever the share of agriculturalists who were farmers, and Essex rates low (45% on average). The second is that there is at best a weak connection between the employment structure in farming and the literacy rate.
As noted, the lack of connection between occupational structure and literacy implies that in rural parishes with a higher farmer share, literacy rates for farmers are lower. Table 6 tests this implication by looking at the determinants of literacy rates for the children of farmers and for the children of labourers. If we do not include county fixed effects, farmers’ children are less literate in parishes where farmers are a larger share of the agricultural work force (column (1) of Table 6). With county fixed effects there is still a negative connection between FARMSHR and the literacy of farmers’ children, but the effect is not statistically significant. The fixed effect that produces this result is that in the county with the largest share of farmers, Lancashire, farmers were 20% less literate. Going back to the baseline specifications in Table 5, if we include an interaction term between FARMSHR and having a father who is a farmer, we see that there is a negative effect of a farmer father in parishes with larger proportions of farmers. This effect is about the same size as the advantage in general to being born to a farmer. The inclusion of this variable, which bears out our investigations in Fig. 6 and Table 6, still does not change the implication that social structure has little effect on literacy.
Literacy rates of unskilled labourers and farmers, by county
Notes: Dark bars represent the rate for farmers; light bars the rate for labourers.
Literacy rates of unskilled labourers and farmers, by county
Notes: Dark bars represent the rate for farmers; light bars the rate for labourers.
Literacy rates of farmers and labourers
| Dependent variable . | Farmer . | Farmer . | Unskilled labourer . | Unskilled labourer . |
|---|---|---|---|---|
| (Year 1837) | 0.005* | 0.005** | 0.009** | 0.009** |
| (0.002) | (0.002) | (0.001) | (0.001) | |
| FAGRIC | −0.292 | −0.296* | −0.587** | −0.421** |
| (0.171) | (0.140) | (0.121) | (0.095) | |
| FARMSHR | −0.341* | −0.065 | −0.003 | 0.109 |
| (0.152) | (0.116) | (0.097) | (0.092) | |
| Lancashire | — | −0.189** | — | −0.052 |
| (0.036) | (0.044) | |||
| Northumberland | — | 0.067** | — | 0.450** |
| (0.027) | (0.046) | |||
| Somerset | — | −0.033 | — | −0.037 |
| (0.032) | (0.040) | |||
| N | 1,616 | 1,616 | 3,584 | 3,584 |
| Pseudo-R2 | 0.03 | 0.10 | 0.03 | 0.08 |
| Dependent variable . | Farmer . | Farmer . | Unskilled labourer . | Unskilled labourer . |
|---|---|---|---|---|
| (Year 1837) | 0.005* | 0.005** | 0.009** | 0.009** |
| (0.002) | (0.002) | (0.001) | (0.001) | |
| FAGRIC | −0.292 | −0.296* | −0.587** | −0.421** |
| (0.171) | (0.140) | (0.121) | (0.095) | |
| FARMSHR | −0.341* | −0.065 | −0.003 | 0.109 |
| (0.152) | (0.116) | (0.097) | (0.092) | |
| Lancashire | — | −0.189** | — | −0.052 |
| (0.036) | (0.044) | |||
| Northumberland | — | 0.067** | — | 0.450** |
| (0.027) | (0.046) | |||
| Somerset | — | −0.033 | — | −0.037 |
| (0.032) | (0.040) | |||
| N | 1,616 | 1,616 | 3,584 | 3,584 |
| Pseudo-R2 | 0.03 | 0.10 | 0.03 | 0.08 |
Literacy rates of farmers and labourers
| Dependent variable . | Farmer . | Farmer . | Unskilled labourer . | Unskilled labourer . |
|---|---|---|---|---|
| (Year 1837) | 0.005* | 0.005** | 0.009** | 0.009** |
| (0.002) | (0.002) | (0.001) | (0.001) | |
| FAGRIC | −0.292 | −0.296* | −0.587** | −0.421** |
| (0.171) | (0.140) | (0.121) | (0.095) | |
| FARMSHR | −0.341* | −0.065 | −0.003 | 0.109 |
| (0.152) | (0.116) | (0.097) | (0.092) | |
| Lancashire | — | −0.189** | — | −0.052 |
| (0.036) | (0.044) | |||
| Northumberland | — | 0.067** | — | 0.450** |
| (0.027) | (0.046) | |||
| Somerset | — | −0.033 | — | −0.037 |
| (0.032) | (0.040) | |||
| N | 1,616 | 1,616 | 3,584 | 3,584 |
| Pseudo-R2 | 0.03 | 0.10 | 0.03 | 0.08 |
| Dependent variable . | Farmer . | Farmer . | Unskilled labourer . | Unskilled labourer . |
|---|---|---|---|---|
| (Year 1837) | 0.005* | 0.005** | 0.009** | 0.009** |
| (0.002) | (0.002) | (0.001) | (0.001) | |
| FAGRIC | −0.292 | −0.296* | −0.587** | −0.421** |
| (0.171) | (0.140) | (0.121) | (0.095) | |
| FARMSHR | −0.341* | −0.065 | −0.003 | 0.109 |
| (0.152) | (0.116) | (0.097) | (0.092) | |
| Lancashire | — | −0.189** | — | −0.052 |
| (0.036) | (0.044) | |||
| Northumberland | — | 0.067** | — | 0.450** |
| (0.027) | (0.046) | |||
| Somerset | — | −0.033 | — | −0.037 |
| (0.032) | (0.040) | |||
| N | 1,616 | 1,616 | 3,584 | 3,584 |
| Pseudo-R2 | 0.03 | 0.10 | 0.03 | 0.08 |
For unskilled labourers there appears to be no connection between the share of the agricultural population that comprises farmers and their literacy rate, as shown in columns (4) and (5) of Table 6. But when we allow county fixed effects, Northumberland stands out as having dramatically higher literacy rates for labourers. The analysis using unskilled labourers only also serves as a robustness check for our assumption that individuals were educated in the parish in which we observe them at marriage, because mobility was much lower amongst lower status individuals. There is therefore less chance of attenuation bias in these coefficients—it is thus reassuring that we still do not find evidence of a significant effect of inequality. This test also highlights that there does not seem to be any spillover effect of farmer demand for education onto the lower status individuals. Were spillovers a large factor, then we would expect FARMSHR to influence labourer education positively, which we do not find. Similarly, the results are robust to including measures of parish population size and density and those variables are not significant, indicating that spillovers are not present.
Figure 6 shows farmer and labourer literacy rates by county. Idiosyncratic variation across counties dominates any effect of the agricultural system.
The sense that idiosyncratic elements of local culture dominated in determining literacy, and not systematic elements from geography comes when we estimate separately male and female literacy rates by county, controlling for year, the fraction of the male population engaged in farming, and the share of those engaged in farming who were farmers. Figure 7 shows by county estimated male and female literacy rates. Female literacy rates relative to male rates vary widely. In Essex women were more literate than men, whilst in Lancashire they were substantially less literate. The argument that farm scale was the key determinant of literacy does not predict any county-level variation in relative female literacy. Again orthogonal cultural forces appear important.
The only information the 1831 census gives on female employment are the numbers of female ‘servants’. The ratio of female servants to the total population does vary substantially across parishes. Could it be that in areas of extensive employment of women in service, this raised the opportunity cost of female education and drove down literacy rates? This possibility is ruled out, however, by the finding that although Essex had the lowest rate of female employment in service, the rates of such employment were easily highest in Northumberland, which had an average level of difference between male and female literacy rates.
It could be that FARMSHR does not show a strong positive association with literacy because of reverse causation between literacy and FARMSHR. What would have to happen here is that high levels of literacy would themselves induce the creation of larger farms, through a move to more progressive farming structures and technologies which used economies of scale.
We can control for reverse causation by instrumenting for FARMSHR. Since FARMSHR correlates well with geography, as shown in Table 1, there is no shortage of potential instruments. However, for instruments to be valid they must not directly cause changes in literacy themselves. The structure must be
Table 7 shows the effects of FARMSHR when it is instrumented by all the geographic variables listed in Table 2. The F-statistics of the first stage in the estimation are satisfactory, above 10 in every specification. The reduced form for these regressions is presented in Supplementary Data and shows that there is some explanatory power in the geographic variables determining literacy. However the relationship is weak and consistent with our central argument that geographically driven inequality is not the main determinant of literacy.
Instrumental variables estimation
| . | (1) . | (2) . | (3) . | (4) . |
|---|---|---|---|---|
| Constant | 1.075** | 0.975** | 0.634** | 0.593** |
| (0.128) | (0.091) | (0.160) | (0.142) | |
| FAGRIC | −0.833** | −0.581** | −0.478** | −0.323** |
| (0.149) | (0.099) | (0.153) | (0.125) | |
| Year 1837 | 0.007** | 0.007** | 0.007** | 0.007** |
| (0.001) | (0.001) | (0.001) | (0.001) | |
| FARMSHR | 0.041 | −0.406** | 0.884 | 0.743 |
| (0.150) | (0.139) | (0.592) | (0.559) | |
| Gentleman | 0.442** | 0.388** | ||
| (0.046) | (0.045) | |||
| Professional | 0.306** | 0.244** | ||
| (0.046) | (0.044) | |||
| Farmer | 0.295** | 0.263** | ||
| (0.042) | (0.033) | |||
| Trader | 0.200** | 0.194** | ||
| (0.050) | (0.041) | |||
| Craftsman | 0.156** | 0.134** | ||
| (0.041) | (0.034) | |||
| Skilled labourer | 0.130** | 0.030 | ||
| (0.048) | (0.037) | |||
| Unskilled labourer | −0.187** | −0.181** | ||
| (0.039) | (0.028) | |||
| Lancashire | −0.171 | −0.271 | ||
| (0.154) | (0.146) | |||
| Northumberland | 0.280** | 0.162** | ||
| (0.077) | (0.075) | |||
| Somerset | −0.152 | −0.172 | ||
| (0.129) | (0.121) | |||
| Observations | 8,105 | 8,105 | 8,105 | 8,105 |
| Pseudo-R2 | 0.04 | 0.19 | 0.08 | 0.22 |
| First-stage F-Stat | 33 | 17 | 27 | 17 |
| . | (1) . | (2) . | (3) . | (4) . |
|---|---|---|---|---|
| Constant | 1.075** | 0.975** | 0.634** | 0.593** |
| (0.128) | (0.091) | (0.160) | (0.142) | |
| FAGRIC | −0.833** | −0.581** | −0.478** | −0.323** |
| (0.149) | (0.099) | (0.153) | (0.125) | |
| Year 1837 | 0.007** | 0.007** | 0.007** | 0.007** |
| (0.001) | (0.001) | (0.001) | (0.001) | |
| FARMSHR | 0.041 | −0.406** | 0.884 | 0.743 |
| (0.150) | (0.139) | (0.592) | (0.559) | |
| Gentleman | 0.442** | 0.388** | ||
| (0.046) | (0.045) | |||
| Professional | 0.306** | 0.244** | ||
| (0.046) | (0.044) | |||
| Farmer | 0.295** | 0.263** | ||
| (0.042) | (0.033) | |||
| Trader | 0.200** | 0.194** | ||
| (0.050) | (0.041) | |||
| Craftsman | 0.156** | 0.134** | ||
| (0.041) | (0.034) | |||
| Skilled labourer | 0.130** | 0.030 | ||
| (0.048) | (0.037) | |||
| Unskilled labourer | −0.187** | −0.181** | ||
| (0.039) | (0.028) | |||
| Lancashire | −0.171 | −0.271 | ||
| (0.154) | (0.146) | |||
| Northumberland | 0.280** | 0.162** | ||
| (0.077) | (0.075) | |||
| Somerset | −0.152 | −0.172 | ||
| (0.129) | (0.121) | |||
| Observations | 8,105 | 8,105 | 8,105 | 8,105 |
| Pseudo-R2 | 0.04 | 0.19 | 0.08 | 0.22 |
| First-stage F-Stat | 33 | 17 | 27 | 17 |
Instrumental variables estimation
| . | (1) . | (2) . | (3) . | (4) . |
|---|---|---|---|---|
| Constant | 1.075** | 0.975** | 0.634** | 0.593** |
| (0.128) | (0.091) | (0.160) | (0.142) | |
| FAGRIC | −0.833** | −0.581** | −0.478** | −0.323** |
| (0.149) | (0.099) | (0.153) | (0.125) | |
| Year 1837 | 0.007** | 0.007** | 0.007** | 0.007** |
| (0.001) | (0.001) | (0.001) | (0.001) | |
| FARMSHR | 0.041 | −0.406** | 0.884 | 0.743 |
| (0.150) | (0.139) | (0.592) | (0.559) | |
| Gentleman | 0.442** | 0.388** | ||
| (0.046) | (0.045) | |||
| Professional | 0.306** | 0.244** | ||
| (0.046) | (0.044) | |||
| Farmer | 0.295** | 0.263** | ||
| (0.042) | (0.033) | |||
| Trader | 0.200** | 0.194** | ||
| (0.050) | (0.041) | |||
| Craftsman | 0.156** | 0.134** | ||
| (0.041) | (0.034) | |||
| Skilled labourer | 0.130** | 0.030 | ||
| (0.048) | (0.037) | |||
| Unskilled labourer | −0.187** | −0.181** | ||
| (0.039) | (0.028) | |||
| Lancashire | −0.171 | −0.271 | ||
| (0.154) | (0.146) | |||
| Northumberland | 0.280** | 0.162** | ||
| (0.077) | (0.075) | |||
| Somerset | −0.152 | −0.172 | ||
| (0.129) | (0.121) | |||
| Observations | 8,105 | 8,105 | 8,105 | 8,105 |
| Pseudo-R2 | 0.04 | 0.19 | 0.08 | 0.22 |
| First-stage F-Stat | 33 | 17 | 27 | 17 |
| . | (1) . | (2) . | (3) . | (4) . |
|---|---|---|---|---|
| Constant | 1.075** | 0.975** | 0.634** | 0.593** |
| (0.128) | (0.091) | (0.160) | (0.142) | |
| FAGRIC | −0.833** | −0.581** | −0.478** | −0.323** |
| (0.149) | (0.099) | (0.153) | (0.125) | |
| Year 1837 | 0.007** | 0.007** | 0.007** | 0.007** |
| (0.001) | (0.001) | (0.001) | (0.001) | |
| FARMSHR | 0.041 | −0.406** | 0.884 | 0.743 |
| (0.150) | (0.139) | (0.592) | (0.559) | |
| Gentleman | 0.442** | 0.388** | ||
| (0.046) | (0.045) | |||
| Professional | 0.306** | 0.244** | ||
| (0.046) | (0.044) | |||
| Farmer | 0.295** | 0.263** | ||
| (0.042) | (0.033) | |||
| Trader | 0.200** | 0.194** | ||
| (0.050) | (0.041) | |||
| Craftsman | 0.156** | 0.134** | ||
| (0.041) | (0.034) | |||
| Skilled labourer | 0.130** | 0.030 | ||
| (0.048) | (0.037) | |||
| Unskilled labourer | −0.187** | −0.181** | ||
| (0.039) | (0.028) | |||
| Lancashire | −0.171 | −0.271 | ||
| (0.154) | (0.146) | |||
| Northumberland | 0.280** | 0.162** | ||
| (0.077) | (0.075) | |||
| Somerset | −0.152 | −0.172 | ||
| (0.129) | (0.121) | |||
| Observations | 8,105 | 8,105 | 8,105 | 8,105 |
| Pseudo-R2 | 0.04 | 0.19 | 0.08 | 0.22 |
| First-stage F-Stat | 33 | 17 | 27 | 17 |
Without fixed effects for counties there is no greater effect of FARMSHR on literacy once it is instrumented, and indeed a significant negative effect once we control for occupation. However, only once we include fixed effects for counties, which are all substantial, FARMSHR enters positively with very strong effects, but the standard errors quadruple so that nothing is even close to statistically significant (presumably because there is a strong association between geography and the county fixed effects). The county effects again seem much better at predicting the literacy variation than differences across parishes in geography.
Even if the instrumental variables (IV) estimates in columns (3) and (4) were significant, then it implies that within a relatively homogeneous society such as England, differences in literacy rates created by factors other than geography will be dramatic, and geography will explain only a modest amount of the variation in literacy rates. To make this effect fit the data, the IV estimation has to assign large negative fixed effects on the counties with large FARMSHR, Lancashire and Somerset. Furthermore, because what we really want to uncover is why Northumberland has higher literacy than Essex, the regression without fixed effects is the most relevant one to consider and, again, shows little evidence of a significant positive effect of land equality on education.
7. Interpretation
It is clear from the foregoing that social structure derived from geography in early nineteenth-century England played little role in explaining variations in literacy across rural parishes, unless we assume large countervailing exogenous differences across counties in underlying literacy rates. Could this be just because the institutional framework was one that prohibited parishes from taxing rate payers to pay for schools, mechanisms that were available elsewhere (such as in North America) and were widely used? Could it be that social structure can only influence education when there is an ability to tax to supply subsidized public education?
A comparison of Northumberland with Berwickshire and Roxburghshire, the adjacent Scottish counties, suggests that local taxing powers for education were in no way required for the achievement of high literacy levels. In Scotland, though schooling was not free, the 1696 education legislation required a publicly supported school in each parish and provided for its support with a tax on landowners. Scotland achieved high literacy rates by the eighteenth century, which has been attributed to this mechanism of public support. Yet literacy rates in the Scottish lowlands, which includes these border counties, were little higher than in Northumberland by the early nineteenth century (Stephens, 1990, p.561).
The example of Northumberland shows that universal literacy was feasible even in an employment structure dominated by labourers, and where public provision of education through compulsory taxation was prohibited. There appears to have been a general culture in northern England that favoured education, which Howkins (1991, p.178) describes as an ‘almost Scottish stress on the merits of democratic education’. This pro-education sentiment was also, crucially, shared by labourers in Northumberland, and many children of labourers were educated at their parents’ expense, often in private schools (Stephens, 1987, pp.49–57). Houston's study of literacy and education in Scotland draws many comparisons between lowland Scotland and the northern English counties and argues that they shared many ‘cultural patterns of which literacy was one’ (Houston, 2002, p.257 and ch.2). Further evidence supporting the link to Scottish culture is found in O'Grada (2010) which finds that once the effects of economic structure and income are controlled for, Ulster stood out as having higher literacy rates than the rest of Ireland in 1841. The back-and-forth migration and long-standing connections of this region to Scotland are well known and may have contributed to this effect.
Taking this argument further, we measured the correlations between literacy rates in Northumberland townships and their distance to Edinburgh. We also measured the correlation of literacy and the percentage of Scots born living in each township, revealed by the 1841 census.14 Distance from Edinburgh was negatively associated with literacy, significant at the 5% level. The percentage of Scots born was positively associated, significant at the 10% level. These are simply correlations but they support a Scottish cultural influence in determining literacy rates. Figure 8 shows the relationship between literacy and percent of a township Scots born. Regressing literacy rates on this measure of ‘Scottishness’ shows that a 1 percentage point increase in the percent of a parish who was born in Scotland leads to a one percentage point increase in literacy rates.
Other compelling evidence against the geography hypothesis comes from looking at the earlier history of literacy in England. Geography was constant over time, and consequently even 200 years earlier, Essex already had few farmers and Lancashire many. There should thus already have been higher literacy in Lancashire than in Essex. But using signature data from the Protestation and Covenant Oaths of the 1640s, Cressy (1980, pp.76–85) shows that Lancashire had only a 24% male literacy rate, whilst the rate in Essex was substantially higher at 37%.
8. Religion and literacy
We have argued that culture must be responsible for differences in literacy rates across English regions. Was that cultural difference evidenced through religious affiliation? Since religious orientation can itself be heavily influenced by literacy and other cultural influences, we cannot show that religious affiliation determines literacy. But was it at least correlated with literacy? A recent study by Becker and Woessmann (2010) finds evidence of a causal link between Protestantism and literacy and education rates in nineteenth-century Germany. They use Prussian data to show that counties with proportionately more Protestants had higher school enrollment rates and more schools per inhabitant. This is true even after instrumenting for the Protestant share with distance to Wittenburg. This effect was particularly evident in the education of women (Becker and Woessmann, 2008).
We have information on the religious composition of England's parishes from the 1851 census. This information identifies the share of the population affiliated with the established Church of England, with the Catholic Church, and with other Protestant denominations. We take as our measure of Protestantism the share affiliated with these other Protestant denominations, since the Church of England was an institution that, even though reformed from Catholicism, retained many aspects of Catholicism. Table 8 summarizes the data.
Fraction nonconformist by county, 1851
| Fraction nonconformist . | Northumberland . | Lancashire . | Essex . | Somerset . |
|---|---|---|---|---|
| Average | 0.52 | 0.29 | 0.37 | 0.39 |
| Standard deviation | 0.10 | 0.01 | 0.08 | 0.06 |
| Fraction nonconformist . | Northumberland . | Lancashire . | Essex . | Somerset . |
|---|---|---|---|---|
| Average | 0.52 | 0.29 | 0.37 | 0.39 |
| Standard deviation | 0.10 | 0.01 | 0.08 | 0.06 |
Fraction nonconformist by county, 1851
| Fraction nonconformist . | Northumberland . | Lancashire . | Essex . | Somerset . |
|---|---|---|---|---|
| Average | 0.52 | 0.29 | 0.37 | 0.39 |
| Standard deviation | 0.10 | 0.01 | 0.08 | 0.06 |
| Fraction nonconformist . | Northumberland . | Lancashire . | Essex . | Somerset . |
|---|---|---|---|---|
| Average | 0.52 | 0.29 | 0.37 | 0.39 |
| Standard deviation | 0.10 | 0.01 | 0.08 | 0.06 |
Since the nonconformist share is high in the county with exceptional literacy rates, Northumberland, and similar in the three lower literacy counties, it seems promising that nonconformism will be correlated with literacy at the parish level.
Table 9 displays the results from logit regressions including a measure of the share of the population who was nonconformist.15 The results show that the share nonconformist does significantly correlate with higher literacy rates, in line with the Becker and Woessmann findings. This result holds even when we control for the occupational status of parents.
Logit regressions including religious composition
| . | (1) . | (2) . | (3) . | (4) . |
|---|---|---|---|---|
| FAGRIC | −0.763** | −0.479** | −0.512** | −0.356** |
| (0.145) | (0.102) | (0.103) | (0.080) | |
| FARMSHR | 0.176 | −0.060 | 0.077 | 0.054 |
| (0.103) | (0.095) | (0.084) | (0.090) | |
| Year 1837 | 0.008** | 0.008** | 0.008** | 0.007** |
| (0.001) | (0.001) | (0.001) | (0.001) | |
| Lancashire | −0.003 | −0.133** | ||
| (0.042) | (0.040) | |||
| Northumberland | 0.406** | 0.303** | ||
| (0.034) | (0.036) | |||
| Somerset | 0.020 | −0.028 | ||
| (0.043) | (0.036) | |||
| Fraction nonconformist | 0.693** | 0.593** | −0.347 | −0.312 |
| (0.198) | (0.169) | (0.190) | (0.166) | |
| Occupation controls | No | Yes | No | Yes |
| Observations | 7,927 | 7,927 | 7,927 | 7,927 |
| Pseudo-R2 | 0.05 | 0.16 | 0.09 | 0.20 |
| . | (1) . | (2) . | (3) . | (4) . |
|---|---|---|---|---|
| FAGRIC | −0.763** | −0.479** | −0.512** | −0.356** |
| (0.145) | (0.102) | (0.103) | (0.080) | |
| FARMSHR | 0.176 | −0.060 | 0.077 | 0.054 |
| (0.103) | (0.095) | (0.084) | (0.090) | |
| Year 1837 | 0.008** | 0.008** | 0.008** | 0.007** |
| (0.001) | (0.001) | (0.001) | (0.001) | |
| Lancashire | −0.003 | −0.133** | ||
| (0.042) | (0.040) | |||
| Northumberland | 0.406** | 0.303** | ||
| (0.034) | (0.036) | |||
| Somerset | 0.020 | −0.028 | ||
| (0.043) | (0.036) | |||
| Fraction nonconformist | 0.693** | 0.593** | −0.347 | −0.312 |
| (0.198) | (0.169) | (0.190) | (0.166) | |
| Occupation controls | No | Yes | No | Yes |
| Observations | 7,927 | 7,927 | 7,927 | 7,927 |
| Pseudo-R2 | 0.05 | 0.16 | 0.09 | 0.20 |
Logit regressions including religious composition
| . | (1) . | (2) . | (3) . | (4) . |
|---|---|---|---|---|
| FAGRIC | −0.763** | −0.479** | −0.512** | −0.356** |
| (0.145) | (0.102) | (0.103) | (0.080) | |
| FARMSHR | 0.176 | −0.060 | 0.077 | 0.054 |
| (0.103) | (0.095) | (0.084) | (0.090) | |
| Year 1837 | 0.008** | 0.008** | 0.008** | 0.007** |
| (0.001) | (0.001) | (0.001) | (0.001) | |
| Lancashire | −0.003 | −0.133** | ||
| (0.042) | (0.040) | |||
| Northumberland | 0.406** | 0.303** | ||
| (0.034) | (0.036) | |||
| Somerset | 0.020 | −0.028 | ||
| (0.043) | (0.036) | |||
| Fraction nonconformist | 0.693** | 0.593** | −0.347 | −0.312 |
| (0.198) | (0.169) | (0.190) | (0.166) | |
| Occupation controls | No | Yes | No | Yes |
| Observations | 7,927 | 7,927 | 7,927 | 7,927 |
| Pseudo-R2 | 0.05 | 0.16 | 0.09 | 0.20 |
| . | (1) . | (2) . | (3) . | (4) . |
|---|---|---|---|---|
| FAGRIC | −0.763** | −0.479** | −0.512** | −0.356** |
| (0.145) | (0.102) | (0.103) | (0.080) | |
| FARMSHR | 0.176 | −0.060 | 0.077 | 0.054 |
| (0.103) | (0.095) | (0.084) | (0.090) | |
| Year 1837 | 0.008** | 0.008** | 0.008** | 0.007** |
| (0.001) | (0.001) | (0.001) | (0.001) | |
| Lancashire | −0.003 | −0.133** | ||
| (0.042) | (0.040) | |||
| Northumberland | 0.406** | 0.303** | ||
| (0.034) | (0.036) | |||
| Somerset | 0.020 | −0.028 | ||
| (0.043) | (0.036) | |||
| Fraction nonconformist | 0.693** | 0.593** | −0.347 | −0.312 |
| (0.198) | (0.169) | (0.190) | (0.166) | |
| Occupation controls | No | Yes | No | Yes |
| Observations | 7,927 | 7,927 | 7,927 | 7,927 |
| Pseudo-R2 | 0.05 | 0.16 | 0.09 | 0.20 |
But this result is not robust to the inclusion of county fixed effects, as is shown in the last two columns of the table. Once we control for county effects, so that the coefficient on the nonconformist share is estimated only on within-county variation, the estimated coefficient is now a substantial negative (though now these estimates are not statistically significantly different from zero). So there is actually little evidence that Protestantism itself is a driver of literacy rates, as opposed to Protestantism just being accidentally associated with the county with unusual high literacy rates. As Table 8 shows, however, there is little variation within counties at the parish level in nonconformism rates, so the failure to find any association may owe to this lack of variation.
9. Conclusions
A central component of the Engerman-Sokoloff hypothesis on the effects of geography on development is that pre-industrial agricultural systems, which create greater inequality, will reduce investments in education. For pre-industrial England, we see that geography created great variations in social structure. We also see significant variations in the literacy rates of rural areas. However, social inequality was not an important driver of these variations. Thus whilst Cinnirella and Hornung (2010) find that in nineteenth-century Prussia greater land ownership inequality is associated with lower levels of primary schooling, large-scale farming has no connection with illiteracy in England. There is no law linking geography and education. This is reinforced by the changing pattern of literacy in England 1640–1840, despite an unchanging geography and relative farm size composition.
However, whilst by exclusion the explanation of most of these literacy differences seems to be culture, we do not find evidence for the Becker-Woessmann conclusion that Protestantism was an important driver of literacy. Nonconformism is positively correlated with literacy, but this correlation occurs only at the county level and not within counties. Protestantism may matter, indirectly, in that the source of the cultural difference that drove higher literacy rates in Northumberland may be the emulation of the high levels of education in the neighboring Scottish counties of Berwickshire and Roxburghshire. Those high levels of education can plausibly be attributed to the religious doctrines of the established Church in Scotland, which was Calvinist in inspiration.
Supplementary material
Supplementary Data is available online at the OUP website.
Funding
National Science Foundation Grant SES, 02-41376 to [G.C]. All-University of California Economic History Group Summer Research Award 2007 to [R.G.].
Acknowledgements
Claire Phan provided excellent research assistance. We thank seminar participants at UC Davis and the Oxford FRESH Meeting for helpful comments on early drafts of the article and two anonymous referees for their suggestions.
" Engerman and Sokoloff (2012) summarizes a series of papers, including Engerman and Sokoloff (1994, 2002).
" Easterly instrumented with the ratio of land suitable for wheat production to that suited to producing sugar.
" For details of this, see de Carvalho Filho and Colistete (2010).
" Cinnirella and Hornung (2010) perform a similar exercise for Prussia, using nineteenth-century land ownership concentration. However, Prussia was much more culturally and institutionally diverse than England, embracing German, Polish, and Eastern Jewish populations whose concentrations correlated with the inequality measure used.
" The figure is drawn for parishes or townships with 40 or more men employed in farming, and at least 30% of the male population engaged in farming.
" Many plants do not grow unless the temperature exceeds a minimum, frequently taken as 10°C. The measure thus looks at the total days above this minimum times the amount the average temperature exceeded the minimum on each of these days. Crops such as wheat can only mature if this number exceeds a certain minimum.
" Weighting by the numbers engaged in farming in the parish.
" The length of the growing season is just the number of days where the average temperature exceeds 10°C. ‘Days at capacity’ is a measure of the number of days each year when land cannot be ploughed because it is waterlogged. It is a measure of the suitability of land for arable cultivation.
" The regression estimates are shown in Supplementary Data.
" In fact, 1750–1839, over 76% of moves were either within region or to a new settlement of the same size.
" The results are robust to using ordinary least squares. When the regressions are run using data aggregated to the parish level, the conclusions are unchanged but the coefficients are less precisely estimated. Similarly with specifications including parish fixed effects.
" Using year dummies in place of the time trend yields the same results.
" Using the website http://www.freemaptools.com/how-far-is-it-between.htm (accessed 7 March 2013).
" We also ran the instrumental variables specification, including religion. The results were similar to those shown in the OLS table but are not reported here.
