friends from afar: the taiping rebellion, cultural ...this paper tests the hypothesis that the...
TRANSCRIPT
Friends from Afar: The Taiping Rebellion, CulturalProximity and Primary Schooling in the Lower Yangzi,
1850-1949∗
Yu Hao‡
Peking University, School of Economics
Melanie Meng Xue§
UCLA Anderson School of Management
This Version: June 2016
Abstract
This paper tests the hypothesis that the cultural distance between migrants and na-tives impedes public goods provision. The Taiping Rebellion was a shock that causedgroups without a history of shared governance to be relocated into the same region.We use a unique historical dataset of surnames in the Lower Yangzi of China to con-struct a measure of the cultural distance between migrants and natives (MNCD).We find an one-standard-deviation increase in MNCD is associated with a decreaseof over 0.19 public primary schools per 10,000 persons in the early 20th century.Results survive various robustness checks and an instrumental analysis exploitingpre-existing cultural distances between native and the nearby population. Evidencefrom the timing of MNCD taking effect, suggests that the primary mechanism runsfrom migrant-native cultural distance through quality of collective decision-makingto modern primary education.
Keywords: Cultural Distance; Primary Education; Local Public Goods; Quasi-ExogenousMigrationJEL Codes: D72, J15, N45, N95, O15, Z1
∗We thank the editor Debin Ma, two anonymous referees, as well as Ying Bai, Zhiwu Chen, Qiang Chen,Gregory Clark, Christian Dippel, Mark Koyama, James Kung, Nan Li, Kris Mitchener, Jean-Laurent Rosenthal,Tuan Hwee Sng, Romain Wacziarg, Noam Yuchtman and conference participants at the All-UC Conference on“Frontiers in Chinese Economic History” , Chinese University of Hong Kong, Shandong University, Xiamen Uni-versity and Peking University for helpful comments and suggestions. All remaining errors are the responsibilityof the authors.
I. INTRODUCTION
An extensive literature documents the negative impact of population heterogeneity on public
goods provision (Easterly and Levine, 1997; Alesina, Baqir, and Easterly, 1999; Alesina and
Ferrara, 2005). However, recent research suggests the lack of history of shared and centralized
governance between groups is just as likely to be responsible for the adverse outcomes associated
with the coexistence of different ethnic groups(Gennaioli and Rainer, 2007; Michalopoulos and
Papaioannou, 2013). This raises the question whether ethnic cleavages or artificial jurisdictions
has caused poor economic performance. Dippel (2014) contributes to the debate by showing that
a lack of a history of shared governance can negatively affect even ethnically and linguistically
homogeneous populations. We instead show that even for previously detached groups, cultural
distance can matter for the coexistence of different ethnic groups.
We exploit variation in cultural distance between previously detached groups following an exter-
nal shock: the Taiping Rebellion. We use a unique dataset of Chinese surnames of approximately
100,000 individuals over the course of 150 years. The same dataset also allows us to build com-
mon measures of population heterogeneity such as fractionalization and polarization.
The Taiping Rebellion (1850-1864) was a massive civil war in South China that constituted a
one-time shock to the population makeup. The rebellion led to the loss of 17 million lives in the
Lower Yangzi, or half of the native population (Cao, 1998). After the war, migrants flocked into
the region and began to coexist with natives. This shock created two groups without a history
of shared governance or prior interaction in a region. Cultural proximity between migrants and
natives varied. We hypothesize that cultural distance between migrants and natives (“MNCD”),
who lived in the same community after the rebellion, had a negative impact on public goods
provision.
We provide the historical context to show that migration was plausibly exogenous to the cultural
distance between migrants and natives. First, migrants moved into the area with little prior
contact with natives. Migrants were not selected based on their cultural proximity with the
natives (as is the case with chain migration). Ex-ante sorting was minimum. Second, in tradi-
tional China where ancestral land was of cultural prominence, natives were not able to move out
as freely in response to the arrival of migrants whose preferences differed. Hence ex-post self-
sorting was not a concern, either.1 To further establish causality, we introduce an instrumental
variable approach exploiting variation in pre-existing native-nearby cultural distance.
We go on to test our hypothesis that a greater migrant-native cultural distance lowers public
goods provision. Our proxy for public goods provision is the number of public primary schools
at county level. In the baseline model, we find that a one-standard-deviation increase in MNCD
is associated with a decrease of 0.19 public primary schools per 10,000 persons between 1900
1Compared to Ager and Bruckner (2013), we use arguably more exogenous migration as a treatment, asnatives and migrants had few opportunities to engage in ex ante screening, or ex-post self-sorting.
1
and 1910. That is a fifth of the mean of the number of public primary schools by population, or
40% of the standard devision. We then include in the controls share of arable land, distance to
the Grand Canal, distance to the Yangtze River, distance to the provincial capital, and distance
to Shanghai. We show that MNCD wins horse races against alternative explanatory variables
including the traditional fractionalization index and the polarization index. For robustness,
we control for initial conditions, interventions in education (missionary activities and temple
conversion), and confront possible effects of war (battle exposure, demographic shock and human
capital shock) on schools.
While our key finding is that cultural distance has an independent effect on public goods pro-
vision outcomes, we find evidence that the negative effects of the cultural distance between
surname groups can be mitigated by the history of shared governance. Our finding is mainly
built off the horse race results of MNCD against other measures of population heterogeneity
that ignore the history of shared governance within native surname groups.
We provide suggestive evidence on the mechanisms through which MNCD prevented the estab-
lishment of public primary schools. First, we exploit institutional features of early 20th century
China to form a testable hypothesis : MNCD should have the strongest effect on lower-primary
and dual-primary schools, since (a.) MNCD should matter the most in an environment of self-
governance, (b.) villages were traditionally self-governed, and (c.) villages (and townships) were
responsible for the building of lower-primary schools, and sometimes, dual-primary schools.2
Consistent with our prediction, we find that MNCD only affects schools at lower-primary and
dual-primary schools, not at upper-primary and secondary schools. Second, we exploit time
variation in institutions through the first half of the 20th century. Massive institutional changes
in 20th century China provides an excellent laboratory to observe the impact of MNCD. In our
sample, we have periods of decentralization and centralization of the education system, when
decisions to education children were made locally and when those decisions were made by the
national or provincial government, respectively. This allows us to use evidence from timing to
interpret our finding. We expect to see a larger effect of MNCD during the period featuring
more self-governance and more decentralization of fiscal authority. And consistent with this
prediction, we find the effect of MNCD on modern education is pronounced in the early 20th
century but is muted in China for much of the 20th century under autocratic rule and fiscal
centralization. Soon after the centralization of the educational system in 1927, MNCD no longer
had a significant effect on schools. We conclude that MNCD resulted in fewer primary schools
being build due to lower quality of collective decision-making in local communities.
Our study builds on the literature on the relationship between diversity of individual preferences
and public goods provision. Alesina, Baqir, and Easterly (1999) show theoretically that the
2At the time, the entire phase of primary education was divided into two: upper- and lower-primary educa-tion.“Upper primary education” refers to the more advanced stage of primary education. Most schools specializedin either upper- or lower-primary education. Those providing both upper and lower primary education were called“dual-primary schools”.
2
median distance from the preference of the median voter can be considered as an indication of
how polarized preferences are. The model predicts that public goods provision will be adversely
affected in a polarized society characterized by two separate groups with relatively homogeneous
preferences within the group, but very distinct preferences across groups. More recent work
shows that in the process of decentralization and redistricting, the benefits of reduced diversity
can be undone if the newly governed population is highly polarized (Bazzi et al., 2015). In our
paper, we use the cultural distance between migrants and natives as a proxy for the difference
in preferences between these two groups. We find that cultural distances between groups indeed
matter for public goods provision, whereas the traditional fragmentation measure that assigns
the same distance to all groups does not produce the same effects.
Our study also contributes to the literature of the effect of genetic dissimilarity on economic
development. Ashraf and Galor (2013) find the beneficial and detrimental effects of diversity on
productivity, and conclude that an immediate level of diversity is the most conducive for eco-
nomic development. Desmet, Le Breton, Ortuno-Ortın, and Weber (2011) link genetic distance
to the stability and breakup of nations, and provides empirical support for the use of genetic dis-
tance as a proxy of cultural heterogeneity. Spolaore and Wacziarg (2009) show genetic distance
affects income differences across countries through a barrier effect to the diffusion of develop-
ment from the world technological frontier. Our paper similarly uses genetic distance as a proxy
for cultural distance, and focuses on the public goods provision consequences of greater genetic
and cultural distances between groups.
This paper is organized as follows. Section II explains the historical context. Section III discusses
data sources and the basis for constructing our measure of migrant-native cultural distance. Sec-
tion IV summarizes my baseline results and the comparison of migrant-native cultural distance
to the fractionalization and polarization of the population. Section V introduces a number of
robustness checks, accounting for initial conditions, interventions in education and war-related
conditions. Section VI comprises an instrumental variable analysis. Section VII identifies the
quality of collective decision-making as a possible channel for migrant-native cultural distance
to influence the supply of modern primary education. Section VIII concludes the paper.
II. HISTORICAL CONTEXT
A The Taiping Rebellion
The Taiping Rebellion was a massive civil war in South China which lasted from 1850 to 1864.
At least 17 million people, or half of the populace, died in the lower Yangzi (Cao, 1998; Cao and
Li, 2000). Battles broke out throughout the Lower Yangzi and all counties, with the exception
of Shanghai, were occupied for at least 3 months. The area around Nanjing, the capital city
of Taiping regime since 1853, had lingering conflicts for over ten years. The most prosperous
and important cities in the Lower Yangzi, Hangzhou and Suzhou were occupied by the Taiping
3
army after 1860. Shanghai, protected by foreign powers, was the least affected, and it served
as a shelter for over 200,000 refugees (Ge, 2002a, pp.62–63). Famine and plague followed the
battles. So did mass migration.
A.1 In-Flow Migration
Migration occurred both during the Taiping Rebellion itself and in the aftermath of the rebellion.
While migration internal to the Lower Yangzi was certainly common, post-Taiping migration
was best characterized by long-distance migration from North China and from the Middle Yangzi
River. A crucial difference between pre- and post-Taiping migration is that pre-Taiping migra-
tion was largely driven by income differences, job opportunities, and based on ethnic bonds
and geographic proximity (Li, 2011), whereas post-Taping rebellion migration originated from
a very wide range of geographic areas and featured diverse economic and cultural backgrounds.
Another difference is the scale and pace of migration—post-Taiping migration was far more
rapid and broader in scale. For this reason, in this paper we focus on post-Taiping migration.3
The mass migration led to conflicts between natives and migrants, and between different migrant
groups. In villages and townships conflicts arose as a result of clashing preferences and interests,
different dialects, skills and social customs. Conflicts, as documented in local gazetteers, took
place over a wide range of issues such as usage of public water, property rights of ownerless land
and eligibility for imperial exams (Ge, 2002a, pp. 303-308).
A.2 The Economic and Political Consequences of the Taiping Rebellion
The Taiping Rebellion constituted a multi-dimensional shock to the region. Most likely, it had
more than one way to affect primary schools. Those effects could be at play on both the supply
and demand side of education. In Section V.C, we provide a quantitative analysis of how various
outcomes of the Taiping Rebellion might have affected the building of primary schools in the
early 20th century.
The rebellion damaged local infrastructure. In the Jiaxin prefecture of Zhejiang, 21% of Bud-
dhist and Tao temples were destroyed by rebels affiliated with the God Worshiping Cult (Li,
2002). County public schools and private schools, where lower degree holders received further in-
struction to prepare for the higher level exams, were also destroyed or damaged in large numbers.
The destruction of local infrastructure could have arguably undermined the resources useful to
the launch of modern schools fifty years after the rebellion. That said, it is widely documented
that most temples and schools were restored shortly and even more were built in the late 19th
century. For example, Li (2002) found that 98 temples were destroyed by the rebels but 220
were built (or restored) within twenty years after the rebellion because living standards was
3The provincial governments advertised all around China for migrants and depicted the Lower Yangzi asa ‘kingdom of free land; and the ‘land of opportunities’. Farmers from Henan, Anhui, Hubei, Hunan, NorthJiangsu, and South Zhejiang came for a better living (Ge, 2002a, pp.100-106). After 1900, industrialization drewmassive immigrants into the urban area of Shanghai (Junya and Wright, 2010). Its population increased byfour-fold from 1907 to 1947.
4
even better than before the war and trade and commercial network was quickly restored. Kuhn
(2002), to the contrary, interpreted this trend (along with the rise of local charity organization)
as the rise of local gentry at provincial and county level overseeing local public affairs, which
eventually led to the formalization of local self-governance in the early 20th century.
The rebellion dismantled kinship networks. Clans used to provide financial aid for clan members
to receive education. During the rebellion rich families migrated to the urban area with their
less well-off relatives left behind in the countryside. Clans also lost land property to the war, the
rent from which were assigned as public funds for supporting education (Li, 1981). As discussed
by Xu and Yao (2015), kinship networks are an alternative to formal institutions in providing
public goods by effectively overcoming free-riding problems. However, in the context of education
reform in the 1900s, strong kinship networks can be a double-edged sword. Clans sometimes
would prefer the option of funding informal and private tutoring exclusively enjoyed by clan
members to establishing a school accessible by both clan and non-clan members. A weaker
kinship network, in that case, may have reduced within-kinship public goods but enhanced
cross-kinship public goods.
The rebellion led to huge population losses, which induced higher land-labor ratios and higher
wages (Cao and Chen, 2002). High wages forced war-stricken areas to abandon subsistence
agriculture and switch to labor-saving technologies and industries. In Wujin and Wuxi, the
silk industry superseded rice farming to be the largest employer in the rural area (Mickey and
Shiroyama, 2009). Lin and Li (2014) show that areas with a larger impact of war saw more
modern industrial enterprises in the late 19th century and had a higher level of urbanization
in the 1930s. In addition, the rebellion inadvertently created political room for institutions in
favor of modernization to set roots. Pro-reform officials were assigned to post-war provinces.
They established formal institutions to promote industrialization.
B Educational Reforms: From Traditional to Modern Education
Fifty years after the Taiping Rebellion, Qing Government put forward an educational reform.
The abolition of the imperial exam system went hand in hand with the attempt to establish a
western-style, modern school system.
Prior to 1905, education focused on Confucian classics and aimed at preparing students for the
imperial examinations. The traditional educational system included two stages: mass primary
education aimed for basic literacy and talent spotting, and more advanced education that drilled
candidates selected from the first stage to pass the exams (Leung, 1994). In the late 19th century
growing economic openness gave rise to higher demand for education in science, technology, and
other non-exam skills (Yuchtman, 2015). Attempts by missionaries to build modern schools
began in some coastal cities as early as the 1860s. But only until the abolishment of the
exam system in 1905 did modern education begin to expand. The Ministry of Education was
established, and Offices of Provincial Education was founded, along with county-level agencies
known as “Education Exhorting Offices” (quan xue suo).
5
Educational reform was not a smooth process. Despite ambitious political and educational
reforms, few things changed on the ground. For villages and townships, the process of building
modern schools was slow and painful. County officials often found it difficult to raise county
taxes, and make within-county transfers to ensure universal primary schools. Clans sometimes
would prefer to provide direct financial aid to clan members for them to take cheaper informal
and private tutorships, rather than establish a school open to both clan and non-clan members.
More details concerning both the institutional features of the traditional exam system and of
the modern education system are included in the appendix (Appendix D and Appendix E).
III. DATA AND MEASUREMENT
Table A-1 provides summary statistics of all the variables used in the paper and their data
sources. Below we focus on the underlying logic of our independent variable, migrant-native
cultural distance.
A Independent Variable: Migrant-Native Cultural Distance
Our independent variable is the cultural distance between migrants and natives. We rely on
surname data to construct our measure. To be specific, we use differences in the surname mix
to proxy for the cultural distance between migrants and natives:
MNCDi =1
normalized isonomyN,MN,i
, (1)
where normalized isonomyN,MN,i =∑S
k Pk,N,iPk,MN,i√∑Sk P 2
k,N,i
∑Sk P 2
k,MN,i
. S is the number of surnames in the two
groups. Pk,N,i and Pk,MN,i are the relative frequencies of surname k within natives and within the
entire population including natives and migrants.4 The isonomy between the native population
and the entire population,∑S
k Pk,N,iPk,MN,i, measures how likely any individual randomly drawn
from within natives bears the same surname as one drawn from within the entire population.
We normalize it with the isonomy of the native population,∑S
k P2k,N,i, and the isonomy of the
entire population,∑S
k P2k,MN,i. MNCD captures how culturally dissimilar natives and migrants
were. Figure 1 illustrates migrant-native cultural distance in the Lower Yangzi.
Our approach is in line with Bai and Kung (2011, 2015); Li (2011); Spolaore and Wacziarg
(2009). Li (2011) uses surname distances between pair of countries or regions at a given time to
measure multilateral genetic and cultural distance. Following Du et al. (1997), Bai and Kung
(2011) and Bai and Kung (2015) use isonomy (similarity in surname distribution between any
4In practice, we extract migrant-native cultural distance from the distance in the surname distribution ofa county’s population before and after the rebellion, with the assumption that the surname mix of the nativesremained relatively stable during that period. It is clear that the total population of natives declined after therebellions, but as long as the proportion of each surname population to the total population did not change, ourassumption remains valid.
6
Figure 1: Migrant-Native Cultural Distance
pair of population) to approximate genetic and cultural distance across regions. They show that
this surname-based measure is strongly correlated to a measure of genetic distance based on the
frequency distribution of the A, B and O alleles of the ABO gene at the province level. They
also find that surname-based measure is strongly correlated to a measure of cultural distance
based on dialects. Similar to Spolaore and Wacziarg (2009), they find that the smaller genetic
or cultural distance to the technological frontier from a region, the faster the technology diffused
to that region and hence faster growth. In this paper, we adopt the same measure as in Bai
and Kung (2011); but instead of using isonomy between two regions, we use isonomy between
before-migration population (natives) and after-migration population (natives and migrants) to
proxy for the consanguinity between natives and migrants.
We do not have data on which surnames were associated with migrants but it is generally
the case that migrants as a group were different to natives in terms of surname distribution,
especially when they came from far away. To obtain the surname distribution of a county before
migration, we hand-collect data from county chronicles (a.) name lists of civilians who died
during the rebellion 1851-1865, (b.) exam degree holders (1645-1850) (c.), surnames of chaste
women, (d.) surnames of their husbands if they are also recorded. The number of records for
7
each county ranges from 800 to 3000. To obtain the surname distribution of a county after
migration, we use the following sources: (a.) surnames of dead soldiers (1927-1953) and (b.)
college students born in that county who graduated between 1900 and 1949. The number of
surnames from the above sources ranges from 500 to 2500 per county. More details can be found
in Appendix B.
We draw surnames of individuals from a wide range of social backgrounds to improve the repre-
sentativeness of the surname sample. One concern might be that these samples seem too small
to correctly estimate the true surname distribution of population if each surname accounts for a
small fraction of population. However, this is not the case for China. In each county the largest
few surnames each accounts for more than 5% of the population. To correctly estimate surname
distribution of population at least for the largest few surnames, one needs a population sample
of as small as a few hundred. For the six counties we indeed have a small sample problem, we do
not include them in most of our regressions. As counties with a small sample are not random, we
impute our outcome variable for those six counties, and run regressions on the sample including
them as well.
One way to check the validity of our measure is to cross-check with measures reflecting cultural
or ethnolingustic devisions. One indicator of cultural devisions in the Lower Yangzi is linguistic
enclaves. Before the rebellion, almost the entire lower Yangzi spoke Wu. After the rebellion,
migrants settled in this area, giving rise to thousands of Mandarin-speaking villages and com-
munities (Huang, 2004; Cao, 2006; Simmons et al., 2006). In Figure 2, we mark counties with
linguistic enclaves where Mandarin (guan hua) is used. As shown on the map, counties with
some of the highest values of MNCD harbor linguistic enclaves even today. This enhances our
confidence in the validity of our measure.
Ideally, we would also like to construct a weighted fragmentation index that takes into account
whether individuals with the same surname to be both natives, both migrants, or one native
and one migrant. The fractionalization of the population comes down to dissimilarity of options
preferred by each other. Rather than to simply stipulate that options are either similar or
dissimilar, (Bossert et al., 2003) propose alternative frameworks that permit more degrees of
similarity between options. A generalized index of fractionalization is described in (Bossert
et al., 2011). Drawing on the insights from Bossert et al. (2003), Alesina and Ferrara (2005)
and Caselli and Coleman (2013), we can assume the distance between two individuals with the
same surname, if one is a migrant and the other is a native, to be positive; and the distance
between two individuals with the same surname, both being natives or being migrants, to be
zero. Unfortunately, without individual-level or surname-level data on migrant-native status,
our ability to operationalize this index with our data is limited.
B Control Variables
To account for other factors shaping modern education, we include in the baseline controls pri-
mary schools rate in 1880, population size and urbanization rate. Culturally dissimilar migrants
8
Figure 2: Mandarin Linguistic Enclaves in the Lower Yangzi (Wu-Speaking) Sources: Cao(2006)
might be selected into places with economic conditions that are not in favor of public goods
provision and schooling. So in the full set of controls, we include ruggedness, share of arable
land, agricultural suitability, distance to the Yangtze River, distance to the Grand Canal, dis-
tance to the provincial capital and distance to Shanghai to capture other differences in economic
conditions between counties. For robustness, we account for initial conditions—the number of
charitable organizations by 1850, population and population density in 1820, and measure of the
impact of war—battle exposure (months), % of elderly and youth (under 20 or over 40), % of
adult men (between 20 and 40) and a measure of differences in human capital between migrants
and natives which we call the “human capital shock”. We infer that natives who were able to set
up more charitable organizations have higher social capital, which is likely correlated with both
the type of migrants they admit and retain, as well as with their own ability to provide public
education. We also discuss other potential shocks to education, such as missionary activities,
measured by the log of one plus communicants per 10,000, and temple conversion, measured
by the log of one plus number of temple-converted schools. We expect both variables to be
positively associated with our dependent variable.
IV. BASELINE RESULTS
We use an OLS model to estimate the impact of migrant-native cultural distance on the number
9
of public primary schools:
#public primary schools per 10,000 personsi = α + βMNCDi + XiΩ + εi . (2)
The dependent variable #public primary schools per 10,000 persons is the number of public
primary schools per 10,000 persons right after the educational reform.5 MNCDi is the migrant-
native cultural distance in County i. MNCD exclusively focuses on the cultural distance between
migrants and natives—two groups with no history of shared governance.6 XiΩ are a vector of
county-level controls. εi is a disturbance term.
Table 1 summarizes estimates of the effect of migrant-native cultural distance on public primary
schools. With all controls, a one-standard-deviation of MNCD reduces the number of public
primary schools by approximately 0.18 school per 10,000 persons, which is equal to a fifth of
the mean or 40% of the standard deviation. We show an unconditional regression of MNCD on
primary schools during the decade of 1900-1910 in Column 1. The relationship is both negative
and significant. In Columns 2 and 3 we add population and urbanization rate sequentially. In
Alesina et al. (1999), a larger population means greater economy of scale to provide public goods,
but a higher transaction cost in raising taxes. Urbanization may enhance the economic return
of education, affecting the demand side of education. In Column 4, we include basic education
access in 1880. We interpret this as a measure of the stock of human capital in an area, and as a
summary statistic of those slow-moving components in local culture and institutions that shape
the decision to receive education in the long run. By isolating the influence of past educational
achievement under the private education system, we are one step closer to focusing on the impact
of MNCD on public goods provision. From Column 5 to Column 10, we introduce geographic
controls such as ruggedness, share of arable land and agricultural suitability, distance to the
Grand Canal, distance to the Yangzi River, distance to Shanghai, and distance to the provincial
capital.7 Share of arable land and agricultural suitability can proxy the opportunity cost of
receiving modern education. Distance to the Grand Canal and distance to the Yangzi River are
used to proxy the market potential and access to trade. Distance to the provincial capital is
included to account for the reach of provincial government or state capacity. Shanghai, which
became a treaty port as early as 1845, was exposed to rapid modernization and industrialization.
We control for distance to Shanghai to account for the spillover effects of Shanghai on the rest of
the region. The coefficient estimate of MNCD in Column 1 (-0.21) with no control is of similar
magnitude to that in Column 10 (-0.18) with all controls, when there is a sizable increase in R2
(from 0.174 to 0.399).
5Data are from the 1907, 1908 and 1909 Census. Those are the only censuses in the decade of 1900 thatcontain information on schools.
6Dippel (2014) stresses the role of the history of shared governance in the discussion of ethnically andlinguistically fragmented jurisdictions having poorer economic performances.
7Nanjing for Jiangsu, and Hangzhou for Zhejiang.
10
Tab
le1:
Th
eIm
pac
tof
ofM
igra
nt-
Nat
ive
Cu
ltu
ral
Dis
tan
ce:
Mai
nS
pec
ifica
tion
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
Dep
end
ent
vari
ab
le:
#p
ub
lic
pri
mary
sch
ools
per
10,0
00
per
son
s
MN
CD
-0.2
19∗∗
∗-0
.272∗
∗∗-0
.272∗∗
∗-0
.271∗∗
∗-0
.272∗
∗∗-0
.255∗
∗∗-0
.246∗∗
∗-0
.192∗∗
∗-0
.161∗
∗∗-0
.180∗∗
∗-0
.185∗∗
∗
(0.0
51)
(0.0
52)
(0.0
52)
(0.0
58)
(0.0
59)
(0.0
61)
(0.0
67)
(0.0
60)
(0.0
59)
(0.0
51)
(0.0
41)
Log
pop
ula
tion
-0.2
37∗
∗∗-0
.240∗∗
-0.2
45∗∗
∗-0
.252∗∗
∗-0
.228∗
∗-0
.203∗
-0.1
85
-0.1
97∗
-0.3
33∗∗
∗-0
.347∗∗
∗
(0.0
86)
(0.0
91)
(0.0
85)
(0.0
87)
(0.0
93)
(0.1
07)
(0.1
13)
(0.1
07)
(0.1
08)
(0.0
92)
Urb
aniz
atio
n0.0
00
0.0
00
0.0
00
0.0
01
0.0
01
-0.0
00
-0.0
02
0.0
03
0.0
03
(0.0
03)
(0.0
03)
(0.0
02)
(0.0
02)
(0.0
03)
(0.0
03)
(0.0
03)
(0.0
03)
(0.0
03)
Log
pri
mar
yen
roll
men
t18
800.0
35
0.0
23
-0.0
56
-0.0
03
-0.0
80
0.0
18
0.0
76
0.2
44
(0.2
64)
(0.2
72)
(0.2
64)
(0.3
10)
(0.2
96)
(0.2
89)
(0.2
63)
(0.2
60)
Ru
gged
nes
s-0
.013
0.0
10
0.0
26
-0.0
02
0.0
23
0.0
16
0.0
41
(0.0
40)
(0.0
64)
(0.0
77)
(0.0
78)
(0.0
84)
(0.0
68)
(0.0
41)
%A
rab
lela
nd
0.5
88
0.6
98∗
0.4
24
0.1
32
0.4
83
0.5
35
(0.3
58)
(0.4
07)
(0.4
30)
(0.5
02)
(0.4
28)
(0.3
87)
Agr
icu
ltu
ral
suit
abil
ity
0.0
14
0.0
08
-0.0
36
-0.0
39
-0.0
06
-0.0
04
(0.0
45)
(0.0
47)
(0.0
41)
(0.0
44)
(0.0
39)
(0.0
37)
Dis
t.to
Gra
nd
Can
al0.1
96
0.0
21
-0.0
67
-0.5
94
-0.4
69
(0.3
28)
(0.3
61)
(0.3
69)
(0.4
00)
(0.3
47)
Dis
t.to
Yan
gtze
0.2
46∗
0.2
05∗
0.1
44
0.0
99
(0.1
30)
(0.1
21)
(0.1
19)
(0.0
99)
Dis
t.to
Sh
angh
ai-0
.002∗
0.0
02
0.0
02
(0.0
01)
(0.0
01)
(0.0
01)
Dis
t.to
pro
vin
cial
cap
ital
0.0
04∗∗
∗0.0
04∗∗
∗
(0.0
01)
(0.0
01)
Ob
serv
atio
ns
5454
54
54
54
54
54
54
54
54
60
Ad
just
edR
20.
174
0.267
0.2
53
0.2
38
0.2
25
0.2
42
0.2
31
0.2
71
0.3
05
0.3
99
0.4
07
Not
es:
Th
eta
ble
rep
orts
the
imp
act
ofm
igra
nt-
nat
ive
cult
ura
ld
ista
nce
onth
enu
mb
erof
pu
bli
cp
rim
ary
sch
ools
inth
ed
ecad
eof
1900
-191
0.T
he
un
itof
anal
ysi
sis
aco
unty
inR
epu
bli
can
Ch
ina.
All
bas
elin
eco
ntr
ols
are
incl
ud
edin
Colu
mn
4:
the
natu
ral
log
ofp
opu
lati
on,
urb
aniz
atio
nra
tean
dth
en
atu
ral
log
ofpri
mar
ysc
hool
enro
llm
ent
in18
80.
Both
base
lin
eco
ntr
ols
an
dgeo
gra
ph
icco
ntr
ols
are
incl
ud
edin
Col
um
n10
.C
olu
mn
11in
clu
des
six
add
itio
nal
obse
rvat
ions
inw
hic
hM
NC
Dis
imp
ute
d.
Rob
ust
stand
ard
erro
rsar
ein
clu
ded
inal
lsp
ecifi
cati
ons.
∗p<
0.1
0,∗∗
p<
0.05
,∗∗
∗p<
0.0
1
11
This assures us that selection on unobservables are likely limited.8 In Column 11 we add six
counties with imputed values of MNCD to the sample. Our coefficients of interest remain stable
throughout the columns. The coefficient estimate in Column 11 is somewhat smaller. We
attribute this to the addition of six counties with imputed values of MNCD resulting in an
increase in measurement error. We use Column 4 as our baseline for the rest of the paper.
We next show how MNCD compares to measures of population heterogeneity. We use the
same surname data to construct the traditional fractionalization index and two polarization
indices (See Appendix G). In Table 2, we estimate the effects of MNCD, fractionalization and
polarization on public primary schools. Our main finding is that MNCD has a strong and
positive effect on public primary schools, and neither fractionalization nor polarization in the
entire population has a statistically significant effect.
The traditional fractionalization index is positively correlated with the number of public primary
schools per 10,000 persons (Col. 1-3), once MNCD is controlled for. This suggests that once
partialling out the effects of the cultural distance between migrants and natives, the fraction-
alization of the population does not necessarily reduce public goods provision. The coefficient
estimates of the polarization indices are negative across the columns (Col. 4-9). By comparing
MNCD to the polarization index adjusting for intergroup distances between surname groups
(Col. 7-9), we find that MNCD can reduce both the statistical significance and magnitude of
the coefficient estimate of the distance-based polarization index. From Column 7 to Column 8,
the coefficient estimate of polarization of the entire population moves from -0.165 (the p-value
is 0.138) to -0.118 (the p-value is 0.269).
In face of other measures of population heterogeneity, the effects of MNCD largely remain.
This first suggests that MNCD likely captures an important component distinct from what is
encapsulated in the fractionalization and polarization indices. Secondly, MNCD likely shares a
few traits in common with polarization. And they both have negative effects on public primary
schools. Third, I can also infer that the history of shared governance mitigates the effects of
cultural distance on local public goods provision. Likely by taking into account the history of
shared governance, MNCD outperforms polarization in explaining outcomes of public primary
schools. Our interpretation is that the cultural distance between previously detached groups is
a more important type of cultural distance. Unfortunately, given the nature of the between-
surname-group “distance” measure we use (see Appendix G), we can only draw a tentative
conclusion about the mitigating role of the history of shared governance on the negative effects
of the cultural distance between groups.
8Oster (2014) formalizes this approach. The results of this analysis suggest that the ratio of selection onunobservables relative to selection on observables has to be five times larger to explain away my results. Based onthe reasoning outlined by Altonji et al. (2005) that unobservables should not be more important than observablesin explaining the treatment, it is highly unlikely that unobservables are biasing my results.
12
Table 2: MNCD, Fractionalization and Polarization
(1) (2) (3) (4) (5) (6) (7) (8) (9)Dependent variable: #public primary schools per 10,000 persons
MNCD -0.214∗∗∗ -0.215∗∗∗ -0.206∗∗∗ -0.173∗∗ -0.163∗∗∗ -0.175∗∗
(0.056) (0.059) (0.056) (0.072) (0.052) (0.066)FRAC 0.013 0.023∗∗ 0.023∗
(0.012) (0.011) (0.012)Native FRAC -0.001
(0.015)POL -1.621 -3.722 -2.196
(2.851) (2.664) (3.539)Native POL -1.193
(1.716)POL DIST -0.165 -0.118 -0.091
(0.109) (0.106) (0.155)Native POL DIST -0.114
(0.386)Baseline controls Y Y Y Y Y Y Y Y YGeographic controls Y Y Y Y Y Y Y Y Y
Observations 54 54 54 54 54 54 54 54 54Adjusted R2 0.318 0.435 0.421 0.306 0.412 0.405 0.341 0.405 0.392
Notes: The table reports OLS results of the impact of fractionalization and polarization onpublic primary schools. The unit of observation is a county in Republican China. Robuststandard errors are used in all specifications. ∗ p < 0.10, ∗∗ p < 0.05, ∗∗∗ p < 0.01
V. ROBUSTNESS CHECKS
A Subsamples
We first check if our results are robust to the omission of outliers. Table 3 provides evidence
that outliers do not have a substantial impact on our results. In Column 2 we exclude counties
with extreme values of MNCD. Column 3 excludes counties with extremely large human capital
shocks from the sample. Column 4 drops Shanghai. Coefficients in Columns 3 and 4 are fairly
comparable to those in the baseline. The coefficient of MNCD is slightly larger in Column 2.
This indicates possible attenuation bias due to measurement errors in MNCD. Our measurement
of MNCD may be particularly poor for those outliers, and MNCD may become more sensitive
to sampling errors in the range of large values. In such cases, removing those observations with
mis-measured values should improve our estimation.
B Initial Conditions
Public goods provision depends on the level of social capital. At the same time, higher social
capital within natives could imply more exclusivity, which could result in fewer migrants, and
especially, fewer migrants who disagree with values possessed by natives. To check if social
capital drives both MNCD and public primary schools, we explicitly include initial social capital
13
Table 3: Robustness: Subsamples
(1) (2) (3) (4)Dependent variable: #public primary schools per 10,000
MNCD -0.180∗∗∗ -0.201∗∗ -0.191∗∗∗ -0.180∗∗∗
(0.051) (0.074) (0.056) (0.051)Baseline controls Y Y Y Y
Subsamples Full sample MNCD outliers Human capital outliers Shanghai
Observations 54 52 51 53Adjusted R2 0.399 0.374 0.371 0.394
Notes: The table reports the impact of migrant-native cultural distance on public primaryschools during the decade of 1900-1910 on various subsamples. The unit of observationis a county in Republican China. Column 1 provides benchmark results from Column 10of Table 1. Column 2 excludes three counties with the highest MNCD (MNCD>5.25).Column 3 excludes counties with the largest decline in human capital (Human capital-afterminus Human capital-before is less than -0.1). Column 4 excludes the important treaty portcity—Shanghai. Robust standard errors are used in all columns. ∗ p < 0.10, ∗∗ p < 0.05,∗∗∗ p < 0.01
Table 4: Initial Conditions
Dependent variable: #public primary schools per 10,000 persons(1) (2) (3) (4) (5)
MNCD -0.180∗∗∗ -0.181∗∗∗ -0.201∗∗∗ -0.170∗∗∗ -0.242∗∗∗
(0.051) (0.052) (0.055) (0.053) (0.063)#initial charities 0.003 0.005
(0.006) (0.006)Initial population 0.608∗∗ 0.887∗∗
(0.239) (0.329)Initial pop. density 0.119 -0.351
(0.259) (0.278)
Baseline controls Y Y Y Y YGeographic controls Y Y Y Y Y
Observations 54 54 54 54 54Adjusted R2 0.399 0.385 0.461 0.388 0.458
Notes: The table reports reports OLS results of the impact of MNCD on public primaryschools, accounting for initial conditions. The unit of observation is a county in RepublicanChina. Initial # charities are number of charities before the Taiping Rebellion. Initial pop-ulation and initial population density are population and population density in 1820. Allspecifications include baseline controls and geographic controls. Baseline controls includelogged primary enrollment rate in 1880, urbanization rate and logged population. Geo-graphic controls include ruggedness, share of arable land, agricultural suitability, distanceto the Grand Canal, distance to the Yangtze River, distance to the provincial capital anddistance to Shanghai. Robust standard errors are used in all specifications. ∗ p < 0.10, ∗∗
p < 0.05, ∗∗∗ p < 0.01
14
among the natives in our regression, proxied by number of charitable organizations in 1850.9 To
overcome endogeneity, we use the number of charitable organizations before the mass migration.
In addition, initial population size can contain crucial information about the socioeconomic
traits of the natives. So we also include initial population size or initial population density in
our regression, and find that initial population size is positively correlated with the number of
public primary schools per 10,000 in the 1900s.
C War-Related Conditions
The Taiping Rebellion was undoubtedly a major blow to the region. Other than bringing
in migrants with cultural dissimilarity with natives, the rebellion likely had an independent
effect on schools by affecting infrastructure, income and the demographics. Below we carefully
evaluate the overall impact of war on primary schools, as well as the specific effects of the
Taiping Rebellion on the human capital stock and the demographic profile of a county. We
show coefficient estimates of MNCD partialling out the other effects of war, as well as coefficient
estimates of those variables without MNCD. In the next section, we propose an instrumental
variable strategy to establish the causality going from cultural distance to primary schools.
Battle Exposure As mentioned in Section II.A, war could have lingering consequences several
decades after. The scale of damage can be a function of battle exposure. We measure battle
exposure by the number of months a county was exposed to war. Overall, we find no evidence
that battle exposure had an impact on the number of public primary schools fifty years later
(Col.2 & 3).
Demographic Shock War and migration could have affected schools by altering the demo-
graphic structure. Population tend to rebound rapidly after major disturbances (Davis and
Weinstein, 2002), resulting in a youthful population. Migrants tend to be younger, male. If a
larger number of migrants flock into a county, who are systematically younger, from the supply
side, education can become more affordable in the short run, due to lower dependency ratio, but
from the demand side, there could be less demand for education if opportunities for unskilled
work are abundant. We include both the %elderly and youth and %adult men (aged 20 to 40)
9During the period we study, charitable organizations were mostly funded by local communities.
15
Tab
le5:
War
-Rel
ated
Con
dit
ion
s
Dep
end
ent
vari
ab
le:
#p
ub
lic
pri
mary
sch
ools
per
10,0
00
per
son
s(1
)(2
)(3
)(4
)(5
)(6
)(7
)(8
)(9
)
MN
CD
-0.1
80∗∗
∗-0
.181∗
∗∗-0
.200∗∗
∗-0
.182∗∗
∗-0
.205∗∗
∗-0
.264∗
∗∗
(0.0
51)
(0.0
52)
(0.0
67)
(0.0
53)
(0.0
67)
(0.0
77)
Bat
tle
exp
osu
re0.0
07
0.0
08
0.0
08
0.0
03
(0.0
12)
(0.0
11)
(0.0
12)
(0.0
11)
%el
der
lyan
dyo
uth
-0.0
18
-0.0
45
-0.0
46
-0.0
39
(0.0
34)
(0.0
39)
(0.0
37)
(0.0
33)
%ad
ult
men
-0.0
23
0.0
04
0.0
07
0.0
12
(0.0
26)
(0.0
28)
(0.0
31)
(0.0
27)
Hu
man
cap
ital
shock
-0.0
06
0.315
0.2
61
1.0
46
(1.2
50)
(1.2
09)
(1.3
50)
(1.3
65)
Init
ial
con
dit
ion
sN
NN
NN
NN
NY
Inve
nti
ons
ined
uca
tion
NN
NN
NN
NN
YB
asel
ine
contr
ols
YY
YY
YY
YY
YG
eogr
aph
icco
ntr
ols
YY
YY
YY
YY
Y
Ob
serv
atio
ns
5454
54
54
54
54
54
54
54
Ad
just
edR
20.
399
0.3
06
0.3
92
0.2
97
0.3
87
0.3
00
0.3
85
0.3
65
0.5
16
Not
es:
Th
eta
ble
rep
orts
rep
orts
OL
Sre
sult
sof
the
imp
act
ofw
aron
pu
bli
cp
rim
ary
sch
ool
s.T
he
un
itof
ob
serv
ati
on
isa
cou
nty
inR
epu
bli
can
Ch
ina.
Bat
tle
exp
osu
reis
mea
sure
dby
mon
th.
%el
der
lyan
dyou
thre
fers
toth
ose
un
der
20
or
over
40.
%ad
ult
men
refe
rsto
men
bet
wee
n20
and
40.
Hu
man
cap
ital
shock
isth
ed
iffer
ence
inhu
man
cap
ital
bet
wee
nn
ati
ves
an
dth
een
tire
pop
ula
tion
incl
ud
ing
nat
ives
and
mig
rants
.A
llsp
ecifi
cati
ons
incl
ud
eb
asel
ine
contr
ols
and
geog
rap
hic
con
rtro
ls.
Robu
stst
an
dard
erro
rsare
use
din
all
spec
ifica
tion
s.∗p<
0.1
0,∗∗
p<
0.05
,∗∗
∗p<
0.0
1
16
in Columns 4 and 5, and we do not find a statistically significant effect of either variable.
Human Capital Shock Human capital could be another channel through which the Taiping
Rebellion affected the long-run prospects of a county. Even though migrants and natives had
little contact before the settlement of migrants, some selection could be on the amount of human
capital migrants had. In the event that migrants simultaneously posed a cultural shock and a
human capital shock to the native population, the human capital shock may be a confounder.
We define a human capital shock as the difference between migrant and native human capital
(See Appendix H for variable construction details). When the migrant human capital stock is
higher than native human capital, we conclude there was a positive human capital shock. In
Columns 6 and 7, we control for the difference between migrant and native human capital. We
find no effect of differences in human capital on schools. However, once MNCD is added to
the regression, human capital shock attains a positive coefficient which is consistent with our
prior. The advantage of having migrants with high human capital on school formation is likely
partially offset by the cultural distance between migrants and natives.
D Interventions in Education
Missionary Activities Christian missionaries came to China in the 19th century to spread
Christianity. In the process, they also built a large number of schools. Compared to secondary
education, however, primary education was not affected to the same degree by local missionaries
(Bai and Kung, 2014). Demand for public or semi-public modern primary education, may still
have been affected by long-standing missionary activities. For example, if church-sponsored
secondary schools already existed in the area before the campaign for modern education, it
might increase demand for modern primary education through a complementarity mechanism.
Due to the earliest missionary expansion and the Taiping Rebellion being concurrent events, we
cannot rule out the possibility that a hostile native-migrant relationship could deter missionaries,
or that migrants are more likely to settle in areas with missionaries, for aid and help. We include
number of communicants per 10,000 as a rough proxy for missionary activities. As predicted,
the density of communicants has a positive effect on public primary schools, but the effect is
not statistically significant (Col. 2). The coefficient estimate of MNCD remains very similar to
baseline estimates.
Temple Conversion In the first few decades of the 20th century, government closed tens
of thousands of Buddhist or Taoist temples and took over temple assets to support modern
education. Two million Buddhist and Taoist temples are estimated to have closed in late Qing.
At the time, they owned about 16 million houses, 13,000 square kilometers of land and millions
teal of silver altogether (Xu, 2010). In Column 3, temple conversion is positively associated
with the density of public primary schools. The coefficient estimate of MNCD slightly declines,
suggesting that temple conversion might be a channel for MNCD to impede the establishment
of public primary schools.
17
Table 6: Interventions: Missionary Activities and Temple Conversion
Dependent variable: #public primary schools per 10,000 persons(1) (2) (3) (4) (5)
MNCD -0.180∗∗∗ -0.178∗∗∗ -0.168∗∗∗ -0.167∗∗∗ -0.220∗∗∗
(0.051) (0.047) (0.053) (0.053) (0.068)Missionary activities 0.148 0.134 0.107
(0.095) (0.085) (0.082)Temple conversion 0.183∗∗ 0.169∗ 0.173∗∗
(0.085) (0.085) (0.084)Initial conditions N N N N YBaseline controls Y Y Y Y YGeographic controls Y Y Y Y Y
Observations 54 54 54 54 54Adjusted R2 0.399 0.443 0.458 0.493 0.543
Notes: The table reports OLS results of the impact of MNCD on public primary schools,accounting for interventions in education. The unit of observation is a county in Republi-can China. “Missionary activities” is measured by the log of number of communicants plusone. “Temple conversation” is measured by the log of number of one plus temple-convertedschools. Baseline controls include logged primary enrollment rate in 1880, urbanizationrate and logged population. Geographic controls include ruggedness, share of arable land,agricultural suitability, distance to the Grand Canal, distance to the Yangtze River, dis-tance to the provincial capital and distance to Shanghai. Robust standard errors are usedin all specifications. ∗ p < 0.10, ∗∗ p < 0.05, ∗∗∗ p < 0.01
VI. INSTRUMENTAL VARIABLE STRATEGY
A Construction of the Instrument
We have provided evidence that “selective migration” was as unlikely as was self-sorting after
the arrival of the migrants. “Selective migration” refers to migrants and natives make decisions
on entry and acceptance based on their cultural proximity. Nevertheless, the observed migrant-
native cultural distance might still be correlated with unobserved characteristics of the native
or migrant population, or with unobserved characteristics of the county of destination or county
of origin. To sidestep omitted variable bias, we introduce an instrumental variable based on the
pre-existing cultural distance between the native population and the population just outside of
the Lower Yangzi (“nearby population”).10
We use the following equation to construct a surname-based measure of the pre-existing cultural
distance between the native population and the nearby population:
Pre-existing native-nearby culture distancei =1
normalized isonomynative,nearby,i
, (3)
10The nearby population refers to residents in Zhejiang and Jiangsu Province excluding the Lower Yangzi.
18
Tab
le7:
Inst
rum
enta
lV
aria
ble
An
alysi
s:P
re-e
xis
tin
gN
ativ
e-N
earb
yC
ult
ure
Dis
tan
ce
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
OL
SIV
OL
SIV
IVIV
IVIV
Sec
on
dS
tage
#p
ub
lic
pri
mary
sch
ools
per
10,0
00
per
son
s
MN
CD
-0.2
19∗∗
∗-0
.250∗
-0.2
71∗
∗∗-0
.384∗
∗-0
.354∗∗
-0.3
24∗∗
-0.3
23∗
∗-0
.312∗∗
(0.0
51)
(0.1
49)
(0.0
58)
(0.1
83)
(0.1
73)
(0.1
55)
(0.1
56)
(0.1
54)
Ad
just
edR
20.1
74
0.1
70
0.2
38
0.1
90
0.3
05
0.3
91
0.3
76
0.3
70
Fir
stS
tage
Dep
end
ent
vari
ab
le:
mig
rant-
nati
vecu
ltu
ral
dis
tan
ce
Nat
ive-
nea
rby
cult
ure
dis
tan
ce1.1
23
∗∗0.9
68∗∗
1.0
21∗
∗0.9
77∗
∗0.9
81
∗∗1.0
21∗
∗∗
(0.4
20)
(.4
77)
(0.4
19)
(0.3
88)
(0.3
99)
(0.3
75)
Fir
st-s
tage
FS
tati
stic
7.1
34.1
25.9
56.3
56.0
47.4
0
Hu
man
cap
ital
shock
NN
NN
NN
NY
Bat
tle
exp
osu
reN
NN
NN
NN
YF
ract
ion
aliz
atio
nN
NN
NN
NY
YIn
itia
lso
cial
cap
ital
NN
NN
NY
YY
Pop
.18
20N
NN
NN
YY
YG
eorg
rap
hic
contr
ols
NN
NN
YY
YY
Bas
elin
eco
ntr
ols
NN
YY
YY
YY
Ob
serv
atio
ns
54
54
54
54
54
54
54
54
Not
es:
Th
eta
ble
rep
orts
IVes
tim
ates
.T
he
inst
rum
ent
isn
ativ
e-n
earb
ycu
ltu
ral
dis
tan
ce.
Th
eu
nit
of
ob
serv
ati
on
isa
county
inR
epu
bli
can
Ch
ina.
“Fra
ctio
nal
izat
ion
”in
clu
des
bot
hw
ith
in-n
ativ
efr
acti
onal
izat
ion
and
over
all
fract
ion
aliza
tion
.F
irst
-sta
ge
Fst
atis
tic
isK
leib
erge
n-P
aap
rkW
ald
Fst
atis
tic.
Rob
ust
stan
dar
der
rors
are
use
din
all
spec
ifica
tion
s.∗p<
0.10,∗∗
p<
0.0
5,∗∗
∗
p<
0.0
1
19
where normalized isonomynative,nearby,i =∑S
k Pk,native,iPk,nearby√∑Sk P 2
k,native,i
∑Sk P 2
k,nearby
.S is the number of the same
surnames in the two groups. Pk,native,i and Pk,nearby are the relative frequencies of surname
k within natives and within the nearby population. For the nearby population, we use the
proportion of each surname to the population total in the 2005 Population Census to proxy for
Pk,nearby. The denominator measures how likely any individual randomly drawn from within
natives bears the same surname as one drawn from within the nearby population. The variable,
native-nearby cultural distance (“NNCD”), is intended to capture the cultural dissimilarity
between the native population and the nearby population.
The intuition behind the instrument is that natives are more likely to be very different from the
incoming migrants, if ex ante, natives are more different from all prospective migrants. I treat
the nearby population as the pool of prospective migrants. This is a reasonable assumption as
the vast majority of migrants in the Lower Yangzi after the war are found to have originated
from that area (Cao, 1998; Cao and Li, 2000; Ge, 2002a; Liu, 2012; Zhe and Zhe, 1896). The
cultural distance between natives and the pool of prospective migrants is independent of the
Taiping Rebellion.
B IV Results
Table 7 compares six IV specifications. We run our regressions with no controls in Columns 1
and 2. The first stage is slightly above 7. The IV estimate is -0.25, only slight larger than the
OLS estimate in Column 1 (-0.219) in terms of magnitude. We add baseline controls in Columns
3 and 4. In Column 5, we add geographic controls, resulting in an IV estimate of 0.354. We then
add initial conditions, fractionalization and war-related conditions sequentially from Column 6
to Column 8. IV estimates in those specifications are highly comparable ranging from -0.312 to
-0.354. Our first-stage F statistics are not particularly high, but this is likely due to the small
sample size. The first-stage F statistics do improve when MNCD outliers are excluded, possibly
due to improved linearity. We also show reduced-form estimates in Table A-2. Our instrument
is negatively correlated with the density of public primary schools in specifications. In Columns
1, 5 and 6, the coefficient estimates drops slightly below the conventional cutoff of statistical
significance, with a p-value of 0.15.
Across the columns, IV estimates are slightly larger than the OLS estimates for the same speci-
fications. We attribute the discrepancies partly to the measurement error in our migrant-native
cultural distance variable. As our surname sample is on the small side for some counties, we
may have a lot of sampling error in determining the relative frequency of a surname in the native
or overall population. In addition, when constructing our measure of MNCD, we assume people
with different surnames in the same location, had similar exposure to the war, and had similar
recovery growths in population after the war. While this assumption is unlikely systematically
violated, our measure can suffer measurement error and be noisy. In comparison, our instru-
ment uses a much larger surname sample from the 1% population census. We have reason to
believe that our instrument has less measurement error with regards to the relative frequency of
20
a surname in a population; hence, our IV estimates do not suffer the same attenuation bias as
our OLS estimates do. In addition, MNCD is partially determined by population losses during
the rebellion, whereas our instrument is not affected by differential population losses during
the rebellion. We control for battle exposure, but it is far from being the perfect indicator of
population losses.11 IV estimates can differ from OLS estimates as they are unaffected by the
magnitude of population losses.12
VII. MECHANISMS: THEORY AND SUGGESTIVE EVIDENCE
Our results suggest cultural differences between migrants and natives have a negative impact on
public goods provision. In this section, we provide suggestive evidence on possible mechanisms.
The Lower Yangzi, in spite of being more developed than the rest of the country, remained
highly agrarian at the time. The average urbanization rate was merely 12%, which means the
vast majority of people lived in the countryside. In the sixty counties we study (fifty-four in
the main sample), every county comprises hundreds of villages. Many natives in those villages
typically have lived there for hundreds of years. Close kinship ties ensured that levels of trust
between villagers were relatively high, and this allowed them to collaborate in the provision of
public goods. In some villages all the inhabitants were from the same linage, in which case,
most of them would share the surname; other villages had a few lineages. Many of the public
goods, such as security and education, were provided by and within the clan. But some of
those goods were provided by the gentry to the whole village in the form of private donations.
The long tradition of village life, which went back centuries, if not millennium, constituted a
deep institutional memory of how to deal with problems such as the provision of local public
goods. Both natives and migrants (back in their own hometowns) had adapted to this stable
and slow-moving environment. So when migrants first showed up in the villages of the natives,
they had to sort out an arrangement in which their coexistence with natives would be possible;
a few decades after, migrants and natives were confronted with a new task: to build modern
schools for their next generation.
Based on theoretical and empirical research, there are several ways migrant-native cleavages
can affect public goods provision and prevent them from building modern schools together:
first, differences in preferences between migrants and natives would mean a lower chance for
migrants and natives to reach a consensus on whether to provide a public good or the best way
to provide that good. Second, mutual dissatisfaction, which is often a function of the cultural
11Despite scholarly efforts to estimate population losses due to the rebellion, the actual damage remainsunknown (Cao and Li, 2000; Cao, 1998).
12Population losses could have an independent effect on primary schools, and MNCD could be picking up theeffects of population losses in addition to the effects of the cultural distance between migrants and natives. Also,although population losses were substantial for most counties in the Lower Yangzi, there were a few exceptions.As places with little to none exposure to the war are “never-takers”, the local average treatment effect is estimatedon counties with actual in-flow migration after the rebellion.
21
distance between migrants and natives, can block any bilateral collaboration and negotiation.
For example, when a village leader originates from the group of migrants, natives would decide
to actively oppose whatever policies he initiates. Esteban et al. (2012a) shows empirically that
ethnic polarization, which accounts for distances between groups, will influence conflict if the
prize is “public”.13 And a closely related scenario is that a leader from the side of natives (or
migrants) indeed looks out for the best interest of natives (or migrants), and at the expense of
the other. When a leader from the side of natives (or migrants) takes advantage of the other
side, often in the name of public interest, it is to be expected for the other side to oppose
expropriatory policies proposed by the leader.
We find qualitative evidence in line with the mechanisms laid out above. First, villagers found it
difficult to raise taxes or use clan or temple assets to establish modern schools, when they were
from different dialectal and cultural background, due to mutual misunderstanding or outright
disagreement between migrants and natives. Second, migrants opposed the policies put forward
by natives. Tian and Chen (2008) document cases where migrants into rural areas refused to pay
county taxes designated to finance upper-primary schools on the grounds that “the schools only
serve the rich in the cities and towns”. In this example, we see both migrants being victims to
expropriatory polices, and a strong identity held by migrants reflected in their statement. Third,
native-migrant conflicts had a direct impact on public goods provision by intervening with the
day-to-day operation of a village. Community leaders were reluctant to build public modern
schools in the midst of frequent native-migrant conflicts. lowered the ability of a community
to perform effective decision-making in the provision of public education. Below we present
quantitative evidence to corroborate our analysis.
A Evidence from Types of Schools
We exploit a feature of the school system in early 20th century China: the financing of lower-
primary and dual-primary schools differed from that of upper-primary and secondary schools.
Overall, the financing of different types of educational institutions after 1905 was parallel with
that of the traditional system. The role of central and provincial governments was limited
to financing universities, students studying abroad, and secondary schools (including teacher
training schools) at provincial capitals.14 County governments financed primarily upper-primary
schools, which were mostly located in cities and towns, with a combination of county tax receipts,
business tax surcharges, the reallocation of endowments from traditional schools, and private
contributions by local elites (Chaudhary et al., 2012). Village communities worked to finance
13Esteban et al. (2012b) develops a theory of conflict across groups allowing for “public” and “private” prizes.A leadership position is an example of a “public” prize. In our context, possible sources of dissatisfaction, oreven conflict, include a lack of trust in migrants or a loss of pride due to losing the election as natives. Thelack of trust can arise from the perception of the village leader being susceptible to predating on the interest ofnatives.
14The decentralization of fiscal authority after the Taiping Rebellion left the majority of tax revenues tocounty-level authorities. So the central state had very limited resources to finance primary schooling at the locallevel.
22
dual-and-lower-primary schools in rural villages. A frequent practice was for community leaders
to transform clan schools and other properties into modern dual-and-lower-primary schools,
and to cover the day-to-day expenses with local taxes and private donations. Those schools
received very little financial support from the county government and its affiliates.15 Figure 3
lists distribution of financial sources for different types of schools in Hangzhou County (Zhang,
2008). According to Figure 3, lower-and dual-primary schools had a higher share of their budget
made up by local funds.
Another institutional feature we exploit is that Chinese villages are highly autonomous and self-
governed. Constrained by state capacity, the reach of the central government tended to end at
the county level (Qu, 2003, pp.11-15, 179-201, 248-255).At the county level, again, each country
comprised of a large number of villages, making it difficult to project power into those villages.
The self governance of villages has a long tradition in China (Fei, 1939, 1992; Kung-Chuan,
1967; Shi et al., 1988).In an environment of self-governance, migrant-native cultural distance
might manifest itself most forcefully in the outcomes of collective decision-making processes, as
there will not be countervailing forces to their decisions.
With these two institutional features in mind, we can derive the following testable implication:
ceteris paribus, MNCD should have a greater impact on lower-primary and dual-primary schools
than on other types of schools. Fortunately, the educational census (1907-08) reports number
of schools and schools separately for five types of schools: secondary schools, upper-primary
schools, dual-primary schools and lower-primary schools and pre-schools. So we can empirically
test how MNCD affects the density of different types of schools.
We begin our analysis by running a simple regression of county government spending on density
of lower-primary and dual-primary schools. Panel A of Table 8 suggests that government funding
is strongly correlated with upper-primary and secondary schools rate (Col. 3, 4), but not with
lower-primary and dual-public primary schools (Col. 1, 2). These are highly consistent with
the information contained in Figure 3: the county government was not a major contributor to
the budget of the lower-primary and dual-primary schools, but provided funding to secondary
schools and upper-primary schools. It is not surprising that both upper-primary school and
secondary schools increase in county government spending on education, while there are no such
relationships for lower-primary or dual-primary schools.
When we replace county government spending with migrant-native cultural distance in Panel
B of Table 8, we find the type of schools with coefficients that are significant, flipped. We
find MNCD has a strong impact on schools at lower-primary and dual-primary schools, but
not on schools at upper-primary and secondary schools. This is consistent with our testable
implication that MNCD should matter the most for the setting of self-governance—given the
period we examine, both villages and townships were self-governed, and lower-primary and dual-
15Schools also charged tuition to supplement other sources of funding. Tuition accounted for 10%-20% ofschool budgets in 1917.
23
County
or
townshi
p raised
85%
Commu
nity or
industry
raised
13%
Tuition
and fees
2%
Secondary Schools
County
or
township
raised
86%
Commu
nity or
industry
raised
6%
Tuition
and fees
8%
Upper-Primary Schools
County
or
township
raised
42%
Commu
nity or
industry
raised
56%
Tuition
and fees
2%
Dual-Primary Schools County
or
township
raised
7%
Commu
nity or
industry
raised
86%
Tuition
and fees
7%
Lower-Primary Schools
Figure 3: The Makeup of the Budget of Different Types of Schools
primary schools were determined, established and financed locally. When migrants and natives
could not get along, they could face severe challenges in solving important and urgent issues
including establishing modern schools for their children.
B Evidence from Timing
Having showed the differential impact of MNCD on different types of schools in the 1900s, we
next explore the time-varying impact of MNCD as funding responsibilities over basic education
changed over time. This task is complicated by the fact that public primary schools did not
exist prior to 1905. So we instead look at access to basic education over time.
We exploit two structural breaks in funding responsibilities over basic education. The first
structural break came around in 1905. Prior to 1905, individual households or extended families
were responsible for basic education (as well as for advanced education). There is no evidence for
either village taxes, government aid, or publicly funded basic education. As basic education was
essentially a private good before 1905, exposure to basic education should not vary in MNCD.
The second structural break kicked in close to the end of the 1920s. From then on, the fiscal
authority over lower-primary and dual-primary schools migrated to upper-level governments. In
24
Table 8: Government Funding, Migrant-Native Cultural Distance and the Density ofSchools
Lower-primary Dual-primary Upper-primary Secondary
Panel A. Government Funding: By School Type
Government spending on education 0.067 0.252 0.327∗∗∗ 0.242∗∗∗
(0.979) (0.338) (0.073) (0.053)Baseline controls Yes Yes Yes Yes
Adjusted R2 -0.078 0.069 0.171 0.328
Panel B. Migrant-Native Cultural Distance: By School Type
MNCD -0.185∗∗∗ -0.052∗∗ -0.003 -0.003(0.042) (0.021) (0.005) (0.003)
Baseline controls Yes Yes Yes Yes
Observations 54 54 54 54Adjusted R2 0.142 0.155 0.122 0.178
Notes: Panel A reports the impact of government spending on then density of differenttypes of schools (#schools per 10,000 persons). Panel B reports the impact of migrant-native cultural distance by school type. The unit of observation is a county in RepublicanChina. Robust standard errors are included in all specifications. ∗ p < 0.10, ∗∗ p < 0.05,∗∗∗ p < 0.01
the meanwhile, upper-level governments become much more autocratic over time. The combined
effect of schools being funded by upper-level governments and upper governments becoming more
autocratic, should render local decision making increasingly less relevant, which should yield a
less economically significant coefficient of MNCD. These two structural breaks give rise to the
prediction that the impact of MNCD should be the most relevant between 1905 and the end of
1920s.
We test this prediction in a fully flexible difference in differences specification. Our variables of
interest are interaction terms between MNCD and time period dummies:
%access to basic educationi,t =∑t
βtMNCDi × Timeperiodt +∑t
γtXi × Timeperiodt
+Zi, tζ + ρt + ηi + εi, t ,
(4)
where the year 1960 is left out as the comparison group. The dependent variable is log share of
population receiving basic education in County i during time period t. Xiis a vector of other
time invariant controls and we allow these variables to have time-varying effects on access to
basic education. Zi,tζ is a vector of time variant controls.ρt denotes a full set of time fixed effect
and ηi denotes a full set of county fixed effects. εi,t is a disturbance term. Our time periods are
defined as follows: ∈ 1820, 1850, 1880, 1900, 1910, 1920, 1930, 1940, 1950.
Table 9 summarizes our results. Across the specifications, the only MNCD × Timeperiod
25
Table 9: The Dynamic Impact of of Migrant-Native Cultural Distance: Baselines
(1) (2) (3) (4)%access to basic education
MNCD×1820 -0.054∗∗ -0.025 -0.029 -0.021(0.025) (0.031) (0.030) (0.025)
MNCD×1850 -0.077∗∗∗ -0.048 -0.050∗ -0.041∗
(0.022) (0.030) (0.029) (0.024)MNCD×1880 -0.078∗∗∗ -0.049 -0.025 -0.031
(0.025) (0.035) (0.037) (0.031)MNCD×1900 -0.339∗∗∗ -0.309∗∗∗ -0.294∗∗∗ -0.266∗∗∗
(0.062) (0.072) (0.076) (0.067)MNCD×1910 -0.168∗∗∗ -0.138∗∗∗ -0.129∗∗∗ -0.087∗
(0.040) (0.045) (0.046) (0.045)MNCD×1920 -0.087∗∗ -0.058 -0.051 -0.044
(0.043) (0.036) (0.037) (0.027)MNCD×1930 -0.031 -0.002 0.001 0.010
(0.045) (0.039) (0.040) (0.028)MNCD×1940 -0.043∗ -0.014 -0.012 -0.006
(0.023) (0.017) (0.019) (0.014)MNCD×1950 -0.039∗ -0.009 -0.008 -0.002
(0.023) (0.011) (0.013) (0.010)Log population 0.145∗ 0.103
(0.084) (0.084)County FE N Y Y YTime period FE Y Y Y Y
Observations 540 540 540 600Adjusted R2 0.904 0.936 0.937 0.937
Standard errors in parentheses∗ p < 0.10, ∗∗ p < 0.05, ∗∗∗ p < 0.01
Notes: The table reports the dynamic impact of migrant-native cultural distance before andafter the Taiping Rebellion.Column 1 only includes time period fixed effectds. Both timeperiod effeccts and county fixed are included in all other specifications. Column 3 controlsfor log population. Column 4 includes the six counties with imputed MNCD. Column 3 isour preferred specification. Standard errors are clusterered at the county level.
coefficients that are consistently significant are MNCD × 1900 and MNCD × 1910 . This
is highly consistent with our prediction that MNCD has the most impact on access to basic
education, during the period where basic education is a public good and fiscal authority is
decentralized. We begin our analysis by including only MNCD × Timeperiod, time period
FE. To make full use of the panel data, in Column 2, we add county FE to the regression.
We find MNCD × Timeperiod is negative but insignificant before 1900, and there is no pre-
trend. It is not the case that the counties became better or worse in providing basic education
over the time. Consistent with our prediction, MNCD did not matter for the period between
the end of the war (1864) and the abolition of the exam (1905), when basic education was
26
Figure 4: The Dynamic Impact of Migrant-Native Cultural Distance
largely financed privately within clans. For 1900-1920, MNCD has a negative effect on schools
of great economic magnitude, and coefficient estimates of interactions between MNCD and time
periods drop substantially after 1920, and are no longer significant.16 In Column 3, we add
a possibly endogenous variable, log population, to the specification, and we get fairly similar
estimates to Column 2.17 For the decade of 1900-1910, a one-standard-deviation increase in
MNCD is associated with 32.3% decrease (0.294*1.1) in access to basic education, relative to
the effect of MNCD on schools in 1960, our omitted category. This estimate is comparable to
our baseline OLS estimate in Column 4 of Table 1. In Column 4, we run the same regression
as in Column 3, but on a sample including counties with imputed MNCD. The estimates are
somewhat smaller, likely due to the attenuation bias posed by less well measured MNCD of the
newly added counties.
The differential impact of MNCD in Table 9 before and after two structure breaks, are most
similar to the approach featured in Dippel (2014). Dippel (2014) finds the impact of forced
coexistence on economic prosperity only emerged after 1990, when Indian reservations were
given substantially more autonomy. We find the impact of MNCD only began to surface after
the modern educational reform, when the responsibilities of individual households and clans
in providing basic education was partly taken over by villages, and only died out after 1920,
when village and small town autonomy began to be eroded by upper-governments. Evidence
from the timing of MNCD kicking in and dying out suggests that the primary mechanism runs
16accesstobasiceducation1910is the average of public primary schools recorded in the 1914, 1915 and 1916Census.
17We would like to include time-varying urbanization rates as well, just to be comparable to the specificationsin Table 1. Unfortunately, we only have urbanization for one period.
27
from migrant-native cultural distance through quality of collective decision-making to public
primary schools. In addition, the general consistency between our panel and OLS estimates
further enhances our confidence in the findings.
VIII. CONCLUSION
This paper uses quasi-exogenous migration to identify the impact of cultural proximity between
migrants and natives on public goods provision outcomes. Our key finding is that cultural
distance has an independent effect on public goods provision outcomes, conditional on the
history of shared governance.
In addition to our key finding, we also find partial evidence that a history of shared governance
mitigates the negative effects of the cultural distance. We find that the culture distance between
previously detached groups has a statistically significant effect on public goods provision, whereas
the polarization measure taking into account differences between surname groups (“surname
group distance”) without making any adjustment for their shared history, does not have the same
effect. A most likely explanation for this is that the culture distance between previously detached
groups (migrants and natives) does more damage to public goods provision than does the cultural
distance between groups with a history of shared governance. A history of shared governance
might have nurtured trust and increased the level of collaboration. This is consistent with
Dippel’s (2014) finding that a history of shared governance facilitates public goods provision.
Therefore, we believe our findings are consistent with both sides of the debate as to whether
ethnic cleavages or artificial jurisdictions has caused poor economic performance.
Our analysis sheds light on how migrant-native cultural distance, as a key aspect of population
heterogeneity, is related to the failure of educational reforms in the early 20th century. Our
finding that a greater cultural distance between migrants and natives is associated with a lower
density of public primary schools reveals the unique challenge faced by communities with many
migrants from afar face in their attempt to modernize their education. For those communities,
educational reforms proved to be incredibly challenging and frustrating, due to intense social
antagonism between two groups. The under provision of modern education could prevent have
prevented the accumulation of modern human capital and increased barriers to industrialization
for those communities.
A more general point is that cultural distance is associated with a higher cost in modernizing the
governance structure in China. Following China’s constitutional reforms, village communities
aimed to provide a wider range of public goods, as well as to formalize its governance structure
(Kuhn, 2002, Chapter 4). The failure to provide a key public good—public primary schools—
actually reveals a deeper issue with the process of modernizing the village governance structure.
The idea that low social capital can prevent effective self-governance in Chinese villages is
28
explored in Qian, Xu, Yao, et al. (2015).18 Formalization and democratization of village decision-
making following the constitutional reforms was hampered by strong social antagonism between
migrants and natives, the two groups with the most mistrust for each other. Social antagonism
between different groups in the village, created substantial barriers for post-1905 villages to
modern its political structure and to provide a wider range public goods than before. We argue
that cultural distance can affect the quality of village governance through social capital and
delay the emergence of full-fledged modern village governance.
Lacking the support of well-functioning local villages, Qing China made little meaningful progress
in its political reforms before its collapse in 1911. While both constitutional reforms and ”mod-
ern school movement” continued into Republican China, they still faced very similar constraints
that Qing China had during its last ten years. By 1930s, modern schools eventually attained
rapid growth but only at the expense of village self-governance and political decentralization
(Kuhn, 2002). The stagnant growth of public primary schools in a time of decentralization
parallels the experience nineteenth-century Prussia had with a decentralized education system
in a linguistically polarized society (Cinnirella and Schueler, 2016). The transition to autocratic
rule after 1930 seems to be consistent with Galor, Klemp, et al. (2015) which establishes the
point that population heterogeneity can provide a breeding ground for autocratic rule.
For those interested in the historical role of the gentry class, our paper also sheds some light
on the role of gentry men in modernizing village governance in a discussion of the mechanisms
linking MNCD to poor public goods provision outcomes. Gentry men were elite members of their
village. Many of them were enthusiastic about making contributions to their village. Schools
before 1905 relied heavily on their private donations, and remained so to a certain degree even
after 1905. However, our findings most likely indicate that gentry men were not best suited to
fulfill leadership roles in a modern context—gentry men, because they were viewed as closely
associated with their clan, encountered criticism about their impartiality; these tensions helped
fuel political conflict as leadership positions were seen as “public” prizes by a heterogeneous
population.
18 Qian, Xu, Yao, et al. (2015) find the presence of village temples, their proxy for social capital, enhancespublic goods provision through the channel of elections.
29
References
Ager, P. and M. Bruckner (2013). Cultural diversity and economic growth: Evidence from the usduring the age of mass migration. European Economic Review 64, 76–97.
Aghion, P., A. Alesina, and F. Trebbi (2005). Choosing electoral rules: theory and evidence from uscities. Technical report, National Bureau of Economic Research.
Alesina, A., R. Baqir, and W. Easterly (1999). Public goods and ethnic divisions. The QuarterlyJournal of Economics 114 (4), 1243–1284.
Alesina, A. and E. L. Ferrara (2005). Ethnic diversity and economic performance. Journal of EconomicLiterature 43 (3), 762–800.
Altonji, J. G., T. E. Elder, and C. R. Taber (2005). Selection on observed and unobserved variables:Assessing the effectiveness of catholic schools. Journal of Political Economy 113 (1), 151–184.
Ashraf, Q. and O. Galor (2013). The’out of Africa’hypothesis, human genetic diversity, and comparativeeconomic development. The American Economic Review 103 (1).
Bai, Y. and J. K.-s. Kung (2011). Genetic distance and income difference: Evidence from changes inchina’s cross-strait relations. Economics Letters 110 (3), 255–258.
Bai, Y. and J. K.-s. Kung (2014). Diffusing knowledge while spreading god’s message: Protestantismand economic prosperity in china, 1840–1920. Journal of the European Economic Association.
Bai, Y. and J. K.-s. Kung (2015). Does genetic distance have a barrier effect on technology diffusion?evidence from historical china.
Bazzi, S., M. Gudgeon, et al. (2015). Local government proliferation, diversity, and conflict. Technicalreport, Boston University-Department of Economics.
Bossert, W., C. D’AMBROSIO, and E. La Ferrara (2011). A generalized index of fractionalization.Economica 78 (312), 723–750.
Bossert, W., P. K. Pattanaik, and Y. Xu (2003). Similarity of options and the measurement of diversity.Journal of Theoretical Politics 15 (4), 405–421.
Buck, J. L. and C. Press (1937). Land Utilization in China: a study of 16,786 farms in 168 localities,and 38,256 farm families in twenty-two provinces in China, 1929-1933, Volume 2. University ofChicago Press.
Cao, S. (1998). The impact of the taiping rebellion on population in south jiangsu. History Research 2,64. .
Cao, S. and Y. Chen (2002). Malthusian theory and chinese population since qing: comments on recentstudies by american scholars. History Study (lishi yanjiu) 1, 41–54.
Cao, S. and Y. Li (2000). The impact of the taiping rebellion on population in zhejiang. Journal ofFudan University: Social Sciences 5, 005. .
Cao, z. (2006). On the rise and fall of linguistic enclaves: the case of wu-hui mandarin dialect. Linguisticstudy (Yuyan yanjiu).
Caselli, F. and W. J. Coleman (2013). On the theory of ethnic conflict. Journal of the EuropeanEconomic Association 11 (s1), 161–192.
Chaudhary, L., A. Musacchio, S. Nafziger, and S. Yan (2012). Big brics, weak foundations: Thebeginning of public elementary education in brazil, russia, india, and china. Explorations in EconomicHistory 49 (2), 221–240.
CHGIS, H. Y. I. (2007). CHGIS.Cinnirella, F. and R. M. Schueler (2016). Dp11274 the cost of decentralization: Linguistic polarization
and the provision of education.Crayen, D. and J. Baten (2010). Global trends in numeracy 1820–1949 and its implications for long-
term growth. Explorations in Economic History 47 (1), 82–99.Davis, D. R. and D. E. Weinstein (2002). Bones, bombs, and break points: the geography of economic
activity. The American Economic Review 92 (5), 1269–1289.Desmet, K., M. Le Breton, I. Ortuno-Ortın, and S. Weber (2011). The stability and breakup of nations:
a quantitative analysis. Journal of Economic Growth 16 (3), 183–213.
30
Dippel, C. (2014). Forced coexistence and economic development: evidence from native americanreservations. Econometrica: Journal of the Econometric Society 82 (6), 2131–2165.
Du, R., C. Xiao, and L. Cavalli-Sforza (1997). Genetic distances between chinese populations calculatedon gene frequencies of 38 loci. Science in China Series C: Life Sciences 40 (6), 613–621.
Duclos, J.-Y., J. Esteban, and D. Ray (2004). Polarization: concepts, measurement, estimation. Econo-metrica, 1737–1772.
Easterly, W. and R. Levine (1997, November). Africa’s growth tragedy: Policies and ethnic divisions.The Quarterly Journal of Economics 112 (4), 1203–50.
Esteban, J., L. Mayoral, and D. Ray (2012a). Ethnicity and conflict: An empirical study. The AmericanEconomic Review , 1310–1342.
Esteban, J., L. Mayoral, and D. Ray (2012b). Ethnicity and conflict: Theory and facts. sci-ence 336 (6083), 858–865.
Esteban, J. and D. Ray (2011). Linking conflict to inequality and polarization. The American EconomicReview 101 (4), 1345–1374.
Esteban, J.-M. and D. Ray (1994). On the measurement of polarization. Econometrica: Journal of theEconometric Society , 819–851.
Fei, H.-T. (1939). Peasant life in China. 19, 39.Fei, X. (1992). From the Soil, the Foundations of Chinese Society: A Translation of Fei Xiaotong’s
Xiangtu Zhongguo, with an Introduction and Epilogue. University of California Press.Galor, O., M. Klemp, et al. (2015). Roots of autocracy. Technical report.Ge, J. and S. Cao (2001). Zhongguo renkoushi qingdaijuan (History of China’s Population: Qing
Dynasty). Fudan University Press.Ge, q. (2002a). jindai suzhewan jiaojie diqu renkou qianxi yanjiu (Migrants from Hunan and Hubei in
the intersection of Jiangsu, Zhejiang and Anhui). Shanghai: Shanghai Academy of Social SciencePress.
Ge, Q. (2002b). Minguo shiqi zhejiang huzheng yu renkou diaocha (Household and Population Censusin Zhejiang During the Republican Period ). Chinese Academy of Social Sciences press. .
Gennaioli, N. and I. Rainer (2007). The modern impact of precolonial centralization in africa. Journalof Economic Growth 12 (3), 185–234.
Guo, y. (1989). Historical geography of the taiping rebellion (taiping tianguo lishi dituji.Huang, X. (2004). An study on the mandarin dialect enclaves of Anji County, Zhejiang province
(Zhejiang anjixian fangyandao yanjiu). Dissertation, Beijing Language University.Junya, M. and T. Wright (2010). Industrialisation and handicraft cloth: The jiangsu peasant economy
in the late nineteenth and early twentieth centuries. Modern Asian Studies 44 (06), 1337–1372.Kuhn, P. A. (2002). Origins of the modern Chinese state. Stanford University Press.Kung-Chuan, H. (1967). Rural China: Imperial control in the nineteenth century. University of Wash-
ington Press.Leung, A. K. C. (1994). Elementary education in the lower yangtze region in the seventeenth and
eighteenth centuries. Education and Society in Late Imperial China, 1600-1900 , 381–416.Li, N. (2011). Essays on cultural diffusion, migration, and human capital: investigation from china’s
historical experience.Li, W. (1981). The change in land ownership in the area occupied by the taiping rebels in jiangsu,
zhejiang and anhui in late qing. History Study (lishi yanjiu) 6, 81–96.Li, W. (2002). Restoration of temples and religious networks in rural lower yangzi after the taiping
rebellion. Zhejiang Chronicles (Zhejiang Fangzhi) 1, 62–69.Liang, q. (2001). Charity and Edification: Charitable Organizations in Ming and Qing (Cishan yu
jiaohua: mingqing de cishan zuzhi). Hebei Education Press.Lin, C. and N. Li (2014). The impacts of taiping rebellion on long-term economic development. Journal
of Translation from Foreign Literature of Economics (jinji ziliao yicong) 2.Liu, Y. (2012). Conflict and assimulation in Jiangnan during Late Qing and Early Republic China—
Jurong in Jiangsu. History (Shi lin) 2, 016.Mann, S. (1994). The education of daughters in the mid-ch’ing period. Education and Society in Late
31
Imperial China, 1600–1900.Michalopoulos, S. and E. Papaioannou (2013). Pre-colonial ethnic institutions and contemporary
african development. Econometrica 81 (1), 113–152.Mickey, G. A. and T. Shiroyama (2009). China during the great depression: Market, state, and the
world economy, 1929-1937.Oster, E. (2014). Unobservable selection and coefficient stability: Theory and evidence. University of
Chicago Booth School of Business Working Paper .Ouyang, N. and W. Zhang (2010). Spatial-temporal analysis of temple-converted schools in Jiangnan
in Late Qing and Republican China. Historical Geography (Lishi dili), 011.Qian, N., Y. Xu, Y. Yao, et al. (2015). Making democracy work: Culture, social capital and elections
in China. Technical report, National Bureau of Economic Research.Qu, t. (2003). Local Government in Qing (Qingdai de difang zhengfu). Ph. D. thesis, Beijing: Law
Press.Rawski, E. S. (1979). Education and popular literacy in Ch’ing China. University of Michigan Press
Ann Arbor, MI.Reynal-Querol, M. (2002). Ethnicity, political systems, and civil wars. Journal of Conflict Resolu-
tion 46 (1), 29–54.Reynal-Querol, M. and J. G. Montalvo (2005). Ethnic polarization, potential conflict and civil war.
American Economic Review 95 (3), 796–816.Shi, J., H. Wu, and X. Fei (1988). Emperor and gentry (huangquan he shenquan).Simmons, R. V., R. Shi, and Q. Gu (2006). An historical geographic study on the division line
of Jianghuai-mandarin dialect and Wu-mandarin dialect. (Wu-huai guanhua yu wuyu bianjie defangyan dilixue yanjiu). Shanghai Education Press.
Spolaore, E. and R. Wacziarg (2009). The diffusion of development. Quarterly Journal of Eco-nomics 124 (2).
Tian, Z. and S. Chen (2008). Taxation for education and conflicts arising from provision of villageeducation. Journal of Zhejiang University: Humanities 38 (3), 142–149. .
Xu, X. (2010). Temple property in the Late Qing Dynasty and the Republic of China (1895-1916).Technical report.
Xu, Y. and Y. Yao (2015). Informal institutions, collective action, and public investment in rural china.American Political Science Review 109 (02), 371–391.
Yin, M. and Q. Tian (2009). Minguo renkou huji shiliao huibian (Compilation of Historical PopulationStatistics During the Republican Period). National Library Press. .
Yuchtman, N. (2015). Teaching to the tests: An economic analysis of traditional and modern educationin late imperial and republican china. Unpublished manuscript 27.
Zhang, x. (2008). Qingdai jiangnan gonggong jiaoyu ziyuan choucuo peizhi de lishi dili xue fenxi (AHistorical Geographical Analysis of Public Education Resource Allocation in Jiangnan from 1644 to1911). Ph. D. thesis, Fudan University. (1644-1911 ).
Zhe, c. and c. Zhe (1896). Draft Gazetteer of Yuhang County (Guangxu yuhangxian zhigao(fu buyi)).
32
Appendix
A Additional Tables
Table A-1: Summary Statistics
Variable Mean Std. Dev. Observations Source
#public primary schools per 10,000 persons 0.937 0.518 60 b
Measures of population heterogeneity:Migrant-native cultural distance 2.12 1.024 54 aOverall diversity 35.357 5.984 54 aWithin-native diversity 30.166 6.311 54 aOverall polarization 1.297 0.729 54 aWithin-native polarization 0.85 0.293 54 a
Instrumental variables:Native-nearby cultural distance 1.578 0.421 54 a
Baseline controls:Log(%access to basic education), 1880 3.136 0.293 60 bUrbanization rate, 1917 12.614 16.093 60 dLog population 12.456 0.869 60 b
Geographic controls:Ruggedness 1.857 2.532 60 g% arable land 0.535 0.242 60 eAgricultural suitability 2.851 2.054 60 gDist. to Grand Canal 0.27 0.26 60 gDist. to provincial capital 137.617 95.719 60 gDist. to Shanghai 171.15 87.681 60 gDist. to Yangzi 1.158 0.694 60 g
Initial conditions:#initial charities 2.583 4.928 60 hLog population, 1820 12.978 0.739 60 bLog population density, 1820 6.095 0.655 60 b
Impact of war:Battle exposure (months) 31.167 16.955 60 i% adult men 20.108 2.090 60 c% elderly and youth 16.527 1.661 60 cHuman capital shock -0.022 0.053 54 b
Interventions in education:Log (#communicants per 10,000+1) 2.921 1.152 60 fLog (#temple-converted schools+1) 1.158 0.930 60 j
33
Variable Mean Std. Dev. Observations Source
Mechanisms:Government spending on education 0.014 0.036 60 b#lower-primary schools per 10,000 0.589 0.4 60 b#dual-primary schools per 10,000 0.179 0.169 60 b#upper-primary schools per 10,000 0.063 0.055 60 b#secondary schools per 10,000 0.01 0.02 60 b
For panel estimates:Log(%access to basic education) 2.971 1.105 600 bLog population 12.707 0.859 600 b
a. Surname data sources (see Appendix B)b. Primary schools data sources (see Appendix C)b. Ge and Cao (2001)c. Yin and Tian (2009, Volumn 4), Compilation of Historical Population Statistics Duringthe Republican Period; Ge (2002b), Household and Population Census in Zhejiang Duringthe Republican Period.d. Yin and Tian (2009, Volumn 1), Compilation of Historical Population Statistics Duringthe Republican Period.e. Buck and Press (1937), Land Utilization in China: a study of 16,786 farms in 168localities, and 38,256 farm families in twenty-two provinces in China, 1929-1933.f. Local gazetteers on religious facilities.g. CHGIS (2007)h. Liang (2001)i. Guo (1989)j. Ouyang and Zhang (2010)
Table A-2: Instrumental Variable Analysis: Reduced-Form Estimates
(1) (2) (3) (4) (5) (6)Dependent variable: #public primary schools per 10,000 persons
Native-nearby culture distance -0.280 -0.371∗ -0.362∗ -0.367∗ -0.317 -0.319(0.194) (0.191) (0.182) (0.184) (0.216) (0.212)
Human capital shock N N N N N YBattle exposure N N N N N YFractionalization N N N N Y YInitial social capital N N N Y Y YPop. 1820 N N N Y Y YGeorgraphic controls N N Y Y Y YBaseline controls N Y Y Y Y Y
Observations 54 54 54 54 54 54Adjusted R2 0.034 0.047 0.381 0.367 0.348 0.331
Notes: The table reports reduced-form estimates of native-nearby culture distance. Theunit of observation is a county in Republican China. The specifications correspond to thosein Table 7. Robust standard errors are used in all specifications. ∗ p < 0.10, ∗∗ p < 0.05,∗∗∗ p < 0.01
34
B Data Sources for Surnames
An Overview of Surname Data Sources
Pre-Taiping period Post-Taiping period
Deaths during the Taiping Rebelliona
Exam degree holders 1645-1850b
Diseased soldiers 1927-52c
College students 1906-1949d
31126
13340
39419
15584
Sources:
a: Traditional county gazetteers (pre-1949): chapter for deaths of high moral worth. 旧志-咸同忠烈
姓名录
b: Traditional county gazetteers (pre-1949): chapter for exam degree holders. 旧志-选举志(by year
of degree)
c: Modern county gazetteers(post-1949): chapter for deceased soldiers. 新志-抗战英烈和革命英烈
姓名录 (by year of death)
d: College yearbooks during the Republican era (by year of admission). See Hao and Clark (2014) for more details.
35
Fig
ure
A-1
:D
ata
Sou
rces
for
Su
rnam
es
36
C Data Sources for Basic Education
Year Region Sources
1820-1900 The Lower Yangzi Same as surname data sources. See “Some notes on surname data”.
The “Statistical Chart of Education” (Jiaoyu Tongji Tubiao) by Ministry of Education (教育部)
1907 Nation 光绪三十三年第一次教育统计图表
1908 Nation 光绪三十四年第二次教育统计图表
1909 Nation 宣统元年第一次教育统计图表
1915 Nation 中华民国第三次教育统计图表
1916 Nation 中华民国第四次教育统计图表
1917 Nation 中华民国第五次教育统计图表
1923
Jiangsu
江苏政治年鉴
The Yearbook of Politics: Jiangsu Province (1923)
The “Statistical Chart of Education” (Jiaoyu Tongji Tubiao) by Department of Education (教育
厅), Zhejiang Province (1925-1929)
1925 Zhejiang 中华民国十四年度浙江省教育统计图表
1927 Zhejiang 中华民国十六年度浙江省教育统计图表
1929 Zhejiang 中华民国十八年度浙江省教育统计图表
1929 Jiangsu 江苏教育概览 The Overview of education: Jiangsu province (1929)
The “Statistical Chart of Education” (Jiaoyu Tongji Tubiao) by Department of Education (教育
厅), Jiangsu Province (1932-1937)
1932 Jiangsu 民国二十一年度江苏省教育经费统计图表
1933 Jiangsu 民国二十二年度江苏省教育经费统计图表
1935-37 Jiangsu 江苏省教育统计图表
1935-37
Zhejiang
浙江省三年来教育概况
Education for the Last Three Years: Zhejiang province
1947 Jiangsu 江苏省 36 年教育统计
1947 Zhejiang
浙江经济年鉴
The Yearbook of Economy: Jiangsu Province (1923)
1950-60
Jiangsu
江苏五十年(1949-1999)
The fifty-year statistics of Jiangsu (1949-1999)
1950-60
Zhejiang
新浙江五十年统计资料汇编(1949-1999)
The fifty-year statistics of Zhejiang (1949-1999)
Figure A-2: Data Sources for public primary schools (1820-1960)
37
D The Traditional Education System
Prior to 1905, the primary education system was based upon Confucian classics and aimed at
success in the imperial examinations. At national, provincial and county levels, highly competi-
tive exams selected a few degree holders, and brought the individual, his clans, and communities
“prestige, power, and wealth through government service (Rawski, 1979, p.21). Within a com-
munity or a clan, an individual’s literacy helped determine his social status, occupation and
wealth. The return to passing exams generated considerable demand for privately or publicly
provided traditional schooling throughout the country. On the other hand, due to commercial-
ization in this area since the 16th century there was a growing demand for education, especially
basic education. The educational system consisted of two phases: mass primary education
teaching basic literacy and select students of talent, and the more advanced education drilling
candidates to prepare for exams (Leung, 1994). Those who reached the threshold level in literacy
to attend the exam (4,000 characters) accounted for only 1%-2% of the male population(Rawski,
1979, p.96). A much higher percentage of male population, roughly 30% to 40%, acquired ba-
sic literacy (numeracy and about 1000 characters) Crayen and Baten (2010). Female literacy
(2%-5%) was confined to those from elite families (Mann, 1994; Rawski, 1979).
Both basic and advanced education were mainly provided privately. Children in elite house-
holds were instructed by family members to begin with (age 3-6), and by hired private tutors
between age 6 and 15 (typically exam degree holders who had not been to a government po-
sition). Households of modest backgrounds pooled their resources and paid local teachers for
instructions. There is no evidence that a village tax was collected, nor any aid from government
was received to finance those educational instructions. Parents paid the teacher in either money
or commodities. Each tutor taught 1-30 students in the tutor’s own house or the village temple.
Hours and schedules were flexible, adjusted to weather and season. The cost varied with the
quality of instructions, and degree holders could demand a higher salary as tutors. At the age
of nine or ten, decisions were made based on the observed talent of children. Families would
support the talented ones to pursue exam careers, while they prepare others for various lesser
occupations.
Boys from families too poor to pay for schooling were not necessarily barred from the classroom.
Clan schools (zu xue) were often established primarily to aid such students (Rawski, 1979, pp.
30-32). These schools were financed by contributions, a clan tax, and clan land (xue tian) with
the rent going to special funds for education and exam preparation. Clan temples (zong miao)
often served as classrooms. For the talented students, their future study and exam taking were
fully covered by land funds. Most clan schools limited admission to kin members but exceptions
were made for talented non-kin members within the community . Overall, in the lower Yangzi
about 5-8% of males were supported by their kinships for their education.
The central and local government in late imperial China took a hands-off approach to financing
education. Local magistrates advocated setting up primary schools for the poor but there
38
was no record of direct funding from the government (Rawski, 1979, pp. 38-40). The education
establishments with direct government funding, called ”government-based county and prefecture
schools”, were for no more than a few hundred lower degree holders to accomplish higher degrees
(an allowance was provided to cover basic living costs and travel costs). There were a few
exceptions to the rule: in frontier regions populated by non-Han minorities, the government
funded the establishment and maintenance of schools; in areas with recent exposure to war and
famine, the government often helped to establish schools as a means to restoring the Confucian
order (Rawski, 1979, p.89). For instance, after the Taiping rebellion, the magistrate of the
county of Wujiang set up sixteen charity schools with the county budget, but those schools
admitted no more than 500 students county-wide, which accounted for only 1% of school-aged
children. Even so, these schools were closed after a decade because of ”a tight budget” and
the restoration of ”private and clan schools to a pre-war level.In most cases, such government-
initiated charity schools only admitted 0.5%-1% of the school-aged children, or 2%-4% of the
entire educated population.
E The Modern Education System
Content Pedagogy
Early childhood(4-7)
Basic Chinese charac-ters
Memorizing
TraditionalTutored at home byfamily members
Primary (7-12) Poetry and Confuciusclassics
Memorizing; Tutoredat home or at sishu byprivate tutors
Juvenile (13-17)and youth (18-25)
Preparing for exams(writing eight-leggedessays based on Con-fucius classics)
Memorizing andwriting; Studied athome or in privateacademies
Modern
Early childhood(4-7)
Basic Chinese charac-ters
Tutored at pre-schools
Primary (7-12) Chinese, art, mathe-matics
Studied in primaryschools
Juvenile (13-18) Chinese, art, math-ematics, western sci-ence
Studied in high pri-mary schools and sec-ondary schools
Youth (19-25) Art, western sciences,foreign languages, lawand engineering
Studied in universities
Table A-3: Traditional versus Modern Education. Sources: Yuchtman (2015)
39
Example county: County of Wu (吴县)
Upper-Primary Schools
Secondary Schools
City of Suzhou (苏州)
Small
town
Village
Lower-primary
schools
and dual-primary
schools
District-level
agency 区劝学员
Bureau of Education
县劝学所,教育局
Figure A-3: Types of Schools and The Fiscal Authority in Charge
40
F The Taiping Rebellion
Figure A-4: Battle Exposure
G Measures of Population Heterogeneity
Below we construct measures of population heterogeneity as alternative explanatory variables,
mainly, the fractionalization index and the polarization index. Those indices do not contain
information on migrant-native status. We compare MNCD to measures of population hetero-
geneity to put the explanatory power of the cultural distance between migrants and natives in
perspective.
G.1 Fractionalization
Akin to the fractionalization measure in (Alesina et al., 1999), we use the inverse of isonomy
to define fractionalization. Rather than ethnic groups, our measure relies on surnames (k). We
measure the probability of randomly drawn two individuals sharing the same surname for a
population at a given time:
I =S∑k
P 2k,i , (5)
41
where S is the number of surnames and Pk is the relative frequency of surname k, which is the
proportion of the population with surname k to the population. This is a Herfindahl-Hirschman
index of surname distribution. And,
FRAC =1
I. (6)
A higher FRAC corresponds with a lower likelihood that any two individuals randomly drawn
from a population bear the same surname (which indicate closeness in dialect, culture and even
genetics). This fractionalization index treats all groups symmetrically.
MNCD can change without increasing fractionalization. An example is that when people car-
rying a certain surname are wiped out by the rebellion, and replaced by migrants carrying a
different surname, the overall fractionalization of a population remains unchanged, but MNCD
can still change.
G.2 Polarization
Our first polarization index is taken from Reynal-Querol (2002) and Reynal-Querol and Montalvo
(2005):
POL =S∑
i=1
(12− Pk
12
)2Pk (7)
where S is the number of surnames and Pk is the relative frequency of surname k, which is
the proportion of the population with surname k to the population. This measure employs a
weighted sum of population shares and assumes the any two groups are either completely similar
or completely dissimilar.
We then rely on Duclos, Esteban, and Ray (2004); Esteban and Ray (2011); Esteban, Mayoral,
and Ray (2012a) to build a second measure of polarization:
POL DIST =S∑
i=1
S∑j=1
P 2i Pjdij . (8)
The group i here is defined as a surname group. The population share of each surname group is
Pi, the intergroup distance dij. Our measure of intergroup distance is the difference in human
capital between groups. This is not the perfect measure, but this is the best measure we can find
given our data. A similar measure to ours is the difference in incomes between groups (Esteban
and Ray, 1994) and (Aghion et al., 2005). For that reason, our polarization measure should be
interpreted with caution.
We report correlations between MNCD and fractionalization and MNCD and polarization (Table
A-4). We find MNCD is positively correlated with overall fractionalization and polarization, but
42
Table A-4: Correlations Between MNCD, Fractionalization and Polarization
MNCD Overall FRAC Overall POL Overall POL DIST
MNCD 1Overall FRAC 0.203 1Overall POL -0.229 -0.786∗∗∗ 1Overall POL DIST 0.151 -0.506∗∗∗ 0.434∗∗ 1∗ p < 0.1, ∗∗ p < 0.05, ∗∗∗ p < 0.01
not with within-native fractionalization and polarization. This is consistent with the nature of
MNCD: it captures the relationship between migrants and natives, not the relationship between
natives. The high correlations between MNCD and overall fractionalization and polarization
suggest that MNCD captures crucial information in overall population heterogeneity.
H Human Capital Shock
Migrants do not always have the same level of human capital as natives. When their levels of
human capital differ, this measure captures the resulting “shock” to the stock of local human
capital:
Human capital shock =∑k
(Pk,MN − Pk,N)RRk (9)
where RRk is a surname-specific variable that denotes the representation of surname k among
the educated elites relative to its population, i.e. RRk is the ratio of the share of k in education
elites to the share of k in the total population. Supposing that natives have two surnames:
Wang and Li. Each surname accounts for 50% of the population. Li’s are over represented
among education elites by a factor of 2, in other words, Li’s are two times more likely to show
up among national elites than in an average population, whereas Wang’s are just as likely to be
educated as is a member of the average population. For illustrative purposes, take the extreme
case where all Li’s die in the war. A family of Zhangs came to the village, with a RR of 0.5.
The human capital shock is (50%-50%)*1+(0%-0.5%)*2+(50%-0%)*0.5=-0.5.
We acknowledge that our measure of human capital shock can have measurement errors due to
migrant selectivity—migrants do not have to have the same capital level as the overall population
with their surnames. When skilled migrants migrate to more industrialized areas, our measure
can underestimate the level of their human capital. However, we do not see how this can bias our
results aside from introducing an attenuation bias. Also, this sort of measurement error is going
to be minor for most counties in our sample, with the exception of industrial Shanghai. Figure
A-5 shows geographic pattern of human capital shocks. The human capital shock appears to be
the most positive in the east, consistent with the fact that Shanghai often attracted migrants
with higher human capital relative to its native population.
43
Figure A-5: Human Capital Shock
I Evidence from Timing
I.1 Pre-1900 Access to Basic Education
We construct our time-varying access to basic education variable using two methods: for 1900-
1960, we obtain primary school enrollment rates from national and provincial censuses on modern
education. For 1820-1900, there were no primary schools in the modern sense, but a percentage
of people did acquire basic education as explained in Appendix D, but data on this are generally
not available. Below we explain our method in estimating pre-1900 rates of access to basic
education and how it affects the validity of our panel analysis.
We estimate pre-1900 %access to basic education from time-varying pre-1900 rates of college-
equivalent education (juren), college schools rate in 1947, and literary rate in 1947. While the
content of provincial-exams was vastly different from modern university education, successful
candidates of provincial exams were similarly at the top of the distribution of the educated.
This yields a method to construct a rough estimate of the share of population with a basic
education prior to 1900. We employ the following parallel projection to estimate time-varying
rates of receiving basic education:
%basic education accessi,1947 = a+ b ∗ college student per 10,000i, 1947 + µit . (10)
Second, using the obtained a and b (1.72 and 13.56 respectively), we project %access to basic
44
education in the exam era from juren per 10,000,
%basic education accessi,t = a+ b ∗ ft ∗ juren per 10000i,t , (11)
where ft is a time-varying ratio that transforms juren per 10,000 into college students in 1947.
For example, in 1820 and 1850, the proportion of juren is 0.03% of male population in Lower
Yangzi and in 1880 the proportion of juren is 0.06% (juren quota changed little but population
halved after 1860), whereas college students account for 0.3% of total population in 1947. So
the ft=10 for 1820 and 1850, and ft=5 for 1880.
We should point out that this method does assume a fixed ratio between advanced and basic
education in 1947 and in the 19th century. We want to point out that in our panel, our results
are unlikely affected by differential advanced-to-basic education ratios in 1947 and in the 19th
century. The difference between two ratios will be captured by our county fixed effects.
It should be noted that official 1900-1950 schools rates only comprise schools at modern schools
that provided western-style education and was open to the public. Comparing the primary
schools rate in the decade of 1900, and the rate of basic education in the period just before that,
we can estimate the differential response to modern basic education in relation to migrant-native
cultural distance, holding constant a county’s past coverage of basic education, and factors
common to all counties in modernizing their education. Due to data limitations, we cannot
track the evolution of traditional schools during the same period.19 Therefore, we do not know
whether the gap left by the differential response to modern education was fully compensated by
additional provision of traditional education.
19In 1935, and in 1949, both Zhejiang and Jiangsu collected data on and facilitated the registration of tra-ditional schools. Unfortunately, data quality was low and varied greatly from county to county. By 1953, alltraditional schools had been converted to formal public schools. Our basic education measure captures theamount of county- and community-level public and semi-public education. Schools initiated by individuals andclans, but did provide education to the wider population, are not excluded from the measure.
45