friends from afar: the taiping rebellion, cultural ...this paper tests the hypothesis that the...

Friends from Afar: The Taiping Rebellion, CulturalProximity and Primary Schooling in the Lower Yangzi,

1850-1949∗

Yu Hao‡

Peking University, School of Economics

Melanie Meng Xue§

UCLA Anderson School of Management

This Version: June 2016

Abstract

This paper tests the hypothesis that the cultural distance between migrants and na-tives impedes public goods provision. The Taiping Rebellion was a shock that causedgroups without a history of shared governance to be relocated into the same region.We use a unique historical dataset of surnames in the Lower Yangzi of China to con-struct a measure of the cultural distance between migrants and natives (MNCD).We find an one-standard-deviation increase in MNCD is associated with a decreaseof over 0.19 public primary schools per 10,000 persons in the early 20th century.Results survive various robustness checks and an instrumental analysis exploitingpre-existing cultural distances between native and the nearby population. Evidencefrom the timing of MNCD taking effect, suggests that the primary mechanism runsfrom migrant-native cultural distance through quality of collective decision-makingto modern primary education.

Keywords: Cultural Distance; Primary Education; Local Public Goods; Quasi-ExogenousMigrationJEL Codes: D72, J15, N45, N95, O15, Z1

∗We thank the editor Debin Ma, two anonymous referees, as well as Ying Bai, Zhiwu Chen, Qiang Chen,Gregory Clark, Christian Dippel, Mark Koyama, James Kung, Nan Li, Kris Mitchener, Jean-Laurent Rosenthal,Tuan Hwee Sng, Romain Wacziarg, Noam Yuchtman and conference participants at the All-UC Conference on“Frontiers in Chinese Economic History” , Chinese University of Hong Kong, Shandong University, Xiamen Uni-versity and Peking University for helpful comments and suggestions. All remaining errors are the responsibilityof the authors.

‡[email protected]§[email protected]

I. INTRODUCTION

An extensive literature documents the negative impact of population heterogeneity on public

goods provision (Easterly and Levine, 1997; Alesina, Baqir, and Easterly, 1999; Alesina and

Ferrara, 2005). However, recent research suggests the lack of history of shared and centralized

governance between groups is just as likely to be responsible for the adverse outcomes associated

with the coexistence of different ethnic groups(Gennaioli and Rainer, 2007; Michalopoulos and

Papaioannou, 2013). This raises the question whether ethnic cleavages or artificial jurisdictions

has caused poor economic performance. Dippel (2014) contributes to the debate by showing that

a lack of a history of shared governance can negatively affect even ethnically and linguistically

homogeneous populations. We instead show that even for previously detached groups, cultural

distance can matter for the coexistence of different ethnic groups.

We exploit variation in cultural distance between previously detached groups following an exter-

nal shock: the Taiping Rebellion. We use a unique dataset of Chinese surnames of approximately

100,000 individuals over the course of 150 years. The same dataset also allows us to build com-

mon measures of population heterogeneity such as fractionalization and polarization.

The Taiping Rebellion (1850-1864) was a massive civil war in South China that constituted a

one-time shock to the population makeup. The rebellion led to the loss of 17 million lives in the

Lower Yangzi, or half of the native population (Cao, 1998). After the war, migrants flocked into

the region and began to coexist with natives. This shock created two groups without a history

of shared governance or prior interaction in a region. Cultural proximity between migrants and

natives varied. We hypothesize that cultural distance between migrants and natives (“MNCD”),

who lived in the same community after the rebellion, had a negative impact on public goods

provision.

We provide the historical context to show that migration was plausibly exogenous to the cultural

distance between migrants and natives. First, migrants moved into the area with little prior

contact with natives. Migrants were not selected based on their cultural proximity with the

natives (as is the case with chain migration). Ex-ante sorting was minimum. Second, in tradi-

tional China where ancestral land was of cultural prominence, natives were not able to move out

as freely in response to the arrival of migrants whose preferences differed. Hence ex-post self-

sorting was not a concern, either.1 To further establish causality, we introduce an instrumental

variable approach exploiting variation in pre-existing native-nearby cultural distance.

We go on to test our hypothesis that a greater migrant-native cultural distance lowers public

goods provision. Our proxy for public goods provision is the number of public primary schools

at county level. In the baseline model, we find that a one-standard-deviation increase in MNCD

is associated with a decrease of 0.19 public primary schools per 10,000 persons between 1900

1Compared to Ager and Bruckner (2013), we use arguably more exogenous migration as a treatment, asnatives and migrants had few opportunities to engage in ex ante screening, or ex-post self-sorting.

1

and 1910. That is a fifth of the mean of the number of public primary schools by population, or

40% of the standard devision. We then include in the controls share of arable land, distance to

the Grand Canal, distance to the Yangtze River, distance to the provincial capital, and distance

to Shanghai. We show that MNCD wins horse races against alternative explanatory variables

including the traditional fractionalization index and the polarization index. For robustness,

we control for initial conditions, interventions in education (missionary activities and temple

conversion), and confront possible effects of war (battle exposure, demographic shock and human

capital shock) on schools.

While our key finding is that cultural distance has an independent effect on public goods pro-

vision outcomes, we find evidence that the negative effects of the cultural distance between

surname groups can be mitigated by the history of shared governance. Our finding is mainly

built off the horse race results of MNCD against other measures of population heterogeneity

that ignore the history of shared governance within native surname groups.

We provide suggestive evidence on the mechanisms through which MNCD prevented the estab-

lishment of public primary schools. First, we exploit institutional features of early 20th century

China to form a testable hypothesis : MNCD should have the strongest effect on lower-primary

and dual-primary schools, since (a.) MNCD should matter the most in an environment of self-

governance, (b.) villages were traditionally self-governed, and (c.) villages (and townships) were

responsible for the building of lower-primary schools, and sometimes, dual-primary schools.2

Consistent with our prediction, we find that MNCD only affects schools at lower-primary and

dual-primary schools, not at upper-primary and secondary schools. Second, we exploit time

variation in institutions through the first half of the 20th century. Massive institutional changes

in 20th century China provides an excellent laboratory to observe the impact of MNCD. In our

sample, we have periods of decentralization and centralization of the education system, when

decisions to education children were made locally and when those decisions were made by the

national or provincial government, respectively. This allows us to use evidence from timing to

interpret our finding. We expect to see a larger effect of MNCD during the period featuring

more self-governance and more decentralization of fiscal authority. And consistent with this

prediction, we find the effect of MNCD on modern education is pronounced in the early 20th

century but is muted in China for much of the 20th century under autocratic rule and fiscal

centralization. Soon after the centralization of the educational system in 1927, MNCD no longer

had a significant effect on schools. We conclude that MNCD resulted in fewer primary schools

being build due to lower quality of collective decision-making in local communities.

Our study builds on the literature on the relationship between diversity of individual preferences

and public goods provision. Alesina, Baqir, and Easterly (1999) show theoretically that the

2At the time, the entire phase of primary education was divided into two: upper- and lower-primary educa-tion.“Upper primary education” refers to the more advanced stage of primary education. Most schools specializedin either upper- or lower-primary education. Those providing both upper and lower primary education were called“dual-primary schools”.

2

median distance from the preference of the median voter can be considered as an indication of

how polarized preferences are. The model predicts that public goods provision will be adversely

affected in a polarized society characterized by two separate groups with relatively homogeneous

preferences within the group, but very distinct preferences across groups. More recent work

shows that in the process of decentralization and redistricting, the benefits of reduced diversity

can be undone if the newly governed population is highly polarized (Bazzi et al., 2015). In our

paper, we use the cultural distance between migrants and natives as a proxy for the difference

in preferences between these two groups. We find that cultural distances between groups indeed

matter for public goods provision, whereas the traditional fragmentation measure that assigns

the same distance to all groups does not produce the same effects.

Our study also contributes to the literature of the effect of genetic dissimilarity on economic

development. Ashraf and Galor (2013) find the beneficial and detrimental effects of diversity on

productivity, and conclude that an immediate level of diversity is the most conducive for eco-

nomic development. Desmet, Le Breton, Ortuno-Ortın, and Weber (2011) link genetic distance

to the stability and breakup of nations, and provides empirical support for the use of genetic dis-

tance as a proxy of cultural heterogeneity. Spolaore and Wacziarg (2009) show genetic distance

affects income differences across countries through a barrier effect to the diffusion of develop-

ment from the world technological frontier. Our paper similarly uses genetic distance as a proxy

for cultural distance, and focuses on the public goods provision consequences of greater genetic

and cultural distances between groups.

This paper is organized as follows. Section II explains the historical context. Section III discusses

data sources and the basis for constructing our measure of migrant-native cultural distance. Sec-

tion IV summarizes my baseline results and the comparison of migrant-native cultural distance

to the fractionalization and polarization of the population. Section V introduces a number of

robustness checks, accounting for initial conditions, interventions in education and war-related

conditions. Section VI comprises an instrumental variable analysis. Section VII identifies the

quality of collective decision-making as a possible channel for migrant-native cultural distance

to influence the supply of modern primary education. Section VIII concludes the paper.

II. HISTORICAL CONTEXT

A The Taiping Rebellion

The Taiping Rebellion was a massive civil war in South China which lasted from 1850 to 1864.

At least 17 million people, or half of the populace, died in the lower Yangzi (Cao, 1998; Cao and

Li, 2000). Battles broke out throughout the Lower Yangzi and all counties, with the exception

of Shanghai, were occupied for at least 3 months. The area around Nanjing, the capital city

of Taiping regime since 1853, had lingering conflicts for over ten years. The most prosperous

and important cities in the Lower Yangzi, Hangzhou and Suzhou were occupied by the Taiping

3

army after 1860. Shanghai, protected by foreign powers, was the least affected, and it served

as a shelter for over 200,000 refugees (Ge, 2002a, pp.62–63). Famine and plague followed the

battles. So did mass migration.

A.1 In-Flow Migration

Migration occurred both during the Taiping Rebellion itself and in the aftermath of the rebellion.

While migration internal to the Lower Yangzi was certainly common, post-Taiping migration

was best characterized by long-distance migration from North China and from the Middle Yangzi

River. A crucial difference between pre- and post-Taiping migration is that pre-Taiping migra-

tion was largely driven by income differences, job opportunities, and based on ethnic bonds

and geographic proximity (Li, 2011), whereas post-Taping rebellion migration originated from

a very wide range of geographic areas and featured diverse economic and cultural backgrounds.

Another difference is the scale and pace of migration—post-Taiping migration was far more

rapid and broader in scale. For this reason, in this paper we focus on post-Taiping migration.3

The mass migration led to conflicts between natives and migrants, and between different migrant

groups. In villages and townships conflicts arose as a result of clashing preferences and interests,

different dialects, skills and social customs. Conflicts, as documented in local gazetteers, took

place over a wide range of issues such as usage of public water, property rights of ownerless land

and eligibility for imperial exams (Ge, 2002a, pp. 303-308).

A.2 The Economic and Political Consequences of the Taiping Rebellion

The Taiping Rebellion constituted a multi-dimensional shock to the region. Most likely, it had

more than one way to affect primary schools. Those effects could be at play on both the supply

and demand side of education. In Section V.C, we provide a quantitative analysis of how various

outcomes of the Taiping Rebellion might have affected the building of primary schools in the

early 20th century.

The rebellion damaged local infrastructure. In the Jiaxin prefecture of Zhejiang, 21% of Bud-

dhist and Tao temples were destroyed by rebels affiliated with the God Worshiping Cult (Li,

2002). County public schools and private schools, where lower degree holders received further in-

struction to prepare for the higher level exams, were also destroyed or damaged in large numbers.

The destruction of local infrastructure could have arguably undermined the resources useful to

the launch of modern schools fifty years after the rebellion. That said, it is widely documented

that most temples and schools were restored shortly and even more were built in the late 19th

century. For example, Li (2002) found that 98 temples were destroyed by the rebels but 220

were built (or restored) within twenty years after the rebellion because living standards was

3The provincial governments advertised all around China for migrants and depicted the Lower Yangzi asa ‘kingdom of free land; and the ‘land of opportunities’. Farmers from Henan, Anhui, Hubei, Hunan, NorthJiangsu, and South Zhejiang came for a better living (Ge, 2002a, pp.100-106). After 1900, industrialization drewmassive immigrants into the urban area of Shanghai (Junya and Wright, 2010). Its population increased byfour-fold from 1907 to 1947.

4

even better than before the war and trade and commercial network was quickly restored. Kuhn

(2002), to the contrary, interpreted this trend (along with the rise of local charity organization)

as the rise of local gentry at provincial and county level overseeing local public affairs, which

eventually led to the formalization of local self-governance in the early 20th century.

The rebellion dismantled kinship networks. Clans used to provide financial aid for clan members

to receive education. During the rebellion rich families migrated to the urban area with their

less well-off relatives left behind in the countryside. Clans also lost land property to the war, the

rent from which were assigned as public funds for supporting education (Li, 1981). As discussed

by Xu and Yao (2015), kinship networks are an alternative to formal institutions in providing

public goods by effectively overcoming free-riding problems. However, in the context of education

reform in the 1900s, strong kinship networks can be a double-edged sword. Clans sometimes

would prefer the option of funding informal and private tutoring exclusively enjoyed by clan

members to establishing a school accessible by both clan and non-clan members. A weaker

kinship network, in that case, may have reduced within-kinship public goods but enhanced

cross-kinship public goods.

The rebellion led to huge population losses, which induced higher land-labor ratios and higher

wages (Cao and Chen, 2002). High wages forced war-stricken areas to abandon subsistence

agriculture and switch to labor-saving technologies and industries. In Wujin and Wuxi, the

silk industry superseded rice farming to be the largest employer in the rural area (Mickey and

Shiroyama, 2009). Lin and Li (2014) show that areas with a larger impact of war saw more

modern industrial enterprises in the late 19th century and had a higher level of urbanization

in the 1930s. In addition, the rebellion inadvertently created political room for institutions in

favor of modernization to set roots. Pro-reform officials were assigned to post-war provinces.

They established formal institutions to promote industrialization.

B Educational Reforms: From Traditional to Modern Education

Fifty years after the Taiping Rebellion, Qing Government put forward an educational reform.

The abolition of the imperial exam system went hand in hand with the attempt to establish a

western-style, modern school system.

Prior to 1905, education focused on Confucian classics and aimed at preparing students for the

imperial examinations. The traditional educational system included two stages: mass primary

education aimed for basic literacy and talent spotting, and more advanced education that drilled

candidates selected from the first stage to pass the exams (Leung, 1994). In the late 19th century

growing economic openness gave rise to higher demand for education in science, technology, and

other non-exam skills (Yuchtman, 2015). Attempts by missionaries to build modern schools

began in some coastal cities as early as the 1860s. But only until the abolishment of the

exam system in 1905 did modern education begin to expand. The Ministry of Education was

established, and Offices of Provincial Education was founded, along with county-level agencies

known as “Education Exhorting Offices” (quan xue suo).

5

Educational reform was not a smooth process. Despite ambitious political and educational

reforms, few things changed on the ground. For villages and townships, the process of building

modern schools was slow and painful. County officials often found it difficult to raise county

taxes, and make within-county transfers to ensure universal primary schools. Clans sometimes

would prefer to provide direct financial aid to clan members for them to take cheaper informal

and private tutorships, rather than establish a school open to both clan and non-clan members.

More details concerning both the institutional features of the traditional exam system and of

the modern education system are included in the appendix (Appendix D and Appendix E).

III. DATA AND MEASUREMENT

Table A-1 provides summary statistics of all the variables used in the paper and their data

sources. Below we focus on the underlying logic of our independent variable, migrant-native

cultural distance.

A Independent Variable: Migrant-Native Cultural Distance

Our independent variable is the cultural distance between migrants and natives. We rely on

surname data to construct our measure. To be specific, we use differences in the surname mix

to proxy for the cultural distance between migrants and natives:

MNCDi =1

normalized isonomyN,MN,i

, (1)

where normalized isonomyN,MN,i =∑S

k Pk,N,iPk,MN,i√∑Sk P 2

k,N,i

∑Sk P 2

k,MN,i

. S is the number of surnames in the two

groups. Pk,N,i and Pk,MN,i are the relative frequencies of surname k within natives and within the

entire population including natives and migrants.4 The isonomy between the native population

and the entire population,∑S

k Pk,N,iPk,MN,i, measures how likely any individual randomly drawn

from within natives bears the same surname as one drawn from within the entire population.

We normalize it with the isonomy of the native population,∑S

k P2k,N,i, and the isonomy of the

entire population,∑S

k P2k,MN,i. MNCD captures how culturally dissimilar natives and migrants

were. Figure 1 illustrates migrant-native cultural distance in the Lower Yangzi.

Our approach is in line with Bai and Kung (2011, 2015); Li (2011); Spolaore and Wacziarg

(2009). Li (2011) uses surname distances between pair of countries or regions at a given time to

measure multilateral genetic and cultural distance. Following Du et al. (1997), Bai and Kung

(2011) and Bai and Kung (2015) use isonomy (similarity in surname distribution between any

4In practice, we extract migrant-native cultural distance from the distance in the surname distribution ofa county’s population before and after the rebellion, with the assumption that the surname mix of the nativesremained relatively stable during that period. It is clear that the total population of natives declined after therebellions, but as long as the proportion of each surname population to the total population did not change, ourassumption remains valid.

6

Figure 1: Migrant-Native Cultural Distance

pair of population) to approximate genetic and cultural distance across regions. They show that

this surname-based measure is strongly correlated to a measure of genetic distance based on the

frequency distribution of the A, B and O alleles of the ABO gene at the province level. They

also find that surname-based measure is strongly correlated to a measure of cultural distance

based on dialects. Similar to Spolaore and Wacziarg (2009), they find that the smaller genetic

or cultural distance to the technological frontier from a region, the faster the technology diffused

to that region and hence faster growth. In this paper, we adopt the same measure as in Bai

and Kung (2011); but instead of using isonomy between two regions, we use isonomy between

before-migration population (natives) and after-migration population (natives and migrants) to

proxy for the consanguinity between natives and migrants.

We do not have data on which surnames were associated with migrants but it is generally

the case that migrants as a group were different to natives in terms of surname distribution,

especially when they came from far away. To obtain the surname distribution of a county before

migration, we hand-collect data from county chronicles (a.) name lists of civilians who died

during the rebellion 1851-1865, (b.) exam degree holders (1645-1850) (c.), surnames of chaste

women, (d.) surnames of their husbands if they are also recorded. The number of records for

7

each county ranges from 800 to 3000. To obtain the surname distribution of a county after

migration, we use the following sources: (a.) surnames of dead soldiers (1927-1953) and (b.)

college students born in that county who graduated between 1900 and 1949. The number of

surnames from the above sources ranges from 500 to 2500 per county. More details can be found

in Appendix B.

We draw surnames of individuals from a wide range of social backgrounds to improve the repre-

sentativeness of the surname sample. One concern might be that these samples seem too small

to correctly estimate the true surname distribution of population if each surname accounts for a

small fraction of population. However, this is not the case for China. In each county the largest

few surnames each accounts for more than 5% of the population. To correctly estimate surname

distribution of population at least for the largest few surnames, one needs a population sample

of as small as a few hundred. For the six counties we indeed have a small sample problem, we do

not include them in most of our regressions. As counties with a small sample are not random, we

impute our outcome variable for those six counties, and run regressions on the sample including

them as well.

One way to check the validity of our measure is to cross-check with measures reflecting cultural

or ethnolingustic devisions. One indicator of cultural devisions in the Lower Yangzi is linguistic

enclaves. Before the rebellion, almost the entire lower Yangzi spoke Wu. After the rebellion,

migrants settled in this area, giving rise to thousands of Mandarin-speaking villages and com-

munities (Huang, 2004; Cao, 2006; Simmons et al., 2006). In Figure 2, we mark counties with

linguistic enclaves where Mandarin (guan hua) is used. As shown on the map, counties with

some of the highest values of MNCD harbor linguistic enclaves even today. This enhances our

confidence in the validity of our measure.

Ideally, we would also like to construct a weighted fragmentation index that takes into account

whether individuals with the same surname to be both natives, both migrants, or one native

and one migrant. The fractionalization of the population comes down to dissimilarity of options

preferred by each other. Rather than to simply stipulate that options are either similar or

dissimilar, (Bossert et al., 2003) propose alternative frameworks that permit more degrees of

similarity between options. A generalized index of fractionalization is described in (Bossert

et al., 2011). Drawing on the insights from Bossert et al. (2003), Alesina and Ferrara (2005)

and Caselli and Coleman (2013), we can assume the distance between two individuals with the

same surname, if one is a migrant and the other is a native, to be positive; and the distance

between two individuals with the same surname, both being natives or being migrants, to be

zero. Unfortunately, without individual-level or surname-level data on migrant-native status,

our ability to operationalize this index with our data is limited.

B Control Variables

To account for other factors shaping modern education, we include in the baseline controls pri-

mary schools rate in 1880, population size and urbanization rate. Culturally dissimilar migrants

8

Figure 2: Mandarin Linguistic Enclaves in the Lower Yangzi (Wu-Speaking) Sources: Cao(2006)

might be selected into places with economic conditions that are not in favor of public goods

provision and schooling. So in the full set of controls, we include ruggedness, share of arable

land, agricultural suitability, distance to the Yangtze River, distance to the Grand Canal, dis-

tance to the provincial capital and distance to Shanghai to capture other differences in economic

conditions between counties. For robustness, we account for initial conditions—the number of

charitable organizations by 1850, population and population density in 1820, and measure of the

impact of war—battle exposure (months), % of elderly and youth (under 20 or over 40), % of

adult men (between 20 and 40) and a measure of differences in human capital between migrants

and natives which we call the “human capital shock”. We infer that natives who were able to set

up more charitable organizations have higher social capital, which is likely correlated with both

the type of migrants they admit and retain, as well as with their own ability to provide public

education. We also discuss other potential shocks to education, such as missionary activities,

measured by the log of one plus communicants per 10,000, and temple conversion, measured

by the log of one plus number of temple-converted schools. We expect both variables to be

positively associated with our dependent variable.

IV. BASELINE RESULTS

We use an OLS model to estimate the impact of migrant-native cultural distance on the number

9

of public primary schools:

#public primary schools per 10,000 personsi = α + βMNCDi + XiΩ + εi . (2)

The dependent variable #public primary schools per 10,000 persons is the number of public

primary schools per 10,000 persons right after the educational reform.5 MNCDi is the migrant-

native cultural distance in County i. MNCD exclusively focuses on the cultural distance between

migrants and natives—two groups with no history of shared governance.6 XiΩ are a vector of

county-level controls. εi is a disturbance term.

Table 1 summarizes estimates of the effect of migrant-native cultural distance on public primary

schools. With all controls, a one-standard-deviation of MNCD reduces the number of public

primary schools by approximately 0.18 school per 10,000 persons, which is equal to a fifth of

the mean or 40% of the standard deviation. We show an unconditional regression of MNCD on

primary schools during the decade of 1900-1910 in Column 1. The relationship is both negative

and significant. In Columns 2 and 3 we add population and urbanization rate sequentially. In

Alesina et al. (1999), a larger population means greater economy of scale to provide public goods,

but a higher transaction cost in raising taxes. Urbanization may enhance the economic return

of education, affecting the demand side of education. In Column 4, we include basic education

access in 1880. We interpret this as a measure of the stock of human capital in an area, and as a

summary statistic of those slow-moving components in local culture and institutions that shape

the decision to receive education in the long run. By isolating the influence of past educational

achievement under the private education system, we are one step closer to focusing on the impact

of MNCD on public goods provision. From Column 5 to Column 10, we introduce geographic

controls such as ruggedness, share of arable land and agricultural suitability, distance to the

Grand Canal, distance to the Yangzi River, distance to Shanghai, and distance to the provincial

capital.7 Share of arable land and agricultural suitability can proxy the opportunity cost of

receiving modern education. Distance to the Grand Canal and distance to the Yangzi River are

used to proxy the market potential and access to trade. Distance to the provincial capital is

included to account for the reach of provincial government or state capacity. Shanghai, which

became a treaty port as early as 1845, was exposed to rapid modernization and industrialization.

We control for distance to Shanghai to account for the spillover effects of Shanghai on the rest of

the region. The coefficient estimate of MNCD in Column 1 (-0.21) with no control is of similar

magnitude to that in Column 10 (-0.18) with all controls, when there is a sizable increase in R2

(from 0.174 to 0.399).

5Data are from the 1907, 1908 and 1909 Census. Those are the only censuses in the decade of 1900 thatcontain information on schools.

6Dippel (2014) stresses the role of the history of shared governance in the discussion of ethnically andlinguistically fragmented jurisdictions having poorer economic performances.

7Nanjing for Jiangsu, and Hangzhou for Zhejiang.

10

Tab

le1:

Th

eIm

pac

tof

ofM

igra

nt-

Nat

ive

Cu

ltu

ral

Dis

tan

ce:

Mai

nS

pec

ifica

tion

(1)

(2)

(3)

(4)

(5)

(6)

(7)

(8)

(9)

(10)

(11)

Dep

end

ent

vari

ab

le:

#p

ub

lic

pri

mary

sch

ools

per

10,0

00

per

son

s

MN

CD

-0.2

19∗∗

∗-0

.272∗

∗∗-0

.272∗∗

∗-0

.271∗∗

∗-0

.272∗

∗∗-0

.255∗

∗∗-0

.246∗∗

∗-0

.192∗∗

∗-0

.161∗

∗∗-0

.180∗∗

∗-0

.185∗∗

∗

(0.0

51)

(0.0

52)

(0.0

52)

(0.0

58)

(0.0

59)

(0.0

61)

(0.0

67)

(0.0

60)

(0.0

59)

(0.0

51)

(0.0

41)

Log

pop

ula

tion

-0.2

37∗

∗∗-0

.240∗∗

-0.2

45∗∗

∗-0

.252∗∗

∗-0

.228∗

∗-0

.203∗

-0.1

85

-0.1

97∗

-0.3

33∗∗

∗-0

.347∗∗

∗

(0.0

86)

(0.0

91)

(0.0

85)

(0.0

87)

(0.0

93)

(0.1

07)

(0.1

13)

(0.1

07)

(0.1

08)

(0.0

92)

Urb

aniz

atio

n0.0

00

0.0

00

0.0

00

0.0

01

0.0

01

-0.0

00

-0.0

02

0.0

03

0.0

03

(0.0

03)

(0.0

03)

(0.0

02)

(0.0

02)

(0.0

03)

(0.0

03)

(0.0

03)

(0.0

03)

(0.0

03)

Log

pri

mar

yen

roll

men

t18

800.0

35

0.0

23

-0.0

56

-0.0

03

-0.0

80

0.0

18

0.0

76

0.2

44

(0.2

64)

(0.2

72)

(0.2

64)

(0.3

10)

(0.2

96)

(0.2

89)

(0.2

63)

(0.2

60)

Ru

gged

nes

s-0

.013

0.0

10

0.0

26

-0.0

02

0.0

23

0.0

16

0.0

41

(0.0

40)

(0.0

64)

(0.0

77)

(0.0

78)

(0.0

84)

(0.0

68)

(0.0

41)

%A

rab

lela

nd

0.5

88

0.6

98∗

0.4

24

0.1

32

0.4

83

0.5

35

(0.3

58)

(0.4

07)

(0.4

30)

(0.5

02)

(0.4

28)

(0.3

87)

Agr

icu

ltu

ral

suit

abil

ity

0.0

14

0.0

08

-0.0

36

-0.0

39

-0.0

06

-0.0

04

(0.0

45)

(0.0

47)

(0.0

41)

(0.0

44)

(0.0

39)

(0.0

37)

Dis

t.to

Gra

nd

Can

al0.1

96

0.0

21

-0.0

67

-0.5

94

-0.4

69

(0.3

28)

(0.3

61)

(0.3

69)

(0.4

00)

(0.3

47)

Dis

t.to

Yan

gtze

0.2

46∗

0.2

05∗

0.1

44

0.0

99

(0.1

30)

(0.1

21)

(0.1

19)

(0.0

99)

Dis

t.to

Sh

angh

ai-0

.002∗

0.0

02

0.0

02

(0.0

01)

(0.0

01)

(0.0

01)

Dis

t.to

pro

vin

cial

cap

ital

0.0

04∗∗

∗0.0

04∗∗

∗

(0.0

01)

(0.0

01)

Ob

serv

atio

ns

5454

54

54

54

54

54

54

54

54

60

Ad

just

edR

20.

174

0.267

0.2

53

0.2

38

0.2

25

0.2

42

0.2

31

0.2

71

0.3

05

0.3

99

0.4

07

Not

es:

Th

eta

ble

rep

orts

the

imp

act

ofm

igra

nt-

nat

ive

cult

ura

ld

ista

nce

onth

enu

mb

erof

pu

bli

cp

rim

ary

sch

ools

inth

ed

ecad

eof

1900

-191

0.T

he

un

itof

anal

ysi

sis

aco

unty

inR

epu

bli

can

Ch

ina.

All

bas

elin

eco

ntr

ols

are

incl

ud

edin

Colu

mn

4:

the

natu

ral

log

ofp

opu

lati

on,

urb

aniz

atio

nra

tean

dth

en

atu

ral

log

ofpri

mar

ysc

hool

enro

llm

ent

in18

80.

Both

base

lin

eco

ntr

ols

an

dgeo

gra

ph

icco

ntr

ols

are

incl

ud

edin

Col

um

n10

.C

olu

mn

11in

clu

des

six

add

itio

nal

obse

rvat

ions

inw

hic

hM

NC

Dis

imp

ute

d.

Rob

ust

stand

ard

erro

rsar

ein

clu

ded

inal

lsp

ecifi

cati

ons.

∗p<

0.1

0,∗∗

p<

0.05

,∗∗

∗p<

0.0

1

11

This assures us that selection on unobservables are likely limited.8 In Column 11 we add six

counties with imputed values of MNCD to the sample. Our coefficients of interest remain stable

throughout the columns. The coefficient estimate in Column 11 is somewhat smaller. We

attribute this to the addition of six counties with imputed values of MNCD resulting in an

increase in measurement error. We use Column 4 as our baseline for the rest of the paper.

We next show how MNCD compares to measures of population heterogeneity. We use the

same surname data to construct the traditional fractionalization index and two polarization

indices (See Appendix G). In Table 2, we estimate the effects of MNCD, fractionalization and

polarization on public primary schools. Our main finding is that MNCD has a strong and

positive effect on public primary schools, and neither fractionalization nor polarization in the

entire population has a statistically significant effect.

The traditional fractionalization index is positively correlated with the number of public primary

schools per 10,000 persons (Col. 1-3), once MNCD is controlled for. This suggests that once

partialling out the effects of the cultural distance between migrants and natives, the fraction-

alization of the population does not necessarily reduce public goods provision. The coefficient

estimates of the polarization indices are negative across the columns (Col. 4-9). By comparing

MNCD to the polarization index adjusting for intergroup distances between surname groups

(Col. 7-9), we find that MNCD can reduce both the statistical significance and magnitude of

the coefficient estimate of the distance-based polarization index. From Column 7 to Column 8,

the coefficient estimate of polarization of the entire population moves from -0.165 (the p-value

is 0.138) to -0.118 (the p-value is 0.269).

In face of other measures of population heterogeneity, the effects of MNCD largely remain.

This first suggests that MNCD likely captures an important component distinct from what is

encapsulated in the fractionalization and polarization indices. Secondly, MNCD likely shares a

few traits in common with polarization. And they both have negative effects on public primary

schools. Third, I can also infer that the history of shared governance mitigates the effects of

cultural distance on local public goods provision. Likely by taking into account the history of

shared governance, MNCD outperforms polarization in explaining outcomes of public primary

schools. Our interpretation is that the cultural distance between previously detached groups is

a more important type of cultural distance. Unfortunately, given the nature of the between-

surname-group “distance” measure we use (see Appendix G), we can only draw a tentative

conclusion about the mitigating role of the history of shared governance on the negative effects

of the cultural distance between groups.

8Oster (2014) formalizes this approach. The results of this analysis suggest that the ratio of selection onunobservables relative to selection on observables has to be five times larger to explain away my results. Based onthe reasoning outlined by Altonji et al. (2005) that unobservables should not be more important than observablesin explaining the treatment, it is highly unlikely that unobservables are biasing my results.

12

Table 2: MNCD, Fractionalization and Polarization

(1) (2) (3) (4) (5) (6) (7) (8) (9)Dependent variable: #public primary schools per 10,000 persons

MNCD -0.214∗∗∗ -0.215∗∗∗ -0.206∗∗∗ -0.173∗∗ -0.163∗∗∗ -0.175∗∗

(0.056) (0.059) (0.056) (0.072) (0.052) (0.066)FRAC 0.013 0.023∗∗ 0.023∗

(0.012) (0.011) (0.012)Native FRAC -0.001

(0.015)POL -1.621 -3.722 -2.196

(2.851) (2.664) (3.539)Native POL -1.193

(1.716)POL DIST -0.165 -0.118 -0.091

(0.109) (0.106) (0.155)Native POL DIST -0.114

(0.386)Baseline controls Y Y Y Y Y Y Y Y YGeographic controls Y Y Y Y Y Y Y Y Y

Observations 54 54 54 54 54 54 54 54 54Adjusted R2 0.318 0.435 0.421 0.306 0.412 0.405 0.341 0.405 0.392

Notes: The table reports OLS results of the impact of fractionalization and polarization onpublic primary schools. The unit of observation is a county in Republican China. Robuststandard errors are used in all specifications. ∗ p < 0.10, ∗∗ p < 0.05, ∗∗∗ p < 0.01

V. ROBUSTNESS CHECKS

A Subsamples

We first check if our results are robust to the omission of outliers. Table 3 provides evidence

that outliers do not have a substantial impact on our results. In Column 2 we exclude counties

with extreme values of MNCD. Column 3 excludes counties with extremely large human capital

shocks from the sample. Column 4 drops Shanghai. Coefficients in Columns 3 and 4 are fairly

comparable to those in the baseline. The coefficient of MNCD is slightly larger in Column 2.

This indicates possible attenuation bias due to measurement errors in MNCD. Our measurement

of MNCD may be particularly poor for those outliers, and MNCD may become more sensitive

to sampling errors in the range of large values. In such cases, removing those observations with

mis-measured values should improve our estimation.

B Initial Conditions

Public goods provision depends on the level of social capital. At the same time, higher social

capital within natives could imply more exclusivity, which could result in fewer migrants, and

especially, fewer migrants who disagree with values possessed by natives. To check if social

capital drives both MNCD and public primary schools, we explicitly include initial social capital

13

Table 3: Robustness: Subsamples

(1) (2) (3) (4)Dependent variable: #public primary schools per 10,000

MNCD -0.180∗∗∗ -0.201∗∗ -0.191∗∗∗ -0.180∗∗∗

(0.051) (0.074) (0.056) (0.051)Baseline controls Y Y Y Y

Subsamples Full sample MNCD outliers Human capital outliers Shanghai

Observations 54 52 51 53Adjusted R2 0.399 0.374 0.371 0.394

Notes: The table reports the impact of migrant-native cultural distance on public primaryschools during the decade of 1900-1910 on various subsamples. The unit of observationis a county in Republican China. Column 1 provides benchmark results from Column 10of Table 1. Column 2 excludes three counties with the highest MNCD (MNCD>5.25).Column 3 excludes counties with the largest decline in human capital (Human capital-afterminus Human capital-before is less than -0.1). Column 4 excludes the important treaty portcity—Shanghai. Robust standard errors are used in all columns. ∗ p < 0.10, ∗∗ p < 0.05,∗∗∗ p < 0.01

Table 4: Initial Conditions

Dependent variable: #public primary schools per 10,000 persons(1) (2) (3) (4) (5)

MNCD -0.180∗∗∗ -0.181∗∗∗ -0.201∗∗∗ -0.170∗∗∗ -0.242∗∗∗

(0.051) (0.052) (0.055) (0.053) (0.063)#initial charities 0.003 0.005

(0.006) (0.006)Initial population 0.608∗∗ 0.887∗∗

(0.239) (0.329)Initial pop. density 0.119 -0.351

(0.259) (0.278)

Baseline controls Y Y Y Y YGeographic controls Y Y Y Y Y

Observations 54 54 54 54 54Adjusted R2 0.399 0.385 0.461 0.388 0.458

Notes: The table reports reports OLS results of the impact of MNCD on public primaryschools, accounting for initial conditions. The unit of observation is a county in RepublicanChina. Initial # charities are number of charities before the Taiping Rebellion. Initial pop-ulation and initial population density are population and population density in 1820. Allspecifications include baseline controls and geographic controls. Baseline controls includelogged primary enrollment rate in 1880, urbanization rate and logged population. Geo-graphic controls include ruggedness, share of arable land, agricultural suitability, distanceto the Grand Canal, distance to the Yangtze River, distance to the provincial capital anddistance to Shanghai. Robust standard errors are used in all specifications. ∗ p < 0.10, ∗∗

p < 0.05, ∗∗∗ p < 0.01

14

among the natives in our regression, proxied by number of charitable organizations in 1850.9 To

overcome endogeneity, we use the number of charitable organizations before the mass migration.

In addition, initial population size can contain crucial information about the socioeconomic

traits of the natives. So we also include initial population size or initial population density in

our regression, and find that initial population size is positively correlated with the number of

public primary schools per 10,000 in the 1900s.

C War-Related Conditions

The Taiping Rebellion was undoubtedly a major blow to the region. Other than bringing

in migrants with cultural dissimilarity with natives, the rebellion likely had an independent

effect on schools by affecting infrastructure, income and the demographics. Below we carefully

evaluate the overall impact of war on primary schools, as well as the specific effects of the

Taiping Rebellion on the human capital stock and the demographic profile of a county. We

show coefficient estimates of MNCD partialling out the other effects of war, as well as coefficient

estimates of those variables without MNCD. In the next section, we propose an instrumental

variable strategy to establish the causality going from cultural distance to primary schools.

Battle Exposure As mentioned in Section II.A, war could have lingering consequences several

decades after. The scale of damage can be a function of battle exposure. We measure battle

exposure by the number of months a county was exposed to war. Overall, we find no evidence

that battle exposure had an impact on the number of public primary schools fifty years later

(Col.2 & 3).

Demographic Shock War and migration could have affected schools by altering the demo-

graphic structure. Population tend to rebound rapidly after major disturbances (Davis and

Weinstein, 2002), resulting in a youthful population. Migrants tend to be younger, male. If a

larger number of migrants flock into a county, who are systematically younger, from the supply

side, education can become more affordable in the short run, due to lower dependency ratio, but

from the demand side, there could be less demand for education if opportunities for unskilled

work are abundant. We include both the %elderly and youth and %adult men (aged 20 to 40)

9During the period we study, charitable organizations were mostly funded by local communities.

15

Tab

le5:

War

-Rel

ated

Con

dit

ion

s

Dep

end

ent

vari

ab

le:

#p

ub

lic

pri

mary

sch

ools

per

10,0

00

per

son

s(1

)(2

)(3

)(4

)(5

)(6

)(7

)(8

)(9

)

MN

CD

-0.1

80∗∗

∗-0

.181∗

∗∗-0

.200∗∗

∗-0

.182∗∗

∗-0

.205∗∗

∗-0

.264∗

∗∗

(0.0

51)

(0.0

52)

(0.0

67)

(0.0

53)

(0.0

67)

(0.0

77)

Bat

tle

exp

osu

re0.0

07

0.0

08

0.0

08

0.0

03

(0.0

12)

(0.0

11)

(0.0

12)

(0.0

11)

%el

der

lyan

dyo

uth

-0.0

18

-0.0

45

-0.0

46

-0.0

39

(0.0

34)

(0.0

39)

(0.0

37)

(0.0

33)

%ad

ult

men

-0.0

23

0.0

04

0.0

07

0.0

12

(0.0

26)

(0.0

28)

(0.0

31)

(0.0

27)

Hu

man

cap

ital

shock

-0.0

06

0.315

0.2

61

1.0

46

(1.2

50)

(1.2

09)

(1.3

50)

(1.3

65)

Init

ial

con

dit

ion

sN

NN

NN

NN

NY

Inve

nti

ons

ined

uca

tion

NN

NN

NN

NN

YB

asel

ine

contr

ols

YY

YY

YY

YY

YG

eogr

aph

icco

ntr

ols

YY

YY

YY

YY

Y

Ob

serv

atio

ns

5454

54

54

54

54

54

54

54

Ad

just

edR

20.

399

0.3

06

0.3

92

0.2

97

0.3

87

0.3

00

0.3

85

0.3

65

0.5

16

Not

es:

Th

eta

ble

rep

orts

rep

orts

OL

Sre

sult

sof

the

imp

act

ofw

aron

pu

bli

cp

rim

ary

sch

ool

s.T

he

un

itof

ob

serv

ati

on

isa

cou

nty

inR

epu

bli

can

Ch

ina.

Bat

tle

exp

osu

reis

mea

sure

dby

mon

th.

%el

der

lyan

dyou

thre

fers

toth

ose

un

der

20

or

over

40.

%ad

ult

men

refe

rsto

men

bet

wee

n20

and

40.

Hu

man

cap

ital

shock

isth

ed

iffer

ence

inhu

man

cap

ital

bet

wee

nn

ati

ves

an

dth

een

tire

pop

ula

tion

incl

ud

ing

nat

ives

and

mig

rants

.A

llsp

ecifi

cati

ons

incl

ud

eb

asel

ine

contr

ols

and

geog

rap

hic

con

rtro

ls.

Robu

stst

an

dard

erro

rsare

use

din

all

spec

ifica

tion

s.∗p<

0.1

0,∗∗

p<

0.05

,∗∗

∗p<

0.0

1

16

in Columns 4 and 5, and we do not find a statistically significant effect of either variable.

Human Capital Shock Human capital could be another channel through which the Taiping

Rebellion affected the long-run prospects of a county. Even though migrants and natives had

little contact before the settlement of migrants, some selection could be on the amount of human

capital migrants had. In the event that migrants simultaneously posed a cultural shock and a

human capital shock to the native population, the human capital shock may be a confounder.

We define a human capital shock as the difference between migrant and native human capital

(See Appendix H for variable construction details). When the migrant human capital stock is

higher than native human capital, we conclude there was a positive human capital shock. In

Columns 6 and 7, we control for the difference between migrant and native human capital. We

find no effect of differences in human capital on schools. However, once MNCD is added to

the regression, human capital shock attains a positive coefficient which is consistent with our

prior. The advantage of having migrants with high human capital on school formation is likely

partially offset by the cultural distance between migrants and natives.

D Interventions in Education

Missionary Activities Christian missionaries came to China in the 19th century to spread

Christianity. In the process, they also built a large number of schools. Compared to secondary

education, however, primary education was not affected to the same degree by local missionaries

(Bai and Kung, 2014). Demand for public or semi-public modern primary education, may still

have been affected by long-standing missionary activities. For example, if church-sponsored

secondary schools already existed in the area before the campaign for modern education, it

might increase demand for modern primary education through a complementarity mechanism.

Due to the earliest missionary expansion and the Taiping Rebellion being concurrent events, we

cannot rule out the possibility that a hostile native-migrant relationship could deter missionaries,

or that migrants are more likely to settle in areas with missionaries, for aid and help. We include

number of communicants per 10,000 as a rough proxy for missionary activities. As predicted,

the density of communicants has a positive effect on public primary schools, but the effect is

not statistically significant (Col. 2). The coefficient estimate of MNCD remains very similar to

baseline estimates.

Temple Conversion In the first few decades of the 20th century, government closed tens

of thousands of Buddhist or Taoist temples and took over temple assets to support modern

education. Two million Buddhist and Taoist temples are estimated to have closed in late Qing.

At the time, they owned about 16 million houses, 13,000 square kilometers of land and millions

teal of silver altogether (Xu, 2010). In Column 3, temple conversion is positively associated

with the density of public primary schools. The coefficient estimate of MNCD slightly declines,

suggesting that temple conversion might be a channel for MNCD to impede the establishment

of public primary schools.

17

Table 6: Interventions: Missionary Activities and Temple Conversion

Dependent variable: #public primary schools per 10,000 persons(1) (2) (3) (4) (5)

MNCD -0.180∗∗∗ -0.178∗∗∗ -0.168∗∗∗ -0.167∗∗∗ -0.220∗∗∗

(0.051) (0.047) (0.053) (0.053) (0.068)Missionary activities 0.148 0.134 0.107

(0.095) (0.085) (0.082)Temple conversion 0.183∗∗ 0.169∗ 0.173∗∗

(0.085) (0.085) (0.084)Initial conditions N N N N YBaseline controls Y Y Y Y YGeographic controls Y Y Y Y Y

Observations 54 54 54 54 54Adjusted R2 0.399 0.443 0.458 0.493 0.543

Notes: The table reports OLS results of the impact of MNCD on public primary schools,accounting for interventions in education. The unit of observation is a county in Republi-can China. “Missionary activities” is measured by the log of number of communicants plusone. “Temple conversation” is measured by the log of number of one plus temple-convertedschools. Baseline controls include logged primary enrollment rate in 1880, urbanizationrate and logged population. Geographic controls include ruggedness, share of arable land,agricultural suitability, distance to the Grand Canal, distance to the Yangtze River, dis-tance to the provincial capital and distance to Shanghai. Robust standard errors are usedin all specifications. ∗ p < 0.10, ∗∗ p < 0.05, ∗∗∗ p < 0.01

VI. INSTRUMENTAL VARIABLE STRATEGY

A Construction of the Instrument

We have provided evidence that “selective migration” was as unlikely as was self-sorting after

the arrival of the migrants. “Selective migration” refers to migrants and natives make decisions

on entry and acceptance based on their cultural proximity. Nevertheless, the observed migrant-

native cultural distance might still be correlated with unobserved characteristics of the native

or migrant population, or with unobserved characteristics of the county of destination or county

of origin. To sidestep omitted variable bias, we introduce an instrumental variable based on the

pre-existing cultural distance between the native population and the population just outside of

the Lower Yangzi (“nearby population”).10

We use the following equation to construct a surname-based measure of the pre-existing cultural

distance between the native population and the nearby population:

Pre-existing native-nearby culture distancei =1

normalized isonomynative,nearby,i

, (3)

10The nearby population refers to residents in Zhejiang and Jiangsu Province excluding the Lower Yangzi.

18

Tab

le7:

Inst

rum

enta

lV

aria

ble

An

alysi

s:P

re-e

xis

tin

gN

ativ

e-N

earb

yC

ult

ure

Dis

tan

ce

(1)

(2)

(3)

(4)

(5)

(6)

(7)

(8)

OL

SIV

OL

SIV

IVIV

IVIV

Sec

on

dS

tage

#p

ub

lic

pri

mary

sch

ools

per

10,0

00

per

son

s

MN

CD

-0.2

19∗∗

∗-0

.250∗

-0.2

71∗

∗∗-0

.384∗

∗-0

.354∗∗

-0.3

24∗∗

-0.3

23∗

∗-0

.312∗∗

(0.0

51)

(0.1

49)

(0.0

58)

(0.1

83)

(0.1

73)

(0.1

55)

(0.1

56)

(0.1

54)

Ad

just

edR

20.1

74

0.1

70

0.2

38

0.1

90

0.3

05

0.3

91

0.3

76

0.3

70

Fir

stS

tage

Dep

end

ent

vari

ab

le:

mig

rant-

nati

vecu

ltu

ral

dis

tan

ce

Nat

ive-

nea

rby

cult

ure

dis

tan

ce1.1

23

∗∗0.9

68∗∗

1.0

21∗

∗0.9

77∗

∗0.9

81

∗∗1.0

21∗

∗∗

(0.4

20)

(.4

77)

(0.4

19)

(0.3

88)

(0.3

99)

(0.3

75)

Fir

st-s

tage

FS

tati

stic

7.1

34.1

25.9

56.3

56.0

47.4

0

Hu

man

cap

ital

shock

NN

NN

NN

NY

Bat

tle

exp

osu

reN

NN

NN

NN

YF

ract

ion

aliz

atio

nN

NN

NN

NY

YIn

itia

lso

cial

cap

ital

NN

NN

NY

YY

Pop

.18

20N

NN

NN

YY

YG

eorg

rap

hic

contr

ols

NN

NN

YY

YY

Bas

elin

eco

ntr

ols

NN

YY

YY

YY

Ob

serv

atio

ns

54

54

54

54

54

54

54

54

Not

es:

Th

eta

ble

rep

orts

IVes

tim

ates

.T

he

inst

rum

ent

isn

ativ

e-n

earb

ycu

ltu

ral

dis

tan

ce.

Th

eu

nit

of

ob

serv

ati

on

isa

county

inR

epu

bli

can

Ch

ina.

“Fra

ctio

nal

izat

ion

”in

clu

des

bot

hw

ith

in-n

ativ

efr

acti

onal

izat

ion

and

over

all

fract

ion

aliza

tion

.F

irst

-sta

ge

Fst

atis

tic

isK

leib

erge

n-P

aap

rkW

ald

Fst

atis

tic.

Rob

ust

stan

dar

der

rors

are

use

din

all

spec

ifica

tion

s.∗p<

0.10,∗∗

p<

0.0

5,∗∗

∗

p<

0.0

1

19

where normalized isonomynative,nearby,i =∑S

k Pk,native,iPk,nearby√∑Sk P 2

k,native,i

∑Sk P 2

k,nearby

.S is the number of the same

surnames in the two groups. Pk,native,i and Pk,nearby are the relative frequencies of surname

k within natives and within the nearby population. For the nearby population, we use the

proportion of each surname to the population total in the 2005 Population Census to proxy for

Pk,nearby. The denominator measures how likely any individual randomly drawn from within

natives bears the same surname as one drawn from within the nearby population. The variable,

native-nearby cultural distance (“NNCD”), is intended to capture the cultural dissimilarity

between the native population and the nearby population.

The intuition behind the instrument is that natives are more likely to be very different from the

incoming migrants, if ex ante, natives are more different from all prospective migrants. I treat

the nearby population as the pool of prospective migrants. This is a reasonable assumption as

the vast majority of migrants in the Lower Yangzi after the war are found to have originated

from that area (Cao, 1998; Cao and Li, 2000; Ge, 2002a; Liu, 2012; Zhe and Zhe, 1896). The

cultural distance between natives and the pool of prospective migrants is independent of the

Taiping Rebellion.

B IV Results

Table 7 compares six IV specifications. We run our regressions with no controls in Columns 1

and 2. The first stage is slightly above 7. The IV estimate is -0.25, only slight larger than the

OLS estimate in Column 1 (-0.219) in terms of magnitude. We add baseline controls in Columns

3 and 4. In Column 5, we add geographic controls, resulting in an IV estimate of 0.354. We then

add initial conditions, fractionalization and war-related conditions sequentially from Column 6

to Column 8. IV estimates in those specifications are highly comparable ranging from -0.312 to

-0.354. Our first-stage F statistics are not particularly high, but this is likely due to the small

sample size. The first-stage F statistics do improve when MNCD outliers are excluded, possibly

due to improved linearity. We also show reduced-form estimates in Table A-2. Our instrument

is negatively correlated with the density of public primary schools in specifications. In Columns

1, 5 and 6, the coefficient estimates drops slightly below the conventional cutoff of statistical

significance, with a p-value of 0.15.

Across the columns, IV estimates are slightly larger than the OLS estimates for the same speci-

fications. We attribute the discrepancies partly to the measurement error in our migrant-native

cultural distance variable. As our surname sample is on the small side for some counties, we

may have a lot of sampling error in determining the relative frequency of a surname in the native

or overall population. In addition, when constructing our measure of MNCD, we assume people

with different surnames in the same location, had similar exposure to the war, and had similar

recovery growths in population after the war. While this assumption is unlikely systematically

violated, our measure can suffer measurement error and be noisy. In comparison, our instru-

ment uses a much larger surname sample from the 1% population census. We have reason to

believe that our instrument has less measurement error with regards to the relative frequency of

20

a surname in a population; hence, our IV estimates do not suffer the same attenuation bias as

our OLS estimates do. In addition, MNCD is partially determined by population losses during

the rebellion, whereas our instrument is not affected by differential population losses during

the rebellion. We control for battle exposure, but it is far from being the perfect indicator of

population losses.11 IV estimates can differ from OLS estimates as they are unaffected by the

magnitude of population losses.12

VII. MECHANISMS: THEORY AND SUGGESTIVE EVIDENCE

Our results suggest cultural differences between migrants and natives have a negative impact on

public goods provision. In this section, we provide suggestive evidence on possible mechanisms.

The Lower Yangzi, in spite of being more developed than the rest of the country, remained

highly agrarian at the time. The average urbanization rate was merely 12%, which means the

vast majority of people lived in the countryside. In the sixty counties we study (fifty-four in

the main sample), every county comprises hundreds of villages. Many natives in those villages

typically have lived there for hundreds of years. Close kinship ties ensured that levels of trust

between villagers were relatively high, and this allowed them to collaborate in the provision of

public goods. In some villages all the inhabitants were from the same linage, in which case,

most of them would share the surname; other villages had a few lineages. Many of the public

goods, such as security and education, were provided by and within the clan. But some of

those goods were provided by the gentry to the whole village in the form of private donations.

The long tradition of village life, which went back centuries, if not millennium, constituted a

deep institutional memory of how to deal with problems such as the provision of local public

goods. Both natives and migrants (back in their own hometowns) had adapted to this stable

and slow-moving environment. So when migrants first showed up in the villages of the natives,

they had to sort out an arrangement in which their coexistence with natives would be possible;

a few decades after, migrants and natives were confronted with a new task: to build modern

schools for their next generation.

Based on theoretical and empirical research, there are several ways migrant-native cleavages

can affect public goods provision and prevent them from building modern schools together:

first, differences in preferences between migrants and natives would mean a lower chance for

migrants and natives to reach a consensus on whether to provide a public good or the best way

to provide that good. Second, mutual dissatisfaction, which is often a function of the cultural

11Despite scholarly efforts to estimate population losses due to the rebellion, the actual damage remainsunknown (Cao and Li, 2000; Cao, 1998).

12Population losses could have an independent effect on primary schools, and MNCD could be picking up theeffects of population losses in addition to the effects of the cultural distance between migrants and natives. Also,although population losses were substantial for most counties in the Lower Yangzi, there were a few exceptions.As places with little to none exposure to the war are “never-takers”, the local average treatment effect is estimatedon counties with actual in-flow migration after the rebellion.

21

distance between migrants and natives, can block any bilateral collaboration and negotiation.

For example, when a village leader originates from the group of migrants, natives would decide

to actively oppose whatever policies he initiates. Esteban et al. (2012a) shows empirically that

ethnic polarization, which accounts for distances between groups, will influence conflict if the

prize is “public”.13 And a closely related scenario is that a leader from the side of natives (or

migrants) indeed looks out for the best interest of natives (or migrants), and at the expense of

the other. When a leader from the side of natives (or migrants) takes advantage of the other

side, often in the name of public interest, it is to be expected for the other side to oppose

expropriatory policies proposed by the leader.

We find qualitative evidence in line with the mechanisms laid out above. First, villagers found it

difficult to raise taxes or use clan or temple assets to establish modern schools, when they were

from different dialectal and cultural background, due to mutual misunderstanding or outright

disagreement between migrants and natives. Second, migrants opposed the policies put forward

by natives. Tian and Chen (2008) document cases where migrants into rural areas refused to pay

county taxes designated to finance upper-primary schools on the grounds that “the schools only

serve the rich in the cities and towns”. In this example, we see both migrants being victims to

expropriatory polices, and a strong identity held by migrants reflected in their statement. Third,

native-migrant conflicts had a direct impact on public goods provision by intervening with the

day-to-day operation of a village. Community leaders were reluctant to build public modern

schools in the midst of frequent native-migrant conflicts. lowered the ability of a community

to perform effective decision-making in the provision of public education. Below we present

quantitative evidence to corroborate our analysis.

A Evidence from Types of Schools

We exploit a feature of the school system in early 20th century China: the financing of lower-

primary and dual-primary schools differed from that of upper-primary and secondary schools.

Overall, the financing of different types of educational institutions after 1905 was parallel with

that of the traditional system. The role of central and provincial governments was limited

to financing universities, students studying abroad, and secondary schools (including teacher

training schools) at provincial capitals.14 County governments financed primarily upper-primary

schools, which were mostly located in cities and towns, with a combination of county tax receipts,

business tax surcharges, the reallocation of endowments from traditional schools, and private

contributions by local elites (Chaudhary et al., 2012). Village communities worked to finance

13Esteban et al. (2012b) develops a theory of conflict across groups allowing for “public” and “private” prizes.A leadership position is an example of a “public” prize. In our context, possible sources of dissatisfaction, oreven conflict, include a lack of trust in migrants or a loss of pride due to losing the election as natives. Thelack of trust can arise from the perception of the village leader being susceptible to predating on the interest ofnatives.

14The decentralization of fiscal authority after the Taiping Rebellion left the majority of tax revenues tocounty-level authorities. So the central state had very limited resources to finance primary schooling at the locallevel.

22

dual-and-lower-primary schools in rural villages. A frequent practice was for community leaders

to transform clan schools and other properties into modern dual-and-lower-primary schools,

and to cover the day-to-day expenses with local taxes and private donations. Those schools

received very little financial support from the county government and its affiliates.15 Figure 3

lists distribution of financial sources for different types of schools in Hangzhou County (Zhang,

2008). According to Figure 3, lower-and dual-primary schools had a higher share of their budget

made up by local funds.

Another institutional feature we exploit is that Chinese villages are highly autonomous and self-

governed. Constrained by state capacity, the reach of the central government tended to end at

the county level (Qu, 2003, pp.11-15, 179-201, 248-255).At the county level, again, each country

comprised of a large number of villages, making it difficult to project power into those villages.

The self governance of villages has a long tradition in China (Fei, 1939, 1992; Kung-Chuan,

1967; Shi et al., 1988).In an environment of self-governance, migrant-native cultural distance

might manifest itself most forcefully in the outcomes of collective decision-making processes, as

there will not be countervailing forces to their decisions.

With these two institutional features in mind, we can derive the following testable implication:

ceteris paribus, MNCD should have a greater impact on lower-primary and dual-primary schools

than on other types of schools. Fortunately, the educational census (1907-08) reports number

of schools and schools separately for five types of schools: secondary schools, upper-primary

schools, dual-primary schools and lower-primary schools and pre-schools. So we can empirically

test how MNCD affects the density of different types of schools.

We begin our analysis by running a simple regression of county government spending on density

of lower-primary and dual-primary schools. Panel A of Table 8 suggests that government funding

is strongly correlated with upper-primary and secondary schools rate (Col. 3, 4), but not with

lower-primary and dual-public primary schools (Col. 1, 2). These are highly consistent with

the information contained in Figure 3: the county government was not a major contributor to

the budget of the lower-primary and dual-primary schools, but provided funding to secondary

schools and upper-primary schools. It is not surprising that both upper-primary school and

secondary schools increase in county government spending on education, while there are no such

relationships for lower-primary or dual-primary schools.

When we replace county government spending with migrant-native cultural distance in Panel

B of Table 8, we find the type of schools with coefficients that are significant, flipped. We

find MNCD has a strong impact on schools at lower-primary and dual-primary schools, but

not on schools at upper-primary and secondary schools. This is consistent with our testable

implication that MNCD should matter the most for the setting of self-governance—given the

period we examine, both villages and townships were self-governed, and lower-primary and dual-

15Schools also charged tuition to supplement other sources of funding. Tuition accounted for 10%-20% ofschool budgets in 1917.

23

County

or

townshi

p raised

85%

Commu

nity or

industry

raised

13%

Tuition

and fees

2%

Secondary Schools

County

or

township

raised

86%

Commu

nity or

industry

raised

6%

Tuition

and fees

8%

Upper-Primary Schools

County

or

township

raised

42%

Commu

nity or

industry

raised

56%

Tuition

and fees

2%

Dual-Primary Schools County

or

township

raised

7%

Commu

nity or

industry

raised

86%

Tuition

and fees

7%

Lower-Primary Schools

Figure 3: The Makeup of the Budget of Different Types of Schools

primary schools were determined, established and financed locally. When migrants and natives

could not get along, they could face severe challenges in solving important and urgent issues

including establishing modern schools for their children.

B Evidence from Timing

Having showed the differential impact of MNCD on different types of schools in the 1900s, we

next explore the time-varying impact of MNCD as funding responsibilities over basic education

changed over time. This task is complicated by the fact that public primary schools did not

exist prior to 1905. So we instead look at access to basic education over time.

We exploit two structural breaks in funding responsibilities over basic education. The first

structural break came around in 1905. Prior to 1905, individual households or extended families

were responsible for basic education (as well as for advanced education). There is no evidence for

either village taxes, government aid, or publicly funded basic education. As basic education was

essentially a private good before 1905, exposure to basic education should not vary in MNCD.

The second structural break kicked in close to the end of the 1920s. From then on, the fiscal

authority over lower-primary and dual-primary schools migrated to upper-level governments. In

24

Table 8: Government Funding, Migrant-Native Cultural Distance and the Density ofSchools

Lower-primary Dual-primary Upper-primary Secondary

Panel A. Government Funding: By School Type

Government spending on education 0.067 0.252 0.327∗∗∗ 0.242∗∗∗

(0.979) (0.338) (0.073) (0.053)Baseline controls Yes Yes Yes Yes

Adjusted R2 -0.078 0.069 0.171 0.328

Panel B. Migrant-Native Cultural Distance: By School Type

MNCD -0.185∗∗∗ -0.052∗∗ -0.003 -0.003(0.042) (0.021) (0.005) (0.003)

Baseline controls Yes Yes Yes Yes


Notes: Panel A reports the impact of government spending on then density of differenttypes of schools (#schools per 10,000 persons). Panel B reports the impact of migrant-native cultural distance by school type. The unit of observation is a county in RepublicanChina. Robust standard errors are included in all specifications. ∗ p < 0.10, ∗∗ p < 0.05,∗∗∗ p < 0.01

the meanwhile, upper-level governments become much more autocratic over time. The combined

effect of schools being funded by upper-level governments and upper governments becoming more

autocratic, should render local decision making increasingly less relevant, which should yield a

less economically significant coefficient of MNCD. These two structural breaks give rise to the

prediction that the impact of MNCD should be the most relevant between 1905 and the end of

1920s.

We test this prediction in a fully flexible difference in differences specification. Our variables of

interest are interaction terms between MNCD and time period dummies:

%access to basic educationi,t =∑t

βtMNCDi × Timeperiodt +∑t

γtXi × Timeperiodt

+Zi, tζ + ρt + ηi + εi, t ,

(4)

where the year 1960 is left out as the comparison group. The dependent variable is log share of

population receiving basic education in County i during time period t. Xiis a vector of other

time invariant controls and we allow these variables to have time-varying effects on access to

basic education. Zi,tζ is a vector of time variant controls.ρt denotes a full set of time fixed effect

and ηi denotes a full set of county fixed effects. εi,t is a disturbance term. Our time periods are

defined as follows: ∈ 1820, 1850, 1880, 1900, 1910, 1920, 1930, 1940, 1950.

Table 9 summarizes our results. Across the specifications, the only MNCD × Timeperiod

25

Table 9: The Dynamic Impact of of Migrant-Native Cultural Distance: Baselines

(1) (2) (3) (4)%access to basic education

MNCD×1820 -0.054∗∗ -0.025 -0.029 -0.021(0.025) (0.031) (0.030) (0.025)

MNCD×1850 -0.077∗∗∗ -0.048 -0.050∗ -0.041∗

(0.022) (0.030) (0.029) (0.024)MNCD×1880 -0.078∗∗∗ -0.049 -0.025 -0.031

(0.025) (0.035) (0.037) (0.031)MNCD×1900 -0.339∗∗∗ -0.309∗∗∗ -0.294∗∗∗ -0.266∗∗∗

(0.062) (0.072) (0.076) (0.067)MNCD×1910 -0.168∗∗∗ -0.138∗∗∗ -0.129∗∗∗ -0.087∗

(0.040) (0.045) (0.046) (0.045)MNCD×1920 -0.087∗∗ -0.058 -0.051 -0.044

(0.043) (0.036) (0.037) (0.027)MNCD×1930 -0.031 -0.002 0.001 0.010

(0.045) (0.039) (0.040) (0.028)MNCD×1940 -0.043∗ -0.014 -0.012 -0.006

(0.023) (0.017) (0.019) (0.014)MNCD×1950 -0.039∗ -0.009 -0.008 -0.002

(0.023) (0.011) (0.013) (0.010)Log population 0.145∗ 0.103

(0.084) (0.084)County FE N Y Y YTime period FE Y Y Y Y


Standard errors in parentheses∗ p < 0.10, ∗∗ p < 0.05, ∗∗∗ p < 0.01

Notes: The table reports the dynamic impact of migrant-native cultural distance before andafter the Taiping Rebellion.Column 1 only includes time period fixed effectds. Both timeperiod effeccts and county fixed are included in all other specifications. Column 3 controlsfor log population. Column 4 includes the six counties with imputed MNCD. Column 3 isour preferred specification. Standard errors are clusterered at the county level.

coefficients that are consistently significant are MNCD × 1900 and MNCD × 1910 . This

is highly consistent with our prediction that MNCD has the most impact on access to basic

education, during the period where basic education is a public good and fiscal authority is

decentralized. We begin our analysis by including only MNCD × Timeperiod, time period

FE. To make full use of the panel data, in Column 2, we add county FE to the regression.

We find MNCD × Timeperiod is negative but insignificant before 1900, and there is no pre-

trend. It is not the case that the counties became better or worse in providing basic education

over the time. Consistent with our prediction, MNCD did not matter for the period between

the end of the war (1864) and the abolition of the exam (1905), when basic education was

26

Figure 4: The Dynamic Impact of Migrant-Native Cultural Distance

largely financed privately within clans. For 1900-1920, MNCD has a negative effect on schools

of great economic magnitude, and coefficient estimates of interactions between MNCD and time

periods drop substantially after 1920, and are no longer significant.16 In Column 3, we add

a possibly endogenous variable, log population, to the specification, and we get fairly similar

estimates to Column 2.17 For the decade of 1900-1910, a one-standard-deviation increase in

MNCD is associated with 32.3% decrease (0.294*1.1) in access to basic education, relative to

the effect of MNCD on schools in 1960, our omitted category. This estimate is comparable to

our baseline OLS estimate in Column 4 of Table 1. In Column 4, we run the same regression

as in Column 3, but on a sample including counties with imputed MNCD. The estimates are

somewhat smaller, likely due to the attenuation bias posed by less well measured MNCD of the

newly added counties.

The differential impact of MNCD in Table 9 before and after two structure breaks, are most

similar to the approach featured in Dippel (2014). Dippel (2014) finds the impact of forced

coexistence on economic prosperity only emerged after 1990, when Indian reservations were

given substantially more autonomy. We find the impact of MNCD only began to surface after

the modern educational reform, when the responsibilities of individual households and clans

in providing basic education was partly taken over by villages, and only died out after 1920,

when village and small town autonomy began to be eroded by upper-governments. Evidence

from the timing of MNCD kicking in and dying out suggests that the primary mechanism runs

16accesstobasiceducation1910is the average of public primary schools recorded in the 1914, 1915 and 1916Census.

17We would like to include time-varying urbanization rates as well, just to be comparable to the specificationsin Table 1. Unfortunately, we only have urbanization for one period.

27

from migrant-native cultural distance through quality of collective decision-making to public

primary schools. In addition, the general consistency between our panel and OLS estimates

further enhances our confidence in the findings.

VIII. CONCLUSION

This paper uses quasi-exogenous migration to identify the impact of cultural proximity between

migrants and natives on public goods provision outcomes. Our key finding is that cultural

distance has an independent effect on public goods provision outcomes, conditional on the

history of shared governance.

In addition to our key finding, we also find partial evidence that a history of shared governance

mitigates the negative effects of the cultural distance. We find that the culture distance between

previously detached groups has a statistically significant effect on public goods provision, whereas

the polarization measure taking into account differences between surname groups (“surname

group distance”) without making any adjustment for their shared history, does not have the same

effect. A most likely explanation for this is that the culture distance between previously detached

groups (migrants and natives) does more damage to public goods provision than does the cultural

distance between groups with a history of shared governance. A history of shared governance

might have nurtured trust and increased the level of collaboration. This is consistent with

Dippel’s (2014) finding that a history of shared governance facilitates public goods provision.

Therefore, we believe our findings are consistent with both sides of the debate as to whether

ethnic cleavages or artificial jurisdictions has caused poor economic performance.

Our analysis sheds light on how migrant-native cultural distance, as a key aspect of population

heterogeneity, is related to the failure of educational reforms in the early 20th century. Our

finding that a greater cultural distance between migrants and natives is associated with a lower

density of public primary schools reveals the unique challenge faced by communities with many

migrants from afar face in their attempt to modernize their education. For those communities,

educational reforms proved to be incredibly challenging and frustrating, due to intense social

antagonism between two groups. The under provision of modern education could prevent have

prevented the accumulation of modern human capital and increased barriers to industrialization

for those communities.

A more general point is that cultural distance is associated with a higher cost in modernizing the

governance structure in China. Following China’s constitutional reforms, village communities

aimed to provide a wider range of public goods, as well as to formalize its governance structure

(Kuhn, 2002, Chapter 4). The failure to provide a key public good—public primary schools—

actually reveals a deeper issue with the process of modernizing the village governance structure.

The idea that low social capital can prevent effective self-governance in Chinese villages is

28

explored in Qian, Xu, Yao, et al. (2015).18 Formalization and democratization of village decision-

making following the constitutional reforms was hampered by strong social antagonism between

migrants and natives, the two groups with the most mistrust for each other. Social antagonism

between different groups in the village, created substantial barriers for post-1905 villages to

modern its political structure and to provide a wider range public goods than before. We argue

that cultural distance can affect the quality of village governance through social capital and

delay the emergence of full-fledged modern village governance.

Lacking the support of well-functioning local villages, Qing China made little meaningful progress

in its political reforms before its collapse in 1911. While both constitutional reforms and ”mod-

ern school movement” continued into Republican China, they still faced very similar constraints

that Qing China had during its last ten years. By 1930s, modern schools eventually attained

rapid growth but only at the expense of village self-governance and political decentralization

(Kuhn, 2002). The stagnant growth of public primary schools in a time of decentralization

parallels the experience nineteenth-century Prussia had with a decentralized education system

in a linguistically polarized society (Cinnirella and Schueler, 2016). The transition to autocratic

rule after 1930 seems to be consistent with Galor, Klemp, et al. (2015) which establishes the

point that population heterogeneity can provide a breeding ground for autocratic rule.

For those interested in the historical role of the gentry class, our paper also sheds some light

on the role of gentry men in modernizing village governance in a discussion of the mechanisms

linking MNCD to poor public goods provision outcomes. Gentry men were elite members of their

village. Many of them were enthusiastic about making contributions to their village. Schools

before 1905 relied heavily on their private donations, and remained so to a certain degree even

after 1905. However, our findings most likely indicate that gentry men were not best suited to

fulfill leadership roles in a modern context—gentry men, because they were viewed as closely

associated with their clan, encountered criticism about their impartiality; these tensions helped

fuel political conflict as leadership positions were seen as “public” prizes by a heterogeneous

population.

18 Qian, Xu, Yao, et al. (2015) find the presence of village temples, their proxy for social capital, enhancespublic goods provision through the channel of elections.

29

References

Ager, P. and M. Bruckner (2013). Cultural diversity and economic growth: Evidence from the usduring the age of mass migration. European Economic Review 64, 76–97.

Aghion, P., A. Alesina, and F. Trebbi (2005). Choosing electoral rules: theory and evidence from uscities. Technical report, National Bureau of Economic Research.

Alesina, A., R. Baqir, and W. Easterly (1999). Public goods and ethnic divisions. The QuarterlyJournal of Economics 114 (4), 1243–1284.

Alesina, A. and E. L. Ferrara (2005). Ethnic diversity and economic performance. Journal of EconomicLiterature 43 (3), 762–800.

Altonji, J. G., T. E. Elder, and C. R. Taber (2005). Selection on observed and unobserved variables:Assessing the effectiveness of catholic schools. Journal of Political Economy 113 (1), 151–184.

Ashraf, Q. and O. Galor (2013). The’out of Africa’hypothesis, human genetic diversity, and comparativeeconomic development. The American Economic Review 103 (1).

Bai, Y. and J. K.-s. Kung (2011). Genetic distance and income difference: Evidence from changes inchina’s cross-strait relations. Economics Letters 110 (3), 255–258.

Bai, Y. and J. K.-s. Kung (2014). Diffusing knowledge while spreading god’s message: Protestantismand economic prosperity in china, 1840–1920. Journal of the European Economic Association.

Bai, Y. and J. K.-s. Kung (2015). Does genetic distance have a barrier effect on technology diffusion?evidence from historical china.

Bazzi, S., M. Gudgeon, et al. (2015). Local government proliferation, diversity, and conflict. Technicalreport, Boston University-Department of Economics.

Bossert, W., C. D’AMBROSIO, and E. La Ferrara (2011). A generalized index of fractionalization.Economica 78 (312), 723–750.

Bossert, W., P. K. Pattanaik, and Y. Xu (2003). Similarity of options and the measurement of diversity.Journal of Theoretical Politics 15 (4), 405–421.

Buck, J. L. and C. Press (1937). Land Utilization in China: a study of 16,786 farms in 168 localities,and 38,256 farm families in twenty-two provinces in China, 1929-1933, Volume 2. University ofChicago Press.

Cao, S. (1998). The impact of the taiping rebellion on population in south jiangsu. History Research 2,64. .

Cao, S. and Y. Chen (2002). Malthusian theory and chinese population since qing: comments on recentstudies by american scholars. History Study (lishi yanjiu) 1, 41–54.

Cao, S. and Y. Li (2000). The impact of the taiping rebellion on population in zhejiang. Journal ofFudan University: Social Sciences 5, 005. .

Cao, z. (2006). On the rise and fall of linguistic enclaves: the case of wu-hui mandarin dialect. Linguisticstudy (Yuyan yanjiu).

Caselli, F. and W. J. Coleman (2013). On the theory of ethnic conflict. Journal of the EuropeanEconomic Association 11 (s1), 161–192.

Chaudhary, L., A. Musacchio, S. Nafziger, and S. Yan (2012). Big brics, weak foundations: Thebeginning of public elementary education in brazil, russia, india, and china. Explorations in EconomicHistory 49 (2), 221–240.

CHGIS, H. Y. I. (2007). CHGIS.Cinnirella, F. and R. M. Schueler (2016). Dp11274 the cost of decentralization: Linguistic polarization

and the provision of education.Crayen, D. and J. Baten (2010). Global trends in numeracy 1820–1949 and its implications for long-

term growth. Explorations in Economic History 47 (1), 82–99.Davis, D. R. and D. E. Weinstein (2002). Bones, bombs, and break points: the geography of economic

activity. The American Economic Review 92 (5), 1269–1289.Desmet, K., M. Le Breton, I. Ortuno-Ortın, and S. Weber (2011). The stability and breakup of nations:

a quantitative analysis. Journal of Economic Growth 16 (3), 183–213.

30

Dippel, C. (2014). Forced coexistence and economic development: evidence from native americanreservations. Econometrica: Journal of the Econometric Society 82 (6), 2131–2165.

Du, R., C. Xiao, and L. Cavalli-Sforza (1997). Genetic distances between chinese populations calculatedon gene frequencies of 38 loci. Science in China Series C: Life Sciences 40 (6), 613–621.

Duclos, J.-Y., J. Esteban, and D. Ray (2004). Polarization: concepts, measurement, estimation. Econo-metrica, 1737–1772.

Easterly, W. and R. Levine (1997, November). Africa’s growth tragedy: Policies and ethnic divisions.The Quarterly Journal of Economics 112 (4), 1203–50.

Esteban, J., L. Mayoral, and D. Ray (2012a). Ethnicity and conflict: An empirical study. The AmericanEconomic Review , 1310–1342.

Esteban, J., L. Mayoral, and D. Ray (2012b). Ethnicity and conflict: Theory and facts. sci-ence 336 (6083), 858–865.

Esteban, J. and D. Ray (2011). Linking conflict to inequality and polarization. The American EconomicReview 101 (4), 1345–1374.

Esteban, J.-M. and D. Ray (1994). On the measurement of polarization. Econometrica: Journal of theEconometric Society , 819–851.

Fei, H.-T. (1939). Peasant life in China. 19, 39.Fei, X. (1992). From the Soil, the Foundations of Chinese Society: A Translation of Fei Xiaotong’s

Xiangtu Zhongguo, with an Introduction and Epilogue. University of California Press.Galor, O., M. Klemp, et al. (2015). Roots of autocracy. Technical report.Ge, J. and S. Cao (2001). Zhongguo renkoushi qingdaijuan (History of China’s Population: Qing

Dynasty). Fudan University Press.Ge, q. (2002a). jindai suzhewan jiaojie diqu renkou qianxi yanjiu (Migrants from Hunan and Hubei in

the intersection of Jiangsu, Zhejiang and Anhui). Shanghai: Shanghai Academy of Social SciencePress.

Ge, Q. (2002b). Minguo shiqi zhejiang huzheng yu renkou diaocha (Household and Population Censusin Zhejiang During the Republican Period ). Chinese Academy of Social Sciences press. .

Gennaioli, N. and I. Rainer (2007). The modern impact of precolonial centralization in africa. Journalof Economic Growth 12 (3), 185–234.

Guo, y. (1989). Historical geography of the taiping rebellion (taiping tianguo lishi dituji.Huang, X. (2004). An study on the mandarin dialect enclaves of Anji County, Zhejiang province

(Zhejiang anjixian fangyandao yanjiu). Dissertation, Beijing Language University.Junya, M. and T. Wright (2010). Industrialisation and handicraft cloth: The jiangsu peasant economy

in the late nineteenth and early twentieth centuries. Modern Asian Studies 44 (06), 1337–1372.Kuhn, P. A. (2002). Origins of the modern Chinese state. Stanford University Press.Kung-Chuan, H. (1967). Rural China: Imperial control in the nineteenth century. University of Wash-

ington Press.Leung, A. K. C. (1994). Elementary education in the lower yangtze region in the seventeenth and

eighteenth centuries. Education and Society in Late Imperial China, 1600-1900 , 381–416.Li, N. (2011). Essays on cultural diffusion, migration, and human capital: investigation from china’s

historical experience.Li, W. (1981). The change in land ownership in the area occupied by the taiping rebels in jiangsu,

zhejiang and anhui in late qing. History Study (lishi yanjiu) 6, 81–96.Li, W. (2002). Restoration of temples and religious networks in rural lower yangzi after the taiping

rebellion. Zhejiang Chronicles (Zhejiang Fangzhi) 1, 62–69.Liang, q. (2001). Charity and Edification: Charitable Organizations in Ming and Qing (Cishan yu

jiaohua: mingqing de cishan zuzhi). Hebei Education Press.Lin, C. and N. Li (2014). The impacts of taiping rebellion on long-term economic development. Journal

of Translation from Foreign Literature of Economics (jinji ziliao yicong) 2.Liu, Y. (2012). Conflict and assimulation in Jiangnan during Late Qing and Early Republic China—

Jurong in Jiangsu. History (Shi lin) 2, 016.Mann, S. (1994). The education of daughters in the mid-ch’ing period. Education and Society in Late

31

Imperial China, 1600–1900.Michalopoulos, S. and E. Papaioannou (2013). Pre-colonial ethnic institutions and contemporary

african development. Econometrica 81 (1), 113–152.Mickey, G. A. and T. Shiroyama (2009). China during the great depression: Market, state, and the

world economy, 1929-1937.Oster, E. (2014). Unobservable selection and coefficient stability: Theory and evidence. University of

Chicago Booth School of Business Working Paper .Ouyang, N. and W. Zhang (2010). Spatial-temporal analysis of temple-converted schools in Jiangnan

in Late Qing and Republican China. Historical Geography (Lishi dili), 011.Qian, N., Y. Xu, Y. Yao, et al. (2015). Making democracy work: Culture, social capital and elections

in China. Technical report, National Bureau of Economic Research.Qu, t. (2003). Local Government in Qing (Qingdai de difang zhengfu). Ph. D. thesis, Beijing: Law

Press.Rawski, E. S. (1979). Education and popular literacy in Ch’ing China. University of Michigan Press

Ann Arbor, MI.Reynal-Querol, M. (2002). Ethnicity, political systems, and civil wars. Journal of Conflict Resolu-

tion 46 (1), 29–54.Reynal-Querol, M. and J. G. Montalvo (2005). Ethnic polarization, potential conflict and civil war.

American Economic Review 95 (3), 796–816.Shi, J., H. Wu, and X. Fei (1988). Emperor and gentry (huangquan he shenquan).Simmons, R. V., R. Shi, and Q. Gu (2006). An historical geographic study on the division line

of Jianghuai-mandarin dialect and Wu-mandarin dialect. (Wu-huai guanhua yu wuyu bianjie defangyan dilixue yanjiu). Shanghai Education Press.

Spolaore, E. and R. Wacziarg (2009). The diffusion of development. Quarterly Journal of Eco-nomics 124 (2).

Tian, Z. and S. Chen (2008). Taxation for education and conflicts arising from provision of villageeducation. Journal of Zhejiang University: Humanities 38 (3), 142–149. .

Xu, X. (2010). Temple property in the Late Qing Dynasty and the Republic of China (1895-1916).Technical report.

Xu, Y. and Y. Yao (2015). Informal institutions, collective action, and public investment in rural china.American Political Science Review 109 (02), 371–391.

Yin, M. and Q. Tian (2009). Minguo renkou huji shiliao huibian (Compilation of Historical PopulationStatistics During the Republican Period). National Library Press. .

Yuchtman, N. (2015). Teaching to the tests: An economic analysis of traditional and modern educationin late imperial and republican china. Unpublished manuscript 27.

Zhang, x. (2008). Qingdai jiangnan gonggong jiaoyu ziyuan choucuo peizhi de lishi dili xue fenxi (AHistorical Geographical Analysis of Public Education Resource Allocation in Jiangnan from 1644 to1911). Ph. D. thesis, Fudan University. (1644-1911 ).

Zhe, c. and c. Zhe (1896). Draft Gazetteer of Yuhang County (Guangxu yuhangxian zhigao(fu buyi)).

32

Appendix

A Additional Tables

Table A-1: Summary Statistics

Variable Mean Std. Dev. Observations Source

#public primary schools per 10,000 persons 0.937 0.518 60 b

Measures of population heterogeneity:Migrant-native cultural distance 2.12 1.024 54 aOverall diversity 35.357 5.984 54 aWithin-native diversity 30.166 6.311 54 aOverall polarization 1.297 0.729 54 aWithin-native polarization 0.85 0.293 54 a

Instrumental variables:Native-nearby cultural distance 1.578 0.421 54 a

Baseline controls:Log(%access to basic education), 1880 3.136 0.293 60 bUrbanization rate, 1917 12.614 16.093 60 dLog population 12.456 0.869 60 b

Geographic controls:Ruggedness 1.857 2.532 60 g% arable land 0.535 0.242 60 eAgricultural suitability 2.851 2.054 60 gDist. to Grand Canal 0.27 0.26 60 gDist. to provincial capital 137.617 95.719 60 gDist. to Shanghai 171.15 87.681 60 gDist. to Yangzi 1.158 0.694 60 g

Initial conditions:#initial charities 2.583 4.928 60 hLog population, 1820 12.978 0.739 60 bLog population density, 1820 6.095 0.655 60 b

Impact of war:Battle exposure (months) 31.167 16.955 60 i% adult men 20.108 2.090 60 c% elderly and youth 16.527 1.661 60 cHuman capital shock -0.022 0.053 54 b

Interventions in education:Log (#communicants per 10,000+1) 2.921 1.152 60 fLog (#temple-converted schools+1) 1.158 0.930 60 j

33

Variable Mean Std. Dev. Observations Source

Mechanisms:Government spending on education 0.014 0.036 60 b#lower-primary schools per 10,000 0.589 0.4 60 b#dual-primary schools per 10,000 0.179 0.169 60 b#upper-primary schools per 10,000 0.063 0.055 60 b#secondary schools per 10,000 0.01 0.02 60 b

For panel estimates:Log(%access to basic education) 2.971 1.105 600 bLog population 12.707 0.859 600 b

a. Surname data sources (see Appendix B)b. Primary schools data sources (see Appendix C)b. Ge and Cao (2001)c. Yin and Tian (2009, Volumn 4), Compilation of Historical Population Statistics Duringthe Republican Period; Ge (2002b), Household and Population Census in Zhejiang Duringthe Republican Period.d. Yin and Tian (2009, Volumn 1), Compilation of Historical Population Statistics Duringthe Republican Period.e. Buck and Press (1937), Land Utilization in China: a study of 16,786 farms in 168localities, and 38,256 farm families in twenty-two provinces in China, 1929-1933.f. Local gazetteers on religious facilities.g. CHGIS (2007)h. Liang (2001)i. Guo (1989)j. Ouyang and Zhang (2010)

Table A-2: Instrumental Variable Analysis: Reduced-Form Estimates

(1) (2) (3) (4) (5) (6)Dependent variable: #public primary schools per 10,000 persons

Native-nearby culture distance -0.280 -0.371∗ -0.362∗ -0.367∗ -0.317 -0.319(0.194) (0.191) (0.182) (0.184) (0.216) (0.212)

Human capital shock N N N N N YBattle exposure N N N N N YFractionalization N N N N Y YInitial social capital N N N Y Y YPop. 1820 N N N Y Y YGeorgraphic controls N N Y Y Y YBaseline controls N Y Y Y Y Y

Observations 54 54 54 54 54 54Adjusted R2 0.034 0.047 0.381 0.367 0.348 0.331

Notes: The table reports reduced-form estimates of native-nearby culture distance. Theunit of observation is a county in Republican China. The specifications correspond to thosein Table 7. Robust standard errors are used in all specifications. ∗ p < 0.10, ∗∗ p < 0.05,∗∗∗ p < 0.01

34

B Data Sources for Surnames

An Overview of Surname Data Sources

Pre-Taiping period Post-Taiping period

Deaths during the Taiping Rebelliona

Exam degree holders 1645-1850b

Diseased soldiers 1927-52c

College students 1906-1949d

31126

13340

39419

15584

Sources:

a: Traditional county gazetteers (pre-1949): chapter for deaths of high moral worth. 旧志-咸同忠烈

姓名录

b: Traditional county gazetteers (pre-1949): chapter for exam degree holders. 旧志-选举志（by year

of degree）

c: Modern county gazetteers(post-1949): chapter for deceased soldiers. 新志-抗战英烈和革命英烈

姓名录 (by year of death)

d: College yearbooks during the Republican era (by year of admission). See Hao and Clark (2014) for more details.

35

Fig

ure

A-1

:D

ata

Sou

rces

for

Su

rnam

es

36

C Data Sources for Basic Education

Year Region Sources

1820-1900 The Lower Yangzi Same as surname data sources. See “Some notes on surname data”.

The “Statistical Chart of Education” (Jiaoyu Tongji Tubiao) by Ministry of Education (教育部)

1907 Nation 光绪三十三年第一次教育统计图表

1908 Nation 光绪三十四年第二次教育统计图表

1909 Nation 宣统元年第一次教育统计图表

1915 Nation 中华民国第三次教育统计图表

1916 Nation 中华民国第四次教育统计图表

1917 Nation 中华民国第五次教育统计图表

1923

Jiangsu

江苏政治年鉴

The Yearbook of Politics: Jiangsu Province (1923)

The “Statistical Chart of Education” (Jiaoyu Tongji Tubiao) by Department of Education (教育

厅), Zhejiang Province (1925-1929)

1925 Zhejiang 中华民国十四年度浙江省教育统计图表

1927 Zhejiang 中华民国十六年度浙江省教育统计图表

1929 Zhejiang 中华民国十八年度浙江省教育统计图表

1929 Jiangsu 江苏教育概览 The Overview of education: Jiangsu province (1929)

The “Statistical Chart of Education” (Jiaoyu Tongji Tubiao) by Department of Education (教育

厅), Jiangsu Province (1932-1937)

1932 Jiangsu 民国二十一年度江苏省教育经费统计图表

1933 Jiangsu 民国二十二年度江苏省教育经费统计图表

1935-37 Jiangsu 江苏省教育统计图表

1935-37

Zhejiang

浙江省三年来教育概况

Education for the Last Three Years: Zhejiang province

1947 Jiangsu 江苏省 36 年教育统计

1947 Zhejiang

浙江经济年鉴

The Yearbook of Economy: Jiangsu Province (1923)

1950-60

Jiangsu

江苏五十年(1949-1999)

The fifty-year statistics of Jiangsu (1949-1999)

1950-60

Zhejiang

新浙江五十年统计资料汇编(1949-1999)

The fifty-year statistics of Zhejiang (1949-1999)

Figure A-2: Data Sources for public primary schools (1820-1960)

37

D The Traditional Education System

Prior to 1905, the primary education system was based upon Confucian classics and aimed at

success in the imperial examinations. At national, provincial and county levels, highly competi-

tive exams selected a few degree holders, and brought the individual, his clans, and communities

“prestige, power, and wealth through government service (Rawski, 1979, p.21). Within a com-

munity or a clan, an individual’s literacy helped determine his social status, occupation and

wealth. The return to passing exams generated considerable demand for privately or publicly

provided traditional schooling throughout the country. On the other hand, due to commercial-

ization in this area since the 16th century there was a growing demand for education, especially

basic education. The educational system consisted of two phases: mass primary education

teaching basic literacy and select students of talent, and the more advanced education drilling

candidates to prepare for exams (Leung, 1994). Those who reached the threshold level in literacy

to attend the exam (4,000 characters) accounted for only 1%-2% of the male population(Rawski,

1979, p.96). A much higher percentage of male population, roughly 30% to 40%, acquired ba-

sic literacy (numeracy and about 1000 characters) Crayen and Baten (2010). Female literacy

(2%-5%) was confined to those from elite families (Mann, 1994; Rawski, 1979).

Both basic and advanced education were mainly provided privately. Children in elite house-

holds were instructed by family members to begin with (age 3-6), and by hired private tutors

between age 6 and 15 (typically exam degree holders who had not been to a government po-

sition). Households of modest backgrounds pooled their resources and paid local teachers for

instructions. There is no evidence that a village tax was collected, nor any aid from government

was received to finance those educational instructions. Parents paid the teacher in either money

or commodities. Each tutor taught 1-30 students in the tutor’s own house or the village temple.

Hours and schedules were flexible, adjusted to weather and season. The cost varied with the

quality of instructions, and degree holders could demand a higher salary as tutors. At the age

of nine or ten, decisions were made based on the observed talent of children. Families would

support the talented ones to pursue exam careers, while they prepare others for various lesser

occupations.

Boys from families too poor to pay for schooling were not necessarily barred from the classroom.

Clan schools (zu xue) were often established primarily to aid such students (Rawski, 1979, pp.

30-32). These schools were financed by contributions, a clan tax, and clan land (xue tian) with

the rent going to special funds for education and exam preparation. Clan temples (zong miao)

often served as classrooms. For the talented students, their future study and exam taking were

fully covered by land funds. Most clan schools limited admission to kin members but exceptions

were made for talented non-kin members within the community . Overall, in the lower Yangzi

about 5-8% of males were supported by their kinships for their education.

The central and local government in late imperial China took a hands-off approach to financing

education. Local magistrates advocated setting up primary schools for the poor but there

38

was no record of direct funding from the government (Rawski, 1979, pp. 38-40). The education

establishments with direct government funding, called ”government-based county and prefecture

schools”, were for no more than a few hundred lower degree holders to accomplish higher degrees

(an allowance was provided to cover basic living costs and travel costs). There were a few

exceptions to the rule: in frontier regions populated by non-Han minorities, the government

funded the establishment and maintenance of schools; in areas with recent exposure to war and

famine, the government often helped to establish schools as a means to restoring the Confucian

order (Rawski, 1979, p.89). For instance, after the Taiping rebellion, the magistrate of the

county of Wujiang set up sixteen charity schools with the county budget, but those schools

admitted no more than 500 students county-wide, which accounted for only 1% of school-aged

children. Even so, these schools were closed after a decade because of ”a tight budget” and

the restoration of ”private and clan schools to a pre-war level.In most cases, such government-

initiated charity schools only admitted 0.5%-1% of the school-aged children, or 2%-4% of the

entire educated population.

E The Modern Education System

Content Pedagogy

Early childhood(4-7)

Basic Chinese charac-ters

Memorizing

TraditionalTutored at home byfamily members

Primary (7-12) Poetry and Confuciusclassics

Memorizing; Tutoredat home or at sishu byprivate tutors

Juvenile (13-17)and youth (18-25)

Preparing for exams(writing eight-leggedessays based on Con-fucius classics)

Memorizing andwriting; Studied athome or in privateacademies

Modern

Early childhood(4-7)

Basic Chinese charac-ters

Tutored at pre-schools

Primary (7-12) Chinese, art, mathe-matics

Studied in primaryschools

Juvenile (13-18) Chinese, art, math-ematics, western sci-ence

Studied in high pri-mary schools and sec-ondary schools

Youth (19-25) Art, western sciences,foreign languages, lawand engineering

Studied in universities

Table A-3: Traditional versus Modern Education. Sources: Yuchtman (2015)

39

Example county: County of Wu （吴县）

Upper-Primary Schools

Secondary Schools

City of Suzhou (苏州）

Small

town

Village

Lower-primary

schools

and dual-primary

schools

District-level

agency 区劝学员

Bureau of Education

县劝学所，教育局

Figure A-3: Types of Schools and The Fiscal Authority in Charge

40

F The Taiping Rebellion

Figure A-4: Battle Exposure

G Measures of Population Heterogeneity

Below we construct measures of population heterogeneity as alternative explanatory variables,

mainly, the fractionalization index and the polarization index. Those indices do not contain

information on migrant-native status. We compare MNCD to measures of population hetero-

geneity to put the explanatory power of the cultural distance between migrants and natives in

perspective.

G.1 Fractionalization

Akin to the fractionalization measure in (Alesina et al., 1999), we use the inverse of isonomy

to define fractionalization. Rather than ethnic groups, our measure relies on surnames (k). We

measure the probability of randomly drawn two individuals sharing the same surname for a

population at a given time:

I =S∑k

P 2k,i , (5)

41

where S is the number of surnames and Pk is the relative frequency of surname k, which is the

proportion of the population with surname k to the population. This is a Herfindahl-Hirschman

index of surname distribution. And,

FRAC =1

I. (6)

A higher FRAC corresponds with a lower likelihood that any two individuals randomly drawn

from a population bear the same surname (which indicate closeness in dialect, culture and even

genetics). This fractionalization index treats all groups symmetrically.

MNCD can change without increasing fractionalization. An example is that when people car-

rying a certain surname are wiped out by the rebellion, and replaced by migrants carrying a

different surname, the overall fractionalization of a population remains unchanged, but MNCD

can still change.

G.2 Polarization

Our first polarization index is taken from Reynal-Querol (2002) and Reynal-Querol and Montalvo

(2005):

POL =S∑

i=1

(12− Pk

12

)2Pk (7)

where S is the number of surnames and Pk is the relative frequency of surname k, which is

the proportion of the population with surname k to the population. This measure employs a

weighted sum of population shares and assumes the any two groups are either completely similar

or completely dissimilar.

We then rely on Duclos, Esteban, and Ray (2004); Esteban and Ray (2011); Esteban, Mayoral,

and Ray (2012a) to build a second measure of polarization:

POL DIST =S∑

i=1

S∑j=1

P 2i Pjdij . (8)

The group i here is defined as a surname group. The population share of each surname group is

Pi, the intergroup distance dij. Our measure of intergroup distance is the difference in human

capital between groups. This is not the perfect measure, but this is the best measure we can find

given our data. A similar measure to ours is the difference in incomes between groups (Esteban

and Ray, 1994) and (Aghion et al., 2005). For that reason, our polarization measure should be

interpreted with caution.

We report correlations between MNCD and fractionalization and MNCD and polarization (Table

A-4). We find MNCD is positively correlated with overall fractionalization and polarization, but

42

Table A-4: Correlations Between MNCD, Fractionalization and Polarization

MNCD Overall FRAC Overall POL Overall POL DIST

MNCD 1Overall FRAC 0.203 1Overall POL -0.229 -0.786∗∗∗ 1Overall POL DIST 0.151 -0.506∗∗∗ 0.434∗∗ 1∗ p < 0.1, ∗∗ p < 0.05, ∗∗∗ p < 0.01

not with within-native fractionalization and polarization. This is consistent with the nature of

MNCD: it captures the relationship between migrants and natives, not the relationship between

natives. The high correlations between MNCD and overall fractionalization and polarization

suggest that MNCD captures crucial information in overall population heterogeneity.

H Human Capital Shock

Migrants do not always have the same level of human capital as natives. When their levels of

human capital differ, this measure captures the resulting “shock” to the stock of local human

capital:

Human capital shock =∑k

(Pk,MN − Pk,N)RRk (9)

where RRk is a surname-specific variable that denotes the representation of surname k among

the educated elites relative to its population, i.e. RRk is the ratio of the share of k in education

elites to the share of k in the total population. Supposing that natives have two surnames:

Wang and Li. Each surname accounts for 50% of the population. Li’s are over represented

among education elites by a factor of 2, in other words, Li’s are two times more likely to show

up among national elites than in an average population, whereas Wang’s are just as likely to be

educated as is a member of the average population. For illustrative purposes, take the extreme

case where all Li’s die in the war. A family of Zhangs came to the village, with a RR of 0.5.

The human capital shock is (50%-50%)*1+(0%-0.5%)*2+(50%-0%)*0.5=-0.5.

We acknowledge that our measure of human capital shock can have measurement errors due to

migrant selectivity—migrants do not have to have the same capital level as the overall population

with their surnames. When skilled migrants migrate to more industrialized areas, our measure

can underestimate the level of their human capital. However, we do not see how this can bias our

results aside from introducing an attenuation bias. Also, this sort of measurement error is going

to be minor for most counties in our sample, with the exception of industrial Shanghai. Figure

A-5 shows geographic pattern of human capital shocks. The human capital shock appears to be

the most positive in the east, consistent with the fact that Shanghai often attracted migrants

with higher human capital relative to its native population.

43

Figure A-5: Human Capital Shock

I Evidence from Timing

I.1 Pre-1900 Access to Basic Education

We construct our time-varying access to basic education variable using two methods: for 1900-

1960, we obtain primary school enrollment rates from national and provincial censuses on modern

education. For 1820-1900, there were no primary schools in the modern sense, but a percentage

of people did acquire basic education as explained in Appendix D, but data on this are generally

not available. Below we explain our method in estimating pre-1900 rates of access to basic

education and how it affects the validity of our panel analysis.

We estimate pre-1900 %access to basic education from time-varying pre-1900 rates of college-

equivalent education (juren), college schools rate in 1947, and literary rate in 1947. While the

content of provincial-exams was vastly different from modern university education, successful

candidates of provincial exams were similarly at the top of the distribution of the educated.

This yields a method to construct a rough estimate of the share of population with a basic

education prior to 1900. We employ the following parallel projection to estimate time-varying

rates of receiving basic education:

%basic education accessi,1947 = a+ b ∗ college student per 10,000i, 1947 + µit . (10)

Second, using the obtained a and b (1.72 and 13.56 respectively), we project %access to basic

44

education in the exam era from juren per 10,000,

%basic education accessi,t = a+ b ∗ ft ∗ juren per 10000i,t , (11)

where ft is a time-varying ratio that transforms juren per 10,000 into college students in 1947.

For example, in 1820 and 1850, the proportion of juren is 0.03% of male population in Lower

Yangzi and in 1880 the proportion of juren is 0.06% (juren quota changed little but population

halved after 1860), whereas college students account for 0.3% of total population in 1947. So

the ft=10 for 1820 and 1850, and ft=5 for 1880.

We should point out that this method does assume a fixed ratio between advanced and basic

education in 1947 and in the 19th century. We want to point out that in our panel, our results

are unlikely affected by differential advanced-to-basic education ratios in 1947 and in the 19th

century. The difference between two ratios will be captured by our county fixed effects.

It should be noted that official 1900-1950 schools rates only comprise schools at modern schools

that provided western-style education and was open to the public. Comparing the primary

schools rate in the decade of 1900, and the rate of basic education in the period just before that,

we can estimate the differential response to modern basic education in relation to migrant-native

cultural distance, holding constant a county’s past coverage of basic education, and factors

common to all counties in modernizing their education. Due to data limitations, we cannot

track the evolution of traditional schools during the same period.19 Therefore, we do not know

whether the gap left by the differential response to modern education was fully compensated by

additional provision of traditional education.

19In 1935, and in 1949, both Zhejiang and Jiangsu collected data on and facilitated the registration of tra-ditional schools. Unfortunately, data quality was low and varied greatly from county to county. By 1953, alltraditional schools had been converted to formal public schools. Our basic education measure captures theamount of county- and community-level public and semi-public education. Schools initiated by individuals andclans, but did provide education to the wider population, are not excluded from the measure.

45

friends from afar: the taiping rebellion, cultural ...this paper tests the hypothesis that the...

Documents