ipums-eurasia, 2009-2014: changing patterns of microdata use * * * robert mccaa, professor of...

18
IPUMS-EurAsia, 2009-2014: IPUMS-EurAsia, 2009-2014: Changing Patterns of Microdata Use Changing Patterns of Microdata Use www.ipums.org/international * * * * * * Robert McCaa, Professor of Population Robert McCaa, Professor of Population History History University of Minnesota University of Minnesota [email protected] for additional details, please see for additional details, please see : : www.hist.umn.edu/~rmccaa www.hist.umn.edu/~rmccaa

Post on 22-Dec-2015

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: IPUMS-EurAsia, 2009-2014: Changing Patterns of Microdata Use  * * * Robert McCaa, Professor of Population History University

IPUMS-EurAsia, 2009-2014:IPUMS-EurAsia, 2009-2014:Changing Patterns of Microdata Use Changing Patterns of Microdata Use

www.ipums.org/international * * ** * *

Robert McCaa, Professor of Population HistoryRobert McCaa, Professor of Population HistoryUniversity of MinnesotaUniversity of Minnesota

[email protected] additional details, please seefor additional details, please see::www.hist.umn.edu/~rmccaawww.hist.umn.edu/~rmccaa

Page 2: IPUMS-EurAsia, 2009-2014: Changing Patterns of Microdata Use  * * * Robert McCaa, Professor of Population History University

Microdata

Integrated into IPUMS

Entrusted to IPUMS None entrusted

None inventoried

IPUMS-EurAsia in global context IPUMS-EurAsia in global context dark greendark green = integrated = integrated 2002-2009 2002-2009

(44 countries, 130 censuses, 279 millon person records)(44 countries, 130 censuses, 279 millon person records)green = to be integrated (40 countries, 120 censuses, ~200 mpr)green = to be integrated (40 countries, 120 censuses, ~200 mpr)

Mollweide projection

Page 3: IPUMS-EurAsia, 2009-2014: Changing Patterns of Microdata Use  * * * Robert McCaa, Professor of Population History University

Integration: IPUMS-EurAsia in global context Integration: IPUMS-EurAsia in global context dark greendark green = integrated = integrated 2002-2009 2002-2009

(44 countries, 130 censuses, 279 millon person records)(44 countries, 130 censuses, 279 millon person records)

Mollweide projection

Microdata

Integrated into IPUMS

Entrusted to IPUMS None entrusted

None inventoried

Page 4: IPUMS-EurAsia, 2009-2014: Changing Patterns of Microdata Use  * * * Robert McCaa, Professor of Population History University

1. Data recovery. Example: Bangladesh Bureau of 1. Data recovery. Example: Bangladesh Bureau of Statistics--1981 census, 276 tapes, recovered Sep. ‘08)Statistics--1981 census, 276 tapes, recovered Sep. ‘08)

>3,000 tapes >3,000 tapes recovered: 1971 Germanyrecovered: 1971 Germany

1980 Mexico, 1980 Mexico, Mali 1976, Sudan 73Mali 1976, Sudan 73

and many moreand many more

MicrodataMicrodataon this tape on this tape

were recovered!!were recovered!!

Page 5: IPUMS-EurAsia, 2009-2014: Changing Patterns of Microdata Use  * * * Robert McCaa, Professor of Population History University

Chile Chile MéxicoMéxico

CodeCode LabelLabel 19921992 20022002 19901990 20002000

00 NIUNIU X X X X X X X X

ACTIVE (In Labor Force)ACTIVE (In Labor Force)

100100 EMPLOYED, not specifiedEMPLOYED, not specified · · · · · · · ·

110110 At workAt work X X X X X X X X

111111 At work, and 'student'At work, and 'student' · · · · · · X X

112112 At work, and 'housework'At work, and 'housework' · · · · · · X X

113113 At work, and 'seeking work'At work, and 'seeking work' · · · · · · X X

114114 At work, and 'retired'At work, and 'retired' · · · · · · X X

115115 At work, and 'no work'At work, and 'no work' · · · · · · X X

116116 At work, and 'other'At work, and 'other' · · · · · · X X

117117 At work, family holding, not specifiedAt work, family holding, not specified · · · · · · · ·

118118 At work, family holding, not agriculturalAt work, family holding, not agricultural · · · · · · · ·

119119 At work, family holding, agriculturalAt work, family holding, agricultural · · · · · · · ·

120120 Have job, not at work last weekHave job, not at work last week X X X X X X X X

2. Microdata integration 2. Microdata integration composite codes (multiple digits)composite codes (multiple digits)retain not only significant distinctions retain not only significant distinctions but also integrate comparable conceptsbut also integrate comparable concepts

INDEC-Argentina evaluated IPUMS integration:INDEC-Argentina evaluated IPUMS integration:A couple of minor errors and misinterpretationsA couple of minor errors and misinterpretations

Page 6: IPUMS-EurAsia, 2009-2014: Changing Patterns of Microdata Use  * * * Robert McCaa, Professor of Population History University

3. Metadata integration3. Metadata integration

» Comprehensive source documentation: Comprehensive source documentation: » Data dictionaries and codebooksData dictionaries and codebooks

» Questionnaires, manuals, etc.Questionnaires, manuals, etc.

» All translated to English and converted into All translated to English and converted into metadatabase for each censusmetadatabase for each census

» New metadata for each census and sampleNew metadata for each census and sample» Census title, year, universe, de-jure/de-facto, Census title, year, universe, de-jure/de-facto,

census day, forms, field work period, etc.census day, forms, field work period, etc.

» Sample: source, design, density, unit, weights, etc.Sample: source, design, density, unit, weights, etc.

Page 7: IPUMS-EurAsia, 2009-2014: Changing Patterns of Microdata Use  * * * Robert McCaa, Professor of Population History University

3. Metadata integration3. Metadata integration

»New, systematic metadata for each variableNew, systematic metadata for each variable»CodesCodes

»UniversesUniverses

»DefinitionsDefinitions

»Comparability Comparability

»Dynamic System—facilitates comparing the Dynamic System—facilitates comparing the wording of questionnaires and instructions for any wording of questionnaires and instructions for any combination of countries and censusescombination of countries and censuses

Page 8: IPUMS-EurAsia, 2009-2014: Changing Patterns of Microdata Use  * * * Robert McCaa, Professor of Population History University

EMPSTAT, General Version, Case-Count ViewEMPSTAT, General Version, Case-Count ViewExample of IPUMS Metadata “Codes”Example of IPUMS Metadata “Codes”

Page 9: IPUMS-EurAsia, 2009-2014: Changing Patterns of Microdata Use  * * * Robert McCaa, Professor of Population History University

EMPSTAT, Variable DescriptionEMPSTAT, Variable DescriptionExample of IPUMS Metadata Example of IPUMS Metadata

Page 10: IPUMS-EurAsia, 2009-2014: Changing Patterns of Microdata Use  * * * Robert McCaa, Professor of Population History University

EMPSTAT, Variable DescriptionEMPSTAT, Variable DescriptionExample of IPUMS Metadata Example of IPUMS Metadata

Page 11: IPUMS-EurAsia, 2009-2014: Changing Patterns of Microdata Use  * * * Robert McCaa, Professor of Population History University

EMPSTAT, “Enumteration Text” EMPSTAT, “Enumteration Text” = form and instructions= form and instructionsExample of IPUMS Metadata Example of IPUMS Metadata

Click above for text or image in official language

Page 12: IPUMS-EurAsia, 2009-2014: Changing Patterns of Microdata Use  * * * Robert McCaa, Professor of Population History University

4. Statistical confidentiality:4. Statistical confidentiality:Conference of European Statisticians: “Good practice”Conference of European Statisticians: “Good practice”

Dennis Trewin on-site inspection: Dennis Trewin on-site inspection:

» ““The best practice for an international The best practice for an international repository of microdata”repository of microdata”

» ““The security of IPUMS is first class…the The security of IPUMS is first class…the standard of the best national statistical offices”standard of the best national statistical offices”

» ““in full compliance with the principles and in full compliance with the principles and recommendations of the CES [Conference of recommendations of the CES [Conference of European Statisticians]”European Statisticians]”

Page 13: IPUMS-EurAsia, 2009-2014: Changing Patterns of Microdata Use  * * * Robert McCaa, Professor of Population History University

5. Microdata access:5. Microdata access:IPUMS is a restricted-access, web-based systemIPUMS is a restricted-access, web-based system

»Password protected: to make extracts and retrieve microdataPassword protected: to make extracts and retrieve microdata

»Licensed researcher selects: Licensed researcher selects: »Countries, Countries, »Censuses,Censuses,»Cases/sub-populations, Cases/sub-populations, »Variables, and Variables, and »Sample densitiesSample densities

»Extract engine queues request, generates extractExtract engine queues request, generates extract

»Researcher retrieves extract via web with SSL 128-bit Researcher retrieves extract via web with SSL 128-bit encryption and analyzes using own wares (soft/hard/wet)encryption and analyzes using own wares (soft/hard/wet)

»NONO source files. source files. NONO complete datasets. complete datasets.

Page 14: IPUMS-EurAsia, 2009-2014: Changing Patterns of Microdata Use  * * * Robert McCaa, Professor of Population History University

C. Whiter IPUMS-EurAsia?C. Whiter IPUMS-EurAsia?» Users: 2,482 researchers, 69 countriesUsers: 2,482 researchers, 69 countries

» 90% are University researchers; but also WHO, ILO, World Bank90% are University researchers; but also WHO, ILO, World Bank» Economists: Economists: 45.7%45.7%» Demographers: Demographers: 19.019.0» Sociologists: Sociologists: 10.110.1» Public policy:Public policy: 5.1 5.1» Statisticians:Statisticians: 2.7 2.7» Historians:Historians: 2.4 2.4

» Asia and Pacific region—not so many users, due to few samples?Asia and Pacific region—not so many users, due to few samples?» ChinaChina 3737» JapanJapan 2525» AustraliaAustralia 2424» SingaporeSingapore 1010» IndiaIndia 8 8

» Looking AheadLooking Ahead

Page 15: IPUMS-EurAsia, 2009-2014: Changing Patterns of Microdata Use  * * * Robert McCaa, Professor of Population History University

Looking aheadLooking ahead

» Countries: Countries: » Soon: Bangaldesh, Indonesia, Nepal, Pakistan, ThailandSoon: Bangaldesh, Indonesia, Nepal, Pakistan, Thailand

» Later: ???Later: ???

» 2010 census round2010 census round

» New methods: variance estimationNew methods: variance estimation» Imputing pseudo-strata to simplify variance estimation for Imputing pseudo-strata to simplify variance estimation for

complex samplescomplex samples

» 2015: 200 censuses, 75 countries, 10,000 users??2015: 200 censuses, 75 countries, 10,000 users??

Page 16: IPUMS-EurAsia, 2009-2014: Changing Patterns of Microdata Use  * * * Robert McCaa, Professor of Population History University

IPUMS at the 57IPUMS at the 57thth ISI ISI (Durban, Aug 16-21, 2009)(Durban, Aug 16-21, 2009)http://www.statssa.gov.za/isi2009/index.aspxhttp://www.statssa.gov.za/isi2009/index.aspx

» IPUMS-NSI IPUMS-NSI Workshop Workshop (Aug 15-16)(Aug 15-16)

» STCPM STCPM session:session:cross-national cross-national microdatamicrodata

» IPUMS-Users IPUMS-Users Workshop Workshop

» IPUMS IPUMS Modest Modest funding for funding for delegates from delegates from developing developing countries countries

Page 17: IPUMS-EurAsia, 2009-2014: Changing Patterns of Microdata Use  * * * Robert McCaa, Professor of Population History University

IPUMS Global workshop, 56IPUMS Global workshop, 56thth ISI ISI (Lisbon, Aug 2007)(Lisbon, Aug 2007)

Page 18: IPUMS-EurAsia, 2009-2014: Changing Patterns of Microdata Use  * * * Robert McCaa, Professor of Population History University

Thank you.Thank you.

[email protected]@umn.edu