predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · a. satellite images c....

30
Predicting poverty from satellite imagery Neal Jean, Michael Xie, Stefano Ermon Department of Computer Science Stanford University Matt Davis, Marshall Burke, David Lobell Department of Earth Systems Science Stanford University 1

Upload: others

Post on 01-Aug-2020

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

Predicting poverty fromsatellite imagery

Neal Jean, Michael Xie, Stefano Ermon

Department of Computer Science

Stanford University

Matt Davis, Marshall Burke, David Lobell

Department of Earth Systems Science

Stanford University

1

Page 2: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

Why poverty?

• #1 UN Sustainable Development Goal

– Global poverty line: $1.90/person/day

• Understanding poverty can lead to:

– Informed policy-making

– Targeted NGO and aid efforts

2

Page 3: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

Data scarcity

3

Page 4: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

Lack of quality data is a huge challenge

• Expensive to conduct surveys:

– $400,000 to $1.5 million

• Data scarcity:

– <0.01% of total households covered by surveys

• Poor spatial and temporal resolution

4

Page 5: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

Satellite imagery is low-cost and globally available

5

Simultaneously becoming cheaper and higher resolution(DigitalGlobe, Planet Labs, Skybox, etc.)

Shipping records

Inventory estimatesAgricultural yield

Deforestation rate

Page 6: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

What if…

6

we could infer socioeconomic indicators from large-scale, remotely-sensed data?

Page 7: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

Standard supervised learning won’t work

7

- Very little training data (few thousand data points)

- Nontrivial for humans (hard to crowdsource labels)

Poverty, wealth, child mortality, etc.Model

Input Output

Page 8: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

Transfer learning overcomes data scarcity

8

Transfer learning: Use knowledge gained from one task to solve a different (but related) task

Train here Perform here

Transfer

Page 9: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

Transfer learning bridges the data gap

9

A. Satellite images C. Poverty measures

Not enough data!

B. Proxy outputs

Deep learning

model

Plenty of data!

Less data needed

Would this work?

Page 10: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

Nighttime lights as proxy for economic development

10

Page 11: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

Why not use nightlights directly?

11

A. Satellite images C. Poverty measures

B. Nighttime light intensities

Page 12: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

Not so fast…

12

Almost no variation

below the poverty line

Page 13: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

Lights aren’t useful for helping the poorest

13

Page 14: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

Step 1: Predict nighttime light intensities

14

training

images

sampled

from these

locations

A. Satellite images C. Poverty measures

Deep learning

model

B. Nighttime light intensities

Page 15: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

Training data on the proxy task is plentiful

15

Millions of training images

training

images

sampled

from these

locations

Low nightlight intensity

,

High nightlight intensity

,

(

(

)

)

Labeled input/output training pairs

Page 16: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

Images summarized as low-dimensional feature vectors

16

f1

f2

f4096

Night-LightNonlinear

mapping

Inputs: daytime satellite images

{Low, Medium, High}

Outputs: Nighttime light intensities

Convolutional Neural Network

(CNN)

Linear

regression

Page 17: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

Feature learning

17

f1

f2

f4096

Night-LightNonlinear

mappingLinear

regression

min𝜃,𝜃′

𝑖=1𝑚 𝑙 𝑦𝑖 , 𝑦𝑖 = min

𝜃,𝜃′ 𝑖=1𝑚 𝑙 𝑦𝑖 , 𝜃

𝑇 𝑓(𝑥𝑖 , 𝜃′ )

Over 50 million parameters to fit

Run gradient descent for a few days

𝜃𝜃′

Page 18: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

Transfer Learning

18

Have we learned to identify useful features?

f1

f2

f4096

PovertyNonlinear mapping

Inputs: daytime satellite images

{Low, Medium, High}

Outputs: Nighttime light intensities

Feature Learning

Target task

Page 19: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

Model learns relevant features automatically

19

Satellite image

Filter activation map

Overlaid image

Page 20: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

• Living Standards Measurement Study (LSMS) data in Uganda (World Bank)

– Collected data on household features

• Roof type, number of rooms, distance to major road, etc.

– Report household consumption expenditures

• Task: Predict if the majority of households in a cluster are above or below the poverty line

Target task: Binary poverty classification

20

Page 21: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

How does our model compare?

21

Survey-based model is the gold standard for accuracy but…– Relies on expensively collected data

– Is difficult to scale, not comprehensive in coverage

0.75

0.53

0.71

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

Survey Nightlights Transfer

Accuracy

Page 22: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

Transfer learning model approaches survey accuracy

22

0.75

0.53

0.71

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

Survey Nightlights Transfer

Accuracy

Advantages of transfer learning approach:– Relies on inexpensive, publicly available data

– Globally scalable, doesn’t require unifying disparate datasets

Page 23: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

Our model maps poverty at high resolution

23

Case study: Uganda

- Most recent poverty map over a decade old- Lack of ground truth highlights need for more data

Smoothed predictions District aggregated Official map (2005)

Page 24: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

We can differentiate different levels of poverty

24

2 continuous measures ofwealth:• Consumption expenditures

• Household assets

We outperform recent methods based on mobile call record data

Blumenstock et al. (2015) Predicting Poverty and Wealth

from Mobile Phone Metadata, Science

Page 25: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

Transfer Learning

25

f1

f2

f4096

ExpendituresAssetsNonlinear

mapping

Inputs: daytime satellite images

{Low, Medium, High}

Outputs: Nighttime light intensities

Feature Learning

Target task

Page 26: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

Models travels well across borders

26

Models trained in one country perform well in other countries

Can make predictions in countries where no data exists at all

Page 27: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

What do we still need?

• Develop models that account for spatial and

temporal dependencies of poverty and health

27

Page 28: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

Take full advantage of incredible richness of images

28

Page 29: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

We have introduced an accurate, inexpensive, and scalable approach

to predicting poverty and wealth

29

A new approach based on satellite imagery

Page 30: Predicting poverty from satellite imageryermon/cs325/slides/poverty.pdf · A. Satellite images C. Poverty measures Not enough data! B. Proxy outputs Deep learning model Plenty of

30