1
OZOEKWE, CHARLES IFEANYI
PG / M.Sc. / 12 / 63531
APPLICATION OF REMOTE SENSING AND GIS IN THE POPULATION STUDY OF ACHARA LAYOUT ENUGU, ENUGU
STATE, NIGERIA
FACULTY OF ENVIRONMENTAL STUDIES
DEPARTMENT OF GEOINFORMATICS AND SURVEYING
Azuka Ijomah
Digitally Signed by: Content manager’s Name
DN : CN = Webmaster’s name
O= University of Nigeria, Nsukka
OU = Innovation Centre
2
APPLICATION OF REMOTE SENSING AND GIS IN THE POPULATION STUDY OF ACHARA
LAYOUT ENUGU, ENUGU STATE, NIGERIA
BY
OZOEKWE, CHARLES IFEANYI
PG / M.Sc. / 12 / 63531
DEPARTMENT OF GEOINFORMATICS AND SURVEYING
FACULTY OF ENVIRONMENTAL STUDIES
UNIVERSITY OF NIGERIA
ENUGU CAMPUS
DECEMBER, 2015
3
APPLICATION OF REMOTE SENSING AND GIS IN THE POPULATION STUDY OF ACHARA
LAYOUT ENUGU, ENUGU STATE, NIGERIA
Submitted By
OZOEKWE, CHARLES IFEANYI
PG / M.Sc. / 12 / 63531
BEING A PROJECT REPORT PRESENTED TO THE DEPARTMENT OF GEOINFORMATICS AND SURVEYING, FACULTY OF ENVIRONMENTAL STUDIES, UNIVERSITY OF NIGERIA, ENUGU
CAMPUS, IN PARTIAL FULFILLMENT OF THE REQUIREMENT FOR THE AWARD MASTER OF
SCIENCE (M.Sc.) IN GEOINFORMATICS AND SURVEYING
SUPERVISOR
PROF. F. I. OKEKE
DECEMBER 2015
4
CERTIFICATION
This is to certify that I, OZOEKWE CHARLES IFEANYI with registration
number PG/MSc/12/63531, a postgraduate student of the department of
Geoinformatics and Surveying, have satisfactorily completed the requirement
for this Project for the award of the M.Sc in Geoinformatics and Surveying.
The work embodied in this Project is original, and has not to my knowledge
been submitted in part or full for any other degree of this or other University
……….………………………… …............................... OZOEKWE, CHARLES IFEANYI DATE
STUDENT
5
APPROVAL
This is to certify that I, OZOEKWE CHARLES IFEANYI with registration
number PG/MSc/12/63531, a postgraduate student of the department of
Geoinformatics and Surveying, have satisfactorily completed the requirement
for this Project for the award of the M.Sc in Geoinformatics and Surveying.
The work embodied in this Project is original, and has not to my knowledge
been submitted in part or full for any other degree of this or other University
………………………………. …………….............. PROF. F. I. OKEKE DATE PROJECT SUPERVISOR ……………………………… ……………………….. DR. E.C. MOKA DATE HEAD OF DEPARTMENT ……………………………… ..………………….…… EXTERNAL EXAMINER DATE ……………………………….. …………..………….. PROF. F. I. OKEKE DATE DEAN, FACULTY OF ENVIRONMENTAL STUDIES ………………………………………… ..………………… DEAN, SCHOOL OF POSTGRADUATE STUDIES DATE
6
DEDICATION
This Research Work is dedicated to God Almighty who through His Grace,
Ace, and Pace, i was able to brace up and face this race to the end.
7
ACKNOWLEDGEMENT
I am very grateful to my parents Sir (Dr) & Lady M. A. Ozoekwe for their
unlimited moral and parental support given to me during the course of my
study in school, post school and up till this moment. Also to a big thanks goes
to my siblings for their love and words of encouragement,
I also appreciate my supervisor Prof F.I Okeke for offering his time, wisdom
and expertise, for that I say more grease to his elbows. To my fellow students
and friends, I appreciate your contributions toward this project.
My thanks also goes to Dr. E.C Moka (Head of Department) and the entire
staff and Lecturers of the department of Geoinformatics and Surveying,
University of Nigeria Enugu Campus.
8
ABSTRACT
This project is an attempt to study population distribution in Achara Layout Enugu, in Enugu State through integration of Remote sensing and Geographic Information System (GIS). The Remote sensing data used in this study included high spatial resolution Quick Bird imagery. Questionnaires were given out for use in generating the desired product in an interactive GIS environment. Field site capturing was based on basic survey principle, technique and instruction. Field work was done to determine the type of building appearing in the satellite imagery and the no of rooms estimated, thus a standard family size gotten. After estimating the population size of Achara Layout, the result was compared to the population data gotten from National Population Commission for Achara Layout Enugu state which was divided into two as it comprised of data for Achara Layout and Idaw River Layout Enugu as directed by National Population Commission Enugu. The methodology involved carrying out a user needs assessment, collection of map from Enugu State Town Planning Department of Ministry of Land, collection of Population census data from National Population Commission, rigorous field work, data processing, comparison of results and presentation of results. The average difference between the population recorded in the 2006 then projected to 2015, and that estimated from Quick Bird images for Achara Layout Enugu, is found to be equal to 1.61%. Results from Arc GIS were presented in form of map while results from Microsoft Excel environment were presented in form of table.
9
TABLE OF CONTENT
Cover page..…………………………………………………………………….i
Title page ….…………………………………………………………………..ii
Certification……….…………….………………………………...….……....iii
Approval…..………………….…………………………………………….…iv
Dedication …….………………………………………………………………v
Acknowledgements ..………………………………………………………....vi
Abstract ….…………………………………………………………………..vii
Table of Contents …………………………………………………….……..viii
List of Tables ………………………………………………………………..xi
List of Figures …………………………………………………….….……...xii
CHAPTER ONE:
INTRODUCTION………………………………………………….……......1
1.0 Introduction………………………...…………………….………….…….1
1.1 Background of study…………………………….………….….…….……1
1.2 Statement of problem……………..………………….……….……….…..2
1.3 Aim………………….. …..…………………………..………..……….….3
1.4 Objectives…………………………………………………….…….……..3
1.5 Scope of work ……………………………………………..……….……...3
1.6 Study Area…………………………...…………………...……….……….4
1.7 Benefits And Contribution…………………………………..…..………...5
CHAPTER TWO: LITERATURE REVIEW……………..……………….6
2.1 Description of Geographic information system……………………...….………...7
2.2 Population studies using Remote sensing and GIS...…………..……...……..8
2.2.1 Background……………………………..……...….…………..……......8
2.2.2 Population density model from urban Geography…..……….…….........9
2.3 Population Estimation methods…………………………………………...9
2.3.1 Area interpolation methods…………………...………………….….....10
10
2.3.2 Point Based Methods……………………………………...…………...11
2.3.3 Area based method………………………………...………………......13
2.3.4 Statisitcal Modelling method…………………………………………..19
2.4 Summary…….………………….……………...………………...............26
CHAPTER THREE: METHODOLOGY
3.1 Flow Chart………………………………….…………….……….….....29
3.2Project Planning.........................................................................................30
3.3Data Acquisition………………..................................................................30
3.4 Tools and components used…………………………………..…...........32
3.5Data Sources………………………………………......………….……..33
3.6Data Processing…………….....………………………..…………..……34
3.6.1 Scanning ……………………………………..…………..………....34
3.6.2 Data Georefrencing………………………………………………….....34
3.6.3Data Vectorization………………………………………………………38
3.6.4. Shape Files in Arc GIS………………..…………………………….....39
3.6.4.1Features in Shape files ……………………………...….…………......39
3.6.5 Digitizing a map………………………….………………..……....40
3.6.6Data base structure for Population census…………………...….……....41
3.6.6.1Conceptual Design ………………….....………………………..........42
3.6.6.2Entity Relationship Model……………………………….…………...43
3.6.6.3Logical Design……………………………………………..……..…..44
3.6.6.4Physical Database Design…………………….…………………..…..45
3.7 User Requirement……………………………………….…...............47
3.7.1 Database Design for Achara Layout…………………….....………….47
CHAPTER FOUR: RESULTS AND ANALYSIS
4.1 Results……………………….……….…………………………..…......49
4.1.1 Analyses…………………….………….…………..…………….……49
4.1.2 Estimation of population data per household through satellite image....49
11
4.1.3 Estimation of population study by Block Parcel……………….……...51
4.1.4 Population Density………………………………………………..........54
4.2 Comparative analysis of population data from National Population
Commission and that gotten from Remote sensing and GIS application….....57
CHAPTER FIVE: CONCLUSION AND RECOMMENDATIONS
5.1 Conclusion……………………………………..……………………........59
5.2 limitations of using remote sensing method for population studies….......59
5.2.1 Image Classification……………………………..………………..……60
5.2.2 3D nature of urban areas…………………………………………….…60
5.3 Recommendations………………………………….…………………….61
REFERENCES………………………………………...................................68
APPENDICES…………………………………...………….………………69
12
LIST OF FIGURES PAGE
Figure 1 Overview Map Showing Achara Layout ……………..………...…...5
Figure 3.1 Flow Chart of Project Plan……………………………...………...29
Figure 3.2 Map of Achara Layout as Designed by Town Planning unit……..31
Figure 3.3 QuickBird Image of Enugu Metropolis……………......................32
Figure 3.4 Selecting Georefrencing in ArcMap……………………...…........35
Figure 3.5 Selecting Georefrencing in ArcMap .……………………..……..35
Figure 3.6 Positioning of The Cross Hair…………………….…….………..36
Figure 3.7 Georefrencing The Scanned Map……………………….….…….37
Figure 3.8 Digitized Block Parcel....................................................................38
Figure 3.9 Digitizing Of Building Through Satellite Imagery Of Study Area.39
Figure 4.0 Joining Table in ArcGIS Environment……………….……..........42
Figure 4.1 Building ER Diagram……………………………….……….…...43
Figure 4.2 Block of ER Diagram…………………………………….….……43
Figure 4.3 Population of ER Diagram…………….………………………….43
Figure 4.4 Relationship Between Entities……………………………...…….44
Figure 4.5 Population Concentration Map By Block Parcel………….….…..51
Figure 4.6 Population Density Map By Block Parcel .....................................55
Figure 4.7 Enugu state 2006 Population Census Result………….…….…….56
Figure 4.8 Enugu South L.G.A. 2006 Population Census Result...…...……..57
13
LIST OF TABLES PAGE
Table 1.0 Geodatabase of Extracted Houses from Imagery………...………..45
Table 2.0 Field Survey data of Achara Layout using MS Excel………….….46
Table 3.0 Field Survey data of Achara Layout using ArcGIS…………...…..47
Table 4.1 Sample Census Per Building………………………………………50
Table 4.2 Detail of Population Study Area Per Block Parcel…………..……53
Table 4.4 Comparative Analysis of Population Data From NPC And Data Got
From Remote Sensing for Achara Layout Enugu State ………………….....59
14
CHAPTER ONE
INTRODUCTION
1.1 BACK GROUND OF STUDY
Although numerous estimates of the Nigerian population were made
during the colonial period, the first attempt at a nationwide census was during
1952-53. This attempt yielded a total population figure of 31.6 million within
the current boundaries of the country. This census has usually been considered
an undercount for a number of reasons: apprehension that the census was
related to tax collection; political tension at the time in eastern Nigeria;
logistical difficulties in reaching many remote areas; and inadequate training
of enumerators in some areas. The extent of undercounting has been estimated
at 10 percent or less, although accuracy probably varied among the regions.
Despite its difficulties, the 1952-53 census has generally been seen as less
problematic than any of its successors. Subsequent attempts to conduct a
reliable post-independence census have been mired in controversy, and only
one was officially accepted. A good census estimate of a country population
can help a country plan for present and future development and so much more.
The data generated from population census are analyzed and used to calculate
for various needs like gender ratio, indigenes and non-indigenes, migrants and
immigrants e.t.c However, remotely sensed data offers the means to measure
spatial attributes of the urban landscape. In the past, researchers remain
depended on “Aerial Photographs” because of their fine spatial resolution to
get accurate data about size of houses and their volumes and consequently
estimates of population (Adeniyi, 1983; Lo and Chan, 1980).In addition to
that, remotely sensed data provide a wider spectral coverage (number of
bands). Due to these advantages, there is a general trend towards using
remotely sensed data for census studies.
15
The rapid growth of computer technology, such as Geographic Information
System (GIS), has been widely used in various fields since it was being born
in year 1989. A GIS is a powerful configuration of computer hardware and
software used for compiling, storing, managing, manipulating, analyzing, and
mapping (displaying) spatially-referenced information. (Haestad Methods,
2003).
The use of GIS for population studies ranges from creation of maps to
modelling the relationship between population variables (Baudot, 2001;
Forbes, 1984; Langford and Unwin, 1994; Martin, 1989; Rhind, 1991.
This project presents the integration of Geographic Information System
(GIS) and remote sensing to carry out some modeling related to population
density. The integration normally takes the form of using remote sensing data
as a source of data or using GIS as in-situ tool. The results of analysis are
integrated for further spatial analysis.
1.2 STATEMENT OF THE PROBLEM
There is need to represent properly the number of populace living in
every location. The inability to properly represent the figures or get a fairly
accurate census figure has contributed to poor planning, poor decision making
and implementation. The need to adequately represent fair estimate of the
population size is paramount and can never be over emphasized. Population
census process has been marred logistical difficulties in reaching many remote
areas, inadequate training of enumerators, inconsistent data management
process e.t.c Inability to manage acquired data can lead to poor data analysis
and usage. Previous studies have shown the importance of integrating remote
sensing with GIS for population studies (Donnay, 1992; Harris and Ventura,
1995; Wilkinson, 1996). The integration normally takes the form of using
remote sensing data as a source of data or using GIS as in-situ tool (Sadler and
Barnsley, 1990).
16
1.3 AIM
The aim of this project is to estimate the population of Achara Layout in Enugu
State using remote Sensing and GIS Technology.
1.4 OBJECTIVES
The objectives of this project are to:
i. To estimate the population of Achara Layout using Remote Sensing
and GIS to population studies in Enugu State of Nigeria.
ii. Compare the population data gotten from National population
commission for Achara Layout, Enugu state and that estimated
using GIS and Remote sensing methods for Achara Layout, Enugu
State.
iii. Show the population density in Achara Layout Enugu State Nigeria.
1.5 SCOPE OF WORK
A good census estimate of a country population can help a country
plan for present and future development and so much more. The data
generated from population census are analyzed and used to calculate for
various needs like gender ratio, indigenes and non-indigenes, migrants and
immigrants e.t.c. Attempts to conduct a reliable post-independence census
have been mired in controversy especially using the traditional methods of
head counting. Modern GIS & Remote Sensing is capable of fulfilling many
of these requirements for an automated Population Census estimation within a
short period of time. The Data so obtained shall be used for future projections
using certain parameter obtained on field visitation..
The scope of work revolves around Achara Layout Enugu, Enugu State as this
is a good site to show case some of the capabilities of GIS because this area is
fully built up and its right about the oldest layout settlement in Enugu
Metropolis.
17
1.6 STUDY AREA
Enugu is a large city in the Eastern region of the Nigeria with
geographic coordinates 6°30′N 7°30′E. Enugu State is one of the states in the
eastern part of Nigeria and occupies a total area of 7,161 km2 (2,765 Sq.
mi)The state shares borders with Abia State and Imo State to the south, Ebonyi
State to the east, Benue State to the northeast, Kogi State to the northwest and
Anambra State to the west. Enugu State has a population of over 3.3 million
people and counting. It is home of the Igbo of south-eastern Nigeria.
Achara Layout is located in Enugu South of Enugu State of Nigeria.
Enugu South is a local Government Area of Enugu State. Its headquarters are
in the city of Enugu. It has an area of about 106 km² and a population of about
244,852 at the 2006 census.
Achara Layout is one of the areas of Enugu that was mapped out in the
1960s and has grown from a residential suburb to a major commercial area
especially along Agbani Road, the main high street; the streets are heavily
built up. It can be assessed from all parts of Enugu. The figure 1 is the Google
Image of Achara Layout.
18
Figure 1 overview map showing achara layout and environs
1.7 BENEFITS AND CONTRIBUTIONS
The use of Geographic information system shall benefit National
Population Commission and contribute to the development of Nigeria in the
following ways:
1. Site location
2. Provision and Access to social amenities information where detailed
attributes about individual age distribution and human concentration
are obtained, etc.
3. Availability of population data which is accessible via GIS and is
easily updated
4. Availability of digital maps that can be used as guides.
5. Creation of as-built drawings that have been geo-referenced and
digitized.
19
6. Field access to flexible data, based on street- aids in finding street and
information about areas, and other information can still be inputted.
7. Used existing building, to build an integrated building / block system
data source.
8. It can be used to define long and short term action for map updates and
data cleanup that will move the City forward for the long term data set
and data management.
9. Creation of Zone maps, boundary maps were very important; this will
assist in planning and organizing excavation, rehabilitation and
election voting systematically.
10. It can be used to coordinate and manage Service calls since location of
streets and building information can be viewable in GIS.
11. Ability to view other City facilities and data in conjunction with street
data.
12. Draw and design plan and profile of streets.
13. It can improve better head counting of populates.
14. Provision of improved operational efficiencies.
15. Ability to query to extract information
16. Time and money savings
17. Better and timely decisions
20
CHAPTER TWO
LITERATURE REVIEW 2.1 DESCRIPTION OF GEOGRAPHIC INFORMATION SYSTEM
A GIS is defined as “an organized collection of computer hardware,
software, geographic data, and personnel designed to efficiently capture, store,
update, manipulate, analyze, and display all forms of geographically
referenced information”. (ESRI, 1992) GIS technology has been widely used
in various fields, such as agriculture, business geographic, ecology, electricity
and gas, emergency management and public safety, environmental
management, forestry, health care, education, mining and geosciences, real
estate, remote sensing, telecommunication, transportation and water
distribution and resources.
More commonly, people use GIS to make maps; a GIS can also be
used as a powerful analysis tool. It can be used to create and link spatial and
descriptive data for problem solving, spatial modeling and to present the
results in tables, graphics or maps. The most powerful feature of a GIS, from a
planner’s perspective, is probably the ability of GIS to integrate databases,
through their spatial relationships, that would be difficult or impossible to do
outside a GIS environment, (Methods, 2003). Remotely sensed data offers the
means to measure spatial attributes of the urban landscape.
In the past, researchers remain dependent on “Aerial Photographs”
because of their fine spatial resolution to get accurate data about size of houses
and their volumes and consequently estimates of population (Adeniyi, 1983;
Lo and Chan, 1980). Aerial photography is not allowed to be exported from
some countries, but satellite images are occasionally available and can be
collected from anywhere. One meter resolution satellite sensor usually covers
a wider field of view compare to aerial photographs, meaning that satellite
data are usually less expensive than aerial photographs per km2. In addition to
that, remotely sensed data provide a wider spectral coverage (number of
21
bands). Due to these advantages, there is a general trend towards using
remotely sensed data for census studies.
(Brugioni, 1983; Iisaka and Hegedua, 1982; Lo, 1995; Lo and Chan,
1980; Lo and Faber, 1997). While these efforts have shown promise, they
have all lacked the use of high-spatial resolution imagery. This work extends
previous studies that have tested the accuracy of the resultant population
estimates using remote sensing. Progressively switching over of remotely
sensed data obtained from government sponsored satellite programs, such as
LANDSAT, SPOT, RADARSAT, IRS to commercial satellites such as
IKONOS, QuickBird, and OrbView after 1999 has improved the state-of-art.
These satellites promised to provide unprecedented access to accurate and
timely information (Baker et al., 2001).
2.2 POPULATION CENSUS STUDIES USING REMOTE SENSING
AND GIS APPLICATION
2.2.1 Background
Many methods for population estimation have been reported in the GIS
and remote sensing literatures. Depending on the intended goal and the
required information, these methods can be grouped into two categories: areal
interpolation and statistical modeling. Areal interpolation methods are
primarily designed for the zone transformation problem that involves
transforming data from one set of spatial units to another. This approach uses
census population data as the input and applies interpolation or disaggregation
techniques to obtain a refined population surface. In contrast, the statistical
modeling approach is more interested in inferring the relationship between
population and other variables for the purpose of estimating the total
population for an area. The statistical modeling approach does not directly use
census data as the input. Rather, it makes use of socioeconomic variables and
applies theories in urban geography for population estimation; census
population data only participate in the model training process. This approach
22
is originally designed to estimate the intercensal population or population of
an area difficult to enumerate, though it can also be incorporated into the
process of interpolating census population. Before reviewing these two
approaches of population estimation, we would like to review the early
population density models from urban geography.
2.2.2 Population Density Models From Urban Geography
The simple gravitational population density model from urban
geography is the heart of what has been called social physics (Stewart and
Warntz, 1958). Although many people have noticed the decrease of population
density from inner city to outer.
2.3 POPULATION ESTIMATION METHODS
it was Clark (1951) who first associated this observation with specific
mathematical functions (Liu, 2003) in the following negative exponential
function: d (r) = K * e–λ r , (1) where d(r) is the population density at distance
r from the center of the city (r = 0); K is a constant that equals the central
density d(0); M describes the rate of decline of density. This relationship has
been demonstrated to exist for many cities of the United States (Weiss, 1961),
as well as for many cities outside the U.S. (Newling, 1965). Although the
goodness of fit varies, the model always holds statistically significantly in
every place studied. Kramer (1958) also incorporated a sectoral model of a
city and showed how various models of urban forms interact with each other.
Some studies have explored other mathematical forms to describe the
relationship between population density and location. For example, Sutton et
al. (1997) examined the Gaussian and the parabolic forms and found that both
are statistically significant. Some studies have criticized the use of the
negative exponential function. For example, Batty and Longley (1994) stated
that the exponential population density function has been used solely for its
23
convenience and elegance, rather than its appropriateness to empirical data.
Parr (1985) suggested that an inverse power function is more appropriate to
the urban fringe and hinterland, and the negative exponential function is more
appropriate for describing density in the urban area. A similar suggestion to
make a modification at the fringe of the urban area is also found in Tobler’s
(1999) comment on Martin’s (1996) population interpolation algorithm.
Tobler pointed out that “the exponential distance decay function is a relevant
approximation for the whole of an urban area, yet its repeated use farther out
from the urban center hardly seems reasonable. In the periphery, far from the
center, the density gradient is much more nearly linear” (1999, p. 85). To
correct this problem, Tobler proposed a “tent function” by first decomposing
each census unit into triangles, with one vertex being the geometric centroid of
the unit; populations inside triangles are then assigned based on the
coordinates and population of the vertices.
2.3.1 Areal Interpolation Methods
The negative exponential function can only be regarded as the empirical
results showing how population is distributed in urban areas. It has not been
used, in existing literatures, for practical population estimation that concerns
accuracy. Instead, most studies used areal interpolation or statistical modeling
methods for population estimation. Areal interpolation, as mentioned above, is
primarily designed for zone transformation that involves transforming data
from one set of spatial units to another. The two sets of spatial units could be
referred as the source zone and the target zone (Lam, 1983). The general
strategy for zone transformation is to apply certain areal interpolation
operations to transform source zone data to finer-scale raster data and then
aggregate them for target zones. In the context of population interpolation,
census data are the vector-based source zone data and are interpolated to finer-
scale raster data by a certain interpolation method. Areal interpolation is
24
subject to errors from original areal aggregation. The quality of the
interpolation estimates depends largely on how source zones and target zones
are defined, the degree of generalization in interpolation process, and the
characteristics of the partitioned surface (Lam, 1983). Areal interpolation
methods can be further separated into two categories depending on whether
ancillary information is used. Areal Interpolation without Ancillary
Information For areal interpolation methods without ancillary information,
there are point based methods and areal-based methods (Lam, 1983). In point-
based interpolation, a control point is assigned to represent each source zone
and a grid map is generated with grid point values estimated from control
points. In contrast, area-based interpolation uses the source zone itself as the
unit of operation instead of arbitrarily assigned control points. Also, area-
based interpolation is more concerned with volume preservation; i.e., the
summation of population data to the original set of areal units is preserved in
the transformation to a new set of areal units. Based on theoretical and limited
empirical evidence, volume preservation is an essential requirement for
accurate interpolation estimates (Lam, 1983).
2.3.2 Point-Based Methods.
There have been many point-based interpolation methods developed in
the past. Some researchers put such methods into two groups, global and local,
depending on whether they consider all of the data values at once or the values
within a pre-defined neighborhood of each point. Here we adopt Lam’s (1983)
approach to group point-based methods into exact methods and approximate
methods, depending on whether they are concerned with preserving the
original sample point values or with determining an overall surface function
f(x, y). The reason for this categorization is that whether interpolation methods
preserve original data values on the inferred surface is fundamental in
analyzing their accuracy (Lam, 1983). The exact methods include
interpolating polynomials, most distance-weighting methods, kriging, spline
25
functions, and finite difference methods, while the approximate methods
include power-series trend models, Fourier series models, distance weighted
least squares, and least-squares fitting with splines. Each of these methods has
its own advantages and disadvantages, and none of them is superior to all
others for all applications (Lam, 1983).
Furthermore, the results from all the methods are seriously affected by
the quality of the original data, especially the density and the spatial
arrangement of data points, and the complexity of the surface. The choice of
an appropriate interpolation method depends largely on the type of data, the
degree of accuracy desired, and the amount of computational effort afforded.
In general, exact methods are more reliable than approximate methods because
of the former’s simplicity, flexibility, and reliability (Lam, 1983). One of the
point-based methods widely used in the UK census is a kernel-based
interpolation proposed by Martin (1989) (Bracken, 1991; Martin and Bracken,
1991; Bracken and Martin, 1989). This method uses a source zone centroid as
the control point. A window is positioned over each control point in turn and
the source zone population is allocated to grid cells falling inside the window
using a unique weighting based on the distance decay function between the
source zone centroid and the grid cell. Point-based areal interpolation methods
experience a few problems (Lam, 1983; Liu, 2003). First, the use of a control
point, usually the centroid or the center of an area, to represent the source zone
often introduces errors. The calculation of the centroid or center of an area
depends on the coordinates of the points defining the boundary of a source
zone. If the source zone is symmetrical and relatively simple, the center or
centroid would be a convenient control point, and the estimated value for each
grid cell would be reliable. However, if the boundaries are not symmetrical or
well generalized, the location of the centroid can be significantly effected, and
the interpolation results may be biased. In reality, census units are rarely
symmetrical, and the non-uniform distribution of population within a census
unit further complicates this issue. Another problem associated with point-
26
based interpolation methods is that all have some kind of a priori assumption
about the surface involved. This rather arbitrary assumption rarely fits the
complex geographical phenomena in the real world. Nevertheless, it is worth
noting that the task of areal interpolation is to search for the best method,
whose output is as close to the ground truth as possible. Violation of the
assumption of a method only implies that the results obtained may not be
optimal, but does not mean that this method is necessarily inferior to others.
For example, in the context of kriging interpolation, if the source zones can be
reduced to control points and the population distribution can be described by
the semi-variogram, kriging is the best linear unbiased estimation. If the
assumption is not satisfied, results from kriging interpolation are not
necessarily inferior to those from others. Comparative studies using empirical
datasets are needed to further research this issue. The most important problem
of point-based methods is that they mostly do not conserve the total value
within each source zone. Volume preservation is important in that it gives
reliability to the approximation of grid values for source zone, and thus the
subsequent estimation for target zone is less subject to error. Besides, in the
context of population interpolation, people should not be “destroyed” or
“manufactured” during the redistribution process (Langford and Unwin,
1994). To correct this problem, Martin (1996) modified the original kernel-
based interpolation algorithm (which is a point-based method) to ensure that
the populations reported for target zones are constrained to match the overall
sum of the source units.
2.3.3 Area-Based Methods.
In contrast to point-based interpolation, area-based methods are
volume-preserving methods. The simplest method in this category is the
overlay operation based on the geometric properties of the source and target
zones. It superimposes the target zone on the source zone to obtain the
proportion of each source zone in each target zone. The proportion then serves
27
as a weight and the values of target zones become a weighted linear function
of source zones. The major problem with the overlay method is that it assumes
homogeneity within each source zone. Source zones having homogeneous
distributions, unfortunately, seldom occur in the real world. This may well be
true of some phenomena such as rainfall or agricultural productivity, but is
harder to justify for human phenomena such as population. In addition, very
often the source zones were originally delineated for other purposes and may
not show the important distribution information for the target zones. For these
reasons, the reliability of target zone estimates is controlled mainly by the
nature and degree of the homogeneity of the source zone and by the size of the
target zone in relation to the source zone (Lam, 1983). Tobler’s (1979)
pycnophylactic interpolation is probably the most widely quoted area-based
interpolation method. This method assumes a smooth density function that
takes into account the effect of adjacent source zones while preserving its
volume.
The smooth condition intends to minimize the curvature of estimated
surface by requiring the value of any grid point to approach the averages of its
four neighbors. Other smoothing conditions may be used depending on the
type of application. The interpolation process begins by assigning the mean
density to each grid cell superimposed on the source zones, and then modifies
this by a slight amount to bring the density closer to the value required by the
smoothing condition. The volume-preserving requirement is then enforced by
either incrementing or decrementing grid densities within each source zone
after each computation. The result is a smooth population density surface. The
original pycnophylactic interpolation uses regular lattice grids as its spatial
configuration. Rase (2001) extended it to a surface representation based on an
triangular irregular network (TIN). The basic step is to generate a TIN from
the boundary network first, and then to interpolate a smooth surface by an
iterative procedure, in which the two steps of smoothing and difference
distribution are repeated until the threshold for the overall smoothness
28
measured by the relative variance is reached or the maximum number of
iterations is exceeded. Compared to the original grid-based method, the TIN-
based pycnophylactic interpolation is argued to have several advantages (Rase,
2001), including that the error resulting from converting source zone polygons
to a regular grid is avoided, and that the TIN-based method is more suitable
for fast display in real-time applications. On the other hand, the TIN model is
more difficult to implement because it requires more effort and support for the
data and program structures. Lam (1983) stated that the overlay methods will
yield better estimates if the surface is discontinuous, whereas the
pycnophylactic method gives better results when smoothness is a real property
of the surface. In cases where the surface is intermediate between
discontinuous and maximally smooth, different target equations and side
conditions should be imposed for reliable results, but such methods are yet to
be developed.
Areal Interpolation with Ancillary Information Population is related to
other information, e.g., land use and transportation networks, that can be used
to assist population interpolation. This section will review the interpolation
methods with ancillary information, particularly those can be extracted from
remotely sensed data. The dasymetric method is the most well-known method
in this category. It was originally developed by Wright (1936) out of a concern
that choropleth maps do not give a valid representation of population
distribution within enumeration units. Wright’s idea was to use knowledge of
the locality to identify areas within zones that have different population
densities, thus allowing refinement of the assumption of an even distribution
(Fisher and Langford, 1995). In his population density mapping of Cape Cod,
Wright made binary partitions iteratively to disaggregate general zones to
detailed zones of population density while making certain that the original
zone population was preserved. In the past, Wright’s dasymetric mapping of
binary partition was difficult to implement. With the development of digital
data and GIS technology, the dasymetric method became easier through use of
29
the GIS overlay process, which also provides the convenience of integrating
various types of ancillary spatial data. For example, Monmonier and Schnell
(1984) demonstrated the integration of classified residential land use classes
from Landsat satellite imagery as the ancillary information in the dasymetric
method. Wright’s dasymetric method relies on knowledge of the local areas to
determine subzone population densities. Flowerdew and Green (1989)
proposed using statistical regression analysis to estimate subzone population
densities; yet Langford et al. (1991) first applied multivariable regression
techniques to estimate dasymetric subzone population densities. Their
approach is based on the following function:
Pi = Σ Pij = Σ Aij * Dj, …………………………………… (2)
where Pi is the total population of source zone i;
Pij is the total population of land use j within source zone i (subzone
ij);
Aij is the total area of land use j within source zone I;
and Dj is the average population density of land use j.
Aij can be obtained by a GIS overlay operation of a land use map and a
source zone map. Since there are multiple source zones, multivariable
regression can be applied to estimate Dj of multiple land use types. Volume
preservation is further maintained by scaling up or down derived density
measures to fit the original total population for each source unit. Despite the
ease of implementation, the dasymetric method is still subject to the problem
of an even distribution assumption within subzones. In other words, while the
difference between subzones is recognized, differences within subzones are
ignored. For example, for single-family land use, there is the difference
between low density, medium-density, and high-density zones. To incorporate
such a consideration, one may conduct a more detailed land use classification,
and associate each land use class with a certain population density. Although
30
this approach could improve population interpolation accuracy, it requires
effective ways to classify detailed land use types and to estimate their
population densities. The easiest dasymetric mapping approach with remote
sensing–derived land use data is a binary division approach in which land use
is classified to “populated” and “unpopulated” and census populations are
simply redistributed to those populated areas; some example studies included
Holt et al. (2004), Fisher and Langford (1996), and Langford and Unwin
(1994). Furthermore, a more specific dasymetric mapping approach would
classify a number of land use classes and redistributed census populations to
these classes; some example studies include Mennis (2003), Eicher and
Brewer (2001), Yuan et al. (1997), and Langford et al. (1991). For the latter
group of studies, some ways of determining the population density ratio
between land use classes must be applied. Some studies used an empirical
sampling approach (e.g, Mennis, 2003), some used pre-defined population
density statistics (e.g., Eicher and Brewer, 2001), whereas some used
regression analysis to derive population density estimates (e.g., Yuan et al.,
1997; Langford et al., 1991). The regression analysis seems to provide a
preferred approach because of its objectivity in testing model accuracies
through statistical significance tests. Harvey (2002b; 2000) adopted an
extreme approach to deal with the homogeneity assumption within subzones
by estimating population density in the spatial unit of pixels. His method first
assigned all residential pixels within a source zone with an equal share of the
total population in the following equation:
Pij = Pi /n, i = 1, 2, …, n
……………………………………………………….(3)
Where Pij is the population initially assigned to the jth pixel in source
zone i whose total population is Pi , and n is the number of pre-classified
residential pixels in source zone i. Since there were many source zones, each
31
of which had some residential pixels with different digital values, an ordinary
least-squares regression could be conducted between the population and the
digital value of the pixels. With the regression coefficients obtained, the
population of each pixel was adjusted by the following equation: , (4) where is
the regression estimate, and. (5) The result was that the adjusted reference
population lay closer to the regression line than the initially assigned
population. If the iteration was run again with the adjusted value, the R2
would be improved. The process was repeated iteratively, and R2 continued to
increase monotonically with decreasing increments, and stopped when a
predefined threshold was reached. Harvey proved that this iterated regression
procedure is a least-square approximation to the Expectation Maximization
(EM) algorithm that was originally presented by Dempster et al. (1977) and
applied by Flowerdew and Green (1989, 1991) for combining data from two
incompatible sets of spatial zones. Harvey (2000, 2002b) argued that the pixel-
level dasymetric method has several advantages over zone-level dasymetric
methods, including: human habitation by individual residences should be
better delineated by pixels; the mathematical form of pixel-based model is
simple and relatively robust; and implementation and refinement of routine
pixel-based classifiers are easier. We separated areal interpolation methods
into two categories depending on whether ancillary information is utilized.
The methods using ancillary information, particularly the dasymetric method,
usually yield more accurate results than those without ancillary information,
assuming the ancillary information reflects the spatial distribution of the
variables being mapped. It is worth noting that methods in these two
categories can be incorporated with each other for certain purposes. For
example, Langford and Unwin (1994) applied a kernel-based smoothing
function to the result of a dasymetric method in order to create a
cartographically pleasing and informative map, so that the readers won’t see
too many pixel-level details. Also, most methods in the first category (without
ancillary information) can still make use of ancillary information when it is
32
available. For example, in the case of Tobler’s pycnophylactic interpolation, if
information about residential areas is available, one can first allocate Pij( ) adj
P ˆ ij r = + ˆ P ˆ ij r ˆ Pij P ˆ – ij ( ) j = 1 n ∑ n = -- the population of a census
unit to residential polygons within it, assuming nonresidential polygons have
no population, and then perform the smoothing interpolation operation.
2.3.4 Statistical Modeling Methods
We will review the second category of population estimation methods,
the statistical modeling methods, in this section. As reviewed previously,
theories in urban geography have demonstrated that population distribution in
an urban area is affected by morphological factors such as distance to the
central business district (CBD), distance to roads, etc. Many of the
morphological factors can be extracted from remotely sensed data.
Consequently, remote sensing has been actively explored as a means to study
population distribution. Strictly speaking, models from this group are mainly
designed to estimate an overall population count rather than population density
that is relevant to population distribution. However, since population count
and population density can be derived from each other through the size of the
area of interest, the method designed to estimate population counts can also be
used to estimate population distribution. Statistical modeling approaches for
population estimation started in the 1950s. The initial motivation was to
remedy the shortcomings of the decennial population census, such as high
cost, low frequency, labor intensity, etc. (e.g., Kraus et al., 1974), but these
approaches also have been applied to check the reliability of the census
enumeration (e.g., Clayton and Estes, 1980), and the inference of
socioeconomic characteristics such as housing value and residential quality
(e.g., Forster, 1983). The use of remote sensing in statistical modeling
approaches started in the mid-1950s, particularly with the goal of searching for
an alternative to a population census. Researchers have conducted various
33
statistical modeling methods for population estimation on different scales with
different types of remotely sensed imagery. In general, there are five
categories of approaches, based on the relationship between population and (1)
urban areas, (2) land use, (3) dwelling units, (4) image pixel characteristics,
and (5) other physical or socioeconomic characteristics (Lo, 1986; Liu, 2003).
Correlation with Urban Areas This category of methods is a general approach
based on a functional relationship between urban areas and population size.
Inspired by the biological law of allometric growth (Huxley, 1932), Nordbeck
(1965) studied the relationship between urban areas and population size of
many U.S. cities and concluded that the built-up area (A) of a settlement is
proportional to its population (P) raised to some power: A = a * Pb. (6) Tobler
(1969) was the first to use satellite imagery to study the relationship between
population and urban areas. He used images from the Gemini manned space
flight program to study populations of many cities in the world. Assuming that
if cities can be considered circular in shape, and if shape varies insignificantly
with time, Tobler found the correlation coefficients between radii and
populations of 0.87 or higher in the following function: r = a * Pb. (7) The
results of his study also indicated that the coefficient (a) and exponent (b) for
cities in the United States was comparable to those for cities in Sweden and
Canada; cities in Japan and in the Nile Delta, however, had coefficients and
exponents reflecting the dense and compact structure of settlements in Asia
and the Middle East. The availability of Landsat satellite imagery and
advancement in image processing techniques allowed researchers to efficiently
study the relationship between population and urban areas, although one of the
major difficulties involved differentiating rural/urban boundaries (Lo and
Welch, 1977). Using 1972 to 1974 Landsat MSS imagery of 10 large cities in
China with 500,000 to 2,000,000 populations, Lo and Welch (1977) found
correlation coefficients of 0.82 or higher between populations and classified
urban areas in a modified function from (6): P = a * Ab. (8) This function can
be referred as the allometric growth model (Lee, 1989; Lo, 2003), which
34
describes that the relative growth rate of population is proportional to the
relative growth rate of the residential land area. Researchers also used urban
light as an indicator for population size. Prosperie and Eyton (2000) found a
quite high R2 of 0.974 between light volumes and populations of 254 Texas
counties using DMSP (Defense Meteorological Satellite Program) imagery.
Adopting a similar approach but at a smaller scale using cities, Lo (2002)
found a correlation coefficient of 0.91 between the light volumes of 35
Chinese cities and their non-agricultural populations. He further evaluated
derived population models using data from other 18 Chinese cities and
obtained acceptable accuracies. Correlation with Land Use The second
approach for population estimation is based on correlating population counts
with different types of land use areas, which should achieve higher precision
than the first approach. The total population for an area can be calculated
according to the following function: *, (9) where P is the total estimated
population;
Aj is the area of land use j;
and Dj is the population density for land use j,
Which is to be determined through regression analysis. This basic
function is similar to that used in the dasymetric method reviewed previously,
only that the former intends to disaggregate census population by maintaining
the original census unit population count, whereas the latter intends to estimate
the total (intercensal) population for an area. P Aj j = ∑ Dj Areas for different
types of land use could be extracted from remote sensing images.
The accuracy of population estimation would largely rely on how
accurate different types of land use are classified. In Weber’s (1994) study of
population estimation for the City of Strasbourg, France, he classified six
types of urban land use from SPOT HRV XS images with a Kappa coefficient
of 0.915. Then he ran a regression analysis between population counts and
land use areas of 2,812 census units based on function (9) and obtained an R2
35
of 0.91. After applying his regression model to estimate the total population
for the city, his model estimate was 7.91% below the census population of the
city. In Lo’s (2003) study of Project ATLANTA, he classified six types of
land use from Landsat TM images and obtained a Kappa coefficient of 0.878.
The area of low density urban use class was then regressed with population
counts of 418 census tracts using a logarithmic transformed allometric growth
model. The result had an R2 of 0.68. He then applied the regression model to
estimate populations of 373 census tracts and the results had a relative error of
14.80% and a overall underestimate of 8.07%. Population densities for
different types of land use could also be determined from sample surveys or
census statistics, in addition to the regression analysis. For example, in Kraus
et al.’s (1974) study of population estimation for four California cities, four
types of urban land use were first classified. Then the authors calculated the
characteristic population densities for each type of land use from sampled
census block-level population data. Finally they estimated city populations
based on function (9). The results ranged from an underestimate of 9.17% to
an overestimate of 7.00% when compared with census populations of the four
cities. Correlation with Dwelling Units The total population of an area can be
estimated by multiplying the total number of dwelling units with the number
of persons normally living in a dwelling unit. It is also possible to categorize
dwelling units and apply a different persons-per-dwelling unit ratio to each
category. This ratio can be derived from sample surveys or calculated from
census data with the assumption that a single household occupies one dwelling
unit. The total number of dwelling units in an area may be estimated from
remote sensing images. Green (1956) was probably the first researcher to
propose using individual dwelling unit counts observed from aerial
photographs for population estimation. Porter (1956), however, was the first to
actually apply this methodology (Kraus et al., 1974), with the persons-per-
dwelling unit ratio established from ground observation in their study of
Liberia. Hsu (1971) applied the same methodology for intercensal population
36
estimation of the Atlanta area, but he derived his persons-per-dwelling unit
ratio from U.S. census tract data. Collins and El-Beik (1971) and Dueker and
Horton (1971) further identified different types of residential buildings from
aerial photographs for population estimation, with their population density
statistics calculated from census data. To obtain a more accurate persons-per-
dwelling unit ratio, Lo and Chan (1980) used a field survey methodology to
calculate the average population density for various types of housing.
Furthermore, in an effort to automate the time-consuming procedure of
counting dwelling units, Lo (1989) used a raster approach to extract residential
building density, on a grid cell by grid cell basis, from high-altitude aerial and
space photographs. He first calculated the maximum possible occurrence of
dwelling units in each grid cell with reference to the dwelling unit size. Then
the percentage of occurrence of residential buildings in each grid cell was able
to be estimated. In the past, no effective ways of automatically extracting
residential buildings existed. Researchers relied on manually identifying and
counting dwelling units from high-spatial-resolution aerial photographs, even
though visual interpretation is laborious and time consuming. With the
advance of very high spatial resolution satellite images, such as IKONOS and
QuickBirds, and the improvement of feature extraction techniques
(Haverkamp, 2004), automatic extraction of dwelling units from satellite
images has become possible. Another prospect for automatic building
extraction is the advancement of 3D object extraction techniques from LIDAR
data (Rottensteiner, 2003). With these new remote sensing data and building
extraction techniques, population estimation by dwelling unit counts may
become a viable approach. Correlation with Image Pixel Characteristics Other
than the physical characteristics extractable from remotely sensed imagery,
population density can also be directly correlated to the spectral reflectance
value of image pixels. Hsu (1973) was probably the first to suggest the idea of
using imagery pixel values to develop a multiple regression model for
population estimation (Lo, 1986). His idea, however, was not implemented
37
until Iisaka and Hegedus’s (1982) pioneering work in estimating population
distribution of Tokyo, Japan. They reported that the mean spectral values of
Landzat MSS bands 4, 6, and 7 were strongly correlated to population density.
Lo (1995) adopted a similar approach while using higher resolution imagery
of SPOT, and reported a correlation coefficient of –0.91 between population
density and the mean spectral values of SPOT band 3 for the Hong Kong area.
Webster (1996) argued that the spectral values alone cannot discriminate areas
of different population densities effectively.
Alternatively, he combined numerous spectral and textural measures
from Landsat TM imagery in a regression model and found textural measures
have more significant predictive power for housing densities than spectral
measures. Harvey (2002a, 2002b) also incorporated a variety of spectral
transformation measures, such as the band-to-band ratio and difference-to-sum
ratio, in addition to textural measures, in a series of stepwise regression
models for population estimation. Further, Harvey (2000; 2002b) developed
an innovative iterated regression procedure (reviewed previously as a
dasymetric method) to improve the predictive power of a regression model
based on pixel spectral values. There are also studies using imagery texture
analysis to categorize pixels first and then correlate pixel counts in different
categories with population density, which is similar to the approach of
inferring population through land use. For example, Chen (2002) used a
homogeneity texture measure to categorize pixels of different levels of
homogeneity and correlate the number of pixels in each category to housing
densities. Correlation with Other Physical and Socioeconomic Characteristics
Other than the mentioned physical and pixel characteristics extractable from
remotely sensed imagery, numerous other physical and socioeconomic
variables can also be incorporated for population estimation. A notable
example is the LandScan Global Population Project (Dobson et al., 2000), in
which light volume from nighttime imagery, land cover derived from various
types of remotely sensed imagery, and other information about demography,
38
topography, and transportation networks have all been combined in a model to
estimate population at a 30 × 30 second (approximately 1 × 1 km) resolution.
Many of the variables can be extracted from remotely sensed imagery. Similar
approaches have also been applied in a smaller scale. For example, Liu and
Clarke (2002) found that the total population in urban areas is correlated with
distance to the CBD, accessibility to the transportation system, slope, and the
time when the residential community was first built.
Overall, the accuracy and robustness are improved with increasing
model complexity. It is worth noting that although multivariable approaches
for population estimation tend to improve the overall accuracy compared to
methods using a single variable, the selection of variables in the model
requires guidance from theories in urban geography. Many physical and
socioeconomic variables can assist the estimation of population, yet only those
attributes that can be directly or indirectly observed and extracted from
remotely sensed imagery are applicable in the remote sensing context.
Residential areas constitute a major component of such analysis. The data are
usually of two types: (1) the structural conditions of individual residential
units; and (2) attributes reflecting the residential or neighborhood
environment. Green (1957) and Green and Monier (1959) pioneered research
using aerial photograph to study socioeconomic and demographic variables.
They cited an extensive literature to demonstrate that social values are
attached to housing and residential communities and, by extension, that
observable physical data have meaningful sociological correlations.
Regardless of whether socioeconomic or pixel characteristics are used in
statistical modeling for population estimation, all studies inferring population
from remotely sensed data have reported a consistent finding, i.e., that small-
area population estimation is often not as accurate as large-area estimation. It
may be explained that overestimation and underestimation are canceled out for
large-area population estimation and thus the overall accuracy is high (Lo,
39
1995). Nonetheless, more studies are needed before remote sensing can be
applied to population estimation on an operational basis.
2.4 SUMMARY
Of all the population estimation methods, the dasymetric method is
commonly regarded as a more accurate approach, provided that the used
ancillary information gives a truthful description of where people actually live.
Furthermore, the dasymetric method is not only more accurate, but also
relatively stable. It is robust to the variation of population density associated
with a certain type of land use, as well as the anomaly of highly urbanized but
sparsely inhabited areas (Fisher and Langford, 1996). The reason is because
the volume-preserving property preserves the population of the source unit in
the transformation to raster representation, and thus all associated errors are
inherently limited to variation within each individual source unit. The
dasymetric method used with remote sensing is also robust to imagery
classification error. Fisher and Langford (1996) reported that errors of up to
40% in the classified TM image still yield better estimates of the interpolated
populations than other regression or surface methods they tested. The reason
for the relative robustness of the dasymetric method under classification error
is due to the aggregated error within zones. Specifically, even if the
classification error is high, the frequency of pixels in different land classes
may not vary significantly within a zone. As observed by Donnay and Unwin
(2001, p. 220), “even though individual pixels may have a weak probability of
being correctly assigned to a land use category, when aggregated into a target
zone for density estimation, the relative frequencies within these target zones
do not degrade substantially.” Relevant empirical studies by Lo (1995),
Webster (1996), and Harvey (2002b) also indicated that classification errors at
the pixel level can be high without impacting the accuracy of areal population
estimates. In this review, we separated population estimation methods into
areal interpolation and statistical modeling. It is worth noting that the
40
statistical modeling approach can also be incorporated into the dasymetric
method. For example, Langford et al. (1991) described a dasymetric procedure
based on five types of land use classified from TM multispectral imagery, with
their average population densities derived from regression analysis. Yuan et al.
(1997) also applied a multivariate regression model to correlate census block-
group populations with different land use areas classified from Landsat TM
images in their dasymetric study.
Studies on population issues generally use census data as the primary
data source. The census, however, may not be applicable to the intended
purpose of these studies. This is because in many counties, including the
United States, the census population figure is actually a de jure population, in
contrast to a de facto population. A de jure population reports all usual
residents of the given area, whether or not they are physically present there at
the reference date. A de facto population, in contrast, reports all persons
physically present in the area at the reference date. The U.S. census is a de jure
census because it is based on people’s home address, rather than where they
work or travel during the day, or if people are out of town. The U.S. census is
mainly concerned with residential populations and the daytime population
distribution can be very different from that described by the census. For
example, Las Vegas has a much higher daytime population than that reported
by the census because of its high proportion of tourists. Since some
applications (e.g., emergency response) require knowledge of daytime
population whereas others (e.g., urban growth) require residential population,
it is desirable that both types of population be estimated. Unfortunately, to
date little research has attempted to model daytime population. Theoretically,
urban land use information is more related to daytime than nighttime
population because land uses such as industrial, commercial, and recreational
provide information about where people are during the day.
41
With the availability of high-spatial-resolution commercial images,
such as QuickBird and IKONOS, as well as the advancement of image
processing techniques, improvement in population estimation accuracies is
expected.
42
CHAPTER THREE
METHODOLOGY
3.1 PROJECT PLAN
The figure below shows the flow chart of the project plan.
Figure 3.1Flowchart of Project plan
In this project termed Application of GIS and Remote sensing in the
Population Study of Achara Layout Enugu, Enugu State, a well analyzed
methodology, and appropriate data structured models were carried out, using
relevant data, software and hardware. Since, maps are produced by the
combined efforts of many professions using a variety of technologies (Lo and
Yeung, 2002). Typically the execution of this project span through project
planning , data acquisition, database creation and data processing .Therefore,
the input and output specifics used in the work are as explained below:
43
3.2 PROJECT PLANNING
The planning of a survey project has a number of unique requirements
depending on its area, purpose, and overall goal. GIS technology helps fulfill
these demands by facilitating traditional tasks more efficiently and easily
accomplishing new tasks that were previously impractical or impossible
(ESRI). In this project a trip was made to Achara layout Enugu for field
reconnaissance to determine the social-impact on the study area. The GPS
coordinate was taken in order to source the spatial data for the study area, the
road networks also was observed. In this population study, critical interview
was made with the population about their dialect, culture, and other
information. These enhanced capabilities eliminate redundant efforts and
promote coordination with the execution of the project. The benefit of the
project planning is:
i. Automate tasks to increase efficiency and save money.
ii. Plan effectively for site location.
iii. Access vast amounts of publicly available geospatial data.
iv. Provide Decision support.
v. View Historic data.
vi. Integrate Survey projects in a single Database
3.3.1 DATA ACQUISITION
The key requirements for population management have been
enumerated to include, maps,census data and GIS tools (Ayeni et al, 2001). In
this project the following data were collected and used in the development of
the GIS / Remote sensing application in population study of Achara Layout,
Enugu, Enugu state. A High-resolution satellite image data (2008 Quickbird
0.5 meter in multi-spectral band) covering the study area and layout map of
the study area at scale of 1: 2000 all collected from Enugu state ministry of
land, survey and Urban Development . The data gotten from questionnaires
44
given out, were used to generate the desired products (Tables and Maps) in an
interactive GIS environment. Field (site) capturing of data were based on basic
survey principles, techniques and instruction. A handheld Garmin GPSMAP
76c receiver was employed in the traverse for point positioning. Also, A420
Cannon power shot Digital Camera was used to picture the area, so as to aid
better visual appreciation and spatial analysis of the area.
Figure 3.2 Map Of Achara Layout As Designed And Subdivided By Town
Planning Unit Of Ministry Of Land, Enugu
45
Figure 3.3 Quickbird satellite image of Enugu metropolis covering the study area
3.4 TOOLS AND COMPONENTS USED
The following tools and components were used for the project;
Hardware used
S/N NAME DESCRIPTION
1. HP Laptop &
accessories
Windows 7, Intel Pentium
2. Hand held GPS Garmin 72s
3. Digital Camera Canon
4. Printer HP Deskjet K7100 series and T1110 Series
5. Scanner Smart LF Colortrac
46
Software used
S/N NAME DESCRIPTION
1. ArcGIS 9.3 Environment
2. Microsoft Excel 2007
3. Microsoft word 2007
4. Microsoft Access 2007
3.5 DATA SOURCES
The following are the main data sources used in this project;
1. Interview: Interviews were carried out on key staff of National
Population census to ascertain how long a population data can stay
before an update is needed.
2. Enugu State Ministry of Lands and Urban Planning: A map of Achara
Layout was used. The scale used to create the map is 1:2000. The map
shows the street network, subdivision of plots and blocks etc.
3. Field work: Spatial data was also collection from field work methods
using Hand held GPS.
The data types collected were primary and secondary data. Primary data
was collected by administration of structured questionnaires. For the
distribution of questionnaires, the study area was divided into 71 blocks based
on the parcellation of the layout by blocks by the Urban and town planning
design. Each block was allocated at least 20 questionnaires bringing the total
number of administered questionnaires to 1420. A total of 1420 questionnaires
were however responded to number of building counting through the high
resolution satellite image. A GPS was also used in collecting data on the
geographic position as well as various relevant information concerning the
47
population of the area; the aspect socio-economic, cultural ethnic was also
analyzed during the field inspection.
3.6 DATA PROCESSING
3.6.1 Scanning
The Achara layout map from Enugu State Ministry of Lands and
Urban Develpoment was scanned through a large scanner. In order to be
scannable, a map should be in a very good condition with minimum text on it.
Hardcopy parcel maps are converted to digital form using a scanner. A
scanner recognizes the black or white value of each pixel location on a map.
The scanned map was saved in a storage media and later saved in a computer.
3.6.2 Data Georeferencing
The secondary data used included, layout map of the study area to the
scale 1:2000, obtained from ministry of land survey and Urban development
and the Quickbird high resolution satellite imagery was used to in extraction
of block and building through the process of vectorization using GIS
technology. The block layer and building layer were created in Arc-Catalogue
and then imported to Arc Map environment, the two raster dataset (image and
map) was also imported to the same arc map environment. Firstly the arc map
environment was reference to projected coordinate, WGS 84, UTM Zone 32 in
hemisphere North. The georeferening toolbar was turned on in other the
georeference the layout map. The layer target was set to Achara layout map
and then the add control point tool was picked to assign the right coordinate to
the four corner of the map (fig 3.3). After this process the Updating
georeference was chosen to validate the georeferenced map. Achara Layout
map was georeferenced in ArcGIS 9.3. Using the control points field survey,
points from fiducial points were chosen for purpose of Georeferencing. Four
points were added at the extreme four corners of the map.
48
To achieve this, the following were done;
1. Click the View menu, point to Toolbars, and click Georeferencing.
Figure 3.4 Selecting Georeferencing in ArcMap
2. Click the Layer dropdown arrow
3. Click the Georeferencing dropdown arrow and click Fit To Display
Figure 3.5 Georeferencing toolbar
ArcMap scales the image to fit in the current window.
Since the window is currently zoomed to the four street intersections that
match the registration marks on the image, the streets and the image are
displayed at approximately the same scale. You can see, though, that the
49
control points aren’t located exactly at the intersections. Add four links to
register the image. To make it easier, use a magnifier window. You can add
control points within the magnifier window
4. Click Window and click Magnifier
5. Click the Add Control Points button on the georeferencing
toolbar.
The cursor turns into a crosshair.
6. Drag and center the magnifier window over the registration
mark in the upper right, labeled 602, and release the mouse
button. If necessary, reposition the window so you can see both
the registration mark and the corresponding street intersection.
Note that the registration mark and intersection may be in
slightly different positions on your map.
7. Center the cursor over the registration mark and click. A green
control point is added to the image. Move the cursor away from
the control point but dont click again. A line stretches from the
control point as you move the cursor. This is the link that will
connect the other end of it to the corresponding street
intersection.
8. Center the cursor over the intersection of the street layer (you
can see the link stretch as you do this) and click.
Figure 3.6 3positioning the cross hair
50
9. Click the View Link Table button on the Georeferencing
toolbar.
For each link, the table lists the x- and y-coordinates for the source (the
scanned image) and the corresponding coordinates for the map (the streets
layer). If you make a mistake and need to delete a link, select it and click the
Delete button, which looks like the letter x.
10. Click Cancel to close the Link Table. The other points can be
added to the other corners of the map.
11. After adding the four points, Click the Georeferencing
dropdown arrow and click Update georeferencing to save the
new registration.
Figure 3.7 georefrencing the scanned map
51
3.6.3 Data Vectorization
In the vectorisation process the editor toolbar was turned on , the start editing
was chosen to start the digitizing process firstly the editor target was set to
Block and the sketch tool pencil was picked to digitize all the Block parcel
through the scanned layout map (fig 3.4) the same process of digitizing was
made to digitize all the building in the study area through satellite imagery.
The field information and the information figured on the scanned map was
used in building the attribute information for each of the extracted data. In
other side the field data was used in database design using Microsoft excel, the
Block_id was used to define the primary Key in which the database will based
when link to the spatial data in ArcGis Environment. Various maps were
produced base on the population study analysis. The query by attribute also
was used to determine some critical information based on the population study
data. The Microsoft excel was used in compilation of the statistic analysis,
multiple table, histogram and chart was therefore produced.
52
Figure 3.8 Digitizing Of Block Parcel Through Layout Map Of The Study Area
Figure 3.9Digitizing Of Building Through Satellite Imagery Of The Study Area
3.6.4 Shapefiles in ArcGIS
Shapefiles are useful for mapmaking and some kinds of analysis. A
great deal of geographic data is available in shapefile format. Shapefiles are
simpler than coverages because they do not store full topological associations
among different features and feature classes. Each shapefile stores features
belonging to a single feature class.
3.6.4.1 Features in shapefiles
Shapefiles have two types of point features: points and multipoints. They have
line features that can be simple lines or multipart polylines. They also have
area features that are simple or multipart areas called polygons.
53
Point shapes are simply single-point features such as wells or monuments.
Multipoint shapes are collections of points that all represent one feature. A
group of small islands could be represented as a single multipoint shape.
Line shapes can be simple continuous lines such as a fault line on a map. They
can also be polylines that branch such as a river. Line shapes can also have
discontinuous parts.
Polygon shapes can be simple areas such as a single island. They can also be
multipart areas such as several islands that constitute a single state. Polygon
shapes can overlap, but the shapefile does not store topological relationships
between them. The market areas of two stores could be represented as
overlapping polygon shapes.
3.6.5 Digitizing a map
1. Zoom in to the Block enclosed by street by using the Zoom in
tool on the Tools toolbar to draw a box around the Block
boundary.
Add a bookmark to use when you start digitizing the boundary.
2. Click the View menu, point to Bookmarks, and click Create.
Name the bookmark Block Building and click
OK.
3. Click the Editor Toolbar button.
4. Click Editor and click Start Editing.
5. Click project\Population Studies.mdb, and then click OK.
Now set the snapping environment so the new park boundary
will align exactly with the existing parcel boundaries.
6. Click Editor, and then click Snapping.
7. Check the box in the Vertex column for pipeline layer.
54
This will snap the editing cursor to the vertices of the parcels.
8. Close the Snapping Environment dialog box.
The snapping tolerance defines how close the cursor must be to
an object before it snaps to that object. You can change the
snapping tolerance by choosing Options from the Editor menu.
3.6.6 Database Structure for Population Census
The effective implementation of any geospatial project lies on the
proper planning and designing of spatial database. The process of designing a
database is called data modeling. According to (Kufoniyi 1997), he defined
database as the process by which the real world and their inter-relationship are
analyzed and modelled in such a way that maximum benefit are derived while
utilizing a minimum amount of data. Improper design often leads to
implementation problems. The attribute database for the population study was
modeled and developed for the purpose of interactiveness based on relational
data model. A relational database is efficient and flexible for data search, data
retrieval and creation of tabular reports (Chang, 2007). Each table in the
database was prepared, maintained and edited separately from other tables.
The tables remained separated until linked up by query or analysis. The
respective questionnaire forms were used for the basis of table creation and
database modeling. In addition some forms were created digitally using
attribute information from the scanned map, the Database Management
System capability in which platform the databases were modeled internally
and linked using the in-built table capabilities in the ArcGIS environment.
Though, external database tables created with Microsoft Excel was joined with
the spatial table of the GIS software (SQL server) which are the default
DBMS in ArcGIS for data manipulation and analysis. In the process, internal
tables the entities and attributes) were created and linked to Arc-GIS shape file
using geo-relational technology. The modeled tables were created in
consideration to linking mechanisms since the tables were properly
55
normalized (Ndukwe, 2001). The building database was modeled as the
primary table; while others were modelled as secondary tables. Unique fields
that serve as keys were inserted accordingly. Although there are few entries in
which tables that make up the database ordinarily does not necessitate
indexing. The entries in the table were indexed. This is because the system if
fully deployed is supposed to handle population data, which has complex and
large information
Figure 4.0 Process of joining table to spatial data in GIS environment
3.6.6.1 Conceptual Design
Conceptual database design involves modeling the collected information at a
high-level of abstraction without using a particular data model. The conceptual
design allows the following;
56
• Independent of DBMS.
• Allows for easy communication between end-users and
developers.
• Has a clear method to convert from high-level model to
relational model.
• Conceptual schema is a permanent description of the database
Requirements
3.6.6.2 Entity-Relationship model (ER)
An ER diagram is a diagram that helps to design databases in an efficient way.
This model is has the following importance;
i. Most popular conceptual model for database design
ii. Basis for many other models
iii. Describes the data in a system and how that data is related
iv. Describes data as entities, attributes and relationships
The following diagrams show the association between various entities;
Figure 4.1 Building ER diagram
Building
Type
Street Name
Purpose Building I.D
57
Figure 4.2 Block ER diagram
Figure 4.3 Population ER diagram
3.6.6.3 Logical Design
In this design, a review of the materials from earlier phases is made and the
system design begins. This involves identifying any additional system objects,
determining operations and data structures for all objects, validating
relationships and interactions between objects etc. The diagram shows a
logical structure and relationship among entities;
BLOCK
No of Building
Street name
Area Block-ID
Population
Gender
Density Unit-ID
Gender Ratio
58
Figure 4.4 Relationship between entities
3.6.6.4 Physical Database Design
The physical design of the database specifies the physical configuration of the database on the storage media. This includes detailed specification of data elements, data types, etc. Using ArcGIS 9.3 and Excel a database was created see figure 3.6.6.5 & figure 3.6.6.6
Building
-Building ID
-Purpose
-Street name
-Type
Block
-Block ID
-No of building
-Size
-Street Names
Fitting
-Name
-Diameter
-Type
Population
-Population ID
-Gender
-Gender ratio
-Density
Passes
through
Connects
Controls
59
Table 1 Geodatabase of extracted house from QuickBird satellite imagery of achara
Layout.
Table 2 Field Survey data of achara layout using Microsoft Exce
60
3.7 USER REQUIREMENT
There is a high level of frustration regarding access to information about
the Population system in Nigeria, since so little of the information is
automated and readily available. Paper records are cumbersome and data
maintenance is difficult. There is concern that National Population Census
office is not integrated with GIS and that it’s data may not fulfill the needs for
a complete and accurate source of Population data and information.
Essentially, the Census data records are a disparate set of records in various
methods of storage and various states of accuracy and completeness. This
creates a large gap between express user needs and the possibility of achieving
them.
Furthermore, there is a need to have information for all populates for
the purpose of planning, budgeting, and allocation of basic amenities. Not all
populates are counted in the most recently concluded Census of Nigeria.
3.8 DATABASE DESIGN FOR ACHARA LAYOUT
After the verification exercise, a database summarizing the whole exercise
was created. It was stored in a microsoft excel format.
61
Table 3 Field Survey data of achara layout using Arc GIS
62
CHAPTER FOUR
RESULTS, DISCUSSION OF RESULTS AND ANALYSIS
4.1.1 ANALYSIS
The census data and the spatial data must be integrated in order to give
an accurate perspective on the number of people living in any geographical
area so that government at various levels can successfully carry out adequate
planning and management techniques that will enhance the quality of lives of
the populace (Ayeni & al 2001). In this project All of the quantitative analysis
we performed on the dataset for the population study was done using standard
statistical procedures in ArcGIS and Microsoft excel, such as quantities
symbolization (Graduated color, Graduated symbol, Proportional Symbol and
Dot density) also the chart symbolization was used, the table and histogram
was generated. For subsequent analyses, we needed to aggregate variables or
create new ones. For instance, by issuing appropriate querying commands,
following tasks can be performed on data obtained: Size of the population,
Patterns and trends of demographic events, Planning and Location of
Infrastructures, Transportation planning and traffic routing, Map out and
control spread of certain diseases and epidemics, Manpower development
planning and Other supportive information or tasks that can be performed
through the Query processes.
4.1.2 Estimation of Population Data Per Household through satellite
image
Satellite imagery has been used with respect to population estimation.
For example, Google Earth satellite images have been used to estimate coarse
population densities at the city and village levels (Javed & al 2012). By
identifying features such as dwelling units and residential areas satellite
63
images have also been applied for the purpose of population estimation
(Alsalman & al. 2011). The “night satellite” imagery was used to estimate
population sizes according to the local densities of light sources (Cheng & al
2007).For the first time, the population of an entire city has been estimated
from space through a pioneering project to speed up medical and disaster relief
efforts. By analysing satellite images of disaster zones and famine, war or
disease-hit areas where census data is unreliable, or cities unreachable,
researchers hope to make rapid and more accurate estimates of how many
people might need help. Surveyors, still had to visit households to estimate
how many people typically lived in each kind of building, but they could then
make a city-wide estimate by counting the total number of each type of
dwelling in the satellite image, either through a computer automated analysis
or by manual counting(Andy 2013).
In this project, the field work was carried out in the study area in order the
determine the type of building appearing in the satellite imagery (story
building or bungalow) and the number of the room was estimated. The
standard family count (number of children and parent per room) was used to
estimate the population per household. The organized population census
database per household was then imported to Arc Gis 9.3 environment for
further analysis.
64
Table 4.1 showing the sample census per building in the study area
4.1.3 Estimation of Population Study Data by Block Parcel
In the oxford dictionary the block parcel is defined as the length of one
side of a piece of land or group of buildings, from the place where one street
crosses it to the next. The block has been used many year ago to estimate the
population of a particular area. In 1970 it was used as a grouping of census
block within a census track or enumeration district designated for
measurement to determine if the density of population might be high enough
for the parcel to be included in urbanization area (Frederick,2013).in this study
the database generated from the field questionnaire was link to the building of
the study area using building_id as a key for joining table, the selection by
location was used in assigning the block code number for each building. The
summary of the building population for each block was then calculated the
block area was also populated in square meter using calculate geometric in
Arc Gis 9.3 to calculate the density (see table 4.1). in Arc Gis 9.3 the Block
dataset was enhanced using graduated colors to display the spatial
65
concentration of the population by block. The map derived presents the block
parcel in five classes with a graduated color from very low to very high (very
low, low, normal, high, very high). The shallow color represents the very low
and the thick color represents the very high population per block (see fig 4.2).
the value between 125 to 400 represent the very low population , the value
between 401 to 800 represent the low population , the value between 801 to
1200 represent the normal population, the value between 1201 to 1600
represent the high population and the population between 1601 to 2000 is the
very high population per block.
Figure 4.5 MAP OF THE POPULATION CONCENTRATION OF THE STUDY
AREA PER BLOCK PARCEL
66
67
Table 4.2 showing detail of the population of the study area
per block parcel
4.1.4 Population Density
Population density according to the world bank is midyear population
divided by land area in square kilometers. Population is based on the de facto
definition of population, which counts all residents regardless of legal status or
citizenship--except for refugees not permanently settled in the country of
asylum, who are generally considered part of the population of their country of
origin. Land area is a country's total area, excluding area under inland water
bodies, national claims to continental shelf, and exclusive economic zones. In
68
most cases the definition of inland water bodies includes major rivers and
lakes. In this study the density was calculated in ArcGIS 9.3 using field
calculation. The result was enhanced using graduated symbol measured with
side. The output map displayed block with a five different graduated point size
in centre, the block with very small size represents the area with low density
while the area with very big size represents the area with very high density.
The value between 0.61 to 1.40 is the block with very low density, the value
between 1.33 to 1.40 represent the block with low density, the value between
1.40 to 1.66 represent the block with normal density, the value between 1.66
to 2.03 represent the block with high density and the value between 2.03 to
2.56 is the block with very high density (see fig 4.3 ) . in the histogram (fig
4.4) below the block CXIV and block XXXVII are the area with very high
density following by block XC and XXVIIA.
Figure 4.6 map showing detail of the population density of the study area per
block parcel
69
Figure 4.7 2006 Population census Result For Enugu State
70
Figure 4.8: 2006 Population census Result For Enugu South LGA State
4.2 COMPARATIVE ANALYSIS BETWEEN NATIONAL POPULATION CENSUS AND CENSUS FROM GIS TECHNOLOGY
The difference between observed houses from field surveys and that
counted from QuickBird image was found to be 1.62% with average
difference 772 for Achara Layout. Counting of houses was easy from the
Multi- spectral QuickBird image. Houses were easily counted from the image
because of their well-defined shape, distinguishable size, contrast with their
surroundings (concrete roofs had high brightness), widely spaced, and prior
knowledge with the area. Table 4.1 shows comparison between NPC census
2015 and Census counted from Quick Bird 2015 in Achara Layout. The table
71
shows that the difference in population census is not much is about 1.61%. this
shows that Remote sensing and GIS is strongly recommended in population
census survey.
Table 4.4 Showing Comparative analysis between Nation Population 2015 data Census and Population extracted Using GIS technology
Population from 2015 Census
Computed population from Quick Bird
Difference Percentage of the Difference
Values 48394 47622 772 1.62
72
CHAPTER FIVE
CONCLUSION AND RECOMMENDATIONS
5.1 CONCLUSION
The integration of remote sensing data with GIS technology for
population studies is vital because each has strength in certain aspects related
to population studies. Since this study utilized both remote sensing and GIS, it
can be said as integrated approach to study population in Enugu, Nigeria.
Remote sensing was used mainly to estimate the population and classify land
use types whereas GIS was used for modelling the relationship among
population variables and the whole study provides a typical approach to obtain
the geographical distribution of population in one city in the Nigeria. This
approach can be replicated for other urban areas in Nigeria. Remote sensing
and GIS technology is slowly becoming cost effective, easy to use, and a
viable technology that can produce fast, reliable, and lowcost alternative for
population estimation. Therefore, its use by census departments, especially in
highly developing countries is the need of the present time. The remote
sensing on its own provides no information relating to population metrics and
it needs to be combined with census data and ground survey in a GIS
environment.
5.2 LIMITATIONS OF USING REMOTE SENSING METHODS
FOR POPULATION STUDIES ENCOUNTERED:
Despite the positive developments in remotely sensed data (high
spatial, spectral, and temporal resolutions), there are still some limitations with
respect to the use of these data for monitoring urban areas in general (Donnay,
1999; Ehlers, 1995) and population studies in particular (Baudot, 2001). The
limitations include: Spatial resolution: The single most important technical
issue in urban remote sensing is the spatial resolution of the image data
73
(Welch, 1982). The last thirty years have seen development in the spatial
resolution of satellite images form 80 m (Landsat MSS) to 10 m (SPOT HRV)
and to 1 m (IKONOS).
The improvement in spatial resolution allows greater discrimination of
the urban fabric. However, the use of very high spatial resolution images brings
with them some major problems such as availability in panchromatic
mode and large size (storage, processing time) (Donnay et al., 2001). Although
some solutions like image fusion and image compression had been made to
minimize and solve these problems. These solutions are not commonly
available for practitioners in the census field.
5.2.1 Image Classification
The heterogonous nature (mixed pixels) of urban environment and the
possibility that identical spectral reflectance values can correspond to very
different land uses and functions poses a classification problem (Baudot, 2001).
Efforts have been made to solve these problems, for example, by coupling
automatic and semi-automatic classification, improving the results achieved
using traditional per-pixel classification algorithms by using a priori
probabilities or aposteriori processing, using of new algorithms such as soft
classifier-Bayesian probabilities-fuzzy sets, combining spectral data with
measures of urban form and texture, and using of knowledge–based methods
and artificial neural networks (Zang and Foody, 1998). However, these
techniques are experimental and not yet included in standard image processing
software (Donnay et al., 2001).
5.2.2 3D Nature of Urban Areas
Imaging 3D buildings from satellites suffers from three problems. The
first one is related to displacement of buildings from their true location (relief
displacement), the second one is related to obscuring of lower buildings by
higher ones, and the third is related to shadow (Hartl and Cheng, 1995).
Although efforts have been made to minimize these problems, up-to-date
74
images such as those from IKONOS suffer from these problems. Due to the
above limitations, remote sensing has to be used in conjunction with other
sources of information (maps, census data, field survey) and GIS to provide
better results (Mesev, 1998). In addition to that, visual interpretation has to be
used to strengthen the digital image classification.
5.3 RECOMMENDATION
1. Sensitization of essential staff of National Population Census on GIS: The understanding of the role GIS will play is important at all levels in the organization. Everything is about people. People play an important part in a successful implementation of GIS. National Population Corporation needs GIS-leaders, so called champions on different levels. GIS-champions are charismatic people, people with an intuitive sense about what direction GIS could take, people who are able to communicate this through all levels. These people are rare. Find them, or keep them. You cannot create them at will. Needless to say, there should be informal communication between these champions. Our observation is that a census mapping process is akin to a typical geographical information process. There is, therefore, justification for the integration of GIS with census mapping operations at all the stages of the process. In doing so, we go beyond the technological approach to reflect the inherent nature of census mapping activities, which is mainly geographic. Obviously, the integration can be undertaken gradually. Therefore, we advocate a paradigm shift from a traditional mapping approach, which is repeated for every census, to a digital census mapping approach, which is an up-to-date and a continuous approach. We believe that an up-to-date approach will allow census planners to rethink, inter-alia, the periodicity of census taking, the frequency of inter-censal surveys, and eventually the way to conduct census itself (some countries are already reconsidering the way to conduct a whole census countrywide every 10 years).
2. Education and training: There is need for education and training as they are essential for the introduction of GIS. If users feel uncomfortable with training or use of GIS, it will not easily be adopted. A good training plan will facilitate GIS implementation. In order to achieve the full benefits of owning a GIS, it is necessary to have a substantive expertise, GIS skills, and an understanding of basic geographic and cartographic principles.
75
3. Proper data management: A GIS without data or information is useless. Especially in a corporate approach, data management is important. There are many aspects of data management (in a GIS environment), important aspects are: co-ordination, quality, data storage, data archiving and access to data. GIS gives the possibility to analyze, integrate and summarize these data. With GIS new possibilities arise. Data can be collected, stored, and maintained once, to the benefit of all authorized users. The opportunities for information sharing are an enormous benefit and beside the quality of data, data agreements, methods and standards are important for successful implementation.
4. Provision of Hardware and software: It is anticipated a suite of software and hardware will be necessary to successfully deploy the enterprise GIS. While users may already have some desktop application of GIS, the existing inventory would need to be evaluated in terms of its ability to meet the requirements found during the needs assessment phase. Where there are gaps, new software will be required. In addition, it is anticipated that hardware purchases in the form of servers will be necessary to adequately store the centralized database.
5. GIS implementation by National Population Commission: Implementation of GIS in census data capture by National Population Census and various Government parastatals is very necessary. Although the implementation of GIS at the initial stage is expensive as implementation requires various hardware and software components as well as training of staff but since its database is flexible, it would save cost in future census mapping for the country.
76
REFERENCES
Adeniyi, P.O. (1983). An Aerial Photographic Method For Estimating Urban
Population. Photogrammetric Engineering and Remote Sensing, 49: 545-560.
Bob Booth and Andy Mitchell, (2004), Getting started with ArcGIS
Baudot, Y. (2001), “Geographical Analysis of the Population of Fast-Growing Cities in the Third World,” in Remote Sensing and Urban Analysis
Bracken, I. (1991), “A Surface Model Approach to Small Area Population Estimation,” Town Planning Review, 62(2):225-237.
Bracken, I. and D. Martin (1989), “The Generation of Spatial Population Distributions from Census Centroid Data Source,” Environment and Planning A, 21(4):537-543.
Chen, K. (2002), “An Approach to Linking Remotely Sensed Data and Areal Census Data,” International Journal of Remote Sensing, 23(1):37-48.
Clark, C. (1951), “Urban Population Densities,” Journal of the Royal Statistical Society, 114:490-496.
Clayton, C. and J. Estes, 1980, “Image Analysis as a Check on Census Enumeration Accuracy,” Photogrammetric Engineering and Remote Sensing, 46:757-764.
Collins, W. G. and A. H. A. El-Beik (1971), “Population Census with the Aid of Aerial Photographs: An Experiment in the City of Leeds,” Photogrammetric Record, 7:16-26.
Dempster, A. P., Laird, N. M., and D. B. Rubin (1977), “Maximum Likelihood from Incomplete Data via the EM Algorithm,” Journal of Royal Statistics Society Series B–Methodological, 39(1):1-38.
Dobson, J. E., Bright, E. A., Coleman, P. R., Durfee, R. C., and B.A. Worley (2000), “LandScan: A Global Population Database for Estimating Populations at Risk,” Photogrammetric Engineering and Remote Sensing, 66(7):849-857.
77
Donnay, J. P. and Unwin D. (2001), “Modeling Geographical Distributions in Urban Areas,” in Remote Sensing and Urban Analysis
Dueker, K. and Horton, F. (1971), “Toward Geographic Urban Change Detection Systems with Remote Sensing Inputs,” Technical Papers, 37th Annual Meeting, American Society of Photogrammetry, 204-218.
Eicher, C. L. and Brewer C. A. (2001), “Dasymetric Mapping and Areal Interpolation: Implementation and Evaluation,” Cartography and Geographic Information Science, 28(2): 125-138.
Fisher, P. F. and Langford M. (1995), “Modeling the Errors in Areal Interpolation between Zonal Systems by Monte Carlo Simulation,” Environment and Planning A, 27:211-224.
Fisher, P. F. and Langford M. (1996), “Modeling Sensitivity to Accuracy in Classified Imagery: A Study of Areal Interpolation by Dasymetric Mapping,” Professional Geographer, 48(3):299-309.
Flowerdew, R. and Green M. (1989), “Statistical Methods for Inference between Incompatible Zonal Systems,” in Accuracy of Spatial Databases
Flowerdew, R. and Green M. (1991, “Data Integration: Statistical Methods for Transferring Data between Zonal Systems,” in Handling Geographical Information: Methodology and Potential Applications
Forster, B.C. (1983), “Some Urban Measurements from Landsat Data,” Photogrammetric Engineering and Remote Sensing, 49:1693-1707.
Green, N. E. (1956), “Aerial Photographic Analysis of Residential Neighborhoods: An Evaluation of Data Accuracy,” Social Forces, 35:142-147.
Green, N. E. (1957), “Aerial Photographic Interpretation and the Social Structure of the City,” Photogrammetric Engineering, 23:89-96.
Green, N. E. and Monier R. B. (1959), “Aerial Photographic Interpretation of the Human Ecology of the City,” Photogrammetric Engineering, 25:770-773.
Harvey, J. T. (2000), “Small Area Population Estimation Using Satellite Imagery,” Statistics in Transition, 4(4):611-633.
78
Harvey, J. T. (2002)a, “Estimating Census District Populations from Satellite Imagery: Some Approaches and Limitations,” International Journal of Remote sensing, 23(10):2071–2095.
Harvey, J. T. (2002)b, “Population Estimation Models Based on Individual TM Pixels,” Photogrammetric Engineering and Remote Sensing, 68(11):1181-1192.
Haverkamp, D. (2004), “Automatic Building Extraction from IKONOS Imagery,” Proceedings of ASPRS 2004 Conference, Denver, Colorado, May 23-28, 2004 [available online at http://www.spaceimaging.com/techpapers/default.htm].
Holt, J. B., Lo, C. P., and Hodler T. W. (2004), “Dasymetric Estimation of Population Density and Areal Interpolation of Census Data,” Cartography and Geographic Information Science, 31(2):103-121.
Hsu, S. Y. (1971), “Population Estimation,” Photogrammetric Engineering, 37:449-454.
Hsu, S. Y. (1973), “Population Estimation from ERTS Imagery: Methodology and Evaluation,” Proceedings of the American Society of Photogrammetry 39th Annual Meeting, 583-591.
Huxley, J. (1932), Problems of Relative Growth, London, UK: Methuen, 356 p.
Kramer, C. (1958), “Population Density Patterns,” Chicago Area Transportation Study (CATS) Research News, 2:3-10.
Kraus, S. P., Senger, L. W. and Ryerson J. M. (1974), “Estimating Population from Photographically Determined Residential Land Use Types,” Remote Sensing of Environment, 3(1):35-42.
Lam, N. S. (1983), “Spatial Interpolation Methods: A Review,” The American Cartographer, 10(2):129-149.
Langford, M., Maguire, D. J., and Unwin D. J. (1991), “The Areal Interpolation Problem: Estimating Population Using Remote Sensing in a GIS Framework,” in Handling Geographical Information: Methodology and Potential Applications
79
Langford, M. and Unwin D. J. (1994), “Generating and Mapping Population-Density Surfaces within a Geographical Information-System,” Cartographic Journal, 31(1):21-26.
Lee, Y. (1989), “An Allometric Analysis of the United States Urban System: 1960-80,” Environment and Planning A, 21(4):463-476.
Lisaka, J. and Hegedus, E. (1982), “Population Estimation from Landsat Imagery,” Remote Sensing of the Environment, 12:259-272.
Liu, X. (2003), Estimation of the Spatial Distribution of Urban Population Using High Spatial Resolution Satellite Imagery, Ph.D. thesis, University of California, Santa Barbara, 175 p.
Liu, X. and Clarke K. C. (2002), “Estimation of Residential Population Using High Resolution Satellite Imagery,” Proceedings of the 3rd Symposium in Remote Sensing of Urban Areas
Lo, C. P. (1986) Applied Remote Sensing, New York, NY: Longman, 393 p.
Lo, C. P. (1989), “A Raster Approach to Population Estimation Using High-Altitude Aerial and Space Photographs,” Remote Sensing of Environment, 27(1):59-71.
Lo, C. P. (1995), “Automated Population and Dwelling Unit Estimation from High Resolution Satellite Images—a GIS Approach,” International Journal of Remote Sensing, 16(1):17-34.
Lo, C. P. (2002), “Urban Indicators of China from Radiance-Calibrated Digital DMSPOLS Night time Images,” Annals of The Association Of American Geographers, 92(2):225-240.
Lo, C. P. (2003), “Zone-Based Estimation of Population and Housing Units from Satellite-Generated Land Use/Land Cover Maps,” in Remotely Sensed Cities, Mesev, V. (Ed.), London, UK/New York, NY: Taylor & Francis, 157-180.
Lo, C. P. and Chan H. F. (1980), “Rural Population Estimation from Aerial Photographs,” Photogrammetric Engineering and Remote Sensing, 46:337-345.
Lo, C. P. and Welch R. (1977), “Chinese Urban Population Estimates,” Annals of the Association of American Geographers, 67(2):246-253.
80
Martin, D. (1989), “Mapping Population Data from Zone Centroid Locations,” Transactions of the Institute of British Geographers, 14(1):90-97.
Martin, D. (1996), “An Assessment of Surface and Zonal Models of Population,” International Journal of Geographical Information Systems, 10(8):973-989.
Martin, D. and Bracken I. (1991), “Techniques for Modeling Population-Related Raster Databases,” Environment and Planning A, 23(7):1069-1075.
Mennis, J. (2003), “Generating Surface Models of Population Using Dasymetric Mapping,” Professional Geographer, 55(1): 31-42.
Monmonier, M. S. and Schnell G. A. (1984), “Land-Use and Land-Cover Data and the Mapping of Population Density,” The International Yearbook of Cartography, 24:115-121.
Newling, B. E. (1965), “Urban Growth and Spatial Structure—Mathematical Models and Empirical Evidence,” Annals of the Association of American Geographers, 55(4):637-637.
Nordbeck, S. (1965), “The Law of Allometric Growth,” Michigan Inter-University Community of Mathematical Geographers, Discussion Paper No. 7.
Parr, J. B. (1985), “A Population-Density Approach to Regional Spatial Structure,” Urban Studies, 22(4):289-303.
Porter, P. W. (1956), Population Distribution and Land Use in Liberia, Ph.D. thesis, London School of Economics and Political Science, London, UK, 213 p.
Prosperie, L. and Eyton R. (2000), “The Relationship between Brightness Values from a Nighttime Satellite Image and Texas County Population,” Southwestern Geographer, 4:16-29.
Rase, W. (2001), “Volume-Preserving Interpolation of a Smooth Surface from Polygon-Related Data,” Journal of Geographical Systems, 3(2):199-213.
Rottensteiner, F. (2003), “Automatic Generation of High-Quality Building Models from Lidar Data,” IEEE Computer Graphics and Applications, 23(6):42-50.
81
Stewart, J. Q. and Wartnz W. (1958), “Physics of Population Distribution,” Journal of Regional Science, 1:99-113.
Sutton, P., Roberts, C., Elvidge, C., and Meij H. (1997), “A Comparison of Nighttime Satellite Imagery and Population Density for the Continental United States,” Photogrammetric Engineering and Remote Sensing, 63(11):1303-1313.
Thompson, D. (1975), “Small Area Population Estimation Using Land Use Data Derived from High Altitude Aircraft Photography,” Proceedings of the American Society of Photogrammetry 41st Annual Meeting, 673-696.
Tobler, W. R. (1969), “Satellite Confirmation of Settlement Size Coefficients,” Area, 1(3):30-34.
Tobler, W. R. (1979), “Smooth Pyncophylactic Interpolation for Geographical Regions,” Journal of the American Statistical Association, 74(367):519-530.
Tobler, W. R. (1999), “Linear Pycnophylactic Reallocation—Comment on a paper by D. Martin,” International Journal of Geographical Information Science, 13(1):85-90.
Weber, C. (1994), “Per-zone Classification of Urban Land Use Cover for Urban Population Estimation,” in Environmental Remote Sensing from Regional to Global Scales, in Foody, G. M. and P. J. Curran (Eds.), Chichester, UK: New York, NY: Wiley, 142-148.
Webster, C. J. (1996), “Population and Dwelling Unit Estimation from Space,” Third World Planning Review, 18(2):155-176.
Weiss, H. K., 1961, “The Distribution of Urban Population and an Application to a Servicing Problem,” Operations Research, 9(6):860-874.
Wright, J. K. (1936), “A Method of Mapping Densities of Population,” The Geographical Review, 26(1):103-110. Yuan, Y., Smith, R. M., and W. F. Limp, 1997, “Remodeling Census Population with Spatial Information from LandSat TM imagery,” Computers, Environment and Urban Systems, 21(3-4):245-258.
82
APPENDICES Appendix A: Map showing population concentration of the study area Appendix B: Map showing population density of the study area Appendix C: population data for Enugu State from National Population Commission