concept course on spatial analyst @ dr. a.k.m. saiful islam geo-statistical analysis dr. a.k.m....
Post on 21-Dec-2015
219 views
TRANSCRIPT
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am Geo-statistical Analysis Geo-statistical Analysis
Dr. A.K.M. Saiful IslamInstitute of Water and Flood Management (IWFM)Bangladesh University of Engineering and Technology (BUET)
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Geo-statistical Analyst of Geo-statistical Analyst of ArcGIS
This training will be on:
1. Histogram2. Normal QQ plot3. Trend Analysis4. Creating a prediction map using the geo-statistical wizard5. Semivariogram / covariance modeling6. Searching neighbor7. Creating a prediction standard error map8. Display Formats
Input Data
Groundwater well data of Dinajpur district of Bangladesh
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Steps in Geo-statistical Analyst
1. Representation of the Data
2. Explore the Data
3. Fit a model (create surface)
4. Perform Diagnostics
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Representation of Data
• Representing the data is a vital first step in assessing the validity of the data and identifying external factors that may ultimately play a role in the distribution of data.
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Explore the Data
• Distribution of the data, looking for data trends, looking for global and local outliers, examining spatial autocorrelation, understanding the co-variation among multiple data sets.
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Explore data
• Histogram• Q-Q plot• Trend Analysis• Semivariogram• Voronoi map• Cross covariance
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Histogram
• Show frequency distribution as a bar graph that displays how often observed values fall within certain intervals or classes
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Normal distribution
• Skewness is zero for normal distribution
Normally distributed Positively skewed
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Q-Q Plot• Normal QQ Plot is created by plotting data values with the
value of a standard normal where their cumulative distributions are equal
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Trend Analysis• The Trend Analysis tool
provides a three-dimensional perspective of the data.
• The locations of sample points are plotted on the x,y plane. Above each sample point, the value is given by the height of a stick in the z dimension.
• The unique feature of the Trend Analysis tool is that the values are then projected onto the x,z plane and the y,z plane as scatter plots.
• This can be thought of as sideways views through the three-dimensional data.
• Polynomials are then fit through the scatter plots on the projected planes.
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Voronoi map• Voronoi maps are constructed from a series of polygons formed around the
location of a sample point. Voronoi polygons are created so that every location within a polygon is closer to the sample point in that polygon than any other sample point. After the polygons are created, neighbors of a sample point are defined as any other sample point whose polygon shares a border with the chosen sample point.
• For example, in the following figure, the bright green sample point is enclosed by a polygon, given as red. Every location within the red polygon is closer to the bright green sample point than any other sample point (given as small dark blue dots). The blue polygons all share a border with the red polygon, so the sample points within the blue polygons are neighbors of the bright green sample point.
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Cross variance
• The Crosscovariance cloud shows the empirical crosscovariance for all pairs of locations between two datasets and plots them as a function of the distance between the two locations.
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Fit a Model
• A wide variety of interpolation methods available to create surface.
• Two main groups of interpolation techniques: – 1. Deterministic– 2. Geo-statistical
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Interpolation techniques
• 1. Deterministic: is used for creating surfaces from measures points based either on extent of similarity (Inverse Distance Weighted (IDW) or the degree of smoothing (radial basis functions and polynomials)
• 2. Geo-statistical: is based on statistics and is used for more advanced prediction of surface modeling that also includes errors or uncertainty of prediction.
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Deterministic Methods
• Four types:
1.1. Inverse Distance Weighted (IDW)Inverse Distance Weighted (IDW)2.2. Global PolynomialGlobal Polynomial3.3. Local PolynomialLocal Polynomial4.4. Radial Basis FunctionsRadial Basis Functions
• Can classified into two groups:
– Global uses entire data set• Global polynomial
– Local calculates prediction from measured point with specified neighbors: • IDW, local polynomials, radial basis functions
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Inverse Distance Weighted (IDW) A window of circular shape with the radius of dmax
is drawn at a point to be interpolated, so as to involve six to eight surrounding observed points.
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Global polynomial interpolation• Global Polynomial interpolation fits a smooth surface that is defined by a
mathematical function (a polynomial) to the input sample points. The Global Polynomial surface changes gradually and captures coarse-scale pattern in the data.
• Conceptually, Global Polynomial interpolation is like taking a piece of paper and fitting it between the raised points (raised to the height of value). This is demonstrated in the diagram below for a set of sample points of elevation taken on a gently sloping hill (the piece of paper is magenta).
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Local Polynomial interpolation
• While Global Polynomial interpolation fits a polynomial to the entire surface, Local Polynomial interpolation fits many polynomials, each within specified overlapping neighborhoods. The search neighborhood can be defined using the search neighborhood dialog
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Radial Basis Functions (RBF) RBF methods are a series of exact interpolation techniques; that is,
the surface must go through each measured sample value. There are five different basis functions: thin-plate spline, spline with
tension, completely regularized spline, multi-quadric function, and inverse multi-quadric function.
RBF methods are a form of artificial neural networks.
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Geo-statistical Methods
• Kriging and Co-kriging– Algorithm
• Ordinary -A variety of kriging which assumes that local means are not necessarily closely related to the population mean, and which therefore uses only the samples in the local neighbourhood for the estimate. Ordinary kriging is the most comrnonly used method for environmental situations.
• Simple - A variety of kriging which assumes that local means are relatively constant and equal to the population mean, which is well-known. The population mean is used as a factor in each local estimate, along with the samples in the local neighborhood. This is not usually the most appropriate method for environmental situations.
• Universal - • Indicator• Probability• Disjunctive
– Output Surfaces• Prediction and prediction standard error• Quantile • Probability and standard errors of indicators
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Kriging• Kriging is a geostatistical method for spatial
interpolation.• It can assess the quality of prediction with estimated
prediction errors.• It uses statistical models that allow a variety of map
outputs including predictions, prediction standard errors, probability, etc.
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Interpolation using Kriging
5
1, )()()(ˆ
iiredyellowi redZdyellowZ
i
Kriging weights
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Semivariogram
• The semivariogram functions quantifies the assumption that things nearby tend to be more similar than things that are farther apart. Semivariogram measures the strength of statistical correlation as a function of distance.
• Semivariance: Y(h) = ½ [(Z(xi) - Z(xj)]2
• Covarience = Sill – Y(h)
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Types of semivariogram models
• Geostatistical Analyst provides the following functions to choose from to model the empirical semivariogram: 1. Circular 2. Spherical 3. Tetraspherical 4. Pentaspherical 5. Exponential 6. Gaussian 7. Rational Quadratic 8. Hole Effect 9. K-Bessel 10.J-Bessel 11.Stable
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Semi-variogram Models
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Trend
An example of a global trend can be seen in the effects of the prevailing winds on a smoke stack at a factory (below).
In the image, the higher concentrations of pollution are depicted in the warm colors (reds and yellows) and the lower concentrations in the cool colors (greens and blues).
Notice that the values of the pollutant change more slowly in the east–west direction than in the north–south direction.
This is because east–west is aligned with the wind while north–south is perpendicular to the wind.
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Detrending tool
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Anisotropy
Anisotropy is a characteristic of a random process that shows higher autocorrelation in one direction than another.
The following image shows conceptually how the process might look.
Once again, the higher concentrations of pollution are depicted in the warm colors (reds and yellows) and the lower concentrations in the cool colors (greens and blues).
The random process shows undulations that are shorter in one direction than another.
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Accounting for Anisotrophy
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Searching Neighbor
The points highlighted in the data view give an indicator of the weights (absolute value in percent) associated with each point located in the moving window. The weights are used to estimate the value at the unknown location which is at the center of the cross hair.
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Data transformation
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Declustering method• There are two ways to decluster your data: by the cell method and by
Voronoi polygons. Samples should be taken so they are representative of the entire surface. However, many times the samples are taken where the concentration is most severe, thus skewing the view of the surface. Declustering accounts for skewed representation of the samples by weighting them appropriately so that a more accurate surface can be created.
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Bi-variate normal distribution
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Output Surface
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Cross Validation
• Cross-validation uses all of the data to estimate the trend and autocorrelation models. It removes each data location, one at a time, and predicts the associated data value.
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Various Surface produced using ordinary kriging
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Model comparison
• Comparison helps you determine how good the model that created a geostatistical layer is relative to another model.
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Co
nce
pt
Co
urs
e o
n S
pa
tial A
na
lyst
@ D
r. A
.K.M
. S
aifu
l Isl
am
Display Format
Filled contour GridsContours
Hill shade Combination of contoursFilled contour and hill shade