Professional Documents
Culture Documents
1.0 Overview
Analysis is often considered to be "the heart" of GIS. Through
analysis new information is gained. As a GIS stores both attribute and spatial data,
analysis can be conducted on both types of data however, it is the spatial analysis
capability that sets GIS apart from database applications.
There are a great many GIS analyses that can be conducted. For convenience sake we
often group the analyses into categories. For the purpose of this course I have
decided to create a bit of a hierarchy. This is done to reflect your lab experiences
using GIS. Our first encounter with GIS analyses is with vector data. Later we will
(hopefully) move into analysis of raster data. The focus of this module is vector
data. The breakout of vector analysis is as follows:
Attribute
Attribute Query - Select by Attribute
Arithmetic Calculation
Statistical Summary
Reclassification
Relating Tables
Spatial Join
Spatial
Spatial Query - Select by Location
Spatial Calculation
Spatial Join
Overlay
Buffer
Dissolve
Network Analysis
2.0 ATTRIBUTE DATA OPERATIONS
There are many operations that can be conducted on the attribute database (the data
tables). These can be divided into 4 categories: query (or logical), arithmetic,
statistical and reclass operations. I included a short note about relating tables as it
enhances our analytical capabilities. There is also the spatial join function, which
straddles attribute and spatial analysis.
2.1 Queries = Select by Attributes
Queries include both comparison (=, >, <, >=, <=, <>) and Boolean (AND, OR and NOT)
operators (for a simple but effective visual, check-out the Boolean Machine). These
operators are used to perform queries.
options available when you right-click on a field name: 'statistics' and 'summarize'.
'Statistics' provides a temporary pop-up table with the typical parameters:
count
minimum
maximum
sum
mean
standard deviation
Plus the data are plotted in a histogram (frequency distribution).
'Summarize' creates a an output data table. Statistics are based on unique values in a
chosen field. selected fields from these operations are placed in a summary table. In
the example below, the field Group was chosen and the statistics count, sum and
mean were calculated.
Data Table
Group
A
A
C
B
B
A
Value
100
300
50
80
70
300
Summary Table
Group
CountValu
e
A
3
B
2
C
1
SumValu
e
700
150
50
MeanValu
e
233
75
50
2.4 Reclassification
Reclassification is another operation that can be conducted on attribute
data. Reclassification results in a generalization (i.e. a simplification) of the
original data set. For instance, raw property values in a data set can be put in 3
classes: lower, middle, and upper class. We typically use the legend editor (in ArcGIS
the Symbology tab of the Layer Properties dialog box) to classify the data - altering
the legend is temporary and we can change the colouring at anytime. (If we want a
"permanent reclassification", e.g. a new map, then we would use Merge / Dissolve this is described in section 3.5 below).
2.5 Table Relations
Relating tables to each other involves joining or linking records between two tables.
This may not be considered an operation, but it does allow us to relate outside
source data to our themes to allow the features in our themes to be analyzed based
on outside data. Refer to the database lecture notes.
2.6 Spatial Join
As with relating tables, a spatial join will relate records between two tables. But the
records are not joined based on a common attribute value (usually ID);
instead records are joined based on common location (as defined by the
coordinates of the spatial features). This type of operation is a combination of spatial
and attribute; it is described in more detail in section 3.2 below.
3.4 Overlays bringing together two themes the line work of the two themes are
combined (lines are broken, new nodes and links/arcs are recognized, topology is redone, note that
sliver polygons may have to be eliminated) and the fields from both theme databases are
combined into one new database. Three common types of overlays:
Union is a complete merging of two themes where the new theme is composed of
the entire map area of both themes and all the fields from both theme data
tables
Intersect is a merging of two themes but only where they share space such that
the map area of the new theme is the area that was in common for both themes
and the attribute database is composed of all the fields from both theme data
tables
Clip is akin to pressing a cookie cutter onto a theme such that the new theme is a
miniature version of the first (a mini-me), the map area is defined by the overlay
(cookie cutter) theme and the database comes only from the input (cookie
dough) theme.
Update - features from the 'update layer' descend upon the input theme and
replace whatever was underneath. An example would be updating a timber type
map (input layer) with a cut block (update layer) where the cut block shape
supersedes the timber types it overlaps.
Erase - the polygons from the 'erase layer' descend upon the input theme and
eliminate that area. An example would be if land were expropriated from a
woodlot owner to create a park. The 'park area' would be erased from the woodlot
area.
3.5 Buffers
Buffering creates a new theme with new polygon features (geometric objects) based
on a constant measure from features in a source theme; buffers around points are
circles, around lines are corridors (snake-like with rounded ends) and around
polygons are donuts; buffers can be created based on:
a single set width (i.e. all features by 50m)
multiple widths where more than one buffer is created around each feature (i.e. a
50 and a 150m buffer created around each feature gives a bulls eye effect)
varying width based on an attribute field (i.e. width for each feature is stored in
the data table, buffer width depends on the value in this field)
3.6 Dissolve
With dissolve, also known as merge polygons, boundaries between adjacent polygons
with the same attribute value (i.e. class = poor) are dissolved and the two (or more)
polygons are merged into one larger polygon; a new map layer (theme) results with
the generalized data. This is the spatial equivalent to reclassification of attribute
data.
3.7 Network Routing as the name implies, this type of analysis assesses movement
through a network. Consider the difference between the shortest route and the
fastest route. During the middle of the night the shortest route is likely the fastest
route, however, during rush hour I would consider traffic and use the fastest route
(which may have a longer distance). The network is modeled using lines (arcs) and
intersections (nodes). Arc-node topology provides information regarding
connectivity. The attribute database would provide additional information regarding
impedance to flow (or movement). Examples would include speed limit and traffic
loads at different times of the day. There is a cost to making turns at intersections
i.e. you have to slow down rather than use just two wheels to make the turn. Oneway streets would provide for an absolute barrier. As well, making a turn off of an
overpass onto a highway below would be prohibited. Other routing examples include
most efficient route (for making several stops or deliveries) and location-allocation
(where school catchment areas can be determined based on road network and not
just a straight-line distance).
Quickest route to Vegas!
Shortest route to Vegas!
Note the difference between the two routes.
3.8 Summary
The results of these spatial operations take on different forms: item 3.1 is a spatial
query and merely results in a selection of map features that meet the spatial
criteria/relation, items 3.2 and 3.3 result in additions/changes to the database
tables, items 3.4 - 3.6 generally result in a new theme with line work and a data table
based on the spatial operation, and . Item 3.7 (networks) usually result in a selection
of lines within the network that represent the best route.