Professional Documents
Culture Documents
Tag Reader
Source: www.belgravium.com
shorter paths?
What locations are common to the paths of a set of
defective auto-parts?
Identify containers at a port that have deviated from their
historic paths
Data mining
Find trends, outliers, frequent, sequential, flow
patterns, …
10 pallets store 1
(1000 cases)
Dist. Center 1 shelf 2
store 2
Dist. Center2 …
Factory
…
10 packs
… (12 sodas)
20 cases
(1000 packs)
Store View:
Transportation View:
series of locations
e.g., what is the average time that milk stays at the
location
If using record transitions: difficult to answer queries, lots of
intersections needed
Map Table: (GID, <GID1,..,GIDn>)
Links together stages that belong to the same path. Provides
additional: compression and query processing efficiency
High level GID points to lower level GIDs
manufacturer, price
r1 l1 t1 t10 r1,r2,r3
g1 l1 t1 t10
r2 l1 t1 t10 r3
g1.2 l4 t15 t20
r2 l3 t20 t30
Map Table
r3 l1 t1 10
gid gids
g1.1 r1,r2
g1.2 r3
Stay Grouping
Retrieve stay record with location = l
IO Cost: 1
l1 n 1
n 3
l2 l3 l4
l5 l6 l7 l8 l9 l10 n 6
0.0 0.1
Assign to each GID a unique
l1 l2
identifier that encodes the path
traversed by the items that it 0.0.0 0.1.0
0.1.1
points to l3 l4
Path-dependent name: Makes
0.0.0.0
0.1.0.1
it easy to detect if locations
form a path l5 l6
0.0.0
l3 0.1.0 l3 0.1.1 l4 0.1 l2 t1 t8 3
t20,t30: 3 t20,t30: 3 t10,t20: 2
{r8,r9} 0.1.0.1 l6 t35 t50 1
0.1.0.0 0.1.0.1
t40,t60: 2 t35,t50: 1
0.1.1 l4 t10 t20 2
0.0.0.0
t40,t60: 3 l5 l5 l6
{r1,r2,r3} {r5,r6} {r7}
Size (MBytes)
Cleansed data size 250
200
map
150
P=1000, B=(500,150,40,8,1), k = 5 100
50
Lossless compression, cuboid is at 0
the same level of abstraction as 0.1 0.5 1 5 10
cleansed RFID database Input Stay Records (m illions)
25 map
P=1000, N = 1,000,000, k= 5 20
15
Map gives significant benefits for bulky 10
data 5
0
For data where items move individually a b c d e
we are better off using tag lists Path Bulkiness
12
Size (MBytes) 10 nomap
8 map
6
4
2
0
1 2 3 4
Abstraction Level
1,800
100K tags
1,600
200K tags
1,400
Time (seconds)
400K tags
1,200
1,000 800K tags
800 1600K tags
600
400
200
0
level 0 level 1 level 2 level 3
Aggregation Level
Construction Time
P=1000, B=(500,150,40,8,1), k = 5, N=1,000,000
Savings by constructing from lower level cuboid 50% to 80%
1,000 nomap
P=1000, B=(500,150,40,8,1), k = 5 map
100
Speed up due to stay table 1 order of
10
magnitude
1
Speed up due to stay table and map 1 2 3 4 5
table 2 orders of magnitude Input Stay Records (m illions)
A B C D E A B C D E
AB AC AD AE BC BD BE CD CE DE AB AC AD AE BC BD BE CD CE DE
ABC ABD ABE ACD ACE ADE BCD BCE BDE CDE ABC ABD ABE ACD ACE ADE BCD BCE BDE CDE
ABCD ABCE ABDE ACDE BCDE ABCD ABCE ABDE ACDE BCDE
ABCD ABCD
E E
records, …
Study the correlation with the characteristics of the
projection
Depth, bushiness, tree size, number of leaves, fan-
out/in, …
Result ─ No rule found with the above parameters for the
projection mining time
sample size.
e.g. Dataset —pumsb
1% random sampling
Becomes accurate when
10 10 10 10
1
1 1 1
1 2 4 8 16 32 64
1 2 4 8 16 32 64 1 2 4 8 16 32 64 1 2 4 8 16 32 64
Processor# Processor# Processor# Processor#
optimal optimal
Par-FP Par-FP
optimal
Par-FP
10 10 10
1 1 1
1 2 4 8 16 32 64 1 2 4 8 16 32 64 1 2 4 8 16 32 64
Processor# Processor# Processor#
1
1 1 1
1 2 4 8 16 32 64
1 2 4 8 16 32 64 1 2 4 8 16 32 64 1 2 4 8 16 32 64
T30I0.2D1K T40I10D100K T50I5D500K
100 100 100
optimal
optimal optimal
one-level
one-level
one-level
multi-level multi-level
multi-level
10 10 10
1 1 1
1 2 4 8 16 32 64 1 2 4 8 16 32 64 1 2 4 8 16 32 64
optimal
optimal optimal
Par-Span
Par-Span
Par-Span
10 10 10
1 1 1
1 2 4 8 16 32 64 1 2 4 8 16 32 64 1 2 4 8 16 32 64
Processor# Processor# Processor#
C100N20T2.5S10I1.25 C200N10T2.5S10I1.25
100 100
optimal
Par-Span optimal
Par-Span
10 10
1 1
1 2 4 8 16 32 64 1 2 4 8 16 32 64
Processor# Processor#
optimal optimal
Par-CSP Par-CSP
10 10
1 1
1 2 4 8 16 32 64 1 2 4 8 16 32 64
Processor# Processor#
C200S25N9 Gazelle
100 100
optimal optimal
Par-CSP Par-CSP
10 10
1 1
1 2 4 8 16 32 64 1 2 4 8 16 32 64
Processor# Processor#
25
20
15
10