Professional Documents
Culture Documents
Analysis of Variance
and
Covariance
16-1
Chapter Outline
1)
Overview
2)
2)
3)
4)
i.
2007
Variables
ii.
iii.
Measurement of Effects
iv.
Significance Testing
v.
Interpretation of Results
16-2
Chapter Outline
5) Illustrative Data
Illustrative Applications of OneWay Analysis of Variance
6)
Assumptions in Analysis of
Variance
7)
8)
9)
Analysis of Covariance
10)
Issues in Interpretation
11)
2007
i.
Interactions
ii.
iii.
Multiple Comparisons
16-3
Chapter Outline
12) Nonmetric Analysis of Variance
13) Multivariate Analysis of Variance
14) Summary
2007
16-4
Relationship Among
Techniques
2007
16-5
Relationship Among
Techniques
2007
16-6
16.1
One Variable
Independent
One or More
Independent
Variables
Binary
Categorical:
Factorial
Categorical
and Interval
Interval
t Test
Analysis of
Variance
Analysis of
Covariance
Regression
2007
One Factor
More than
One Factor
One-Way Analysis
of Variance
N-Way Analysis
of Variance
16-7
One-Way Analysis of
Variance
Marketing researchers are often interested
in examining the differences in the mean
values of the dependent variable for
several categories of a single independent
variable or factor. For example:
2007
16-8
16-9
2007
16-10
2007
16-11
2007
16-12
SS x = n (Y j Y )2
j =1
SS error=
j
2007
(Y ij Y j )2
Yi = individual observation
Yj = mean for category j
= mean over the whole sample, or grand
Y mean
Yij = i th observation in the j th category
16-13
Within
Categor
y
Variatio
n
=SSwithin
Categor
y Mean
2007
Sample
X1
X2
X
Total
Categories
X3
Xc
Y1
Y1
Y1
Y1
Y1
Y2
:
:
Yn
Y2
Y2
Y2
Yn
Yn
Yn
Y2
:
:
YN
Total
Variati
on
=SSy
Y1
Y2
Y3
Yc
Y
Between Category Variation = SSbetween
16-14
Conducting One-Way
Analysis
ofIn Variance
analysis of variance, we estimate two measures
of variation: within groups (SSwithin) and between
groups (SSbetween). Thus, by comparing the Y
variance estimates based on between-group and
within-group variation, we can test the null
hypothesis.
2
= SSx/SSy = (SSy - SSerror)/SSy
2007
16-15
H0: 1 = 2 = 3 = ........... = c
Under the null hypothesis, SSx and SSerror come from the same
source of variation. In other words, the estimate of the
population variance of Y,
y
= SS
x/(c - 1)
= Mean square due to X
= MSx
or
y
= SS
error/(N - c)
= Mean square due to error
= MSerror
2007
16-16
SS x /(c1)
= MS x
F=
SS error/(Nc)
MS error
2007
16-17
2007
16-18
2007
16-19
2007
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Coupon Level
In-Store Prom otion
Sales Clientel Rating
1.00
1.00 10.00
9.00
1.00
1.00
9.00
10.00
1.00
1.00 10.00
8.00
1.00
1.00
8.00
4.00
1.00
1.00
9.00
6.00
1.00
2.00
8.00
8.00
1.00
2.00
8.00
4.00
1.00
2.00
7.00
10.00
1.00
2.00
9.00
6.00
1.00
2.00
6.00
9.00
1.00
3.00
5.00
8.00
1.00
3.00
7.00
9.00
1.00
3.00
6.00
6.00
1.00
3.00
4.00
10.00
1.00
3.00
5.00
4.00
2.00
1.00
8.00
10.00
2.00
1.00
9.00
6.00
2.00
1.00
7.00
8.00
2.00
1.00
7.00
4.00
2.00
1.00
6.00
9.00
2.00
2.00
4.00
6.00
2.00
2.00
5.00
8.00
2.00
2.00
5.00
10.00
2.00
2.00
6.00
4.00
2.00
2.00
4.00
9.00
2.00
3.00
2.00
4.00
2.00
3.00
3.00
6.00
2.00
3.00
2.00
10.00
2.00
3.00
1.00
9.00
2.00
3.00
2.00
8.00
16-20
Column Totals
Category means:Y
Y
Grand mean,
2007
83
83/10
= 8.3
62
37
62/10
37/10
= 6.2
= 3.7
= (83 + 62 + 37)/30 = 6.067
16-21
SSy
= (10-6.067)2 + (9-6.067)2 + (10-6.067)2 + (8-6.067)2 + (96.067)2
+ (8-6.067)2 + (9-6.067)2 + (7-6.067)2 + (7-6.067)2 + (6-6.067)2
+ (8-6.067)2 + (8-6.067)2 + (7-6.067)2 + (9-6.067)2 + (6-6.067)2
(4-6.067)2 + (5-6.067)2 + (5-6.067)2 + (6-6.067)2 + (4-6.067)2
+ (5-6.067)2 + (7-6.067)2 + (6-6.067)2 + (4-6.067)2 + (5-6.067)2
+ (2-6.067)2 + (3-6.067)2 + (2-6.067)2 + (1-6.067)2 + (2-6.067)2
2007
16-22
SSerror
= (10-8.3)2 + (9-8.3)2 + (10-8.3)2 + (8-8.3)2
+ (9-8.3)2
+ (8-8.3)2 + (9-8.3)2 + (7-8.3)2 + (7-8.3)2 + (6-8.3)2
+ (8-6.2)2 + (8-6.2)2 + (7-6.2)2 + (9-6.2)2 + (6-6.2)2
+ (4-6.2)2 + (5-6.2)2 + (5-6.2)2 + (6-6.2)2 + (4-6.2)2
+ (5-3.7)2 + (7-3.7)2 + (6-3.7)2 + (4-3.7)2 + (5-3.7)2
+ (2-3.7)2 + (3-3.7)2 + (2-3.7)2 + (1-3.7)2 + (2-3.7)2
16-23
SS x /(c1)
MS X
F=
=
SS error/(Nc) MS error
2007
106.067/(31)
F=
79.800/(303)
= 17.944
16-24
2007
16-25
Table
16.4
Source of
prob.
Variation
Sum of
df
Between groups
(Promotion)
Within groups
(Error)
TOTAL
106.067
53.033
79.800
27
2.956
185.867
29
6.409
squares
Mean
F ratio
17.944
0.000
square
Cell means
Level of
Promotion
High (1)
Medium (2)
Low (3)
Count
Mean
10
10
10
8.300
6.200
3.700
TOTAL
30
6.067
2007
16-26
Assumptions in Analysis of
Variance
The salient assumptions in analysis of
variance can be summarized as follows:
1.
2.
3.
2007
16-27
2007
16-28
N-Way Analysis of
Variance
Consider the simple case of two factors X1 and X2 having
categories c1 and c2. The total variation in this case is
partitioned as follows:
or
The strengthof the joint effect of two factors, called the overall
effect, or multiple 2, is measured as follows:
multiple
2007
16-29
N-Way Analysis of
Variance
The significance of the overall effect may be tested by an
F test, as follows
F=
SS x 1, x 2, x 1x 2/dfn
SS error/dfd
MS , ,
= x 1 x 2 x 1x 2
MS error
where
dfn
=
=
dfd
=
MS
2007
=
degrees of freedom for the numerator
(c1 - 1) + (c2 - 1) + (c1 - 1) (c2 - 1)
c1c2 - 1
=
degrees of freedom for the denominator
N - c1c2
=
mean square
16-30
SS /df
F=
SS error/dfd
MS x 1x 2
=
MS error
Where
dfn = (c1 - 1) (c2 - 1)
2007
dfd = N - c1c2
16-31
where
dfn = c1 - 1
dfd = N - c1c2
2007
16-32
2007
Sum of
squares
df
Mean
square
Sig. of
F
106.067
53.333
159.400
3.267
2
1
3
2
53.033
53.333
53.133
1.633
54.862
55.172
54.966
1.690
0.000
0.000
0.000
0.226
162.667 5
23.200 24
185.867 29
32.533
0.967
6.409
33.655
0.000
2
0.557
0.280
16-33
Coupon
Yes
No
Yes
No
Yes
No
Count
5
5
5
5
5
5
30
Factor Level
Means
Promotion
High
Medium
Low
Coupon
Yes
No
2007
Mean
9.200
7.400
7.600
4.800
5.400
2.000
Grand Mean
Count
10
10
10
15
15
30
Mean
8.300
6.200
3.700
7.400
4.733
6.067
16-34
Analysis of
Covariance
Suppose that we wanted to determine the effect of instore promotion and couponing on sales while controlling
for the affect of clientele. The results are shown in Table
16.6.
2007
16-35
Analysis of Covariance
Table 16.6
Sum of
Sig.
Source of Variation
Mean
Squares
df
Square
of F
Covariance
0.363
Clientele
0.838
0.838
0.862
Main effects
Promotion
Coupon
Combined
106.067
53.333
159.400
3.267
163.505
2-Way Interaction
Promotion* Coupon
Model
Residual (Error)
TOTAL
Covariate
Clientele
2007
1.633
1.680 0.208
22.362
23
0.972
185.867
29
6.409
Raw Coefficient
-0.078
16-36
Issues in
Interpretation
2007
16-37
A Classification of Interaction
Effects
Fig. 16.3
Possible Interaction Effects
No Interaction
(Case 1)
Interaction
Ordinal
(Case 2)
Disordinal
Noncrossover
(Case 3)
2007
Crossover
(Case 4)
16-38
Patterns of
Interaction
Fig.
16.4
Case 1: No Interaction
X
2
X2
Y
21
X
X
1
12 1
1 3: Disordinal
3
Case
Interaction: Noncrossover
X
2
Y
X2
21
Case 2: Ordinal
X
Interaction
2
2
X
21
X
X
1
12 1
1
3
Case
4: Disordinal
Interaction: Crossover
X
2
2
X
21
2007
12
X
1
12
X
1
16-39
Issues in
Interpretation
2
significant effects. In
Table
16.5,
associated with the
level of in-store promotion is calculated as follows:
2
p =
106.067 (2 x 0.967)
185.867 + 0.967
104.133
186.834
= 0.557
=
2007
16-40
Issues in
Interpretation
2 53.333 (1 x 0.967)
c =
185.867 + 0.967
52.366
186.834
= 0.280
=
2007
16-41
Issues in
Interpretation
Multiple Comparisons
2007
16-42
Issues in Interpretation
Multiple Comparisons
2007
16-43
Repeated Measures
ANOVA
One way of controlling the differences between
subjects is by observing each subject under
each experimental condition (see Table 16.7).
Since repeated measurements are obtained
from each respondent, this design is referred
to as within-subjects design or repeated
measures analysis of variance. Repeated
measures analysis of variance may be thought
of as an extension of the paired-samples t test
to the case of more than two related samples.
2007
16-44
Between
People
Variation
=SSbetween
people
Category
Mean
2007
Sample
X1
X2
X3
Xc
Y11
Y12
Y13
Y1c
Y1
Y21
Y22
Y23
Y2c
Y2
Yn1
Yn2
Yn3
Ync
YN
Y1
Y2
Y3
Yc
Total
Variation
=SSy
16-45
Repeated Measures
ANOVA
2007(c - 1) (n -1).
16-46
Repeated Measures
ANOVA
Thus,
SS x /(c1)
MS x
=
SS error/(n1)(c1) MS error
F=
2007
16-47
Nonmetric Analysis of
Variance
2007
16-48
The Kruskal-Wallis test is more powerful than the ksample median test as it uses the rank value of each
case, not merely its location relative to the median.
However, if there are a large number of tied rankings in
the data, the k-sample median test may be a better
choice.
2007
16-49
Multivariate Analysis of
Variance
2007
16-50
SPSS Windows
One-way ANOVA can be efficiently performed
using the program COMPARE MEANS and
then One-way ANOVA. To select this
procedure using SPSS for Windows click:
Analyze>Compare Means>One-Way ANOVA
2007
16-51
2.
3.
4.
5.
Click OPTIONS
6.
Click Descriptive.
7.
Click CONTINUE.
8.
Click OK.
2007
16-52
2.
3.
4.
5.
6.
Click OK.
2007
16-53