Professional Documents
Culture Documents
1
1 2 3
4 5 6
N
7 8 9
10 11 12
Block
1 (B) 2 (A) 3 (C)
1
4 (A) 5 (B) 6 (C)
2
N
7 (A) 8 (C) 9 (B)
3
4 10 (A) 11 (C) 12 (B)
The field can be divided into four blocks of three plots each, and the soil in
each of these blocks will be uniform.
RCBD: INDEPENDENT RANDOMIZATIONS IN EACH BLOCK!
2
6. 2. 3. Statistical model
The new model is Yij = + i + j + ij.
The sum of squares equation becomes:
t r t r t r
(Y
i 1 j 1
ij Y .. ) 2 r (Y i . Y .. ) 2 t (Y . j Y .. ) 2 (Yij Y i . Y . j Y .. ) 2
i 1 j 1 i 1 j 1
The RCBD design has r - 1 fewer degrees of freedom than the CRD.
If blocks were no different, the MSE for the CRD would be smaller than
the MSE for the RCBD and the CRD would be more powerful.
If there was a substantial difference between blocks, the MSE for the
RCBD would be smaller than the MSE for the CRD and the RCBD
would be more powerful.
3
Obviously one should only use the RCBD when it is appropriate, but how
can this be determined? The concept of efficiency, discussed in section 6.
3., answers this.
Example: An experiment was Ranch
conducted to investigate the Trtmt 1 2 3 4
effect of estrogen on weight
gain in sheep. The treatments M Est0
are combinations of sex of
sheep (M, F) and level of M Est3
estrogen (Est0, Est3). The
sheep are blocked by ranch, F Est0
with one replication of each
treatment level at each ranch. F Est3
RCBD. Effect of estrogen on weight gains. Blocks are 4 different ranches.
Block Treatment
Treatment I II III IV Total Mean
F-S0 47 52 62 51 212 53
M-S0 50 54 67 57 228 57
F-S3 57 53 69 57 236 59
M-S3 54 65 74 59 252 63
Block Mean 52 56 68 56 58
Differences among blocks do not result from treatments but from other
differences associated with the blocks.
4
This component of the total sum of squares can be removed and the
experimental error reduced accordingly.
Compare the SSerror in Tables 6.2 and 6.3
6. 3. Relative efficiency: When to block? [ST&D p. 221, Topic 1, 1.4.4.6]
MST SSE
F MSE Fcrit F , df trt , df e
MSE df e
1 df MSE 1 1
I (when MSE df<20 a correction factor is used)
2 df MSE 3 MSE
df MSE1 1 1
If RE is >1, design 1 provides
I1 df MSE1 3 MSE1 (df MSE1 1)(df MSE 2 3) MSEmore information and is more
RE1:2 2
I 2 df MSE 2 1 1 (df MSE 2 1)( df MSE1 3) MSEefficient.
1
df MSE 2 3 MSE2
The main complication is how to estimate MSE for the alternative design.
To obtain this formula the TSS of the 2 designs are assumed equal, and then the
MS are replaced by the variance components of the expected MS (next page)
5.51 replications
( f rcbd 1)(off crd
theCRD produce
3) MSE the(9same
1)(amount of .information
12 3)44 62 as one replication of the RCBD.
RERCBD crd
5.51
to CRD
( f crd 1)( f RCBD 3) MSERCBD (12 1)(9 3)7.78
5
Derivation of the expected MSECRD
ft+fe fb
fb+ft+fe
6
Power comparison between CRD and RCBD designs
1. Not significant differences among blocks.
>0
2 Significant differences among blocks.
α/2
>0
7
6.4. Assumptions of the model
Model for the RCBD with a single replication per block-treatment combination:
Yij = + i + j + ij
1. The residuals (ij) are independent, homogeneous, and normally distributed.
2. The variance within each treatment levels is homogeneous across all
treatment levels.
3. The main effects are additive.
Recall that experimental error is defined as the variation among experimental units that
are treated alike.
Ranch
Trtmt 1 2 3 4
M Est0
M Est3
F Est0
F Est3
With only one replication per cell, the interaction i*j cannot be included in
the model (there will be 0 df for error). Therefore, the residuals are the
combined effects of experimental error and treatment*block interactions:
8
9
Example: Consider the hypothetical effects of two factors (A, B) on some
response variable (assume =0):
Purely additive
Factor A
Factor B 1 = 1 2 = 2 3 = 3
1 = 1 2 3 4
2 = 5 6 7 8
Purely multiplicative
Factor A
Factor B 1 = 1 2 = 2 3 = 3
1 = 1 1 2 3
12
4 8
2 = 5 5 10 15
10
6. 4. 1. Tukey’s test for non-additivity [ST&D 395]
Under our linear model, each observation is characterized as:
y ij i j ij
SO, if the observed and predicted values obey a linear relationship, then the
non-random Interaction Effects buried in the error term are sufficiently small
to uphold our assumption of additivity of main effects.
11
This test is easily implemented using SAS:
Data Lambs;
Input Sex_Est $ @@;
Do Ranch = 1 to 4;
Input Gain @@;
Output;
End;
Cards;
F0 47 52 62 51
M0 50 54 67 57
F3 57 53 69 57
M3 54 65 74 59
;
Proc GLM;
Class Sex_Est Ranch;
Model Gain = Ranch Sex_Est;
Output out = LambsPR p = Pred r = Res;
Proc GLM; * This is the Tukey 1 df test;
Class Sex_Est Ranch;
Model Gain = Ranch Sex_Est Pred*Pred;
Run;Quit;
1.- The second Proc GLM adds an additional variable that is the square of the
predicted values: Pred*Pred, which is used in the non-additivity test.
2.- If this quadratic component is significant the relationship is not linear!
3.- Pred*Pred is not in the Class statement, and is last term in the
Model.
Since P= 0.5395, NS, we do not reject the null hypothesis of additive effects
This test is necessary ONLY when there is one observation per
block-treatment combination.
If there are two or more replications per block-treatment
combination, the block*treatment interaction can be tested
directly in an exploratory model:
Model Gain = Ranch Sex_Est Ranch*Sex_Est;
12
Example Table 6.5. Yield of penicillin in four treatments A, B, C, D.
Blocks are different stocks of a raw material. The numbers below each
observation (O) are the predicted (P: Grand Mean + Treatment effect +
Block effect) and residual (R) values.
Treatment Block Block
Block A B C D Mean Effect
O: 89 O: 88 O: 97 O: 94
Stock 1 92 +6
P: 90 P: 91 P: 95 P: 92
R: -1 R: -3 R: 2 R: 2
Stock 2 O: 84 O: 77 O: 92 O: 79 83 3
P: 81 P: 82 P: 86 P: 83
R: 3 R: -5 R: 6 R: -4
Stock 3 O: 81 O: 87 O: 87 O: 85 85 1
P: 83 P: 84 P: 88 P: 85
R: -2 R: 3 R: -1 R: 0
Stock 4 O: 87 O: 92 O: 89 O: 84 88 2
P: 86 P: 87 P: 91 P: 88
R: 1 R: 5 R: -2 R: -4
Stock 5 O: 79 O: 81 O: 80 O: 88 82 4
P: 80 P: 81 P: 85 P: 82
R: -1 R: 0 R: -5 R: 6
Treat. 84 85 89 86 Mean=86
Treat. 2 1 3 0
13
The SAS code for such an analysis:
Data Penicillin;
Do Block = 1 to 4;
Do Trtmt = 1 to 4;
Input Yield @@;
Output;
End;
End;
Cards;
89 88 97 94
84 77 92 79
81 87 87 85
87 92 89 84
79 81 80 88
;
Proc GLM Data = Penicillin;
Class Block Trtmt;
Model Yield = Block Trtmt;
Output out = PenPR p = Pred r = Res;
Proc Univariate Data = PenPR normal; * Testing for normality of
residuals;
Var Res;
Proc GLM Data = Penicillin; * Testing for variance homogeneity (1 way
ANOVA);
Class Trtmt;
Model Yield = Trtmt;
Means Trtmt / hovtest = Levene;
Proc Plot Data = PenPR; * Generating a plot of predicted vs. residual
values;
Plot Res*Pred = Trtmt;
Proc GLM Data = PenPR; * Testing for nonadditivity;
Class Block Trtmt;
Model Yield = Block Trtmt Pred*Pred;
Run;Quit;
Normality
Test --Statistic--- -----p Value------
Shapiro-Wilk W 0.967406 Pr < W 0.6994 NS
14
Discrepancies between the tentative model and the data can be detected by
studying residuals.
These residuals are the quantities remaining after the systematic
contributions associated with the assumed model are removed.
Sometimes the plot of the residuals versus the predicted values shows a
curvilinear relationship. This appearance suggests nonadditivity between
the block and the treatment effects
Other times the plot of the residuals versus the predicted shows a funnellike
appearance. This indicates that the variance increases as the value of the
response increases (a situation that is common when the variance is a
constant percentage of the mean).
15
EXAMPLE LAMBS NESTED
data lambs;
input sex_est $ block animal gain @@;
cards;
f0 1 1 46 f0 2 1 51 f0 3 1 61 f0 4 1 50 f0 m0 f3 m3
m0 1 2 49 m0 2 2 53 m0 3 2 66 m0 4 2 56 Block 1
f3 1 3 56 f3 2 3 52 f3 3 3 68 f3 4 3 56
m3 1 4 53 m3 2 4 64 m3 3 4 73 m3 4 4 58 Block 2
f0 1 1 48 f0 2 1 53 f0 3 1 62 f0 4 1 52 Block 3
m0 1 2 51 m0 2 2 55 m0 3 2 68 m0 4 2 58
f3 1 3 58 f3 2 3 54 f3 3 3 70 f3 4 3 58 Block 4
m3 1 4 55 m3 2 4 66 m3 3 4 75 m3 4 4 60
2 measurements
proc glm data=lambs order=data;
*Without order=data, SAS reads alphabetically f0 f3 m0 m3;
*With order=data, SAS reads f0 m0 f3 m3: Contrast are different!;
class block sex_est animal;
model gain= block sex_est animal(block*sex_est);
random animal(block*sex_est);
test h=sex_est e=animal(block*sex_est);
run;
quit;
16
Output LAMB NESTED
ANOVA Dependent Variable: gain
Source DF SS MS F Value Pr > F
Model 15 1700.5 113.4 59.47 <.0001
Error 16 30.5 1.9
Corrected Total 31 1731.0
17
The calculation of the variance comp. is the objective of the nested design
This % contribution can be used to optimize the allocation of resources.
18
EXAMPLE LAMBS RCBD with 2 reps per cell
data lambs;
input sex_est $ block gain @@;
cards;
f0 1 46 f0 2 51 f0 3 61 f0 4 50 f0 m0 f3 m3
m0 1 49 m0 2 53 m0 3 66 m0 4 56 Block 1
f3 1 56 f3 2 52 f3 3 68 f3 4 56
m3 1 53 m3 2 64 m3 3 73 m3 4 58 Block 2
f0 1 48 f0 2 53 f0 3 62 f0 4 52
m0 1 51 m0 2 55 m0 3 68 m0 4 58
Block 3
f3 1 58 f3 2 54 f3 3 70 f3 4 58
m3 1 55 m3 2 66 m3 3 75 m3 4 60 Block 4
2
proc glm data=lambs order=data;
class block sex_est;
model gain= block sex_est block*sex_est; * Exploratory model;
19
Output LAMB Reps
ANOVA Dependent Variable: gain
Exploratory
Source DF SS MS F Value Pr > F
Model 15 1700.5 113.4 59.47 <.0001
Error 16 30.5 1.9
Corrected Total 31 1731.0
Source DF SS MS F Value Pr > F
block 3 1132.1 377.4 198.0 <.0001
sex_est 3 426.1 142.0 74.5 <.0001
block*sex_est 9 142.3 15.8 8.3 0.0002
Final
Source DF SS MS F Value Pr > F
Model 6 1558.2 259.7 37.6 <.0001
Error 25 172.8 6.9
Corrected Total 31 1731.0
Source DF SS MS F Value Pr > F
block 3 1132.1 377.4 54.6 <.0001
sex_est 3 426.1 142.0 20.6 <.0001
Still significant in spite of large block*sex_est interaction
Sum of Mean
Source DF Squares Square F Value Pr > F
sex_est 3 3181.5 1060.5 0.54 0.6567 OK
Error 28 54660.5 1952.2
20
RCBD 1 rep/cell RCBD 1 rep/cell with subsamples
T1 T2 T1 T2
B1 B1
B2 B2
T1 T2 T1 T2
B1 B1
B2 B2
ANOVA: ANOVA:
Class Block Trtmt; Class Bl Trt Pot;
Model Y = Block Trtmt; Model Y = Bl Trt Pot(Bl*Trt);
Random Pot(Bl*Trt);
Test h = Trt e = Pot(Bl*Trt);
21