You are on page 1of 15

Leanne Erika L.

Tan
201379280
IE 230
Regression Analysis Problem Set
1 Draw a random sample of 40 observations from the set of 50 observations given.
Indicate the corresponding observation numbers not drawn. Describe briefly how
you performed the random sampling.
The numbers of the observations which were not drawn are 50, 11, 48, 44, 21, 29,
22, 3, 40 and 1. The numbers were drawn by creating a random function in excel.
The index and rank functions were used to determine the corresponding number of
the observation. Finally, the function was copied to select 40 samples.
2 Compute for the descriptive statistics (mean, median, lowest and highest, range etc)
of the quantitative variables. Output individual scatter plots with PROD as the
dependent variable. Comment on the plots generated.
Descriptive Statistics: PROD, COST FERT, TYPE FERT, COST CHEM, TYPE CHEM, ...
Total
Variable Count N N* CumN Percent CumPct Mean SE Mean TrMean
PROD
40 40 0 40
100
100 13213
877 13159
COST FERT
40 40 0 40
100
100 3507
267 3499
TYPE FERT
40 40 0 40
100
100 0.6500 0.0764 0.6667
COST CHEM
40 40 0 40
100
100 1142.8
89.5 1118.0
TYPE CHEM
40 40 0 40
100
100 0.6750 0.0750 0.6944
LABOR COST
40 40 0 40
100
100
800
131
727
OTHER COST
40 40 0 40
100
100 123.1
13.3 114.7
EXP
40 40 0 40
100
100 23.98
1.49 23.86
ELEM LEVEL
40 40 0 40
100
100 0.5750 0.0792 0.5833
HIGH LEVEL
40 40 0 40
100
100 0.1500 0.0572 0.1111
LOAN
40 40 0 40
100
100 12233
1108 11647
Variable
StDev Variance CoefVar
Sum Sum of Squares Minimum
Q1
PROD
5544 30735718 41.96 528527
8182206933
4800 8000
COST FERT
1686 2843671 48.08 140280
602867149
683 2229
TYPE FERT 0.4830 0.2333 74.31 26.0000
26.0000 0.0000 0.0000
COST CHEM 565.9 320242.0 49.52 45712.5
64730347.8 248.0 804.0
TYPE CHEM 0.4743 0.2250 70.27 27.0000
27.0000 0.0000 0.0000
LABOR COST
828 685617 103.56 31982
52310996
40
155
OTHER COST 84.1 7077.4 68.36 4922.8
881867.2
28.3 59.0
EXP
9.41
88.64 39.27 959.00
26449.00 10.00 15.25
ELEM LEVEL 0.5006 0.2506 87.07 23.0000
23.0000 0.0000 0.0000
HIGH LEVEL 0.3616 0.1308 241.08 6.0000
6.0000 0.0000 0.0000
LOAN
7005 49069878 57.26 489305
7899209825
3000 7175
N for
Variable Median
Q3 Maximum Range
IQR
Mode Mode
PROD
12800 18000 24000 19200 10000
5000
5
COST FERT
3303 4766
6460 5777 2538
2736
2
TYPE FERT 1.0000 1.0000 1.0000 1.0000 1.0000
1
26
COST CHEM 993.2 1439.3 2540.0 2292.0 635.3 900, 1100, 2160

TYPE CHEM 1.0000 1.0000 1.0000 1.0000 1.0000


LABOR COST
459 1235
3061 3021 1080
OTHER COST 112.0 178.8 472.0 443.7 119.8
EXP
25.00 30.00 40.00 30.00 14.75
ELEM LEVEL 1.0000 1.0000 1.0000 1.0000 1.0000
HIGH LEVEL 0.0000 0.0000 1.0000 1.0000 0.0000
LOAN
10000 15000 32000 29000 7825

1
27
*
0
*
0
30
11
1
23
0
34
10000
7

Variable Skewness Kurtosis


MSSD
PROD
-0.14
-1.16 16462693
COST FERT
0.35
-0.93 1510475
TYPE FERT
-0.65
-1.66 0.1538
COST CHEM
0.94
0.16 301475.3
TYPE CHEM
-0.78
-1.47 0.2692
LABOR COST
1.18
0.46 147243
OTHER COST
1.93
6.29 7646.7
EXP
0.01
-1.27
48.79
ELEM LEVEL
-0.32
-2.00 0.2308
HIGH LEVEL
2.04
2.26 0.1154
LOAN
1.18
1.28 51992960

3 Perform regression and


estimate parameters using only the quantitative variables in the model. Do not
include the qualitative variables yet.
4 Conduct test on the full model and on the individual parameters. Exclude an
estimate from the model if it is not significant. If there are several estimates which
are not significant, select only the one with the largest p-value and re-run the
regression. Continue until all remaining estimates are significant. Write the final
model.
Test on individual parameters:
Regression Analysis: PROD versus COST FERT
The regression equation is
PROD = 7319 + 1.68 COST FERT
Predictor Coef SE Coef
T
P
Constant
7319
1780 4.11 0.000
COST FERT 1.6806 0.4584 3.67 0.001
S = 4827.15 R-Sq = 26.1% R-Sq(adj) = 24.2%
PRESS = 970391350 R-Sq(pred) = 19.05%
Analysis of Variance
Source

DF

SS

MS

Regression
1 313238900 313238900 13.44 0.001
Residual Error 38 885454104 23301424
Total
39 1198693004
Unusual Observations
COST
Obs FERT PROD Fit SE Fit Residual St Resid
18 1500 24000 9840 1195
14160
3.03R
R denotes an observation with a large standardized residual.

Regression Analysis: PROD versus TYPE FERT


The regression equation is
PROD = 9355 + 5936 TYPE FERT
Predictor Coef SE Coef
T
P
Constant 9355
1285 7.28 0.000
TYPE FERT 5936
1593 3.73 0.001
S = 4806.91 R-Sq = 26.7% R-Sq(adj) = 24.8%
PRESS = 985857607 R-Sq(pred) = 17.76%
Analysis of Variance
Source
DF
SS
MS
F
P
Regression
1 320649696 320649696 13.88 0.001
Residual Error 38 878043307 23106403
Total
39 1198693004
Unusual Observations
TYPE
Obs FERT PROD Fit SE Fit Residual St Resid
39 0.00 18750 9355 1285
9395
2.03R
40 0.00 21000 9355 1285
11645
2.51R
R denotes an observation with a large standardized residual.

Regression Analysis: PROD versus COST CHEM


The regression equation is
PROD = 13000 + 0.19 COST CHEM
Predictor Coef SE Coef
T
P
Constant 13000
2021 6.43 0.000
COST CHEM 0.186 1.589 0.12 0.907
S = 5615.44 R-Sq = 0.0% R-Sq(adj) = 0.0%

PRESS = 1321777628 R-Sq(pred) = 0.00%


Analysis of Variance
Source
DF
SS
MS
F
P
Regression
1
433935 433935 0.01 0.907
Residual Error 38 1198259069 31533133
Total
39 1198693004
Unusual Observations
COST
Obs CHEM PROD Fit SE Fit Residual St Resid
18 248 24000 13046 1676
10954
2.04R
31 2540 18500 13474 2391
5026
0.99 X
R denotes an observation with a large standardized residual.
X denotes an observation whose X value gives it large leverage.

Regression Analysis: PROD versus TYPE CHEM


The regression equation is
PROD = 13210 + 4 TYPE CHEM
Predictor Coef SE Coef
T
P
Constant 13210
1558 8.48 0.000
TYPE CHEM
4
1896 0.00 0.998
S = 5616.45 R-Sq = 0.0% R-Sq(adj) = 0.0%
PRESS = 1338785926 R-Sq(pred) = 0.00%
Analysis of Variance
Source
DF
SS
MS
F
P
Regression
1
155
155 0.00 0.998
Residual Error 38 1198692849 31544549
Total
39 1198693004

Regression Analysis: PROD versus LABOR COST


The regression equation is
PROD = 9839 + 4.22 LABOR COST
Predictor
Coef SE Coef
T
P
Constant 9839.2 964.4 10.20 0.000
LABOR COST 4.2198 0.8433 5.00 0.000
S = 4360.61 R-Sq = 39.7% R-Sq(adj) = 38.1%

PRESS = 796251183 R-Sq(pred) = 33.57%


Analysis of Variance
Source
DF
SS
MS
F
P
Regression
1 476126222 476126222 25.04 0.000
Residual Error 38 722566782 19014915
Total
39 1198693004
Unusual Observations
LABOR
Obs COST PROD Fit SE Fit Residual St
18 564 24000 12219
718
11781
27 2664 18000 21079 1717
-3079
38 3061 17500 22757 2028
-5257

Resid
2.74R
-0.77 X
-1.36 X

R denotes an observation with a large standardized residual.


X denotes an observation whose X value gives it large leverage.

Regression Analysis: PROD versus OTHER COST


The regression equation is
PROD = 11980 + 10.0 OTHER COST
Predictor Coef SE Coef
T
P
Constant 11980
1569 7.64 0.000
OTHER COST 10.02 10.57 0.95 0.349
S = 5551.17 R-Sq = 2.3% R-Sq(adj) = 0.0%
PRESS = 1450594570 R-Sq(pred) = 0.00%
Analysis of Variance
Source
DF
SS
MS
F
P
Regression
1 27703296 27703296 0.90 0.349
Residual Error 38 1170989708 30815519
Total
39 1198693004
Unusual Observations
OTHER
Obs COST PROD Fit SE Fit Residual St Resid
15 472 8000 16709 3790
-8709
-2.15RX
R denotes an observation with a large standardized residual.
X denotes an observation whose X value gives it large leverage.

Regression Analysis: PROD versus EXP


The regression equation is

PROD = 6266 + 290 EXP


Predictor Coef SE Coef
T
P
Constant
6266
2138 2.93 0.006
EXP
289.78 83.16 3.48 0.001
S = 4889.28 R-Sq = 24.2% R-Sq(adj) = 22.2%
PRESS = 1006444645 R-Sq(pred) = 16.04%
Analysis of Variance
Source
DF
SS
MS
F
P
Regression
1 290299791 290299791 12.14 0.001
Residual Error 38 908393213 23905085
Total
39 1198693004
Unusual Observations
Obs EXP PROD Fit SE Fit Residual St Resid
40 10.0 21000 9163 1396
11837
2.53R
R denotes an observation with a large standardized residual.

Regression Analysis: PROD versus ELEM LEVEL


The regression equation is
PROD = 13734 - 906 ELEM LEVEL
Predictor Coef SE Coef
T
P
Constant 13734
1358 10.12 0.000
ELEM LEVEL -906
1790 -0.51 0.616
S = 5597.61 R-Sq = 0.7% R-Sq(adj) = 0.0%
PRESS = 1319832730 R-Sq(pred) = 0.00%
Analysis of Variance
Source
DF
SS
MS
F
P
Regression
1
8029650 8029650 0.26 0.616
Residual Error 38 1190663354 31333246
Total
39 1198693004
Unusual Observations
ELEM
Obs LEVEL PROD Fit SE Fit Residual St Resid
18 1.00 24000 12828 1167
11172
2.04R
R denotes an observation with a large standardized residual.

Regression Analysis: PROD versus HIGH LEVEL


The regression equation is
PROD = 13621 - 2721 HIGH LEVEL
Predictor
Coef SE Coef
T
P
Constant 13621.4 947.9 14.37 0.000
HIGH LEVEL -2721
2448 -1.11 0.273
S = 5527.26 R-Sq = 3.2% R-Sq(adj) = 0.6%
PRESS = 1300072431 R-Sq(pred) = 0.00%
Analysis of Variance
Source
DF
SS
MS
F
P
Regression
1 37770022 37770022 1.24 0.273
Residual Error 38 1160922982 30550605
Total
39 1198693004
Unusual Observations
HIGH
Obs LEVEL PROD Fit SE Fit Residual St Resid
12 1.00 12600 10900 2256
1700
0.34 X
15 1.00 8000 10900 2256
-2900
-0.57 X
16 1.00 6000 10900 2256
-4900
-0.97 X
22 1.00 13000 10900 2256
2100
0.42 X
25 1.00 4800 10900 2256
-6100
-1.21 X
40 1.00 21000 10900 2256
10100
2.00RX
R denotes an observation with a large standardized residual.
X denotes an observation whose X value gives it large leverage.

Regression Analysis: PROD versus LOAN


The regression equation is
PROD = 10935 + 0.186 LOAN
Predictor Coef SE Coef
T
P
Constant 10935
1754 6.24 0.000
LOAN
0.1862 0.1248 1.49 0.144
S = 5458.76 R-Sq = 5.5% R-Sq(adj) = 3.1%
PRESS = 1246316213 R-Sq(pred) = 0.00%
Analysis of Variance
Source
Regression

DF
1

SS
MS
F
P
66367279 66367279 2.23 0.144

Residual Error 38 1132325725 29798045


Total
39 1198693004
Unusual Observations
Obs LOAN PROD Fit SE Fit Residual St Resid
22 31000 13000 16708 2496
-3708
-0.76 X
29 32000 19600 16894 2613
2706
0.56 X
X denotes an observation whose X value gives it large leverage.

Full Model:
Regression Analysis: PROD versus COST FERT, TYPE FERT, ...
The regression equation is
PROD = 6120 + 0.274 COST FERT + 5972 TYPE FERT - 1.41 COST CHEM - 1456 TYPE CHEM
+ 3.80 LABOR COST + 1.99 OTHER COST + 50.9 EXP - 1164 ELEM LEVEL
+ 1766 HIGH LEVEL + 0.0606 LOAN
Predictor
Coef SE Coef
T
P
Constant
6120
2883 2.12 0.042
COST FERT 0.2744 0.5369 0.51 0.613
TYPE FERT
5972
1310 4.56 0.000
COST CHEM -1.406 1.148 -1.22 0.231
TYPE CHEM
-1456
1251 -1.16 0.254
LABOR COST 3.797 1.031 3.68 0.001
OTHER COST 1.992 7.128 0.28 0.782
EXP
50.93 75.85 0.67 0.507
ELEM LEVEL -1164
1381 -0.84 0.406
HIGH LEVEL
1766
2005 0.88 0.385
LOAN
0.06057 0.08397 0.72 0.477
S = 3403.74 R-Sq = 72.0% R-Sq(adj) = 62.3%
PRESS = 643139707 R-Sq(pred) = 46.35%
Analysis of Variance
Source
DF
SS
MS
F
P
Regression
10 862714254 86271425 7.45 0.000
Residual Error 29 335978750 11585474
Total
39 1198693004
Source
DF
Seq SS
COST FERT 1 313238900
TYPE FERT 1 259617883
COST CHEM 1 70350045
TYPE CHEM 1 10557017
LABOR COST 1 164864264
OTHER COST 1 5557800
EXP
1 1859277
ELEM LEVEL 1 22045316
HIGH LEVEL 1 8596654

LOAN

6027098

Unusual Observations
COST
Obs FERT PROD Fit SE Fit Residual St Resid
18 1500 24000 16123 1637
7877
2.64R
40 3550 21000 13471 2059
7529
2.78R
R denotes an observation with a large standardized residual.

Regression Analysis: PROD versus COST FERT, TYPE FERT, ...


The regression equation is
PROD = 6391 + 0.263 COST FERT + 5945 TYPE FERT - 1.41 COST CHEM - 1449 TYPE CHEM
+ 3.88 LABOR COST + 49.7 EXP - 1211 ELEM LEVEL + 1844 HIGH LEVEL
+ 0.0611 LOAN
Predictor
Coef SE Coef
T
P
Constant
6391
2673 2.39 0.023
COST FERT 0.2627 0.5270 0.50 0.622
TYPE FERT
5945
1287 4.62 0.000
COST CHEM -1.409 1.130 -1.25 0.222
TYPE CHEM
-1449
1231 -1.18 0.249
LABOR COST 3.8845 0.9669 4.02 0.000
EXP
49.73 74.56 0.67 0.510
ELEM LEVEL -1211
1349 -0.90 0.377
HIGH LEVEL
1844
1954 0.94 0.353
LOAN
0.06107 0.08265 0.74 0.466
S = 3351.04 R-Sq = 71.9% R-Sq(adj) = 63.5%
PRESS = 621094278 R-Sq(pred) = 48.19%
Analysis of Variance
Source
DF
SS
MS
F
P
Regression
9 861809067 95756563 8.53 0.000
Residual Error 30 336883937 11229465
Total
39 1198693004
Source
DF
Seq SS
COST FERT 1 313238900
TYPE FERT 1 259617883
COST CHEM 1 70350045
TYPE CHEM 1 10557017
LABOR COST 1 164864264
EXP
1 1266015
ELEM LEVEL 1 26170119
HIGH LEVEL 1 9614742
LOAN
1 6130083
Unusual Observations

COST
Obs FERT PROD Fit SE Fit Residual St Resid
18 1500 24000 16073 1602
7927
2.69R
40 3550 21000 13563 2002
7437
2.77R
R denotes an observation with a large standardized residual.

Regression Analysis: PROD versus TYPE FERT, COST CHEM, ...


The regression equation is
PROD = 6596 + 5940 TYPE FERT - 1.16 COST CHEM - 1401 TYPE CHEM + 4.18 LABOR COST
+ 60.1 EXP - 1405 ELEM LEVEL + 1883 HIGH LEVEL + 0.0628 LOAN
Predictor
Coef SE Coef
T
P
Constant
6596
2609 2.53 0.017
TYPE FERT
5940
1271 4.67 0.000
COST CHEM -1.161 1.002 -1.16 0.255
TYPE CHEM
-1401
1213 -1.16 0.257
LABOR COST 4.1834 0.7494 5.58 0.000
EXP
60.10 70.72 0.85 0.402
ELEM LEVEL -1405
1276 -1.10 0.279
HIGH LEVEL
1883
1929 0.98 0.337
LOAN
0.06279 0.08158 0.77 0.447
S = 3310.18 R-Sq = 71.7% R-Sq(adj) = 64.4%
PRESS = 602645516 R-Sq(pred) = 49.72%
Analysis of Variance
Source
DF
SS
MS
F
P
Regression
8 859017902 107377238 9.80 0.000
Residual Error 31 339675102 10957261
Total
39 1198693004
Source
DF
Seq SS
TYPE FERT 1 320649696
COST CHEM 1 5673547
TYPE CHEM 1
311077
LABOR COST 1 472094611
EXP
1 5557292
ELEM LEVEL 1 38175591
HIGH LEVEL 1 10063629
LOAN
1 6492459
Unusual Observations
TYPE
Obs FERT PROD Fit SE Fit Residual St Resid
18 1.00 24000 16261 1538
7739
2.64R
40 0.00 21000 13606 1975
7394
2.78R
R denotes an observation with a large standardized residual.

Regression Analysis: PROD versus TYPE FERT, COST CHEM, ...


The regression equation is
PROD = 7116 + 6209 TYPE FERT - 1.07 COST CHEM - 1323 TYPE CHEM + 4.24 LABOR COST
+ 54.3 EXP - 1389 ELEM LEVEL + 1848 HIGH LEVEL
Predictor
Coef SE Coef
T
P
Constant
7116
2504 2.84 0.008
TYPE FERT
6209
1214 5.11 0.000
COST CHEM -1.0657 0.9877 -1.08 0.289
TYPE CHEM
-1323
1201 -1.10 0.279
LABOR COST 4.2415 0.7408 5.73 0.000
EXP
54.34 69.87 0.78 0.442
ELEM LEVEL -1389
1268 -1.10 0.282
HIGH LEVEL
1848
1916 0.96 0.342
S = 3289.03 R-Sq = 71.1% R-Sq(adj) = 64.8%
PRESS = 577972052 R-Sq(pred) = 51.78%
Analysis of Variance
Source
DF
SS
MS
F
P
Regression
7 852525443 121789349 11.26 0.000
Residual Error 32 346167561 10817736
Total
39 1198693004
Source
DF
Seq SS
TYPE FERT 1 320649696
COST CHEM 1 5673547
TYPE CHEM 1
311077
LABOR COST 1 472094611
EXP
1 5557292
ELEM LEVEL 1 38175591
HIGH LEVEL 1 10063629
Unusual Observations
TYPE
Obs FERT PROD Fit SE Fit Residual St Resid
18 1.00 24000 15694 1342
8306
2.77R
27 1.00 18000 24011 1524
-6011
-2.06R
40 0.00 21000 13759 1953
7241
2.74R
R denotes an observation with a large standardized residual.

Regression Analysis: PROD versus TYPE FERT, COST CHEM, ...


The regression equation is
PROD = 8477 + 6469 TYPE FERT - 1.14 COST CHEM - 1575 TYPE CHEM + 4.48 LABOR COST
- 1587 ELEM LEVEL + 1476 HIGH LEVEL

Predictor
Coef SE Coef
T
P
Constant
8477
1781 4.76 0.000
TYPE FERT
6469
1160 5.58 0.000
COST CHEM -1.1351 0.9778 -1.16 0.254
TYPE CHEM
-1575
1149 -1.37 0.180
LABOR COST 4.4817 0.6693 6.70 0.000
ELEM LEVEL -1587
1235 -1.29 0.208
HIGH LEVEL
1476
1844 0.80 0.429
S = 3269.28 R-Sq = 70.6% R-Sq(adj) = 65.2%
PRESS = 544797820 R-Sq(pred) = 54.55%
Analysis of Variance
Source
DF
SS
MS
F
P
Regression
6 845982018 140997003 13.19 0.000
Residual Error 33 352710986 10688212
Total
39 1198693004
Source
DF
Seq SS
TYPE FERT 1 320649696
COST CHEM 1 5673547
TYPE CHEM 1
311077
LABOR COST 1 472094611
ELEM LEVEL 1 40409379
HIGH LEVEL 1 6843708
Unusual Observations
TYPE
Obs FERT PROD Fit SE Fit Residual St Resid
18 1.00 24000 15604 1329
8396
2.81R
27 1.00 18000 24041 1514
-6041
-2.08R
40 0.00 21000 14436 1738
6564
2.37R
R denotes an observation with a large standardized residual.

Regression Analysis: PROD versus TYPE FERT, COST CHEM, ...


The regression equation is
PROD = 9117 + 6193 TYPE FERT - 0.947 COST CHEM - 1648 TYPE CHEM
+ 4.33 LABOR COST - 2080 ELEM LEVEL
Predictor
Coef SE Coef
T
P
Constant
9117
1582 5.76 0.000
TYPE FERT
6193
1102 5.62 0.000
COST CHEM -0.9466 0.9440 -1.00 0.323
TYPE CHEM
-1648
1140 -1.45 0.157
LABOR COST 4.3290 0.6381 6.78 0.000
ELEM LEVEL -2080
1064 -1.95 0.059
S = 3251.94 R-Sq = 70.0% R-Sq(adj) = 65.6%

PRESS = 517128644 R-Sq(pred) = 56.86%


Analysis of Variance
Source
DF
SS
MS
F
P
Regression
5 839138310 167827662 15.87 0.000
Residual Error 34 359554694 10575138
Total
39 1198693004
Source
DF
Seq SS
TYPE FERT 1 320649696
COST CHEM 1 5673547
TYPE CHEM 1
311077
LABOR COST 1 472094611
ELEM LEVEL 1 40409379
Unusual Observations
TYPE
Obs FERT PROD Fit SE Fit Residual St Resid
18 1.00 24000 15436 1305
8564
2.88R
27 1.00 18000 24136 1501
-6136
-2.13R
40 0.00 21000 13594 1375
7406
2.51R
R denotes an observation with a large standardized residual.

Regression Analysis: PROD versus TYPE FERT, TYPE CHEM, ...


The regression equation is
PROD = 8275 + 6016 TYPE FERT - 1798 TYPE CHEM + 4.30 LABOR COST
- 2080 ELEM LEVEL
Predictor
Coef SE Coef
T
P
Constant
8275
1341 6.17 0.000
TYPE FERT
6016
1088 5.53 0.000
TYPE CHEM -1798
1130 -1.59 0.120
LABOR COST 4.3002 0.6375 6.75 0.000
ELEM LEVEL -2080
1064 -1.95 0.059
S = 3252.20 R-Sq = 69.1% R-Sq(adj) = 65.6%
PRESS = 495756677 R-Sq(pred) = 58.64%
Analysis of Variance
Source
DF
SS
MS
F
P
Regression
4 828504830 207126208 19.58 0.000
Residual Error 35 370188174 10576805
Total
39 1198693004
Source
DF
Seq SS
TYPE FERT 1 320649696
TYPE CHEM 1
792878

LABOR COST 1 466673176


ELEM LEVEL 1 40389079
Unusual Observations
TYPE
Obs FERT PROD Fit SE Fit Residual St Resid
18 1.00 24000 14636 1033
9364
3.04R
27 1.00 18000 23946 1489
-5946
-2.06R
40 0.00 21000 13639 1375
7361
2.50R
R denotes an observation with a large standardized residual.

Regression Analysis: PROD versus TYPE FERT, LABOR COST, ELEM LEVEL
The regression equation is
PROD = 7089 + 5901 TYPE FERT + 4.14 LABOR COST - 1778 ELEM LEVEL
Predictor
Coef SE Coef
T
P
Constant
7089
1139 6.22 0.000
TYPE FERT
5901
1108 5.32 0.000
LABOR COST 4.1406 0.6429 6.44 0.000
ELEM LEVEL -1778
1069 -1.66 0.105
S = 3320.76 R-Sq = 66.9% R-Sq(adj) = 64.1%
PRESS = 487537954 R-Sq(pred) = 59.33%
Analysis of Variance
Source
DF
SS
MS
F
P
Regression
3 801705802 267235267 24.23 0.000
Residual Error 36 396987202 11027422
Total
39 1198693004
Source
DF
Seq SS
TYPE FERT 1 320649696
LABOR COST 1 450577679
ELEM LEVEL 1 30478427
Unusual Observations
TYPE
Obs FERT PROD Fit SE Fit Residual St Resid
18 1.00 24000 13548
790
10452
3.24R
27 1.00 18000 24019 1520
-6019
-2.04R
40 0.00 21000 12255 1087
8745
2.79R
R denotes an observation with a large standardized residual.

Regression Analysis: PROD versus TYPE FERT, LABOR COST

The regression equation is


PROD = 6225 + 5698 TYPE FERT + 4.11 LABOR COST
Predictor
Coef SE Coef
T
P
Constant
6225
1037 6.00 0.000
TYPE FERT
5698
1127 5.05 0.000
LABOR COST 4.1073 0.6577 6.25 0.000
S = 3398.99 R-Sq = 64.3% R-Sq(adj) = 62.4%
PRESS = 496694704 R-Sq(pred) = 58.56%
Analysis of Variance
Source
DF
SS
MS
F
P
Regression
2 771227375 385613688 33.38 0.000
Residual Error 37 427465629 11553125
Total
39 1198693004
Source
DF
Seq SS
TYPE FERT 1 320649696
LABOR COST 1 450577679
Unusual Observations
TYPE
Obs FERT PROD Fit SE Fit Residual St Resid
14 1.00 6000 12943
765
-6943
-2.10R
18 1.00 24000 14240
688
9760
2.93R
38 0.00 17500 18799 1764
-1299
-0.45 X
40 0.00 21000 11350
963
9650
2.96R
R denotes an observation with a large standardized residual.
X denotes an observation whose X value gives it large leverage.

Final Model: PROD = 6225 + 5698 TYPE FERT + 4.11 LABOR COST
5 Once final model is obtained, determine R 2 , R2adj, s2(b) and Confidence interval for
the estimates.

Final Model: PROD = 6225 + 5698 TYPE FERT + 4.11 LABOR COST
R2 = 64.3%
R2 (adj) = 62.3%
S2(b) =
Confidence Interval =
6

Determine if multicollinearity is present.

You might also like