Professional Documents
Culture Documents
Factor Analysis
Factor analysis is a general name denoting a class of procedures primarily
used for data reduction and summarization.
Factor analysis is an interdependence technique in that an entire set of
interdependent relationships is examined without making the distinction
between dependent and independent variables.
Factor analysis is used in the following circumstances:
To identify underlying dimensions, or factors, that explain the
correlations among a set of variables.
To identify a new, smaller, set of uncorrelated variables to replace the
original set of correlated variables in subsequent multivariate analysis
(regression or discriminant analysis).
To identify a smaller set of salient variables from a larger set for use in
subsequent multivariate analysis.
Xi
Aij
F
Vi
Ui
m
=
=
i th standardized variable
standardized multiple regression coefficient of
variable i on common factor j
=
common factor
=
standardized regression coefficient of variable i on
unique factor i
=
the unique factor for variable i
=
number of common factors
=
=
=
estimate of i th factor
weight or factor score coefficient
number of variables
Factor scores. Factor scores are composite scores estimated for each
respondent on the derived factors.
Kaiser-Meyer-Olkin (KMO) measure of sampling adequacy. The KaiserMeyer-Olkin (KMO) measure of sampling adequacy is an index used to examine
the appropriateness of factor analysis. High values (between 0.5 and 1.0)
indicate factor analysis is appropriate. Values below 0.5 imply that factor
analysis may not be appropriate.
Scree plot. A scree plot is a plot of the Eigenvalues against the number of
factors in order of extraction.
Calculation of
Factor Scores
Determination of Model Fit
Selection of
Surrogate Variables
S
D
A
S
A
6 7
1 2 3 4 5
6 7
1 2 3 4 5
6 7
1 2 3 4 5
6 7
1 2 3 4 5
6 7
6 7
RESPONDENT
NUMBER
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
V1
7.00
1.00
6.00
4.00
1.00
6.00
5.00
6.00
3.00
2.00
6.00
2.00
7.00
4.00
1.00
6.00
5.00
7.00
2.00
3.00
1.00
5.00
2.00
4.00
6.00
3.00
4.00
3.00
4.00
2.00
V2
3.00
3.00
2.00
5.00
2.00
3.00
3.00
4.00
4.00
6.00
4.00
3.00
2.00
6.00
3.00
4.00
3.00
3.00
4.00
5.00
3.00
4.00
2.00
6.00
5.00
5.00
4.00
7.00
6.00
3.00
V3
6.00
2.00
7.00
4.00
2.00
6.00
6.00
7.00
2.00
2.00
7.00
1.00
6.00
4.00
2.00
6.00
6.00
7.00
3.00
3.00
2.00
5.00
1.00
4.00
4.00
4.00
7.00
2.00
3.00
2.00
V4
4.00
4.00
4.00
6.00
3.00
4.00
3.00
4.00
3.00
6.00
3.00
4.00
4.00
5.00
2.00
3.00
3.00
4.00
3.00
6.00
3.00
4.00
5.00
6.00
2.00
6.00
2.00
6.00
7.00
4.00
V5
2.00
5.00
1.00
2.00
6.00
2.00
4.00
1.00
6.00
7.00
2.00
5.00
1.00
3.00
6.00
3.00
3.00
1.00
6.00
4.00
5.00
2.00
4.00
4.00
1.00
4.00
2.00
4.00
2.00
7.00
V6
4.00
4.00
3.00
5.00
2.00
4.00
3.00
4.00
3.00
6.00
3.00
4.00
3.00
6.00
4.00
4.00
4.00
4.00
3.00
6.00
3.00
4.00
4.00
7.00
4.00
7.00
5.00
3.00
7.00
2.00
Note: Small values of the KMO statistic indicate that the correlations
between pairs of variables cannot be explained by other variables and that
factor analysis may not be appropriate.
Correlation Matrix
Variables
V1
V2
V3
V4
V5
V6
V1
1.000
-0.530
0.873
-0.086
-0.858
0.004
V2
0.530
1.000
-0.155
0.572
0.020
0.640
V3
0.873
-0.155
1.000
-0.248
-0.778
-0.018
V4
-0.086
-0.572
-0.248
1.000
-0.007
0.640
V5
0.858
0.020
--0.778
-0.007
1.000
-0.136
V6
0.004
0.640
-0.018
0.640
0.136
1.000
Correlation Matrix
Variables
V1
V2
V3
V4
V5
V1
1.000
V2
-0.530
1.000
V3
0.873
-0.155
1.000
V4
-0.086
0.572
-0.248
1.000
V5
-0.858
0.020
-0.778
-0.007
1.000
V6
0.004
0.640
-0.018
0.640
-0.136
V6
1.000
In common factor analysis, the factors are estimated based only on the
common variance.
Communalities are inserted in the diagonal of the correlation matrix.
This method is appropriate when the primary concern is to identify the underlying
dimensions and the common variance is of interest.
This method is also known as principal axis factoring.
What is meant by total variance in the data set? To understand the meaning of
total
What is a communality?
A communality refers to the percent of variance in an
observed variable that is accounted for by the retained
components (or factors).
A given variable will display a large communality if it
loads heavily on at least one of the studys retained
components.
Although communalities are computed in both
procedures, the concept of variable communality is
more relevant in a factor analysis than in principal
component analysis.
Initial
1.000
1.000
1.000
1.000
1.000
1.000
Extraction
0.926
0.723
0.894
0.739
0.878
0.790
(2.731/6)100=45.52
Eigen value
2.731
2.218
0.442
0.341
0.183
0.085
% of variance
45.520
36.969
7.360
5.688
3.044
1.420
6.0
decreasing
Cumulat. %
45.520
82.488
89.848
95.536
98.580
100.000
Eigen value
2.731
2.218
% of variance
45.520
36.969
Cumulat. %
45.520
82.488
Factor Matrix
Variables
V1
V2
V3
V4
V5
V6
Factor 1
0.928
-0.301
0.936
-0.342
-0.869
-0.177
Factor 2
0.253
0.795
0.131
0.789
-0.351
0.871
Eigenvalue
2.688
2.261
% of variance
44.802
37.687
Cumulat. %
44.802
82.488
Factor 1
0.962
-0.057
0.934
-0.098
-0.933
0.083
Factor 2
-0.027
0.848
-0.146
0.845
-0.084
0.885
Factor 1
0.358
-0.001
0.345
-0.017
-0.350
0.052
Factor 2
0.011
0.375
-0.043
0.377
-0.059
0.395
V1
V2
V3
V4
V5
V6
0.926
0.024 -0.029
0.031
0.038 -0.053
-0.078
0.723
0.022 -0.158
0.038 -0.105
0.902 -0.177
0.894 -0.031
0.081
0.033
-0.117
0.730 -0.217
0.739 -0.027 -0.107
-0.895 -0.018 -0.859
0.020
0.878
0.016
0.057
0.746 -0.051
0.748 -0.152
0.790
A Priori Determination.
Because of prior knowledge, the researcher may knows how many factors
to expect and thus can specify the number of factors to be extracted
beforehand. ( review of literature /judgment/ focus group )
If the number of variables is less than 20, this approach will result in a
conservative number of factors.
Scree Plot
3.0
2.5
Eigenvalue
2.0
1.5
1.0
0.5
0.0
1
3
4
5
Component Number
Out put
Important out put of the factor analysis is Factor Matrix, also called
as factor pattern matrix
it contains the coefficients used to express the standardized
variables in terms of the factors
these coefficients and factor loadings represents the
correlations between the factors and variables
Coefficients with a large absolute value indicates that the
factor & variable are closely related
coefficients of factor matrix is used to interpret the factor
2
X
Variable
2
3
2
X
X
X
2. The rotation is called oblique rotation when the axes are not
maintained at right angles, and the factors are correlated.
Sometimes, allowing for correlations among factors can simplify the factor pattern matrix.
Oblique rotation should be used when factors in the population are likely to be strongly
correlated.
Component
2
V1
V6
V2
V4
V2
1.0
0.0
V3
V1
V5
Component 2
0.5
-0.5
-1.0
1.0
0.5
0.0
-0.5
-1.0
0.962
-5.72E-02
-2.66E-02
0.848
V3
0.934
-0.146
V4
-9.83E-02 0.854
V5
-0.933
V6
8.337E-02 0.885
-8.40E-02
By examining the factor matrix, one could select for each factor
the variable with the highest loading on that factor.
That variable could then be used as a surrogate variable for the
associated factor.
However, the choice is not as easy if two or more variables have
similarly high loadings.
In such a case, the choice between these variables should be based on
theoretical and measurement considerations.
SPSS Windows
To select this procedures using SPSS for Windows click:
Analyze>Data Reduction>Factor
Factor Analysis-result
Result of un rotated factor analysis
Data is - anxiety about SPSS
Factor Analysis-result
Result of un rotated factor analysis
KMO and Bartlett's Test
Kaiser-Meyer-Olkin Measure of Sampling Adequacy.
0.9302
Approx. ChiSquare
df
Sig.
19334
253
0
Initial Eigenvalues
Extraction Sums of Squared Loadings
% of
Varianc Cumul
Total e
ative % Total
% of Variance
Cumulative %
1 7.29 31.696 31.696
7.29
31.6958568
31.6958568
2 1.739 7.5601 39.256 1.739
7.560124986
39.25598178
3 1.317 5.725 44.981 1.317
5.725006643
44.98098843
4 1.227 5.3356 50.317 1.227
5.335644146
50.31663257
5 0.988 4.2951 54.612
6 0.895 3.8927 58.504
7 0.806 3.5024 62.007
8 0.783 3.4036 65.41
9 0.751 3.2651 68.676
10 0.717 3.1172 71.793
11 0.684 2.9721 74.765
12 0.67 2.9109 77.676
13 0.612 2.6609 80.337
14 0.578 2.5119 82.849
15 0.549 2.3878 85.236
16 0.523 2.2746 87.511
17 0.508 2.2104 89.721
18 0.456 1.9823 91.704
19 0.424 1.8426 93.546
20 0.408 1.773 95.319
21 0.379 1.6499 96.969
22 0.364 1.5827 98.552
Items
Loadings
I have little experience of computers
0.80
All computers hate me
0.64
Computers are useful only for playing games
0.55
I worry that I will cause irreparable damage
because of my in-competenece with computers 0.65
Computers have minds of their own and
deliberately go wrong whenever I use them
0.58
Computers are out to get me
0.46
SPSS always crashes when I try to use it
0.68
Reliability Statistics
Cronbach's Alpha
.674
Factor-2
Items
Statistics makes me cry
Standard deviations excite me
I dream that Pearson is attacking me with
correlation coefficients
I don't understand statistics
People try to tell you that SPSS makes
statistics easier to understand but it doesn't
I weep openly at the mention of central
tendency
I can't sleep for thoughts of eigen vectors
I wake up under my duvet thinking that I am
trapped under a normal distribution
Reliability Statistics
Cronbach's Alpha
.605
Loadings
0.50
0.57
0.52
0.43
0.52
0.51
0.68
0.66
Factor-3
Items
Loadings
0.83
0.75
0.75
Reliability Statistics
Cronbach's Alpha
.819
Factor-4
Items
My friends will think I'm stupid for not being able to cope
with SPSS
Loadings
0.54
0.65
0.43
0.65
0.59
Reliability Statistics
Cronbach's Alpha
.570
Factor Analysis-result
Result of rotated factor analysis
Run the factor analysis with out considering the
variables having extraction value less than .4
Items
Loadings
0.80
0.68
0.07
0.74
Reliability Statistics
Cronbach's Alpha = .710
0.53
0.68
0.62
Factor-2
Items
Loadings
0.44
0.58
0.51
0.70
Reliability Statistics
Cronbach's Alpha = .391
0.46
0.62
Factor-3
Items
Loadings
0.85
0.76
0.76
Reliability Statistics
Cronbach's Alpha =
.819
Factor-4
Items
My friends will think I'm stupid for not being able to cope
with SPSS
0.51
0.67
0.64
Reliability Statistics
Cronbach's Alpha = .409
Loadings