Professional Documents
Culture Documents
2. ordinal scales: the categories are ordered much like the ordinal
number, first, second,. It does not make sense to talk of distance or spacing between first and second nor to compare spacings between pairs of response categories.
Note:
In applications, the distinction between nomial or ordinal scales is usually but not always clear. For example, hair color and eye color can be ordered to a large extent on the grey-scale from light to dark and are therefore ordinal. However, unless there is a clear connection with electromagnetic spectrum or a grey-scale, colors are best regarded as nomial.
Note:
It is essential the same conclusion can be arrived even though the number or choice of response categories has been changed. As a consequence, if a new category is formed by combining adjacent categories of the old scale, the form of the conclusions should be unaffected. This is an important non-mathematical point that is difficult to make mathematically rigorous. This point lead fair directly to models based on the cumulative probabilities than the category probabilities
rj
rather
1. logistic scale:
It is the simplest model. The form is
r j (x ) log = ( ) 1 r x j
x .
This model is also known as the proportional-odds model since the ratio of the odds is
1 r j (x 2
1 r j (x 1 ) = exp [ ( x 1 x 2 ) r j (x 2 )
r j (x 1 )
1 r j (1 ) = e r j (0 ) . 1 r j (0 )
2. complementary log-log scale:
The form is
r j (1 )
log { log 1 r j ( x ) } =
x .
Note:
The model based on logistic scale may be derived from the notion of a tolerance distribution or an underlying unobserved continuous random variable Z, Z = x +
is distributed as logistic
j 1
< Z j , then Y = y
r j ( x ) = P (Y j ) = P (Z = P ( = exp ( = P (Z x
j j j
x x
j
1 + exp (
) )
) )
)
j
r j (x ) log = ( ) 1 r x j
Note:
It is sometimes claimed that the models based on logistic scale and complementary log-log scale and related models are appropriate only if there exists a latent variable Z. This claim seems to be too strong and, in any case, the existence of Z is usually unverifiable in practice.
Note:
r j (x ) j x log = ( ) 1 r x exp ( x ) , j
where
plays the role of linear predictor for the mean and in the
denominator
dispersion or variance. if
1 1 j
= exp (
).
If
<1
increasing in
= 0
) against the
Note:
Models in which the k-1 regression lines are not parallel can be specified by
r j (x ) log = ( ) 1 r x j
arbitrarily. 2. It does not normally make sense to form a new category by amalgamating adjacent categories. 3. Attached to the jth category is a cardinal number or score,
sj,
such that the difference between scores is a measure of distance between or separation of categories.
Note:
Genuine interval scales having these 3 properties are rare in practice because, although properties 1 and 2 may be satisfied, it is rare to find a response scale having well determined cardinal scores attached to the categories. There are 3 options for model construction.
1.
r j (x ) s j + s j +1 = + log 0 1 ( ) 1 2 r x j
where c j =
x x (c j c )
s j + s j +1 2
s j + s j +1 = log c it or j . 2
2.
The probability
(x i ) =
exp j ( x i )
k j =1 j
exp [ (x )] ,
i
where
j ( x i ) = j + ( x i )s j + i .
Note:
6
The relative odds for category j over category k in the above model are
j (x ) = exp ( j k + ( x )(s j s k )) k (x )
Thus, the relative odds are increased multiplicatively by the factor
exp
3.
(s
sk
x .
(x )s
j =1 j i
= xi
sj
weights.
(x )s
j =1 j i
expected score is
Si =
s
j =1
y ij
.
mi
If there are only two treatment groups, with observed counts we may use the standardized difference as test statistic
{y , y }
1j 2j
T =
S1 S 2
2 k 1 k 1 2 ~ ~ j s j j s j + m m2 j =1 j =1 1
~ where j = m1 + m2 .
y1 j + y2 j
(x i ) =
exp j ( x i )
k j =1 j
exp [ (x )] ,
i
where
j ( x i ) = j ( x 0 ) + ( x i x 0 ) j + i .
Note:
The relative odds for category j over category k in the above model are
j (x ) j (x 0 ) = exp [( x x 0 )( j k )] k (x ) k (x 0 )
Thus, the relative odds are increased multiplicatively by the factor
(x 0 ) exp ( ( ) x k 0
j
x .
To test the hypothesis, 80 cows were assigned at random to one of the two diets. More cows become pregnant at first insemination but a few require a second or third insemination. The response variable is the pregnancy rate. The response, probability and odds are summarized in the following table: Insemination Response Probability Odds Y1 | m 1 First 1
1 r1
Second Third
Y2 | m y1
2 3
1 r1 1 r2
2 3
1 r2 1 r3
Y3 | m y1 y 2
g (
)=
+ x
2
2 g 1 r 1 3 g 1 r 2
j 1 r
= =
=
+ x + x
log
+ x
1 , 2 ,K, k 1