Professional Documents
Culture Documents
The frequency distribution summarizes the given mass of data, but for practical purposes there is usually
a need for further condensation, particularly when we want to compare two or more different
distributions. We may even reduce the entire distribution to one number which represents the
distribution. We calculate Measures of central tendency for this purpose. These measures summarize
the given mass of data in much more concise fashion than a frequency distribution. Frequency
distribution has too many details while an average reduces the large number of observations to one
figure.
The term averages is used very often e.g., average Indian, average marks, or average size, etc.,
Sometimes it means typical or usual like average Indian. It may also refer to the result of a specific
process of calculation like average marks of students.
Average is used to reduce two or more aggregates to a common denominator, in order to make
comparisons. It can be used to compare the totals for time periods of different lengths, e.g., if we have
the figures of production for time periods of different lengths, e.g., if we have the figures of production for
the months of January and February, in 1985 the production for the month of January is 4000 units while
for the month of February it is 3640 units. We cannot compare the two figures, 4000 and 3640 units. The
reason is, January has 31 days while February has 28 days. Here we find the average daily production
by dividing the total by the number of days. The average daily production of January is
units while the average daily production of February is
4000
129.03
31
3640
130 units i.e., There is no significant
28
difference between the production rate for the two months. Though the total production in February is
less, the daily production rate is almost the same.
The number of deaths due to traffic accidents in two different periods should not be compared directly.
The number should be compared with the total population and deaths per thousand should be calculated.
The number of accidents is affected by the number of vehicles on the road and therefore we can also
compare the number of accidents per 100 vehicles.
Averages are also used as a measure of typical size. It gives one figure that is typical of all the
observations that are essentially different. If the items are scattered, the measure will not be very
satisfactory while for homogeneous data the average will be a good representative of the data. But it is
necessary to have this kind of summary statement for many statistical data.
There are five averages which are conceptually different and each of them is from some point of view of a
central value of the distribution. The averages are also referred to as measures of central tendency
because they are used to describe a magnitude near the centre of a distribution about which the values
cluster.
If we have the distribution of marks of students, very few students will get marks like 4, 5, 8, . and
similarly there will be a small number of students getting above 80. Most of the students will have marks
between 40 and 60 and the average will be somewhere within these limits, that is, average is a central
figure. Averages are also known as measures of location.
Each of these averages has its own advantages and disadvantages. But there are certain characteristics,
which make the average a good representative of the given data.
DESIDERATA FOR SATISFACTORY AVERAGE
1. An average should be rigidly defined; otherwise its value will be affected by the bias of the
person who calculates it. It cannot be a good representative if it is not a fixed value.
3. It should be easy to calculate and easy to understand. If the calculations requires tedious
mathematical process, it will not be understood by many and it use will be limited.
4. It should be capable of further algebraic treatment. This makes the average more useful.
5. It should not be affected much by sampling fluctuations. If two independent samples are taken
from the same population, the average should not differ significantly.
It should also be remembered that the average should be expressed in the same unit as the series given.
i.e., If we have the heights of 50 children in cms and the average is 130, it should be written as 130 cms.
If the income of 100 families are given in (00 Rs.) and the average is 30, the average should be
expressed as 30(100 Rs.) or Rs.3000/-. Now we consider the types of averages.
ARITHMETIC MEAN:
Arithmetic mean is defined as the sum of all the observations in the distribution divided by the number of
observations, i.e., if a variable x takes the values x1 , x2 , x3 ,....., xn , its arithmetic mean is defined as
n
x x x
x 1 2 3
n
i.e.,
x
x
x
i 1
f x f x f3 x3 f nxn
x 1 1 2 2
i 1n
f1 f 2 f 3 f n
fi xi
f
i 1
fx
f
i i
i
Short-cut Method:
x A
fd
f
i
xi A
and c is the length of the classc
interval.
Exercise:
1.
b) 12.4
c) 1418
d) 34.3 ]
2.
12
5
14
10
16
15
18
12
20
8
22
3
[ Answer: 16.6415 ]
3.
6
32
7
40
8
52
9
40
10
32
11
25
8
16
9
12
10
9
11
4
[ Answer: 8.3394 ~ 8 ]
4.
5
11
6
15
7
20
[ Answer: 7.5287 ]
5.
The following data represents frequency distribution of weights of children, find its arithmetic mean .
Wt. in Kgs.
No. of Children
11
7
12
11
13
15
14
13
15
9
16
4
[ 13.305 kgs ]
6.
The following data represents distribution of marks (out of 10) for a class of students. Find the
arithmetic mean.
Marks:
No. of Students:
0
2
1
4
2
5
3
7
4
11
5
15
6
13
7
10
8
7
9
3
10
1
[ Answer: 5.06 ]
7.
8.
0-10
10-20
6
11
[ Answer: 22.91 ]
20-30
15
30-40
8
40-50
3
Calculate the arithmetic mean for the following data giving daily wages of workers.
Wages in Rs.:
No. of workers:
20-40
7
40-60
12
60-80
16
80-100
13
100-120
13
120-140
4
44-55
5
Ans:
31.54
[ Answer: 77.69 ]
9.
No. of persons
15
33
54
80
97
100
[ Answer: 27.1 ]
10.
5-15
3
15-25
8
25-35
13
35-45
10
11.
Calculate the arithmetic mean for the following data representing monthly salary of a group of
employees.
Salary in Rs.:
No. of persons:
700-800
32
800-900
43
900-1100
55
1100-1500
22
1500-1600
18
[ Answer: Rs.1012.06 ]
12.
20-30
9
30-50
14
50-70
20
70-90
12
90-100
5
5-10
10-15
2
8
[ Answer: Rs.27.97 ]
15-25
12
25-35
15
35-45
11
[ Answer: 57 ]
13.
14.
15.
The following data represents yield per acre (in kgs.) for a number of farms. Find the arithmetic
mean.
Yield per acre:
700-750
No. of farms:
32
43
[ Answer: 825.8 kgs. ]
750-800
800-850
850-900
900-950
55
22
17
9501000
18
The following is the distribution of heights in cms of 50 students. Find the mean.
Height in cms:
No. of students:
16.
45-50
5
140-145
145-150
7
10
[ Answer: 152.4 cms ]
150-155
15
155-160
13
160-165
5
148152
3
152156
5
156160
9
160164
15
164168
10
168172
6
172176
2
The following data represents the distribution of balance amounts in bank accounts at the end of
March 2002. Find the average balance amount.
Amount in Rs.:
No. of accounts:
500599
25
600699
42
700799
55
800899
70
900999
62
10001099
50
11001199
35
12001299
11
[ Answer: Rs.877.21 ]
18.
100399
12
400699
20
700999
25
1000-1299
1300-1599
35
15
1600-1899
8
[ Answer: Rs.966.89 ]
19.
50-99
4
100-149
9
150-199
11
200-249
15
250-299
12
300-349
8
350-399
2
[ Answer: 218.76 ]
20.
The following data represents salary of employees in an office. Find the average salary.
Salary in Rs.:
900 1000
1000 1200
1200 1400
1400 1600
1600 1800
1800 1900
1900 2000
No. of Employees.
4
11
19
22
18
9
3
[ Answer: Rs.1473.26 ]
21.
If the mean for the following data is Rs.56/-, find the missing frequency.
Wages in Rs.:
No. of persons:
30-40
10
40-50
20
50-60
40
60-70
70-80
8
80-90
6
[ Answer: 16 ]
22.
If the average marks of students are 26.75, find the number of students belonging to the class
interval 10 20.
Marks:
No. of students:
0 10
3
10-20
20-30
15
30-40
10
40-50
5
[ Answer: 7 ]
23.
If the average wages of workers are Rs.73.25, find the number of workers with wage between Rs.80
and Rs.100.
Wages in Rs.:
No. of persons:
20-40
10
40-60
18
60-80
22
80-100
100-120
11
120-140
5
[ Answer: 14 ]
24.
If the mean value for the following data is 33, find the missing frequency.
Marks:
No. of students:
0-10
5
10-20
10
20-30
25
30-40
30
40-50
50-60
10
[ Answer: 20 ]
25.
Find the missing frequencies if the mean is 21.9 and the total of frequencies is 75.
Class Interval:
Frequencies:
0-5
2
5-10
5
10-15
7
15-20
20-25
25-30
16
30-35
8
35-40
3
[ Answer: 13 and 21 ]
x12
n1 x1 n2 x2
n1 n2
x 123
n1 x 1 n2 x 2 n3 x 3
n1 n2 n3
PROBLEMS:
1.
The average marks of a group of 100 students in Accountancy are 60 and for another group of 50
students, the average marks are 90. Find the average marks of the combined group of 150
students. [ 70 marks ]
2.
The average daily wages for 90 workers in a factory is Rs.59/-, the average wages for 50 male
workers out of them is Rs.63/-. Find the average wages for the remaining female workers.
[ Rs.54/- ]
3.
The average marks of a class of students are 76. The average marks of boys and girls are 69 and
83 respectively. If there are 100 boys in the class find the number of girls in the class. [ No. of
girls = 100 ]
4.
The mean daily wages of a group of employees are Rs.180/- The mean daily wages of men and
women are Rs.186/- and Rs.175/- respectively. Find the ratio of men and women in the group.
[5:6]
5.
If the average marks in a certain test of boys and girls in a class are 80 and 85 respectively and if
the average marks for the entire class are 83.75, find the percentage of boys in the class. [ 25% ]
6.
The mean weight of a group of 70 workers is 60 kgs. The second group consists of 80 workers
with average weight 57 kgs and there are 50 workers in the third group with average weight 62
kgs. Find the average weight of the combined group of 200 workers. [ 59.3 kgs ]
7.
The mean marks of 100 boys in a class are 45. The mean marks of the entire class of 150
students are 50. Find the mean marks of the remaining group of girls. [ 60 ]
8.
There are three groups in a class of 100 students. The first contains 25 students with average
pocket money Rs.62/-, the second group consists of 50 students with average pocket money
Rs.55/-. Find the average pocket money of the students from the third group if the average for the
entire class is Rs.58/-. [ Rs.60/- ]
9.
The average monthly salary of employees of a firm is Rs.5200/-. The average salaries of gents
and ladies from the firm are Rs.6000/- and Rs.4800/- . Find the percentage of gents and ladies in
the firm. [ 1 : 2 ]
10.
A garment factory makes both mens and womens shirts. The average profit of the factory is 8%
of sales. Average profit on mens shirt is 10%. Womens shirts form 60% of the total sales. What
is the average profit on sales of womens shirts? [ 6.67% ]
11.
There are men, women and children working in a factory. The total number of workers is 500.
The average daily wages of 250 male workers is Rs.100/-. The average daily wages of 150
women workers is Rs.80/-. What is the average daily wages of children working in that factory,
given that the average daily wages of all the 500 workers taken together is Rs.82/- [ Rs.40/- ]
12.
The sum of the deviations of a certain number of observations measured from 4 is 72 and the sum
of the deviations of the same observations from 7 is -3. Find the number of observations and
their mean. [ 25 & 6.88 ]
13.
The mean of a certain number of observations is 40. If two more observations with values 50 and
64 are added to the data, the mean rises to 42. Find the number of items in the original data.
[ 15 ]
14.
The mean weight of 98 students as calculated from a frequency distribution is found to be 50 kgs.
It is later discovered that the frequency of the class interval 30 40 was wrongly taken as 8
instead of 10. Calculate the correct arithmetic mean. [ 49.7 kgs ]
15.
The mean monthly salaries paid to all 77 employees in a company was Rs.78/-. The mean
monthly salaries of 32 of them was Rs.75/- and that of the other 25 was Rs.82/-. What was the
mean salary of the remaining? [ Rs.77.80 ]
MEDIAN
Definition:
If x1 , x2 , x3 ,....., xn are n observations arranged either in ascending order or in descending order and if
i.
n is ODD, then there will be only one middle term and the value of the middle term is the median.
i.e., Median =
ii.
n 1
n is EVEN, then there will be two middle terms, the average of the values of the two middle terms
is the median. i.e., Median =
n
2
n 1
2
In the case of a frequency distribution, it is calculated as
Median = l1
Where
l2 l1 n
pcf
f 2
Note:
i.
ii.
iii.
n
becomes less than determines the median
2
class.
To calculate the median the class intervals must be continuous. If they are not continuous then we
have to make them continuous by subtracting 0.5 from the lower limit and adding 0.5 to the upper
limit.
The value of median thus calculated must be a value in the C.I. of the median class.
PROBLEMS:
1.
ii.
2.
[ 13.5 ]
15
7
20
15
25
19
30
23
35
20
40
15
45
8
50
5
[ Answer: 30 ]
3.
Find the median for the following data representing heights of 45 students.
Ht. in cms:
No. of students:
4.
166-170
12
170-174
15
174-178
6
178-182
2
5.
158-162
162-166
3
7
[ Answer: 170.13 cms ]
Less than 35
24
[ Rs.41.16 ]
35 40
62
40 45
99
45 50
18
Over 50
15
35 50
18
50 65
32
65 80
18
Above 80
12
Below 35
20
[ 55.625 years ]
6.
Find the median and the two quartiles for the following data.
Rainfall in cms:
No. of Years:
20-25
2
25-30
5
30-35
8
35-40
12
40-45
10
45-50
7
50-55
6
55-59
6
60-64
2
240260
23
260300
24
For the following distribution of weights of 60 students, find the three quartiles.
Weights in Kgs:
No. of students:
30-34
3
35-39
5
40-44
12
45-49
18
50-54
14
For the following distribution of weights of 60 students, find the three quartiles.
100140180140
180
200
No. of Salesmen:
14
45
52
[ Median = 206, Q1 = 183.27 and
Commission in Rs.
200220220
240
80
32
Q3 = 227.19 ]
9.
The median marks of 100 students in Accountancy are 56. It was later found that marks of one
student were wrongly considered as 76 instead of 67. What would be the correct median?
[ Median is unaltered ]
10.
In a batch of 25 students, 10 students failed in a test, by obtaining less than 35 marks. Those who
passed the test got 40, 45, 57, 60, 49, 52, 75, 72, 80, 87, 55, 58, 65,42 and 60 marks. What was
the median of the marks of all the 25 students? [ 45 ]
11.
In a group of 25 children the median height is 164 cms and the heights of the tallest and shortest
boy in the group are 170 cms and 154 cms respectively. To this group 4 children are added with
the heights 152, 150, 174, and 171 cms. Find the median height of the new group of 29
children. [ 164 cms. ]
12.
If the median height for the following distribution is 162.5 cms, find the missing frequency.
Height in cms:
No. of students:
150 155
3
155 160
6
160 165
8
165 170
170 175
3
175 180
1
[ Answer: 5 ]
13.
40-49
8
50-59
60-69
5
70-79
3
[ Answer: 7 ]
14.
If the median for the following distribution is Rs.26.25, find the missing frequency.
Wages in Rs.:
12.5 17.5
17.5 22.5
22.5 27.5
27.5 32.5
32.5 37.5
37.5 42.5
42.5 47.5
47.5 52.5
52.5 57.5
15.
No. of persons
2
22
10
3
4
6
1
1
[ 16 ]
If the median marks in History for a group of students are 27, find the number of students getting
marks between 30 and 40.
Marks:
No. of Students:
0 10
5
10 20
5
20 30
10
30 40
40 50
3
[ Answer: 11 ]
16.
The following data represents the weekly wages in Rs. of a group of workers. If the median is
Rs.114, find the missing frequency.
60-75
3
75-90
3
90-105
6
105-120
5
120-135
135-150
6
[ Answer : 7 ]
17.
If the first and the third quartiles for the following distribution are given to be 23.125 and 43.5
respectively, find the missing frequencies.
Weekly wages in Rs.
No. of workers:
0 10
5
10 20
20 30
20
30 40
30
40 50
50 60
10
[ Answer: 15 & 25 ]
18.
Find the missing frequencies given that the first quartile is 320 and the third quartile is 550.
Weekly wages in
Rs.
No. of workers:
100-200
200-300
300-400
400-500
500-600
600-700
10
20
16
[ Answer: 15 & 12 ]
19.
If the median of the following distribution is 146 and the total of the frequencies is 230, find the
missing frequencies.
C.I.:
110-120
120-130
130-140
140-150
150-160
160-170
170-180
Frequency
:
12
34
65
46
18
40-50
65
50-60
60-70
25
70-80
18
[ Answer: 30 & 25 ]
20.
10-20
13
20-30
30
30-40
You are given that the median value is 46 and the total number of frequencies is 230. Also
calculate the mean of the completed data.
[ Answer: 33 & 46 and the mean value = 45.7826]
MODE
Mode is that value of the variable, which characterizes more items than any other value. It is the value of
greatest frequency or more precisely greatest frequency density. Mode cannot be calculated unless the
data are converted in the form of a discrete or a continuous distribution. In some distributions it is
difficult to get the exact value of mode as observations may concentrate around two or more values. In
such cases the distribution s bimodal, trimodal or multimodal.
Mode is a measure which should be used with caution, only when the person believes that it has
relevance. The mode can occur at an extreme value, in which vase it will be a poor measure or central
tendency.
Example:
Find the mode of the following data:
21, 44, 31, 21, 57, 36, 21, 44, 45, 21
On observing the given data, we see that the value 21 occurs 4 times which is the maximum. Hence mode
= 21.
In the case of frequency distribution, Mode is calculated using the following formula:
Mode = l1
Where
d1
c
d1 d 2
PROBLEMS:
1.
200 400
16
400 600
34
600 800
60
800-1000
37
1000-1200
13
[ Answer: Rs.706.12 ]
10
2.
The following data gives the consumption of electricity. Calculate the value of mode.
No. of Units:
No. of consumers:
0 -100
9
100-200
18
200-300
35
300-400
32
400-500
28
500-600
10
70-90
12
90-110
8
110-130
6
25-30
20
30-35
8
35-40
7
10-30
4
30-50
10
50-70
14
[ Answer: 63.3 ]
4.
10-15
3
15-20
5
20-25
15
If the mode for the following distribution is 130, find the missing frequency.
Class Interval:
Frequency:
60-75
3
75-90
3
90-105
6
105-120
120-135
7
135-150
6
[ Answer: 5 ]
6
If the mode of the following data is 750 and the total of the frequencies is 186, find the missing
frequencies.
Life in hrs.:
No. of bulbs:
200-400
10
400-600
600-800
50
800-1000
45
1000-1200
30
1200-1400
1400-1600
5
[ Answer: 35 & 11 ]
7.
Prove that the value of median lies between mean and mode using the following data.
Age (below)
No. of persons:
10
11
20
35
30
50
40
79
50
89
60
100
Find the mean, median and mode for the following data.
Class Interval:
Frequency:
60-75
3
75-90
3
90-105
6
105-120
5
120-135
7
135-150
6
Find the mean, median and mode for the following data.
Class Interval:
Frequency:
10-30
4
30-50
10
50-70
14
70-90
12
90-110
8
110-130
6
If the median and mode of the following distribution are 33.5 and 34 respectively, find the
missing frequencies.
Wages in Rs.:
No. of Workers:
0-10
4
10-20
16
20-30
30-40
40-50
50-60
6
60-70
4
Total
230
Given that the mean of wages is Rs.418.75 and the mode is Rs.362.50, find the missing
frequencies. Hence calculate the median of the completed data.
11
Wages in Rs.:
No. of Workers:
100-200
5
200-300
12
300-400
400-500
500-600
14
600-700
11
Find the missing frequencies for the following data, given that the modal marks
are 53.25 and median is 52.5. Find the arithmetic mean of the completed data.
Marks:
No. of Students:
20-29
10
30-39
18
40-49
25
50-59
60-69
15
70-79
12
80-89
Find the missing frequencies for the following data given that the mode of the distribution is 44
and the median is 45.8
Age in years:
No. of persons:
10-20
10
20-30
10
30-40
40-50
50
50-60
29
60-70
15
70-80
80-90
10
[ Answer: 36 and 10 ]
14.
Find the missing frequencies if the mode of the following distribution is given to be 95 and
arithmetic mean 96.
Weekly Expenditure:
No. of Families:
50 70
70 90
60
90 110
70
110 130
130 150
10
[ Answer: 20 and 40 ]
15.
The following data gives the distribution of marks of some students. The arithmetic mean of
marks is 78 and the mode is 75. Find the missing frequencies.
Marks:
No. of
students:
10-30
30-50
50-70
70-90
90-110
110-130
130-150
25
30
10
[ Answer: 10 and 15 ]
16.
The median age of the following distribution is 44 years. The modal age is 43 years. Two of the
frequencies however are missing. Find those frequencies given the following data.
Age in years:
No.of persons:
25-30
8
30-35
35-40
24
40-45
30
45-50
50-55
20
55-60
14
[ Answer: 10 and 26 ]
17.
Find the missing frequencies given that the median and mode of the distribution are 1504 and
1500 respectively.
Life in hours:
950-1150
1150-1350
No. of bulbs:
..
43
13501550
100
15501750
17501950
23
1950-2150
13
[ Answer: 20 and 81 ]
18.
The first and the third quartiles of the following data are given to be 12.5 marks and 25 marks
respectively. Find the missing frequencies.
Marks:
Frequency:
0-5
4
5-10
8
10-15
15-20
19
20-25
25-30
10
30-35
5
35-40
Total
72
12
19.
Find the missing frequencies given that the mode is 4400 hours and arithmetic mean is 4100
hours.
Life in hours:
No. of bulbs:
1000-2000
100
2000-3000
3000-4000
200
4000-5000
5000-6000
150
6000-7000
50
7000-8000
50
If the arithmetic mean for the following frequency distribution is 54 years, find the missing
frequency and also calculate its mode and median.
Age in years:
No. of persons:
0 20
4
20 40
5
40 60
60 80
11
80 100
5
13