REPRESENTATIVE VALUES
You might be aware of the term average and would have come across statements involving the term ‘average’ in your day-to-day life:
Isha spends on an average of about 5 hours daily for her studies.
The average temperature at this time of the year is about 40 degree celsius.
The average age of pupils in my class is 12 years.
The average attendance of students in a school during its final examination was 98 per cent
Many more of such statements could be there. Think about the statements given above. Do you think that the child in the first statement studies exactly for 5 hours daily?
Or, is the temperature of the given place during that particular time always 40 degrees?
Or, is the age of each pupil in that class 12 years?
Obviously not. Then what do these statements tell you?
Then what do these statements tell you?
By average we understand that Isha, usually, studies for 5 hours. On some days, she may study for less number of hours and on the other days she may study longer.
Similarly, the average temperature of 40 degree celsius, means that, very often, the temperature at this time of the year is around 40 degree celsius. Sometimes, it may be less than 40 degree celsius and at other times, it may be more than 40°C.
Thus, we realise that average is a number that represents or shows the central tendency of a group of observations or data. Since average lies between the highest and the lowest value of the given data so, we say average is a measure of the central tendency of the group of data. Different forms of data need different forms of representative or central value to describe it. One of these representative values is the “Arithmetic mean”. You will learn about the other representative values in the later part of the chapter.
ARITHMETIC MEAN
The most common representative value of a group of data is the arithmetic mean or the mean. To understand this in a better way, let us look at the following example: Two vessels contain 20 litres and 60 litres of milk respectively. What is the amount that each vessel would have, if both share the milk equally? When we ask this question we are seeking the arithmetic mean.
In the above case, the average or the arithmetic mean would be
\(\frac{\text{Total quantity of milk }}{\text{Number of vessels }}=\text{ }\frac{\text{2}0+\text{ 6}0}{2}\text{ litres }=\text{ 4}0\text{ litres}.\)
Thus, each vessels would have 40 litres of milk.
The average or Arithmetic Mean (A.M.) or simply mean is defined as follows:
\(\text{mean }=\frac{\text{ Sum of all observations}}{\text{number of observations}}\text{ }\)
Consider these examples.
EXAMPLE :Ashish studies for 4 hours, 5 hours and 3 hours respectively on three consecutive days. How many hours does he study daily on an average?
SOLUTION :The average study time of Ashish would be
\(\frac{\text{Total number of study hours}}{\text{Number of days for which he studied }}\text{ }=\text{ }\frac{\text{4 +5+ 3}}{3}\text{ hours }=\text{ 4 hours per day}\)
Thus, we can say that Ashish studies for 4 hours daily on an average.
EXAMPLE 2:A batsman scored the following number of runs in six innings:
36, 35, 50, 46, 60, 55
Calculate the mean runs scored by him in an inning.
SOLUTION :Total runs = 36 + 35 + 50 + 46 + 60 + 55 = 282.
To find the mean, we find the sum of all the observations and divide it by the number of observations. Therefore, in this case, mean = 282 6 = 47. Thus, the mean runs scored in an inning are 47
Range
The difference between the highest and the lowest observation gives us an idea of the spread of the observations. This can be found by subtracting the lowest observation from the highest observation. We call the result the range of the observation.
Look at the following example:
EXAMPLE The ages in years of 10 teachers of a school are: 32, 41, 28, 54, 35, 26, 23, 33, 38, 40
(i) What is the age of the oldest teacher and that of the youngest teacher?
(ii) What is the range of the ages of the teachers?
(iii) What is the mean age of these teachers?
SOLUTION (i) Arranging the ages in ascending order, we get:
23, 26, 28, 32, 33, 35, 38, 40, 41, 54
We find that the age of the oldest teacher is 54 years and the age of the youngest teacher is 23 years.
(ii) Range of the ages of the teachers = (54 – 23) years = 31 years
(iii) Mean age of the teachers
\(\begin{align} & =\text{ }\frac{\text{23 +26 +28+ 32+ 33+ 35+ 38+ 4}0+\text{ 41+ 54}}{10}\text{ years} \\ & \text{=}\frac{350}{10}\text{years}=35\text{years} \\ \end{align}\)
MODE
As we have said Mean is not the only measure of central tendency or the only form of representative value. For different requirements from a data, other measures of central tendencies are used
Look at the following example
To find out the weekly demand for different sizes of shirt, a shopkeeper kept records of sales of sizes 90 cm, 95 cm, 100 cm, 105 cm, 110 cm. Following is the record for a week:
If he found the mean number of shirts sold, do you think that he would be able to decide which shirt sizes to keep in stock?
\(\text{Mean of total shirts sold }=\text{ }\frac{\text{Total number of shirts sold}}{\text{Number of different sizes of shirts}}\text{ }=\frac{\text{1}0\text{5}}{5}\text{ =21}\)
Should he obtain 21 shirts of each size? If he does so, will he be able to cater to the needs of the customers?
The shopkeeper, on looking at the record, decides to procure shirts of sizes 95 cm, 100 cm, 105 cm. He decided to postpone the procurement of the shirts of other sizes because of their small number of buyers.
Look at another example
The owner of a readymade dress shop says, “The most popular size of dress I sell is the size 90 cm.
Observe that here also, the owner is concerned about the number of shirts of different sizes sold. She is however looking at the shirt size that is sold the most. This is another representative value for the data. The highest occuring event is the sale of size 90 cm.This representative value is called the mode of the data.
The mode of a set of observations is the observation that occurs most often.
EXAMPLE : Find the mode of the given set of numbers:
1, 1, 2, 4, 3, 2, 1, 2, 2, 4
SOLUTION: Arranging the numbers with same values together, we get
1, 1, 1, 2, 2, 2, 2, 3, 4, 4
Mode of this data is 2 because it occurs more frequently than other observations.
Mode of Large Data
Putting the same observations together and counting them is not easy if the number of observations is large. In such cases we tabulate the data. Tabulation can begin by putting tally marks and finding the frequency, as you did in your previous class. Look at the following example:
EXAMPLE : Following are the margins of victory in the football matches of a league.
1, 3, 2, 5, 1, 4, 6, 2, 5, 2, 2, 2, 4, 1, 2, 3, 1, 1, 2, 3, 2, 6, 4, 3, 2, 1, 1, 4, 2, 1, 5, 3, 3, 2, 3, 2, 4, 2, 1, 2
Find the mode of this data.
SOLUTION Let us put the data in a tabular form:
Looking at the table, we can quickly say that 2 is the ‘mode’ since 2 has occured the highest number of times. Thus, most of the matches have been won with a victory margin of 2 goals.
EXAMPLE :Find the mode of the numbers:
2, 2, 2, 3, 3, 4, 5, 5, 5, 6, 6, 8
SOLUTION Here, 2 and 5 both occur three times. Therefore, they both are modes of the data.
Whereas mean gives us the average of all observations of the data, the mode gives that observation which occurs most frequently in the data. Let us consider the following examples:
(a) You have to decide upon the number of chapattis needed for 25 people called for a feast.
(b) A shopkeeper selling shirts has decided to replenish her stock.
(c) We need to find the height of the door needed in our house.
(d) When going on a picnic, if only one fruit can be bought for everyone, which is the fruit that we would get. In which of these situations can we use the mode as a good estimate?
Consider the first statement. Suppose the number of chapattis needed by each person is
2, 3, 2, 3, 2, 1, 2, 3, 2, 2, 4, 2, 2, 3, 2, 4, 4, 2, 3, 2, 4, 2, 4, 3, 5
The mode of the data is 2 chapattis. If we use mode as the representative value for this data, then we need 50 chapattis only, 2 for each of the 25 persons. However the total number would clearly be inadequate. Would mean be an appropriate representative value? For the third statement the height of the door is related to the height of the persons using that door. Suppose there are 5 children and 4 adults using the door and the height of each of 5 children is around 135 cm. The mode for the heights is 135 cm. Should we get a door that is 144 cm high? Would all the adults be able to go through that door? It is clear that mode is not the appropriate representative value for this data. Would mean be an appropriate representative value here?
Why not? Which representative value of height should be used to decide the doorheight?
Similarly analyse the rest of the statements and find the representative value useful for that issue.
MEDIAN
We have seen that in some situations, arithmetic mean is an appropriate measure of central tendency whereas in some other situations, mode is the appropriate measure of central tendency. Let us now look at another example. Consider a group of 17 students with the following heights (in cm):
106, 110, 123, 125, 117, 120, 112, 115, 110, 120, 115, 102, 115, 115, 109, 115, 101.
The games teacher wants to divide the class into two groups so that each group has equal number of students, one group has students with height lesser than a particular height and the other group has students with heights greater than the particular height. How would she do that? Let us see the various options she has:
(i) She can find the mean. The mean is
\(\begin{align} & =\frac{10\text{6 +11}0\text{ +123+ 125+ 117+ 12}0\text{ +112 +115+ 11}0\text{ +12}0\text{ +115+ 1}0\text{2 +115 +115+ 1}0\text{ 9 +115 +1}0\text{1}}{17} \\ & =\frac{1930}{17}=113.5 \\ \end{align}\)
So, if the teacher divides the students into two groups on the basis of this mean height, such that one group has students of height less than the mean height and the other group has students with height more than the mean height, then the groups would be of unequal size. They would have 7 and 10 members respectively
(ii) The second option for her is to find mode. The observation with highest frequency is 115 cm, which would be taken as mode.
There are 7 children below the mode and 10 children at the mode and above the mode. Therefore, we cannot divide the group into equal parts. Let us therefore think of an alternative representative value or measure of central tendency. For doing this we again look at the given heights (in cm) of students and arrange them in ascending order. We have the following observations:
101, 102, 106, 109, 110, 110, 112, 115, 115, 115, 115, 115, 117, 120, 120, 123, 125
The middle value in this data is 115 because this value divides the students into two equal groups of 8 students each. This value is called as Median. Median refers to the value which lies in the middle of the data (when arranged in an increasing or decreasing order) with half of the observations above it and the other half below it. The games teacher decides to keep the middle student as a refree in the game. Here, we consider only those cases where number of observations is odd.
Thus, in a given data, arranged in ascending or descending order, the median gives us the middle observation.
Note that in general, we may not get the same value for median and mode. Thus we realise that mean, mode and median are the numbers that are the representative values of a group of observations or data. They lie between the minimum and maximum values of the data. They are also called the measures of the central tendency.
EXAMPLE :Find the median of the data: 24, 36, 46, 17, 18, 25, 35
SOLUTION We arrange the data in ascending order, we get 17, 18, 24, 25, 35, 36, 46
Median is the middle observation. Therefore 25 is the median.
USE OF BAR GRAPHS WITH A DIFFERENT PURPOSE
We have seen last year how information collected could be first arranged in a frequency distribution table and then this information could be put as a visual representation in the form of pictographs or bar graphs. You can look at the bar graphs and make deductions about the data. You can also get information based on these bar graphs. For example, you can say that the mode is the longest bar if the bar represents the frequency.
Choosing a Scale
We know that a bar graph is a representation of numbers using bars of uniform width and the lengths of the bars depend upon the frequency and the scale you have chosen. For example, in a bar graph where numbers in units are to be shown, the graph represents one unit length for one observation and if it has to show numbers in tens or hundreds, one unit length can represent 10 or 100 observations. Consider the following examples:
EXAMPLE :Two hundred students of 6th and 7th classes were asked to name their favourite colour so as to decide upon what should be the colour of their school building. The results are shown in the following table. Represent the given data on a bar graph.
Answer the following questions with the help of the bar graph:
(i) Which is the most preferred colour and which is the least preferred?
(ii) How many colours are there in all? What are they?
SOLUTION Choose a suitable scale as follows: Start the scale at 0. The greatest value in the data is 55, so end the scale at a value greater than 55, such as 60. Use equal divisions along the axes, such as increments of 10. You know that all the bars would lie between 0 and 60. We choose the scale such that the length between 0 and 60 is neither too long nor too small. Here we take 1 unit for 10 students. We then draw and label the graph as shown. From the bar graph we conclude that
(i) Blue is the most preferred colour (Because the bar representing Blue is the tallest).
(ii) Green is the least preferred colour. (Because the bar representing Green is the shortest).
(iii) There are five colours. They are Red, Green, Blue, Yellow and Orange. (These are observed on the horizontal line)
EXAMPLE:Following data gives total marks (out of 600) obtained by six children of a particular class. Represent the data on a bar graph.
SOLUTION
(i) To choose an appropriate scale we make equal divisions taking increments of 100. Thus 1 unit will represent 100 marks. (What would be the difficulty if we choose one unit to represent 10 marks?)
(ii) Now represent the data on the bar graph
Drawing double bar graph
Consider the following two collections of data giving the average daily hours of sunshine in two cities Aberdeen and Margate for all the twelve months of the year. These cities are near the south pole and hence have only a few hours of sunshine each day
By drawing individual bar graphs you could answer questions like
(i) In which month does each city has maximum sunlight? or
(ii) In which months does each city has minimum sunlight? However, to answer questions like “In a particular month, which city has more sunshine hours”, we need to compare the average hours of sunshine of both the cities. To do this we will learn to draw what is called a double bar graph giving the information of both cities side-by-side. This bar graph (Fig 3.1) shows the average sunshine of both the cities.
For each month we have two bars, the heights of which give the average hours of sunshine in each city. From this we can infer that except for the month of April, there is always more sunshine in Margate than in Aberdeen. You could put together a similiar bar graph for your area or for your city. Let us look at another example more related to us.
EXAMPLE : A mathematics teacher wants to see, whether the new technique of teaching she applied after quarterly test was effective or not. She takes the scores of the 5 weakest children in the quarterly test (out of 25) and in the half yearly test (out of 25):
SOLUTION: She draws the adjoining double bar graph and finds a marked improvement in most of the students, the teacher decides that she should continue to use the new technique of teaching. Can you think of a few more situations where you could use double bar graphs?