Measures of central tendency are statistical measures used to describe the central or typical value of a dataset.
Get the full solved assignment PDF of MHH-103/As-8 of 2024 session now.
They provide a summary of the distribution of data by identifying a single value around which the data tends to cluster. The three main measures of central tendency are the mean, median, and mode. Each measure has its own characteristics and applications in data analysis. Let’s explore each measure in detail:
- Mean:
The mean, also known as the average, is calculated by summing up all the values in a dataset and then dividing by the total number of values. Mathematically, the mean ( \bar{x} ) of a dataset ( x_1, x_2, …, x_n ) is given by:
[ \bar{x} = \frac{\sum_{i=1}^{n} x_i}{n} ]
- Characteristics:
- The mean is sensitive to outliers, as it takes into account the magnitude of each value in the dataset.
- It is affected by extreme values, skewing the value towards the tails of the distribution.
- The mean is suitable for data that follows a symmetric distribution or a normal distribution.
- Example: Consider a dataset of exam scores: 70, 75, 80, 85, and 95. The mean score is calculated as:
[ \bar{x} = \frac{70 + 75 + 80 + 85 + 95}{5} = \frac{405}{5} = 81 ]
- Median:
The median is the middle value in a sorted dataset. If the dataset has an odd number of values, the median is the middle value. If the dataset has an even number of values, the median is the average of the two middle values. The median divides the dataset into two equal halves.
- Characteristics:
- The median is less affected by outliers compared to the mean, making it a robust measure of central tendency.
- It is suitable for skewed distributions or datasets with extreme values.
- The median is not influenced by the magnitude of values, only their relative position in the dataset.
- Example: Using the same dataset of exam scores: 70, 75, 80, 85, and 95. Since the dataset has an odd number of values, the median is the middle value, which is 80.
- Mode:
The mode is the value that appears most frequently in a dataset. A dataset may have one mode (unimodal), two modes (bimodal), or more than two modes (multimodal). If all values occur with the same frequency, the dataset is considered to have no mode.
- Characteristics:
- The mode is not affected by outliers or extreme values.
- It is suitable for categorical or nominal data, as well as discrete data with repeating values.
- Example: Consider a dataset of exam scores: 70, 75, 80, 80, 85, and 95. The mode score is 80, as it appears more frequently than any other value in the dataset.
In summary, measures of central tendency provide valuable insights into the typical or central value of a dataset. While the mean reflects the arithmetic average of the data, the median represents the middle value, and the mode identifies the most frequently occurring value. The choice of which measure to use depends on the nature of the data and the underlying distribution.