Psychology · Research Methods in Psychology

Data Analysis: Descriptive Statistics

Lesson 6 55 min

Data Analysis: Descriptive Statistics

7 min read

AI Explain — Ask anything

AI Illustrate — Make it visual

Why This Matters

# Data Analysis: Descriptive Statistics ## Summary This lesson covers fundamental descriptive statistics used to summarise and present psychological research data, including measures of central tendency (mean, median, mode), measures of dispersion (range, standard deviation), and graphical representations (bar charts, histograms, scattergrams). Students learn to calculate these statistics, interpret their meaning in research contexts, and evaluate their appropriateness for different data types and distributions. This topic is essential for both AS and A-Level examinations, appearing in research methods questions where candidates must demonstrate the ability to analyse given data sets, select appropriate statistical measures, and justify their choices with reference to data characteristics such as skewness and outliers.

Key Words to Know

Descriptive Statistics — Statistical methods used to summarise, organise, and describe the characteristics of a data set.

Measures of Central Tendency — Statistics that represent the typical or central value of a data set (mean, median, mode).

Measures of Dispersion — Statistics that describe the spread or variability of a data set (range, standard deviation).

Mean — The arithmetic average of all values in a data set, calculated by summing all values and dividing by the number of values.

Median — The middle value in a data set when the values are arranged in ascending or descending order.

Mode — The most frequently occurring value in a data set.

Range — The difference between the highest and lowest values in a data set.

Standard Deviation — A measure of the average amount of variability or dispersion around the mean.

Core Concepts & Theory

Descriptive statistics summarize and organize data to make patterns visible without drawing conclusions beyond the sample. They form the foundation of quantitative analysis in psychological research.

Measures of Central Tendency describe the 'typical' value:

• Mean (x̄) = Σx/n (sum of all values divided by number of values). Most sensitive to extreme scores; used with interval/ratio data.

• Median = middle value when data arranged in order. Robust against outliers; suitable for ordinal, interval, or ratio data.

• Mode = most frequently occurring value. Only measure for nominal data; can be used with all data types.

Measures of Dispersion indicate data spread:

• Range = highest value - lowest value (+ 1 for discrete data). Simple but affected by extreme values.

• Standard Deviation (SD) measures average distance of scores from the mean: σ = √[Σ(x-x̄)²/n]. Shows data clustering; larger SD indicates greater variability.

Key Distinctions: Population parameters use σ (sigma), while sample statistics use s with (n-1) denominator for unbiased estimation.

Normal Distribution: Bell-shaped curve where mean = median = mode. Approximately 68% of data falls within ±1 SD, 95% within ±2 SD, and 99.7% within ±3 SD.

Cambridge Definition: Descriptive statistics are mathematical techniques for organizing, summarizing, and presenting quantitative data in meaningful ways.

Data Types Matter: Nominal (categories), ordinal (ranked), interval (equal intervals, no true zero), ratio (equal intervals with true zero) determine appropriate statistical measures.

Detailed Explanation with Real-World Examples

Think of descriptive statistics as a psychological snapshot that captures essential features of data without the full detail.

Real-World Application: Sleep Study

Imagine researching whether blue light affects sleep duration. You collect hours slept from 30 participants:

Without descriptive statistics, you'd have 30 individual numbers—overwhelming and meaningless. With them, you might report: "Mean sleep = 6.8 hours (SD = 1.2), indicating most participants slept between 5.6-8.0 hours."

The Restaurant Analogy:

• Mean is like average bill per table—useful but distorted if one group orders champagne (outlier) • Median is the middle bill—better represents typical spending • Mode is the most common order—shows what's genuinely popular • Standard deviation shows whether all tables spend similarly or wildly differently

Clinical Psychology Example

When evaluating depression treatment, researchers measure symptom scores before/after therapy:

Mean change shows overall effectiveness
SD reveals consistency—low SD means treatment works similarly for most; high SD suggests it helps some greatly, others minimally
Median helps when few participants show dramatic improvement (positive skew)

Memory Research Application

Studying recall accuracy: Mode identifies most common error type, mean shows average performance, SD indicates individual differences. A bimodal distribution (two modes) might reveal two distinct participant groups—perhaps different learning strategies.

Professional Context: NHS psychologists use descriptive statistics in clinical audits to track patient outcomes, comparing their service against national benchmarks.

Descriptive statistics transform raw data into actionable insights, enabling psychologists to identify patterns, compare groups, and communicate findings effectively to stakeholders who lack statistical training.

Worked Examples & Step-by-Step Solutions

Example 1: Complete Analysis (6 marks)

Question: Ten participants completed a memory test. Scores: 12, 15, 18, 18, 20, 22, 24, 24, 24, 28. Calculate mean, median, mode, range, and standard deviation.

SOLUTION:

Mean: Σx/n = (12+15+18+18+20+22+24+24+24+28)/10 = 205/10 = 20.5

Median: Already ordered. Two middle values (5th & 6th) = (20+22)/2 = 21

Mode: 24 appears three times (most frequent) = 24

Range: 28 - 12 = 16 (or 17 for discrete data)

Standard Deviation:

Calculate deviations: (12-20.5)², (15-20.5)², etc.
Squared deviations: 72.25, 30.25, 6.25, 6.25, 0.25, 2.25, 12.25, 12.25, 12.25, 56.25
Σ(x-x̄)² = 210.5
σ = √(210.5/10) = √21.05 = 4.59

Examiner Note: Show all working; marks awarded for process, not just final answer.

Example 2: Comparison Question (4 marks)

Question: Group A (control): Mean = 15, SD = 2.1. Group B (experimental): Mean = 22, SD = 8.3. What do these statistics suggest about the intervention's effect?

SOLUTION:

The experimental group shows higher mean performance (22 vs 15), suggesting the intervention improved scores by 7 points on average. However, the much larger SD (8.3 vs 2.1) indicates greater variability—the intervention helped some participants considerably but may have been less effective for others. The control group's low SD shows consistent performance, while the experimental group's high SD suggests individual differences in response to treatment.

Examiner Note: Always interpret both central tendency and dispersion; discuss implications for the research question.

Common Exam Mistakes & How to Avoid Them

Mistake 1: Confusing Mean, Median, and Mode

Why it happens: Students memorize definitions without understanding w...

This section is locked

Cambridge Exam Technique & Mark Scheme Tips

Command Word Mastery:

• Calculate (2-3 marks): Show working in stages. Even if final answer is wrong, method ma...

This section is locked

2 more sections locked

Upgrade to Starter to unlock all study notes, audio listening, and more.

Exam Tips

1.Be able to *define and calculate* the mean, median, mode, range, and standard deviation from a given set of raw data.
2.Understand the *strengths and weaknesses* of each measure of central tendency and dispersion, and when it is appropriate to use each one (e.g., mean is affected by outliers, median is not).
3.Practice interpreting what a given mean, median, mode, range, or standard deviation tells you about a dataset in a psychological context.
4.Remember that descriptive statistics *summarise* data; they do not allow for generalisation or hypothesis testing (that's inferential statistics).

Bài học tiếp theo

Data Analysis: Inferential Statistics