Secondary Statistics Resources

We have been delighted to work with the Education Policy Advisory Group of the Royal Statistical Society to produce some secondary mathematics resources which use real climate data. James Nicholson led the development of these resources.

This resource aims to provide mathematics teachers a way of tying together the various graphical representations in GCSE Maths and Statistics, and Scottish National 5 Maths examinations. It uses real large data sets from the Met Office.

The emphasis is on teaching the statistical ideas, using a meaningful context more than trying to teach meteorology. These notes aim to support maths teachers with enough contextual information about the data to have confidence to use the resource with their classes. They also provide guidance on appropriate responses to the questions posed.

The ‘daily maximum temperatures’ are calculated as the average of the daily maximum temperatures recorded in each of the stations used. There are 540 stations across the 10 regions in the UK which feed into the data set used in these materials.

The materials use 4 regions spanning the geography of the UK – Northern Scotland, Northern Ireland, North West England & North Wales, South East and Central Southern England. We have constructed boxplots of 5 data sets all using maximum daily temperatures. There are obviously a huge number of similar potential data sets available for that weather variable, and for each of the other variables the Met Office have.

We have shown January and Winter average temperatures along with July, Summer and Annual average temperatures: we have forced a common scale to enable comparisons to be done visually without having to employ mental gymnastics in adjusting for the effects of different scales.

The resources can be used in different ways: they could be used while teaching individual topics, to supplement textbook and other resources with some real world data. However, they are particularly suited to being used together to revise statistical representations, and the relationships between the different representations.

NOTES:

  • Autograph has been used to produce the Boxplots and Histograms for these resources. Autograph is free, so if you want to use different regions, it is not too difficult to import the data from the Excel data file with this resource into Autograph to generate the graphs. The Excel file has the summary statistics (5 quartiles plus mean and standard deviation) for each month, season and year at the bottom of the relevant column. The ten regions are each displayed in named sheets in the workbook.
  • There are some suggested skills statements in each section. These relate to the questions already contained in the section, but you may want to expand the questions you ask in your classroom depending on time, and capabilities of the class.
  • GCSE Maths (Higher Tier only) has boxplots, but does not address outliers in the graph, despite outliers themselves being required at Foundation level. They are required graphically in GCSE Statistics with boxplots. The bulk of the resource does not include outliers, but there is an optional extension where the boxplots do display outliers and ask pupils to think about the value of having the individual outlier values visible.

The student sheets below can be downloaded and edited.

Excel spreadsheet with source data

1) Autoscaling Issues: Annual Electricity Data

Learning Objectives
  1. To be able to evaluate which graphical representation is most effective in displaying energy data.
  2. To be able to analyse annual electricity generation data and to explain the significant changes in energy source contributions.
  3. To be able to differentiate how the use of different vertical scales affects the interpretation of graphs.

Despite technology offering multiple options to display tabular data in different formats, we spend almost no time in discussing / evaluating which displays show most clearly what you (the resource creator) want the reader (the audience) to focus on.

This is very much a classroom teaching activity as presented here.

The context can prompt discussions about why there has been such a shift from coal to wind and solar over this period – and what the future looks like.

Student Worksheet

2) What do Different Graph Types Show Best: Monthly Electricity Data

Learning Objectives
  1. To be able to interpret monthly electricity generation data and propose reasons why the contributions from different sources vary monthly.
  2. To be able to critique time series data to highlight seasonal variations and short-term trends

Despite technology offering multiple options to display tabular data in different formats, we spend almost no time in discussing / evaluating which displays show most clearly what you (the resource creator) want the reader (the audience) to focus on.

All these displays show the same data (pretty much), but are not all equally easy to see a particular story in.

This is very much a classroom teaching activity as presented here.

Student worksheet

3) Resources Using Graphs of Maximum Temperatures in the UK since 1884

There are 540 stations across the 10 regions in the UK which feed into the data set used in these materials.
The materials use 4 regions spanning the geography of the UK – Northern Scotland, Northern Ireland, North West England & North Wales, South East and Central Southern England. The ‘daily maximum temperatures’ are calculated as the average of the daily maximum temperatures recorded in each of the stations used.

We have constructed boxplots of 5 data sets all using maximum daily temperatures. There are obviously a huge number of similar potential data sets available for that weather variable, and for each of the other variables the Met Office have.

We have shown January and Winter average temperatures along with July, Summer and Annual average temperatures: we have forced a common scale to enable comparisons to be done visually without having to employ mental gymnastics in adjusting for the effects of different scales.

These three resources are designed to be taught consecutively.

Skills statements:

  • Students should be able to compare 2 or more boxplots using data as evidence.
  • Students should be able to make contextualised comments as to what the boxplots show. E.g higher median means a warmer temperature.

Student Worksheet 1 – Boxplots

Learning Objectives
  1. To be able to compare the centre and spread of regional maximum temperature distributions across different timescales (monthly vs. seasonal) using boxplots.
  2. To be able to differentiate between the variability seen in temperatures averaged over shorter periods (e.g., January) versus longer periods (e.g., Winter/Annual).
  3. To be able to evaluate the usefulness of displaying outliers separately in boxplots for identifying and discussing potentially damaging extreme weather events.

More detailed comments on what the graphs show:

  • For Jan / Winter the averages (centres) for N Ireland and Eng SE & Central S are very similar, but N Ireland has considerably less spread. Averages for Eng NW & N Wales are lower, and Scotland N is lower again. Scotland N & N Ireland have similar spreads, while Eng NW & N Wales has more variation, and Eng SE & Central S has more again.
  • The story for July / Summer is very similar except that now the average in N Ireland is now similar to Eng NW and N Wales rather than Eng SE & Central S.
  • In the extension question – the summer boxplots for N Ireland and Scotland N are roughly 100 to the right from the corresponding winter boxplots, but the Eng NW and N Wales and the Eng SE & Central S boxplots are about 150 to the right.

Commentary – Boxplots with Outliers

For Jan and winter data the only outliers are unusually cold years for that region, while for July and summer the only outliers are unusually hot years for that region.

While Eng SE & Central S has the highest median Jan and winter temperatures (and indeed highest max, highest UQ and highest LQ) it also has the most extreme cold Jan and winter temperatures recorded.

The only extra information with these diagrams is the detail of any outliers, so there is limited extra insight available.

Student Worksheet 2 – Histograms

Learning Objectives
  1. To be able to interpret regional differences in July average maximum temperature data by analyzing frequency distributions shown in histograms.
  2. To be able to contrast the utility of histograms (showing distribution detail) with that of boxplots (showing high-level summary measures like centre and spread) when presented with the same temperature data.

Commentary:

(these descriptions are not the only way to answer the questions)

  • (i) Eng SE + S Central    (ii)  Scotland N
  • The temperatures in N Ireland have less variability; the top end of the Eng NW + N Wales data is hotter than for N Ireland, and the median will be higher for Eng NW + N Wales.
  • These two regions have similar variability, but Eng SE + Central S is (about) 4 – 5 degrees hotter on average that Eng NW + N Wales.
  • The big advantage of using the same scales for all 4 graphs is that you are forced to compare like with like, where if the scales are different you really have to work hard in order to compare like with like. If you wanted to only look at Scotland N then scaling it so that you lost all that unused space in the right half would be good.
  •  (i) No    (ii)  No – because histograms plot the frequency density of the data, changing the intervals does not affect the visual shape dramatically – details change only a little bit (unless it is a very unusual data set), which is the strength of the display – a bad actor can’t pick intervals to distort the impression given by the data. Since these data sets have 141 values in each, it is not surprising that there are not substantive differences by changing the intervals.
  •  Boxplots are 5 number summaries of a data set, so quick comparisons favour using boxplots. The histograms provide much more detail and provide the capacity for more detailed comparisons when required.The statistical issue in e) here is that where data is plentiful, smaller intervals gives more detail that is reasonably stable. Where data is scarce, it is more subject to the vagaries of randomness, and it is tempting for the user to over-interpret what the data is saying i.e. to look for an explanation for those scarce data points appearing in the particular place they did.

    Students don’t often meet situations where they are asked what representation is best – but once they leave education, if they are writing any report using data (in any discipline) the software will do the donkey work – but they need to have the skills to decide which representation to use, and issues like whether to allow autoscaling (usually the default) or to force equal scales to enable like-for-like comparisons to be made easily.

Student Worksheet 3 – Time Series

Learning Objectives
  1. To be able to analyze time series graphs of regional maximum temperatures to observe long-term trends, identify variability, and locate the central value of the data.
  2. To be able to evaluate the advantages and disadvantages of using smoothed time series data (e.g., 3-year, 5-year, or 15-year averages) compared to annual data when trying to identify underlying long-term changes in temperature behaviour.

Where the maths curriculum deals with time series it has a primary focus on calculations – typically of moving averages in a context where some cyclical pattern (‘seasonality’ even if cycle is weekly or daily) is present, and the behaviour can be modelled by season + trend + variation. However, time series data are critical to understanding a wide range of scientific, historical, and social science phenomena.

This section is intended as an extension to show the timelines of the data because that is a very important context in terms of weather measurements – for example, if the same maximum temperatures were recorded, but they occurred in strictly ascending order then the boxplots would be exactly the same. However, a hugely different interpretation would be appropriate. The amount of (chaotic) variation from year to year makes it much more difficult to discern long term trends. If you want more information on this, have a look at https://www.metlink.org/blog/weather-climate-and-chaos-theory/.

Again, the axes here are forced to be consistent with one another to facilitate accurate comparison visually. However, apart from ensuring the vertical axis always shows zero or a broken scale, the scale for different times of year (in the following panels) will be allowed to vary, because we are looking at the stories in 4 time series and the corresponding boxplots.

Again, the axes here are forced to be consistent with one another to facilitate accurate comparison visually. However, apart from ensuring the vertical axis always includes zero, the scale for different times of year (in the following panels) will be allowed to vary, because we are looking at the stories in 4 time series and the corresponding boxplots.

There will be quickly diminishing returns on the time taken – you could treat the following as two pairs (Jan + winter, and July + summer) where the vertical scales are the same in terms of numbers – to allow direct comparisons. The stories in each pair are very similar, so looking at one pair only, or looking very quickly at the second pair – asking ‘do we see a similar story here’ is likely to be sufficient.

Note all 5 panels have a range of 16 degrees, so variability can have visual ‘like for like’ comparisons.

Commentary:

Comparing the annual temperatures in the time series, and in the boxplots you can see in both that there is broadly similar amounts of variation in the 4 regions, but centred round different temperatures, and a feeling that it trends up a bit over time – though this is hard to be sure of because of the amount of (chaotic*) variation from year to year. If the data values had occurred in strictly ascending order over the 141 years the ‘time series’ would have never fallen as you move from 1884 to 2024 – so it would look very very different – but the boxplots would have been identical so the time line is an extremely important component of trying to understand weather and climate – and what is happening with climate change.

When you look at the January and Winter data, there is substantially more variation in the Eng SE + Central S region than in the other 3 and this corresponds to its time series showing more extreme fluctuations than the other time series.

* Chaotic variation is what meteorologists call this – mathematicians & statisticians would refer to it as ‘random variation’ but like many other phenomenon we use random to describe it only reflects that we do not understand the process well enough yet to be able to predict outcomes e.g. turbulence around a Formula 1 car.

‘Chaotic variation’ captures that aspect of the variation much better than ‘random variation’.

Gentle extensions of time series and multiple representations

One of the difficulties in dealing with time series when there is a lot of variation, as here, is to try to identify if there is any long term change underlying the process. Meteorologists describe the behaviour in temperatures as ‘chaotic variation’ – which is a very good descriptor of what it looks like. There are some things we can do to try to make it easier to identify any long term changes. One is to smooth the data by taking an average of a number of years – but how many years is best?

Using Multiple Representations

One thing that mathematicians and statisticians do to try to get fuller understanding of a problem is to look at multiple representations.

This section is very short to do with a class and its purpose is just to show how accessible the stories in data can be – without complicated statistical techniques, but using the simple graphs they know to visualise how the data behaves, and to show the power of using more than one representation to develop a fuller understanding of the stories in the data.

One of the difficulties in dealing with time series when there is a lot of variation, as here, is to try to explain every movement.  There is a narrative about losing detail if you take too long a period – including not getting the next 15 year average until 2033 (the next 25 year average is actually available at the same time), where the next 3 year average is available in 2027, and next 5 year would be available in 2028. There is no ‘right answer’ to a best time period – there is a trade off between the detail of the ‘chaotic variation’ most evident in the single year data, and seeing an upward trend in the data which is more evident as the time period increases .

The table below shows the highest 20 average annual maximum temperatures in this region between 1884 and 2024. There are 19 which are above 15°C of which 12 are this century.

Other noteworthy observations from this table: All of the top five were recorded since 2014, and 8 of the top 10 were recorded since 2003.

Yearannual
202216.075
201415.63333
202015.63333
202315.625
201815.41667
198915.3
201115.3
200315.28333
192115.275
202415.2
194915.19167
200615.19167
199015.175
201915.13333
200715.075
201715.075
199515.075
195915.01667
199914.93333

Note of caution: this table, and the graphs are for the highest average maximum temperatures – news reports on ‘highest annual temperatures’ are normally based on the mean temperature (average of maximum and minimum daily temperatures), so the rank order doesn’t match exactly to this table.

Student Worksheet 4 – Scatter graphs

Learning Objectives

  1. To be able to determine the strength and type of the relationship between two temperature variables using scatter graphs.
  2. To be able to evaluate the usefulness of one temperature variable as a predictor for another based on the correlation in a graph.

Before showing any data, or any maths questions, ask the pupils to reflect on the following question for a minute, and then discuss it for a couple of minutes with your neighbour:

  • Do you think that the spring maximum temperature would be a good predictor of the autumn maximum temperature?

Commentary: 

Note:  It is important in looking at these activities that association (correlation) should not be confused with causation.

Q1 – Spring & Autumn data for N Ireland in 2005 – 2024 and 1885 – 1904

Neither scatter diagram shows much correlation so for these periods the spring temperature does not give you any substantive indication of what the autumn temperature will be.

The temperatures in the 2005 – 2024 period seem to be about 1 to 1.5 degrees warmer, on average, than the temperatures in 1885 – 1904 in the same season.

Q2 – Summer and Winter data for Eng SE + Central S in 2005 – 2024 and 1885 – 1904

The story is very similar to what was seen in the other two seasons, in a different region.

Q3 – Here there is fairly strong correlation between the two temperatures, which means that knowledge of the value of one would give you a reasonable prediction of what the other one is – not that there is a causal effect, but both are the result of the prevailing meteorological conditions over the UK.

However – there are systematic differences between the weather in the two regions due their geographical characteristics – and part b) draws attention to this – a line of best fit to the data (by eye) would not be too far away from parallel to the equal temperature line shown, but roughly 5 degrees lower. It wouldn’t change my view that it is a reasonably good predictor (because of the strong correlation) but it would help me to identify how I would make the prediction.

Royal Statistical Society logo
Climate education quality mark October 2025

Averages, Distributions and Range

The 30th United Nations Climate Change Conference, or Conference of the Parties of the UNFCCC, COP30, will be taking place in Brazil this year.

So, let’s take the opportunity to use ERA explorer data to look at the difference between climate and weather and why we usually use a 30-year averaging period to define the climate of a particular place.

Specifically, students will

  • Look at the mean, mode and median of a 30-year data set and critically consider which is the most appropriate average 
  • Look at the difference between 10, 20 and 30-year means and critically consider which is the most appropriate definition of the climate. 

Worksheet

PowerPoint

Excel spreadsheet with the original ERA explorer data

climate education quality mark Sept 25

Climate Change Glossary and Resources

Select a letter to see a definition of the terms in the climate change association tool. Alternatively, to find a teaching resource associated with any of the terms, use the ‘all climate change’ drop-down menu on the right. Not all the terms have associated resources yet, but we are adding new ones all the time.

Scotland’s Curriculum – Trees for Net Zero

Resource produced in collaboration with MEI

This resource comprises several stand-alone activities which may be used separately.

Brief overview of session ‘logic’

  • Why trees are good
  • People are planting trees – estimates around what the numbers look like in terms of land use
  • Some companies encourage you to offset flights by planting trees – how many trees for one flight?
  • How much carbon do trees capture and store?
  • How does the amount of carbon captured and stored by a tree change during its lifecycle?
  • What happens to that carbon when a tree dies?
  • Can you plant a tree to offset a flight?
  • What is Net Zero?

Mathematical opportunities offered

  • Estimation and proportional reasoning
  • Developing a sense of scale of large numbers
  • Interpretation of data, statistics, graphs, infographics in context
  • Critiquing graphs
  • Analysing and comparing data in order to develop and present a conclusion
  • Making assumptions
  • Making predictions
  • Reading scales
Climate Change Quality Mark Content

Scotland’s Curriculum Trees for Net Zero

Resource produced in collaboration with MEI

Note that this session is made up of separate activities which may be used independently.

Brief overview of session ‘logic’

  • Why trees are good
  • People are planting trees – estimates around what the numbers look like in terms of land use
  • Some companies encourage you to offset flights by planting trees – how many trees for one flight?
  • How much carbon do trees sequester?
  • How does the amount of carbon sequestered by a tree change during its lifecycle?
  • What happens to that carbon when a tree dies?
  • Can you plant a tree to offset a flight?
  • What is Net Zero?
  • Can trees be used to achieve Net Zero?

Mathematical opportunities offered

  • Estimation and proportional reasoning
  • Developing a sense of scale of large numbers
  • Converting between m2 and km2
  • Interpretation of data, statistics, graphs, infographics in context
  • Critiquing graphs
  • Analysing and comparing data in order to develop and present a conclusion
  • Making assumptions
  • Making predictions
  • Reading scales
Climate Change Quality Mark Content

Maths for Planet Earth

Climate-based questions for students and teachers. A team of students and academics at the University of Oxford developed these Maths for Planet Earth questions.

Weather and Climate worksheet

Background Information:

Climate is the average weather over a long time period (30 years) for a particular region or place. The climate affects a number of environmental factors within the region including the type and growth of vegetation and wildlife. The climate is determined by large scale factors such as the Earth’s orbit around the Sun, the position of the continents and the composition of the atmosphere. Weather describes the short-term state of our atmosphere. This may include information about the air temperature, precipitation, air pressure and cloud cover. Our local weather changes daily due to the movement of air in our atmosphere.

Experiences and Outcomes:

I can investigate the relationship between climate and weather.

two dice tally

Difference between weather and climate

You will need:

2 dice

Tally chart for numbers 2-12 Graph paper

Method:

In pairs, throw the two dice about 100 times and record the combined score shown each time. 

Draw a bar graph of the results.

Results:

The results should show a smoothish distribution, with a score of 7 being most frequent. Ask each group to predict what their score will be if they throw the dice one more time – they can’t. However, with one more throw, the mean of all the scores will stay about the same (about 7). In the same way, the weather may be very different from day to day but the climate, the weather we ‘expect’, stays about the same.

If you don’t have access to dice, you can do this activity online at https://www.metlink.org/blog/weather-climate-extreme-weather-and-chaos-theory/ 

Extension:

Can the students design a concept cartoon to illustrate the difference between weather and climate? See https://www.stem.org.uk/system/files/elibrary-resources/legacy_files_migrated/1292WEATHER.PDF and http://www.millgatehouse.co.uk/concept-cartoons-research/ 

References/Resources: 

A YouTube video showing an owner and his dog, as an analogy for weather and climate

Key Stage 3 – Trees and Carbon Capture

Resource produced in collaboration with MEI

Brief overview of session ‘logic’

  • Why trees are good
  • How much carbon do trees capture and store?
  • How does the amount of carbon captured and stored by a tree change during its lifecycle?

Mathematical opportunities offered

  • Interpretation of data, statistics, graphs, infographics in context
  • Critiquing graphs
  • Analysing and comparing data in order to develop and present a conclusion
  • Making assumptions
  • Making predictions
  • Reading scale
Climate Change Quality Mark Content

Key Stage 3 – Trees for Net Zero (Extended Resource)

Resource produced in collaboration with MEI

Brief overview of session ‘logic’

  • Why trees are good
  • People are planting trees – estimates around what the numbers look like in terms of land use
  • Some companies encourage you to offset flights by planting trees – how many trees for one flight?
  • How much carbon do trees capture and store?
  • How does the amount of carbon captured and stored by a tree change during its lifecycle?
  • What happens to that carbon when a tree dies?
  • Can you plant a tree to offset a flight?
  • What is Net Zero?

Mathematical opportunities offered

  • Estimation and proportional reasoning
  • Developing a sense of scale of large numbers
  • Interpretation of data, statistics, graphs, infographics in context
  • Critiquing graphs
  • Analysing and comparing data in order to develop and present a conclusion
  • Making assumptions
  • Making predictions
  • Reading scales
Climate Change Quality Mark Content

Core Maths – Trees and Carbon Capture

Resource produced in collaboration with MEI

Brief overview of session ‘logic’

  • Why trees are good
  • How much carbon do trees sequester?
  • How does the amount of carbon sequestered by a tree change during its lifecycle?

Mathematical opportunities offered

  • Interpretation of data, statistics, graphs, infographics in context
  • Critiquing graphs
  • Analysing and comparing data in order to develop and present a conclusion
  • Making assumptions
  • Making predictions
  • Reading scales
Climate Change Quality Mark Content
MetLink - Royal Meteorological Society
We use cookies on this site to enhance your user experienceBy clicking any link on this page you are giving your consent for us to set cookies. More info

By clicking any link on this page you are giving your consent for us to set cookies. More info