Lesson 1.5 - Measures of Spread & Percentiles

Key Question: Is the American Dream still alive?

Content: Measures of Center | Percentiles | Visualizing Data

Video

Student Items

Handout: pdf, doc

Mastery Check: link

CODAP Activity: link

CODAP Handout: pdf, doc

Teacher Items

Handout Key: pdf, doc

Mastery Check Key: link

Slide Deck: pdf, ppt

CODAP Handout Key: pdf, doc

CODAP Video: link

Course Resources

Resources for teaching our High School Statistics curriculum.

  • Lesson Flow - timing and flow of class, using our lesson materials
  • Pacing Guide - pacing our units, with daily or block schedules
  • Alignment Guide - aligning our lessons to national and state standards for high school statistics
  • Classroom Routines - a guidebook of classroom routines embedded within our lessons

Teaching Resources

Resources for teaching with Skew The Script.

Lesson Notes

Lesson-specific insights from the creators of this lesson.

GIF

In this lesson, students continue to consider home ownership and building wealth. It builds on the prior lesson, as we introduce measures of spread alongside measures of center. Once students have engaged with both measures of center and spread, they will have the opportunity to explore a housing dataset using CODAP. Using the CODAP activity link, they will access a preloaded data file and explore it using the steps in the CODAP handout. Then, they’ll have the opportunity to explore the data set on their own.

Learning Targets
  • Calculate and interpret measures of spread (standard deviation, range, IQR)
  • Calculate and interpret percentiles
  • Calculate summary statistics and visualize distributions of single quantitative variables with technology
Learning Progression

Fresh from considering measures of center, perhaps from new perspectives, students now move on to address and describe measures of spread. While the range will be likely familiar to most students, and interquartile range to some, standard deviation is likely new to most. For standard deviation, the goal of this lesson is to see how it is calculated, but not to be able to do it by hand independently. Problems and contexts throughout the course will either provide the standard deviation value or students will utilize technology to find it. Then, students will be asked to interpret the standard deviation value, in context. Once students have developed or refreshed their understanding of interquartile range, they will build on that knowledge to find percentiles. In the next lesson (1.6), students will combine their understanding of measures of center and spread to develop 5-number summaries. As students add to their ‘statistical toolbox,’ they may begin to recognize the advantages and disadvantages of these summary statistics for quantitative data, and determine which measure of center or spread is most appropriate in a given situation. Informally developing this differentiation will support them in decision making for later lessons.


Before proceeding: Familiarize yourself with the lesson materials linked above (e.g. handout, handout key, slides, video). Then, for additional background and teaching tips from the lesson creators, check out the sections below.


  • If your students do not have much experience with summation notation (Σ), you may need to explain the way this notation works when covering the standard deviation formula. Although covering the formula for standard deviation may not feel necessary (students will normally use technology to compute the standard deviation), understanding the underlying process behind the formula will help them develop a deeper conceptual understanding of what the standard deviation represents. If your students do not have much experience with summation notation (Σ), you may need to explain the way this notation works when covering the standard deviation formula. Although covering the formula for standard deviation may not feel necessary (students will normally use technology to compute the standard deviation), understanding the underlying process behind the formula will help them develop a deeper conceptual understanding of what the standard deviation represents.
  • It may be helpful to call out the usefulness of IQR to identify how the middle 50% of data behaves. Doing so will help students to recall it as different from the 5-number summary they will see in lesson 1.6. Similarly, describing standard deviation as a way to measure how the data is dispersed away from the mean differentiates it from the range, which is simply the difference between the extreme data values.
  • One nice connection to make with students is the relationship between percentiles and quartiles, where Q1 approximately corresponds to the 25th percentile and Q3 approximately corresponds to the 75th percentile.

First, download this lesson's slide deck and handout key to see the prompt and sample responses for the Lesson Starter. Then, check out the additional background notes below.

Instructional routine: Notice & Wonder. The lesson handout provides a Notice & Wonder T-frame for students to capture their notes and ideas. It is important that students recognize the difference between a noticing (observation) and a wondering (question that comes to mind). You can find more background on implementing a Notice & Wonder here.

Purpose & Background: The goal of this Lesson Starter is to compare and contrast data displays with the same summary statistics. Having recently considered the mean and the median of a data set in context, students may have developed their own mental picture of how these statistics visually appear in graphs. This Lesson Starter is designed to challenge that thinking and to show that measures of center can’t always tell the whole story. As datasets that are clearly different but with the same measures of center, students will be challenged to recognize what similarities the sets must have and what causes their graphs to look so different. Offered without context, this Notice & Wonder is designed to provide students with the opportunity to solidify their understanding of measures of center before looking more broadly at measures of spread.

First, download this lesson's handout key and read through its Discussion Question section. Then, check out our model discussion norms and the additional background notes below.

  • When motivating the Discussion Question, some students may look at the situation as inherently unfair, but others may see it as a ‘change in the times’ with renting being the norm and finding new ways to build wealth through other types of investments. This latter stance offers opportunities for additional discussion of the implications of the change, such as people not being tied to the same roots that home ownership may require.
  • Other topics that may inform the discussion are considerations of what else has changed. For example, in the 1970s, many people stayed with the same company for 20-30 years and often lived in the same place during that time. In 2024, the median tenure for employees at their current jobs was only 3.9 years. Changing employers or careers more frequently often means moving more frequently, which makes home ownership less attractive.
  • Students may express concerns that it will be harder to build wealth and pass it on to their children. Home ownership (and the equity developed over time) sometimes gives parents the opportunity to give their children a ‘head start’ by helping them to pay for college or even get that first mortgage for a home of their own.
  • Depending on the context in which you teach, you may find that your student population has a lopsided opinion on this Discussion Question. As a facilitator, it’s valuable to encourage students to voice the “other side,” so that students have a robust discussion that mirrors the broader civic discourse on this topic.
  • Data sets of home values and incomes tend to have a right skew. Hence, the median is often a more useful measure of center for these variables than the mean. This may be a helpful point to reference during the next lesson in the course sequence, which specifically explores the median’s resistance to outliers and skew.
  • The data explored by students in CODAP is a random sample from the famous Ames housing data set. The data set includes uniquely detailed data about housing, compiled from research of the housing market in Ames, Iowa. The city of Ames is home to Iowa State University. Like most college towns, the city of Ames has a diverse housing stock, including homes with a variety of features, sizes, and values. The full data set includes more than 20 variables for nearly 3,000 homes. The sampled data set students use to explore CODAP was curated to remove variables less recognizable or significant to students, and to make it manageable for students (300 cases) to analyze.
  • To motivate each measure of spread, we continue to work with a small data set. This permits students to develop conceptual understanding of these measures by seeing their manual calculation. Then, we show students how to make use of technology to find summary measures of larger sets of data.
  • Squaring differences is another key idea that occurs in the standard deviation formula. Also used in a variety of methods across statistics (variation, weighting errors, regression, coefficient of determination) students will see the strategy of squaring differences several times throughout the course.
  • Students sometimes have these two common questions about the standard deviation formula. Some background is provided below. Note that each explanation is beyond the scope of our course, but can be helpful for curious students.
    • Why do we square the differences to get rid of negatives? Why not just get the absolute value? There are two central reasons for this: i) Squaring emphasizes larger differences, which can be helpful for surfacing outliers. ii) Squares have nicer mathematical properties than absolute values. In particular, models that try to minimize the standard deviation often take the derivative of the standard deviation function. It’s much easier to find the derivative of a square than an absolute value. Check out a more thorough discussion of this topic here.
    • Why do we divide by n - 1, instead of simply dividing by n? The reason is based on the concept of degrees of freedom. When calculating the sample standard deviation, we use the sample mean and every data value. However, with the sample mean in hand, we don’t actually need to be given every data to recreate the full sample. The final data value can be inferred from the sample mean and the remaining n - 1 data values. Hence, there are only n - 1 varying pieces of information in the standard deviation formula, or n - 1 degrees of freedom. This is why we divide by n - 1. A fuller discussion can be found here.

Student Supports

Lesson-specific resources to support all learners.

  • Eliciting student thinking about why we ‘get rid of the negatives’ is worthy of exploration and discussion to help students make sense of the standard deviation formula. A motivating example is considering two data values that are equidistant from the mean, in opposite directions. When we sum their differences from the mean, the negative difference cancels the positive difference, and we get a value of 0, making it appear that there is no error or variation at all, even though we know that both data values have some variation from the mean. This is why squaring the differences to ‘get rid of negatives’ is important for measuring spread.
  • For the exploration of CODAP, a step-by-step walkthrough video is provided among the lesson materials. For students who would benefit from extra support with the interface, having the walkthrough video pulled up alongside their window for CODAP can be helpful.
  • Vocabulary used in the context of the lesson may include words that are unfamiliar or have several meanings. In particular, the following mathematical terms may need clarification or a definition provided:
    • Standard deviation
    • Population standard deviation
    • Sample standard deviation
    • Range
    • Interquartile range (IQR)
    • Percentile
    • Inclusive
    • Exclusive
  • In addition, the following contextual terms may need clarification or a definition provided:
    • Federal reserve
    • Middle class
  • Provide sentence stems to aid students in discussing prices of houses and what that means within a larger context. See these specific examples and discussion supports.
    • If young people aren’t earning enough money to buy a home, then ___________________
    • If young people only rent their homes, then ___________________________
    • Higher housing prices mean _____________________________
    • Today’s young adults may not be buying homes because_______________________________