Lesson 1.B.5 - Grouping in Experiments
Key Question: Has social media caused declines in mental health?
Content: Completely Randomized, Randomized Block, & Matched Pairs Experiments
Alignment: CED Topic 1.13.B-1.13.D
Video
Course Resources
Resources for teaching our AP® Statistics curriculum.
- Lesson Flow - timing and flow of class, using our lesson materials
- Pacing Guide - pacing our units, with daily or block schedules
- CED Alignment Guide - aligning our lessons to the AP® Statistics Course and Exam Description
Teaching Resources
Resources for teaching with Skew The Script.
- Discussion Norms - our model discussion norms for the classroom
- Letter to Parents - letter to share with parents about our nonpartisan approach
- Teaching Math on Civic Topics - tips for teaching math lessons that cover civic topics
Lesson Notes
Lesson-specific insights from the creators of this lesson.
From the radio, to television, and to video games, there’s a long history of new technology that makes adults wonder: “Are the kids going to be ok?” For the most part, the worries have turned out to be more noise than signal. However, some new data suggests that a more modern tech boom – social media – may be different. In this lesson, students investigate the relationship between social media and mental health. Then, they explore how different study designs allow researchers to ask: Is the relationship just association, or actually causation?
- Describe the features of different experimental designs (completely randomized, randomized block, matched pairs)
- Evaluate the advantages and disadvantages of different experimental designs
- Describe the generalizability of results from an experiment
Before proceeding: Familiarize yourself with the lesson materials linked above (e.g. handout, handout key, slides, video). Then, for additional background and teaching tips from the lesson creators, check out the sections below.
- We’ve found that many teenagers tend to be curious and open to having discussions about mental health and social media. However, that openness quickly closes if they feel that they’re being “given a lecturing” about social media. Starting the lesson with the examples (in the lesson slides and video) of past concerns about radio, television, and video games provides a helpful “we know some concerns have been overblown in the past” framing to the lesson. Then, presenting the more recent statistics about mental health declines feels less like a lecture, and more like the presentation of a compelling question: are these trends just associated with the spread of social media, or are they causally connected?
- Many students are frequent social media users. Bringing that expertise into the room can feel empowering, and we find that students are often excited to dive into this lesson and share their own experiences. At the same time, other students may not feel as comfortable discussing their own experience with social media, especially as it relates to mental health. We recommend keeping this lesson’s discussions at the statistical level, while also allowing students to share their own experiences according to their own level of comfort.
- To motivate the more complex experimental designs (randomized block and matched pairs), it’s helpful to explicitly motivate the problem of variation. In an ideal world, we could give someone a treatment, then rewind time, give them a different treatment, and then measure the difference in their outcomes. Because this isn’t possible, we have to randomly assign different individuals to different treatments. The problem with this approach is that different treatment groups could be composed of very different individuals, making the effect of treatment difficult to detect. Randomized block and matched pairs designs attempt to mitigate this problem by making the treatment groups as comparable as possible, making any treatment effect easier to detect.
First, download this lesson's Handout Key and read through its Discussion Question section. Then, check out our model discussion norms and the additional background notes below.
- This question represents another exploration of the concept of generalizability. Because the experiment is performed with college students, the results can’t be fully generalized to other age groups. So, students have to make their own inferences about how the treatment effect might differ for high school students. Surfacing this limitation during discussion can be helpful for reinforcing the concept of generalizability.
- Because of the generalizability limitation mentioned above, answers to the Discussion Question can (and should) vary. The important point to emphasize is that students should provide robust reasoning for their opinion, grounded in the context of the lesson and by statistical reasoning.
- Although they’re less frequent, experimental studies of social media and mental health have been performed with high school aged individuals (or with mixed samples of high school and college aged individuals). For example, Kleemans, Daalmans, Carbaat, & Anschütz (2018) and Davis & Goldfield (2024). That said, because these studies are so few in number and test different treatment conditions, it’s difficult to use them to make strong inferences about this lesson’s Discussion Question.
- The opening of the lesson mentions increased rates of major depressive episodes, anxiety disorders, and emergency department visits for self-harm among young people. It’s possible that the first two indicators could be attributed to more frequent detection and reporting of mental health disorders, rather than true increases in the underlying prevalence of these disorders. However, the same can’t be said about the third metric, since standards for reporting among emergency departments have not changed over the time periods mentioned in the lesson.
- Students may wonder: In the Iowa State social media experiment, why did researchers only ask participants to limit their social media use, rather than fully controlling it? In the full paper, the researchers state, “One consistent finding has been that complete abstinence from social media may not be sustainable for the average user. A less strict approach is to limit social media use by monitoring. Monitoring limited usage, as opposed to abstinence, may be more sustainable and practical.” So, researchers wanted to know if a more sustainable approach that required less coercion – merely telling folks to reduce their social media time – would still produce an effect. And it did.
- Students interested in this topic further can explore the increasing body of experimental research on social media use and mental health, including the following articles: Brailovskaia, Swarlik, Grethe, Schillack, & Margraf (2023), Davis & Goldfield (2025), Graham, Mason, Riordan, Winter, & Scarf (2021), Hunt, All, Burns, & Li (2021), Kleemans, Daalmans, Carbaat, & Anschütz (2018), Lambert, Barnstable, Minter, Cooper, & McEwan (2022), Thai, Davis, Mahboob, Perry, Adams, & Gold (2023), and Yuen, Koterba, Stasio, et al. (2019).
- The lesson discusses the matched pairs design of having each participant engage in both treatment conditions, with the order of those conditions randomized. A matched pairs design can also be applied to situations in which different participants are paired together, before each participant is randomly assigned to a different treatment. For example, many experiments seek to recruit identical twins – who have almost identical DNA – to serve as matched pairs in experiments. In fact, some Universities even have twin registries that help connect researchers with twins for studies.
- Randomized block and matched pairs designs can be connected to a statistical idea that arises later in the course: power. By reducing variation between treatment groups, these experimental designs increase the power to detect significant differences between treatment groups. This can help reduce study costs, as researchers can recruit fewer participants, while still retaining the power to detect treatment effects. That said, because students have not yet been introduced to the concept of power, it’s best to wait to surface this connection until later in the course.
- Although the Iowa State researchers did not implement a randomized block design or a matched pairs design, they used advanced data analysis methods (beyond the scope of AP Stats) that mimic these designs. For example, researchers used multiple regression models to control for gender when measuring treatment effects. This is similar to comparing treatment effects only within gender blocks. In addition, researchers had every student take a pre and post psychological evaluation. The pre scores were used as a regression control, allowing researchers to control for individuals’ baseline evaluations when measuring treatment effects. This is similar to a matched pairs comparison at an individual level.
Student Supports
Lesson-specific resources to support all learners.
- For supporting student understanding of the advantages of matched pairs designs, it can often help to use the example of identical twin studies. Identical twins have almost identical DNA. Imagine we recruit four sets of identical twins to participate in an experiment comparing two treatments (A and B). What would be the better approach: Randomly assigning two sets of twins to treatment A and the other two sets to treatment B? Or splitting each pair of twins, such that one twin in each group randomly gets assigned to A, while the other gets B? The second plan is best, as it reduces variation between the subjects in group A and group B. Then, we compare outcomes within each group of twins, so that the treatment effect will be easier to detect.
- Similarly, the example above can be used to motivate the advantages of randomized block experiments. Although we may not have twins, we can still try to group participants according to a similar trait (a blocking variable). Then, if we can compare treatment across individuals who share that trait, the treatment effect will be easier to detect. In fact, we can think of “being an identical twin” as a particularly powerful blocking variable.
- Vocabulary used in the context of the lesson may include words that are unfamiliar or have several meanings. In particular, the following mathematical terms may need clarification or a definition provided:
- Control group
- Placebo effect
- Single blind study / single-masked study
- Double blind study / double-masked study
- In addition, the following contextual terms may need clarification or a definition provided:
- Social media
- Mental health
- Scarecrow