How did you do? Did you think one car performed better than the other? What data were you paying attention to as you were racing?
Before we analyze the data your class has collected, let’s look at a sample dataset, called sample1. In the following visualization app, you first have the Group ID which represents a sample taken from a previous group of players. The X Variable can either be Car, Order, or Player ID. The Y Variable can either be Finish Time or Top Speed Reached. Try switching between different X and Y variables to see what different data pops up. Then, complete the questions below the app to make sure you understand how this app works.
To make good decisions with data, you often need to consider more than what you can see in a single graph or table. Sometimes there’s an outside influence that the original experiment did not account for. For example, confounding variables (variables that the researcher did not include in the study, but that might be connected to both the independent variable and the dependent variable) may influence the results. In order to evaluate whether the car speeds are truly different, we should identify and consider possible confounding variables. List at least two potential confounding variables that might get in the way of determining which car is faster.
In the Racer App above replace sample1 with the Group ID used by your class.
a. Look carefully through the data from your class. Identify any players that you think should be removed from the dataset.
b. Do the results from your class agree with the sample1 data? In what ways (if any) do they disagree?
c. Do you believe the tutorial data from your class provides convincing evidence that one car is better than the other?
d. Based on the previous question, what evidence did you use to determine that one car was better than another, or what additional evidence would be needed before you could determine that?
Think about different ways we could have conducted this study to improve the quality of our data.
a. Are there better variables to use? For example, what would be the benefits of using Top Speed Reached as the response variable instead of Finish Time?
b. What are the benefits of randomizing the order of the cars?
c. How many races would need to be completed to convince you that one car is better than the other?