yego.me
💡 Stop wasting time. Read Youtube instead of watch. Download Chrome Extension

Matched pairs experiment design | Study design | AP Statistics | Khan Academy


4m read
·Nov 11, 2024

The last video, we constructed an experiment where we had a drug that we thought might help control people's blood sugar. We looked for something that we could measure as an indicator of whether blood sugar is being controlled, and hemoglobin A1c is actually what people measure in a blood test. We have a whole video on it on KH Academy, but it is an average measure of your blood sugar over roughly a 3-month period.

So that's the explanatory variable: whether or not you're taking the pill. The response variable is, well, what does it do to your hemoglobin A1c? We constructed a somewhat classic experiment where we had a control group and a treatment group. We randomly assigned folks into either the control or the treatment group, and to ensure that one group or the other, or I guess both of them, don't end up with an imbalance—in the case of the last video, an imbalance of men or women—we did what we call a block design.

We took our 100 people, and we just happened to have 60 women and 40 men. We said, okay, well, let's split the 60 women randomly between the two groups and let's split the 40 men between these two groups, so that we have at least an even distribution with respect to sex. We would measure folks’ A1c’s before they get the treatment or the placebo. Then we would wait three months of getting either the treatment or the placebo, and then we'll see if there's a statistically significant improvement.

Now, this was a pretty good, and it's a bit of a classic experimental design. We would also do it so that the patients don’t know which one they’re getting: placebo or the actual treatment. So, it's a blind experiment, and it would probably be good if even the nurses or the doctors who are administering the pills, who are giving the pills, also don’t know which one they’re giving. So, it would be a double-blind experiment.

But this doesn’t mean that it’s a perfect experiment, and there seldom is a perfect experiment. That’s why it should be able to be replicated; other people should try to prove the same thing, maybe in different ways. But even the way that we designed it, there’s still a possibility that there are some lurking variables in here. Maybe we took care to make sure that our distribution of men and women was roughly even across both of these groups, but maybe through that random sampling, we got a disproportionate number of young people in the treatment group.

And maybe young people responded better to taking a pill. Maybe that, you know, it changes their behaviors in other ways, or maybe older people, when they take a pill, they decide to eat worse because they say, “Oh, this pill is going to solve all my problems.” And so, you could have these other lurking variables like age, or where in the country they live, or other types of things that just by the random process might get uneven in one way or another.

Now, one technique to help control for this a little bit—I shouldn’t use the word control too much—another technique to help mitigate this is something called matched pairs design. Matched pairs design of an experiment is essentially, instead of going through all of this trouble, saying, “Oh boy, maybe we do block design, all this random sampling,” instead, you randomly put people first into either the control or the treatment group.

Then we do another round; you measure, and then you do another round where you switch. The people who are in the treatment go to the control, and the people who are in the control go into the treatment. So, we could even extend from what we have here. We can imagine a world where the first three months, we have 50 people in this treatment group. We have another 50 people in this control group that are taking the placebo.

We see what happens to the A1c’s, and then we switch, where this group over here, then—and they don’t know—they don’t know, first of all, ideally it’s a blind experiment, so they don’t even know they were in the treatment groups. Hopefully, the pills look identical. So now that same group for the next three months is now going to be the control group.

They got the medicine for the first three months, and we saw what happens to their A1c. Now they’re going to get the placebo for the second three months, and then we are going to see what happens to their A1c. Likewise, the other group is going to be switched around: the folks that used to be getting the placebo could now get the treatment. They are now going to get the treatment.

The value here is that because everyone is going through both—being in the control group for one period and in the treatment group for another—and they don’t know when which one is happening, you are less likely to have a lurking variable, like age or geographic region or behavior, cause an imbalance or somehow skew the results or give you biased results.

So, this is an interesting thing, and you know, even what I’ve talked about in this video and the last one, these are just different ways to approach it. As you construct experiments, this is in medicine; you’ll obviously construct experiments in other fields. It’s important to think about what types of things are practical to do and also have the best chance at giving you, I guess you could say, real, unbiased information as to, in the case of an experiment, the efficacy of something or whether a certain explanatory variable really does drive a causal effect on a response variable.

More Articles

View All
Buying A $0 House: My Real Estate Investing Strategy
What’s up you guys, it’s Graham here. So, as I mentioned in one of my previous videos, I read all of the comments, and yes, that includes the comments where you asked me if I read them. I read them! And it’s by doing this that I can see that anytime I get…
Comparing P-value from t statistic to significance level | AP Statistics | Khan Academy
Jude was curious if the automated machine at his restaurant was filling drinks with the proper amount. He filled a sample of 20 drinks to test his null hypothesis, which is the actual population mean for how much drink there was in the drinks per drink is…
Q&A With Grey: One Million Subscribers Edition
Hi. So I think this is the part where I’m supposed to have a dramatic opening, perhaps the 2001 theme song playing in the background as the number 1,000,000 fades into view, but obviously that’s not happening. Don’t get me wrong, I’m lucky that so many pe…
Molecular evidence for evolutionary relationships examples | High school biology | Khan Academy
An investigation was carried out on four different plant species to determine which of three species was most closely related to an unknown plant species. The results of the investigation are shown in the data table below. Which plant species appears to b…
Differentiating related functions intro | Advanced derivatives | AP Calculus AB | Khan Academy
We are told the differentiable functions x and y are related by the following equation: y is equal to the square root of x. It’s interesting, they’re telling us that they’re both differentiable functions. Even x is a function must be a function of somethi…
Safari Live - Day 333 | National Geographic
This program features live coverage of an African safari and may include animal kills and carcasses. Viewer discretion is advised. This program features live coverage of an African safari. It’s a wonderful Sunday, and you are looking at a lovely drawn go…