yego.me
💡 Stop wasting time. Read Youtube instead of watch. Download Chrome Extension

Example estimating from regression line


3m read
·Nov 11, 2024

Lizz's math test included a survey question asking how many hours students spent studying for the test. The scatter plot below shows the relationship between how many hours students spend studying and their score on the test. A line was fit to the data to model the relationship. They don't tell us how the line was fit, but this actually looks like a pretty good fit if I just eyeball it.

Which of these linear equations best describes the given model? So as you know, this point right over here shows that some student at least self-reported they studied a little bit more than half an hour, and they didn't actually do that well on the test. Looks like they scored a 43 or 44 on the test.

This right over here shows, or like this one over here, is a student who says they studied two hours, and it looks like they scored about a 64 or 65 on the test. This over here, or this over here, looks like a student who studied over four hours, or they reported that, and they got, looks like, a 95 or 96 on the exam.

And so then, these are all the different students; each of these points represents a student. They fit a line, and when they say which of these linear equations best describes the given model, they're really saying which of these linear equations describes or is being plotted right over here by this line that's trying to fit to the data.

So essentially, we just want to figure out what is the equation of this line. Well, it looks like the Y intercept right over here is 20, and it looks like all of these choices here have a y intercept of 20, so that doesn't help us much. But let's think about what the slope is. When we increase by one, when we increase along our x-axis by one, so change in X is one, what is our change in y?

Our change in y looks like, let's see, we went from 20 to 40. It looks like we got went up by 20, so our change in y over change in X for this model, for this line that's trying to fit to the data, is 20 over one. So this is going to be our slope, and if we look at all of these choices, only this one has a slope of 20, so it would be this choice right over here.

Based on this equation, estimate the score for a student that spent 3.8 hours studying. So we would go to 3.8, which is right around, let's see, this would be 3.8, would be right around here. So let's estimate that score. If I go straight up, where do we intersect our model? Where do we intersect our line?

So it looks like they would get a pretty high score. Let's see, if I were to take it to the vertical axis, it looks like they would get about a 97. So I would write my estimate is that they would get a 97 based on this model.

Once again, this is only a model; it's not a guarantee that if someone studies 3.8 hours they're going to get a 97, but it could give an indication of what maybe might be reasonable to expect, assuming that the time studying is the variable that matters.

But you also have to be careful with these models because it might imply, if you kept going, that if you study for nine hours, you're going to get a 200 on the exam, even though something like that is impossible. So you always have to be careful extrapolating with models and keep it, take it with a grain of salt.

This is just a model that's trying to fit to this data, and you might be able to use it to estimate things or to maybe set some form of an expectation, but take it all with a grain of salt.

More Articles

View All
Conditions for a t test about a mean | AP Statistics | Khan Academy
Sunil and his friends have been using a group messaging app for over a year to chat with each other. He suspects that, on average, they send each other more than 100 messages per day. Sunil takes a random sample of seven days from their chat history and r…
How To Build Product As A Small Startup - Michael Seibel
A lot of the problems that I faced in the early stages of my companies were because I didn’t have a process to get product out of the door. Um, instead, my co-founders and I would have long debates, which would often turn into arguments. We wouldn’t write…
Percent from fraction models
So we’re told the square below represents one whole. So, this entire square is a whole. Then they ask us, what percent is represented by the shaded area? So why don’t you pause this video and see if you can figure that out? So, let’s see. The whole is di…
FIRST VIDEO OF NEW SPIDER SPECIES! - Smarter Every Day 78
[Music] Hey, it’s me D. Welcome back to Smarter Every Day. Today’s observation I’m going to share with you is amazing: it’s the discovery of a new species of spider, potentially in the Amazon rainforest. I’m talking about the exact moment that we walked u…
Let's talk about Dave Ramsey and why he doesn't like credit cards!
What’s up you guys, it’s Graham here. So, what are the comments I get a lot of on my channel, especially on my videos about getting a credit card and building your credit history? Comments like, “Dave Ramsey would let me show you drunk yet!” He’d have a …
Comparing with z-scores | Modeling data distributions | AP Statistics | Khan Academy
Before applying to law school in the U.S., students need to take an exam called the LSAT. Before applying to medical school, students need to take an exam called the MCAT. Here are some summary statistics for each exam. For the LSAT, the mean score is 15…