yego.me
💡 Stop wasting time. Read Youtube instead of watch. Download Chrome Extension

Estimating mean and median in data displays | AP Statistics | Khan Academy


3m read
·Nov 11, 2024

We are told researchers scored 31 athletes on an agility test. Here are their scores; it's in this histogram. And what I'm going to ask you is which of these intervals, interval A, B, or C, which one contains the median of the scores and which one, or give an estimate of which one contains the mean of the scores.

Pause this video and see if you can figure that out.

So let's just start with the median. Remember, the median you could view as the middle number, or if you have an even number of data points, it would be the average of the middle two. Here we have an odd number of data points, so it would be the middle number. So what would be the middle number if you were to order them from least to greatest? Well, it would be the one that has 15 on either side, so it would be the 16th data point.

16th data point, and so we could just think about which interval here contains the 16th data point. You could view it for the 16th from the highest or the 16th from the lowest; it is the middle one.

All right, so let's start from the highest. So this interval C contains the 13 highest data points, and then interval B goes from the 14th highest all the way to the 18th highest. So this B contains the median; it contains the 16th highest data point, or if you started from the left, it would also be the 16th lowest data point. So that's where the median is, the median.

Now, what about an estimate for the mean? Well, you have calculated the mean in the past, but when you're looking at a distribution like this, when you're looking at a histogram, one way to think about the mean is it would be the balancing point. If you imagine that this histogram was made out of some material of, let's say, uniform density, where would you put a fulcrum in order to balance it?

If you put the fulcrum right over here, it feels like you would have all—it feels like you would tip over to the left because this is a left-skewed distribution; you have this long tail to the left. If you really wanted to balance it out, it seems like you would have to move your fulcrum in the direction of that left skew, in the direction of the tail.

And so I would estimate to balance it out, it would actually be closer to that, which would be interval A. Interval A would contain the mean. The intention of this type of exercise isn't for you to try to calculate every data point. In fact, they don't give you all the information here and add them all up and then divide by 31.

It's really to estimate and to also get the intuition that when you have a left-skewed distribution like this, you will often see a situation where your mean is to the left of the median. If you have a right-skewed distribution, it would be the other way around.

As we will see, when you see a symmetric distribution, the mean and the median will be awfully close to each other, or when you have a roughly symmetric distribution. If you have a perfectly symmetric distribution, they might be exactly in the same place.

So let's do another example. So here it says we have the ages of 14 co-workers, and what I want you to do is say roughly where is the mean and roughly where is the median? Is it roughly at A, is it roughly at B, or is it roughly at C?

Pause this video and try to figure it out.

So let's first start off with the median. We have 14 data points, so this would be the average of the middle two data points. It would be the average of the seventh and eighth data points. Well, you could say one, two, three, four, five, six, seven, and then the eighth one is here.

So the seventh data point is a 30, and the eighth one is in the 31 bucket, so the average of the two would give you 30.5. Another way that you could think about it is you can just eyeball it and see you have just as many data points below B as you do have above B, and so that also gives you a good indication that B would be where the median is.

So that is where the median is. Now, what about the mean? Well, this is a perfectly symmetric distribution. If I wanted to balance it, I would put the fulcrum right in the middle. So I would say that the mean would also be at B.

And we are done.

More Articles

View All
Morgan DeBaun on Reaching 20M Millennials - With Kat Manalac at the Female Founders Conference
And now I’m really, really excited to introduce you to our next speaker, Morgan DeBon. She’s the founder of Blabbetty. So, Blabbetty has, you know, grown into the largest media company and lifestyle brand for Black Millennials. Morgan started Blabbetty in…
Robinhood REVEALS Their Sneaky Business Model... (Robinhood IPO Filing)
Well, a couple of weeks ago, the commission-free trading app Robinhood submitted their S1 filing to the SEC, which is the initial registration form for new securities based in the US. What this means is that yes, Robinhood is gearing up for their IPO, whi…
How to drive an Exotic Car for Free (Top 10 Best Cars)
What’s up you guys, it’s Graham here. So, this has been the most requested video topic in the history of the entire YouTube internet, and that is: how to car hack and drive an exotic car for free. I’ve been pretty fortunate that, with the last two sports …
Worked example: distance and displacement from position-time graphs | AP Physics 1 | Khan Academy
In other videos, we’ve already talked about the difference between distance and displacement, and we also saw what it meant to plot position versus time. What we’re going to do in this video is use all of those skills. We’re going to look at position vers…
SpaceX Makes History | MARS
T minus 20 seconds. Stage two tanks pressing for flight. Flight computer has control of the vehicle. Do we see anything on the sensors that’s a problem? Anything right now? Nothing. Well, I’ll say go for launch. T minus 10. 9. 8. 7. 6. 5. 4. 3. 2. 1…
Graphs of rational functions: zeros | High School Math | Khan Academy
So we’re told let ( F(x) = \frac{2x^2 - 18}{G(x)} ), where ( G(x) ) is a polynomial. Then they tell us which of the following is a possible graph of ( y = F(x) ). They give us four choices here, and like always, I encourage you to pause the video and see …