yego.me
💡 Stop wasting time. Read Youtube instead of watch. Download Chrome Extension

Median, mean and skew from density curves | AP Statistics | Khan Academy


4m read
·Nov 11, 2024

In other videos, we introduce ourselves to the idea of a density curve, which is a summary of a distribution—a distribution of data. In the future, we'll also look at things like probability density.

But what I want to talk about in this video is to think about what we can glean from the properties, how we can describe density curves and the distributions they represent. We have four of them right over here. The first thing I want to think about is if we can approximate what value would be the middle value or the median for the data set described by these density curves.

So just to remind ourselves: If we have a set of numbers and we order them from least to greatest, the median would be the middle value or the midway between the middle two values. In a case like this, we want to find the value for which half of the values are above that value and half of the values are below.

So, when you're looking at a density curve, you want to look at the area, and you'd want to say, "Okay, at what value do we have equal area above and below that value?" For this one, just eyeballing it, this value right over here would be the median. In general, if you have a symmetric distribution like this, the median will be right along that line of symmetry.

Here, we have a slightly more unusual distribution; this would be called a bimodal distribution. We have two major lumps right over here, but it is symmetric, and that point of symmetry is right over here. So this value, once again, would be the median. Another way to think about it is that the area to the left of that value is equal to the area to the right of that value, making it the median.

But what if we're dealing with nonsymmetric distributions? Well, we'd want to do the same principle. We want to think at what value the area on the right and the area on the left are equal. Once again, this isn't going to be super exact, but I'm going to try to approximate it. You might be tempted to go right at the top of this lump right over here, but if I were to do that, it's pretty clear—even eyeballing it—that the right area right over here is larger than the left area. So that would not be the median.

If I move the median a little bit over to the right, maybe right around here, this looks a lot closer. Once again, I’m approximating it, but it's reasonable to say that the area here looks pretty close to the area right over there. If that is the case, then this is going to be the median. Similarly, on this one right over here, maybe right over here. Once again, I'm just approximating it, but that seems reasonable that this area is equal to that one.

Even though this is longer, it's much lower; this part of the curve is much higher even though it goes on less to the right. So that's the median for well-behaved continuous distributions like this. It's going to be the value for which the area to the left and the area to the right are equal.

But what about the mean? Well, the mean is where you take each of the possible values, weight it by their frequencies, and add all of that up. So for symmetric distributions, your mean and your median are actually going to be the same. This is going to be your mean as well.

If you want to think about it in terms of physics, the mean would be your balancing point—the point at which you would want to put a little fulcrum, and you would want to balance the distribution. So you could put a little fulcrum here, and you could imagine that this thing would balance. That's all comes out of this idea of the weighted average of all of these possible values.

What about for these less symmetric distributions? Well, let's think about it over here. Where would I have to put the fulcrum, or what does our intuition say if we wanted to balance this? We have equal areas on either side, but when you have this long tail to the right, it's going to pull the mean to the right of the median in this case.

So our balance point is probably going to be something closer to that. Once again, this is me approximating it, but this would roughly be our mean. It would sit, in this case, to the right of our median. Let me make it clear: this median is referring to that, the mean is referring to this.

In this case, because I have this long tail to the left, it's likely that I would have to balance it out right over here. So the mean would be this value right over there. There's actually a term for these nonsymmetric distributions where the mean is varying from the median. Distributions like this are referred to as being skewed.

This distribution, where you have the mean to the right of the median—where you have this long tail to the right—this is called right skewed. Now, the technical idea of skewness can get quite complicated, but generally speaking, you can spot it out when you have a long tail in one direction. That's the direction in which it will be skewed or if the mean is to that direction of the median.

So the mean is to the right of the median. Generally speaking, that's going to be a right skew distribution. The opposite of that is here: the mean is to the left of the median, and we have this long tail on the left of our distribution. So, generally speaking, we will describe these as left skewed distributions.

More Articles

View All
Risking My Life To Settle A Physics Debate
This propeller craft was built to settle a physics debate because what its creators claim it can do is so counterintuitive that it seems to violate the law of conservation of energy. So I’ve come here to drive it myself and see if it really works. And is …
See the Sparks That Set Off Violence in Charlottesville | National Geographic
The point of the rally is to, number one, protect this statue because this statue is one of many statues that are in honor of the history of Western civilization and European peoples that are being torn down. [Applause] The policies that liberals have put…
Worked example: Approximation with local linearity | AP Calculus AB | Khan Academy
We’re told the function ( f ) is twice differentiable with ( f(2) = 1 ), ( f’(2) = 4 ), and ( f”(2) = 3 ). What is the value of the approximation of ( f(1.9) ) using the line tangent to the graph of ( f ) at ( x = 2 )? So pause this video and see if you c…
Creativity break: What can we do to expand our creative skills? | Algebra 1 | Khan Academy
There are so many ways that you can expand your creative and math skills without even really realizing that you’re doing it. Like for me, I’m a big board game fan. Um, I realize that there are so many games that involve math and learning how to solve pro…
Buddhism | World History | Khan Academy
We’re now going to talk about one of the most significant figures in all of human history, and that is Sedara Gotha, who would later be known as Buddha, as the Awakened One or the Enlightened One. Now, before we get into his life, let’s think about the co…
Syria, Israel, Ukraine - A New Decade of War
10 years ago, in 2014, we asked if war was over. Based on long-term trends in the last century, it seemed violent conflict was on the decline and the world more peaceful than ever. Our video began with the Russian invasion of Ukraine and the war between H…