yego.me
💡 Stop wasting time. Read Youtube instead of watch. Download Chrome Extension

Classifying shapes of distributions | AP Statistics | Khan Academy


4m read
·Nov 11, 2024

What we have here are six different distributions, and what we're going to do in this video is think about how to classify them or use the words that people typically use to classify distributions.

So let's first look at this distribution right over here. That's the distribution of the lengths of houseflies. Someone went out there and measured a bunch of houseflies and then said, “Hey, look, there are many houseflies that are between six-tenths of a centimeter and six and a half-tenths of a centimeter.” It looks like there are about 40 houseflies there.

And then if you say between six and a half and seven-tenths, there are about 30 houseflies. If you were to say between five and a half tenths and six tenths, it looks like it's about the same amount. This type of distribution is usually described as being symmetric. Why is it called that? Because if you were to draw a line down the middle of this distribution, both sides look like mirror images of each other. This one looks pretty exactly symmetric, but more typically, when you're collecting data, you'll see roughly symmetric distributions.

Now here we have a distribution that gives us the dates on pennies. Someone went out there, observed a bunch of pennies, and looked at the dates on them. They saw many pennies—looks like a little bit more than 55 pennies had a date between 2010 and 2020—while very few pennies had a date older than 1980 on them. This type of distribution, when you have a tail to the left (you can see it right over here), you have a long tail to the left. This is known as a left skewed distribution.

Left skewed. Now in future videos, we'll come up with more technical definitions of what makes it left skewed, but the way that you can recognize it is you have the high points of your distribution on the right, but then you have this long tail that skews it to the left. Now the other side of a left skewed, you might say, “Well, that would be a right skewed distribution,” and that's exactly what we see right over here.

This is a distribution of state representatives, and as you can see, most of the states in the United States have between 0 and 10 representatives. It looks like it's a little over 35; none of them actually have zero—they all have at least one representative—but they would fall into this bucket, while very few have more than 50 representatives.

So here, where the bulk of our distribution is to the left, we have this tail that skews us to the right. This is known as a right skewed distribution. Now, if we look at this next distribution, what would this be? Pause this video and think about it.

Well, this could be a distribution of maybe someone went around the office and surveyed how many cups of coffee each person drank. If they found someone who drank one cup of coffee per day, maybe this would be them. And they found another person who drinks one cup of coffee—that's them. Then they found three people who drank two cups of coffee.

Well, this is a very similar situation to what we saw on the dates on pennies. A large amount of our data fell into this right bucket of three cups of coffee, but then we had this tail to the left. So this would be left skewed. Now these right two distributions are interesting. One could argue that this distribution here, which is telling us the number of days that we had different high temperatures, looks roughly symmetric or actually even looks exactly symmetric, because if you did that little exercise of drawing a dotted line down the middle, it looks like the two sides are mirror images of each other.

Now that would not be technically incorrect, but typically when you see these two peaks, this would typically be called a bimodal distribution. So even though bimodal distributions can sometimes be symmetric or roughly symmetric, you want to be more precise. Here, when you have these two peaks, that's where the "bi" comes from; you would call it bimodal. This makes sense because you have a lot of days that are warm that might happen during the summer, and you might have a lot of days that are cold that are happening during the winter.

Now this last distribution here, the results from die rolls, one could argue as well that this is roughly symmetric. It's not exact; it's not perfectly symmetric, but when you look at this dotted line here on the left and the right sides, it looks roughly symmetric. But a more exact classification here would be that it looks approximately uniform. So rather than calling it a symmetric distribution or a roughly symmetric distribution, most people would classify this as an approximately uniform distribution.

More Articles

View All
Are We Running Out of Sand?
[Music] It can be easy to take something for granted that every time you see it, it seems to go on forever. It’s like an infinite path to the horizon, a landscape that never ends. This is sand. And even though just a simple trip to the beach can make it f…
The discovery of the double helix structure of DNA
In 1865, Mendel, often considered the father of modern genetics, comes up with a structured way of thinking about these inheritable factors, which we now call genes. Then, as we go into the early 1900s, his work was rediscovered, and people started to say…
There are Three Elements to a Perfect Job. Now Pick Two. With James Citrin | Big Think
It’s a pretty good market for college grads coming in, but it’s never good when you’re the one trying to find a job because it’s really hard. Here’s the thing about jobs: there are three forces that are at fundamental war with one another all the time. T…
Pinstriping my Lotus Exige S240 for $7
What’s up? You think Mornington granted today is going to be a fun game? I’m all the ways of red clay and my God, I wore last week judgment. Lots of luck, and I got this off of eBay, like six dollars in China. What it is, it’s the final lightning pink. I …
Quadratic approximation formula, part 1
So our setup is that we have some kind of two variable function f(x, y) who has a scalar output, and the goal is to approximate it near a specific input point. This is something I’ve already talked about in the context of a local linearization. I’ve writt…
The Dangers of Climbing Helmcken Falls | Edge of the Unknown on Disney+
[MUSIC PLAYING] Yeah. [BLEEP] [CHUCKLING] From here, it’s hard to tell the scale. Yeah, it’s so– it’s so big. WILL GADD: If you aren’t scared walking into Helmcken Falls, something is wrong with you. Imagine a covered sports stadium, and you cut it in h…