yego.me
💡 Stop wasting time. Read Youtube instead of watch. Download Chrome Extension

Z-score introduction | Modeling data distributions | AP Statistics | Khan Academy


3m read
·Nov 11, 2024

One of the most commonly used tools in all of statistics is the notion of a z-score. One way to think about a z-score is it's just the number of standard deviations away from the mean that a certain data point is. So let me write that down: number of standard deviations. I'll write it like this: number of standard deviations from our population mean for a particular data point.

Now let's make that a little bit concrete. Let's say that you're some type of marine biologist, and you've discovered a new species of winged turtles. There's a total of seven winged turtles; the entire population of these winged turtles is seven. So you go, and you're actually able to measure all the winged turtles. You care about their length, and you also want to care about how those lengths are distributed. Lengths of winged turtles.

All right, and let's say—and this is all in centimeters—these are very small turtles. So you discover, and these are all adults: there's a two centimeter one, there's another two centimeter one, there's a three centimeter one, there's another two centimeter one, there's a five centimeter one, a one centimeter one, and a six centimeter one. So we have seven data points. From this, I encourage you, at any point, if you want, pause this video and see if you want to calculate what is the population mean. Here we're assuming that this is the population of all the winged turtles.

Well, the mean in this situation is going to be equal to—you could add up all these numbers and divide by seven, and you would then get three. Then using these data points and the mean, you can calculate the population standard deviation. Once again, as a review, I always encourage you to pause this video and see if you can do it on your own. But I've calculated that ahead of time: the population standard deviation in this situation is approximately around to the hundredths place, 1.69.

So with this information, you should be able to calculate the z-score for each of these data points. Pause this video and see if you can do that. So let me make a new column here. Here I'm going to put the z-score. If you just look at the definition, what you're going to do for each of these data points—let's say each data point is x—you’re going to subtract from that the mean, and then you're going to divide that by the standard deviation.

The numerator idea over here is going to tell you how far you are above or below the mean, but you want to know how many standard deviations you are from the mean. So then you'll divide by the population standard deviation. For example, this first data point right over here, if I want to calculate its z-score, I will take 2 from that, I will subtract 3, and then I will divide by 1.69. I will divide by 1.69, and if you got a calculator out, this is going to be negative 1 divided by 1.69. If you use a calculator, you would get this is going to be approximately negative 0.59, and the z-score for this data point is going to be the same; that is also going to be negative 0.59.

One way to interpret this is this is a little bit more than half a standard deviation below the mean. We could do a similar calculation for data points that are above the mean. Let's say this data point right over here—what is its z-score? Pause this video and see if you can figure that out. Well, it's going to be 6 minus our mean, so minus 3, all of that over the standard deviation—all of that over 1.69. This, if you have a calculator—and I calculated it ahead of time—this is going to be approximately 1.77. So more than one but less than two standard deviations above the mean.

I encourage you to pause this video and now try to figure out the z-scores for these other data points. Now an obvious question that some of you might be asking is why. Why do we care how many standard deviations above or below the mean a data point is? In your future statistical life, z-scores are going to be a really useful way to think about how usual or how unusual a certain data point is. That's going to be really valuable once we start making inferences based on our data.

So I will leave you there. Just keep in mind it's a very useful idea, but at the heart of it, a fairly simple one. If you know the mean, you know the standard deviation. Take your data point, subtract the mean from the data point, and then divide by your standard deviation. That gives you your z-score.

More Articles

View All
Richard Carranza on how NYC is handling school closures during Covid-19 | Homeroom with Sal
Hi everyone! Welcome to the daily homeroom live stream, which is something that all of us at Khan Academy started up once we started having mass school physical closures. I should say many seems like a lifetime ago. It’s just a way to keep in touch, have …
Estimating to subtract multi-digit numbers | Grade 5 (TX TEKS) | Khan Academy
So we have two subtraction problems here that I want you to estimate. I first want you to estimate what 51,384 minus 28,251 is, and then I want you to estimate what 761,023 minus 18,965 is. This little squiggly equal sign means approximately, so you’re on…
Here's Why I AM the BEST Salesman in the World! | Kevin O'Leary
[Music] So, Shark Tank, luck. People don’t understand it’s a really grueling, tough, long day, and you got to be sharp because you’re basically buying and selling millions of dollars worth of product. So, that means you can’t go there with a hangover. Bel…
JERRY BLOOP!!! Uninformed Video Game Reviews
[Music] [Applause] Vsauce! Michael here with a special treat for you today. It’s a guy named Jerry Bloop, who’s never played a video game in his life, but yet reviews them anyway. Played by a real person named Kevin, who does play video games and has a g…
Hear Kids' Honest Opinions on Being a Boy or Girl Around the World | National Geographic
Um, my name is Hil Kack. I’m 9 years old, and I’m 9 years old. The best thing about being a boy is like a boy, being very sporty. The best thing about being a girl is because girls can do a little bit more things than boys. [Music] The best thing about …
Post-Truth: Why Facts Don't Matter Anymore
This is the challenge of a YouTuber, which is, you know, pushing the record button and actually filming something. Because you never know: “Are people going to hate it?” Or “Is it good enough?” Have you thought through what you’re going to say. I’ve not t…