yego.me
💡 Stop wasting time. Read Youtube instead of watch. Download Chrome Extension

Sampling distribution of sample proportion part 1 | AP Statistics | Khan Academy


6m read
·Nov 11, 2024

  • [Instructor] So I have a gumball machine right over here. It has yellow, and green, and pink, and blue gumballs. Let me throw a few blue ones in there. And what we're going to concern ourselves in this video are the yellow gumballs.

And let's say that we know that the proportion of yellow gumballs over here is P. This right over here is a population, population parameter. And for the sake of argument, just to make things concrete, let's just say that 60% of the gumballs are yellow, or 0.6 of them.

Now let's review some things that we have seen before. I'm gonna define our Bernoulli random variable, let's call this capital Y, which is equal to one if when we take one random gumball out of that machine we get a, we pick a yellow gumball, and it's equal to zero if when we pick one random gumball out of that machine we don't pick yellow, so not yellow.

From previous videos we know some interesting things about this Bernoulli random variable. We know its mean. We know the mean of our Bernoulli random variable. It's going to be the proportion of yellow balls in this population, so it's going to be equal to P, which in this particular case we know is 0.6, and we know what the standard deviation of this Bernoulli random variable is.

It is going to be P times one minus P, actually that's the variance. We want to take the square root of that to get the standard deviation. So in this particular scenario, that's going to be the square root of 0.6 times 0.4. Fair enough. This is all review so far.

But now let me define, let me define another random variable X, which is equal to the sum of 10 independent, independent trials, trials of Y. Now, we have seen random variables like this before. This is a binomial random variable.

Now, what do we know about its mean and standard deviation? Well, in previous videos we know that the mean of this binomial random variable is just going to be equal to n times the mean of each of the Bernoulli trials right over here, so n times P, which in this particular situation is going to be, n is 10, we're doing 10 trials, and P is zero, 0.6, which is equal to six.

And that makes sense. If 60% of the balls here are yellow and if you were to take a sample or if you were to take 10 trials, so if you were to take ten balls one at a time, they have to be independent, so you keep looking at them and then replacing them, but if you took 10, then you would expect that six of them would be yellow.

They're not always going to be six yellow, but that would be maybe what you would expect. All right, now what's the standard deviation here? So our standard deviation is equal to, and we have proved this in other videos, it's equal to the square root of n times P, times one minus P.

Notice you just put an n right over here under the radical sign. And so this is going to get us in this particular situation to 10 times 0.6 times 0.4 and then the square root of everything. So all of this is review. If it's unfamiliar, I encourage you to review some of the videos on Bernoulli random variables and on binomial random variables.

But what we're going to do in this video is think about a sampling distribution and it's going to be the sampling distribution for a sample statistic known as the sample proportion, which we actually talked about when we first introduced sampling distributions.

So let's say, so let's just park all of this, this is background right over here. Let's start taking samples of 10. And I didn't pick that randomly. I want to make it reconcile with what we do with our random variable here.

And so let's take a sample of 10 gumballs and let's calculate the proportion that are yellow. And so we will call that our sample proportion, and I might as well just do that in yellow. So we want to calculate the sample proportion that are yellow.

And what is this equivalent to? Well, you could say, Well this is equivalent to my random variable X. I want to count the number of gumballs that are yellow and then I'm gonna divide it by the sample size. So I'm gonna divide it by n.

And in this case, it would be X divided by 10. I know what some of you are thinking. Wait, wait, wait. Hold on for a second. X is sum of 10 independent, independent trials right over here.

To be independent, you can't just take 10 gumballs. You have to take them one at a time and then replace them back in order for them to be truly independent. But remember, we have our 10% rule.

We have our 10% rule, which tells us that if a sample is less than or equal to 10% of the population, that you can treat each of the gumballs in this situation as being independent. So let's just say for the sake of argument that there are 10,000 gumballs in here, and so we can feel pretty good that these samples, that each of the things in the sample are independent of each other by our 10% rule.

And so each of these are, each of these 10 gumballs what we see, they are going to be independent, I'm going to put them in quotes, by the 10% rule. And so in that situation, we can make this claim or we can feel good that this claim is roughly true.

And so let's say for that first sample that we do, our sample proportion is equal to 0.3, so three of our gumballs just happened, three of our 10 gumballs just happened to be yellow. Then we'd do it again.

So we take another sample. We calculate our sample proportion, this statistic again, remember, it's trying to estimate our population parameter, and let's say that time it happens to be seven out of 10, and we just keep doing that.

And if we keep doing that, and we plot it on a dot chart or a dot distribution I guess we could say, where our possible outcomes you could have zero out of 10, one out of 10, two, three, four, five, so half of them, six, seven, eight, nine, 10, so that would be all of them.

And so you could plot okay, 0.3 is one, two, three, that's one scenario where I got, where my sample proportion is 0.3. Then 0.7, that's one situation where I got a 0.7.

And let's say I were to take another sample of 10 and I were to get 0.7, then you would plot that over here. And if you kept taking samples and kept calculating these sample proportions and you kept plotting it here, you would get a better, and better, and better approximation for the sampling distribution of the sampling proportion.

But how can we actually characterize the true sampling distribution for the sample proportion? What is going to be the mean of this sampling distribution and what is going to be the standard deviation?

Well, we can derive that from what we see right over here. The mean of our sampling distribution of our sample proportion is just going to be equal to the mean of our random variable X divided by n.

It's just going to be the mean of X divided by n, which is equal to what? Well, the mean of X is n times P. This is n times P. You divided it by n, you're going to get P.

And that makes sense. One way to think about it, the expected value for your sample proportion is going to be the proportion of gumballs that you actually see. And so this also a good indicator that this is going to be a reasonably unbiased estimator.

Now let's think about the standard deviation for our sample proportion. Well, we can just view that as our standard deviation of the binomial random variable X divided by n.

So this is going to be equal to the square root of n times P, times one minus P, all of that over n, which is the same thing as if we put this, if we divide by n inside the radical, it'd be the same thing as the square root of nP times one minus P over n squared.

Divide the numerator and denominator by n, you will get the square root of P times one minus P, all of that over n. And so in this particular situation, where our parameter is 0.6, where our population parameter is 0.6, so it's going to be 0.6, that's the true proportion for our population.

And then what would out standard deviation be for our sample proportion? Well, it's going to be equal to the square root of 0.6 times 0.4, all of that over 10.

And we can get a calculator out to calculate that. So if we take 0.6 times 0.4 equals, divided by 10, equals, and then we take the square root of that, and we get it's approximately 0.15. Approximately 0.15.

More Articles

View All
Overview of the Middle Ages | World History | Khan Academy
Growing up, we all have impressions of the Middle Ages. We read about knights in shining armor, castles with moats, and towers. But when were the Middle Ages? The simple answer: the Middle Ages in Europe are the roughly 1,000 years from the fall of the Ro…
The Philosophy Of Cold Showers
Taking cold showers has become a normal part of my life. Even though I’ve taken many, cold showers still suck every single time to this day, but it takes me considerably less effort than the first time I took one. Besides the many physical benefits of col…
Power rule (with rewriting the expression) | AP Calculus AB | Khan Academy
What we’re going to do in this video is get some practice taking derivatives with the power rule. So let’s say we need to take the derivative with respect to x of 1 over x. What is that going to be equal to? Pause this video and try to figure it out. So…
LearnStorm Growth Mindset: Khan Academy's math content creator on learning strategies
I’m Jeff Dodds. I’m a math content creator here at Khan Academy. I had a fixed mindset for a really long time. I was convinced that I was a math person and I was smart and I was just good at math. So, I remember the first time I really struggled. I was t…
How to subtract mixed numbers that have unlike denominators | Fractions | Pre-Algebra | Khan Academy
Let’s try to evaluate 7 and 6 9ths - 3 and 25ths. So, like always, I like to separate out the whole number parts from the fractional parts. This is the same thing as 7 + 6⁄9 - 3 - 25⁄100. The reason why I’m saying -3 and -25⁄100 is this is the same thing…
Nature's Grand Show: Exploring a Season of Wonder in Canada | National Geographic
In a world that often feels consumed by the rush of daily life, there’s something profound about standing before nature’s grand show, experiencing landscapes that leave us with this humbling sense of scale. It nurtures our souls and heightens our senses. …