yego.me
💡 Stop wasting time. Read Youtube instead of watch. Download Chrome Extension

Conditions for valid t intervals | Confidence intervals | AP Statistics | Khan Academy


4m read
·Nov 11, 2024

Flavio wanted to estimate the mean age of the faculty members at her large university. She took an SRS, or simple random sample, of 20 of the approximately 700 faculty members, and each faculty member in the sample provided Flavio with their age. The data were skewed to the right, with a sample mean of 38.75. She's considering using her data to make a confidence interval to estimate the mean age of faculty members at her university.

Which conditions for constructing a t-interval have been met? So pause this video and see if you can answer this on your own.

Okay, now let's try to answer this together. So there's 700 faculty members over here. She's trying to estimate the population mean, the mean age. She can't talk to all 700, so she takes a sample, a simple random sample of 20. So the n is equal to 20 here. From this 20, she calculates a sample mean of 38.75.

Now, ideally, she wants to construct a t-interval, a confidence interval using the t-statistic. That interval would look something like this: it would be the sample mean plus or minus the critical value times the sample standard deviation divided by the square root of n. We use a t-statistic like this and a t-table and a t-distribution when we are trying to create confidence intervals for means where we don't have access to the standard deviation of the sampling distribution, but we can compute the sample standard deviation.

Now, in order for this to hold true, there are three conditions, just like what we saw when we thought about z-intervals. The first is that our sample is random. Well, they tell us here that she took a simple random sample of 20, and so we know that we are meeting that constraint. And that's actually choice A: the data is a random sample from the population of interest.

So we can circle that in.

The next condition is the normal condition. Now, the normal condition when we're using, when we're doing a t-interval, is a little bit more involved because we do need to assume that the sampling distribution of the sample means is roughly normal. Now, there are a couple of ways that we can get there. Either our sample size is greater than or equal to 30. The central limit theorem tells us that, then our sampling distribution, regardless of what the distribution is in the population, that the sampling distribution actually would then be approximately normal.

She didn't meet that constraint right over here. Here, her sample size is only 20. So so far, this isn't looking good.

Now, that's not the only way to meet the normal condition. Another way to meet the normal condition, if we have a smaller sample size, smaller than 30, is one: if the original distribution of ages is normal, so original distribution normal, or even if it's roughly symmetric around the mean, so approximately symmetric. But if you look at it, they tell us that it has a right skew. They say the data were skewed to the right with a sample mean of 38.75. So that tells us that the data set that we're getting in our sample is not symmetric, and the original distribution is unlikely to be normal.

Think about it. It's not going to be. You're likely to have people who are, you could have faculty members who are 30 years older than this 68 and three-quarters, but you're very unlikely to have faculty members who are 30 years younger than this, and that's actually what's causing that skew to the right. So this one does not meet the normal condition. We can't feel good that our sampling distribution of the sample means is going to be normal, so I'm not going to fill that one in.

Choice C: individual observations can be considered independent. So there are two ways to meet this constraint. One is if we sample with replacement. Every faculty member we look at after asking them their age, we say, "Hey, go back into the pool," and we might pick them again until we get our sample of 20. It does not look like she did that. It doesn't look like she sampled with replacement.

Even if you're sampling without replacement, the 10% rule says that, look, as long as this is less than 10, or less than or equal to 10, of the population, then we're good. And the 10% of this population is 70. 70 is 10% of 700, and so this is definitely less than or equal to 10, and so it can be considered independent.

Thus, we can actually meet that constraint as well.

So the main issue where our t-interval might not be so good is that our sampling distribution—we can't feel so confident that that is going to be normal.

More Articles

View All
Cost vs Quality in Edtech – Keith Schacht, Avichal Garg, and Geoff Ralston
A vitro you found it prep me in 2001, sold it ten years later in 2011. That was actually the year we found it. Imagine K12, the world’s first educational technology accelerator. And, Keith, you founded Mystery Science, I think in 2013. We just celebrated …
Ray Dalio on The Big Debt Cycle
Just frame for us your thoughts on debt for a second. How do you think about debt as an absolute construct or a relative construct, especially sovereign debt? You know, there is a US debt, but then there are also every other 182 countries who have a ton o…
Comparing animal and plant cells | Cells and organisms | Middle school biology | Khan Academy
So, let’s play a game of spot the difference. Now, if you were asked to spot the difference between these two pictures, you’d probably laugh and say that’s too easy because it’s obvious that this picture of a lion on the left is nowhere close to looking …
Rise Again: Tulsa and the Red Summer | Full Documentary
[Music] So [Music] [Music] [Music] I feel a very strong spiritual connection to what’s happening in Tulsa. You know, I had to be there when they dug into the ground for the first time to search for Black people who were killed in the 1921 Tulsa race massa…
Welcome to Intro to Computer Science! | Intro to CS - Python | Khan Academy
Welcome to KH Academy’s intro to computer science course in Python! Let’s learn more about what this course has to offer. In this course, you’ll learn the fundamentals of programming, from variables to conditionals, loops, functions, and data structures.…
Kevin O'Leary V2
Actually, I was born Terrence Thomas Kevin O’Leary. My dad was Irish and he loved long names, but when they got me home, everybody realized it was going to be total confusion because dad was named Terry too. So the next thing I knew, I was Kevin. Two year…