yego.me
💡 Stop wasting time. Read Youtube instead of watch. Download Chrome Extension

Marginal and conditional distributions | Analyzing categorical data | AP Statistics | Khan Academy


3m read
·Nov 11, 2024

Let's say that we are trying to understand a relationship in a classroom of 200 students between the amount of time studied and the percent correct. So, what we could do is we could set up some buckets of time studied and some buckets of percent correct. Then, we could survey the students and/or look at the data from the scores on the test. Afterward, we can place students in these buckets.

What you see right over here is a two-way table, and you can also view this as a joint distribution along these two dimensions. One way to read this is that 20 out of the 200 total students got between 60 and 79 percent on the test and studied between 21 and 40 minutes.

So, there's all sorts of interesting things that we could try to glean from this. But what we're going to focus on in this video is two more types of distributions other than the joint distribution that we see in this data. One type is a marginal distribution. A marginal distribution is just focusing on one of these dimensions. One way to think about it is you can determine it by looking at the margin.

For example, if you wanted to figure out the marginal distribution of the percent correct, what you could do is look at the total of these rows. The counts right over here give you the marginal distribution of the percent correct.

40 out of the 200 got between 80 and 100, and 60 out of the 200 got between 60 and 79, so on and so forth. Now, a marginal distribution could be represented as counts or as percentages. If represented as percentages, you would divide each of these counts by the total, which is 200.

So, 40 over 200, that would be 20 percent. 60 out of 200, that would be 30 percent. 70 out of 200, that would be 35 percent. 20 out of 200 is 10, and 10 out of 200 is 5. So, this right over here, in terms of percentages, gives you the marginal distribution of the percent correct based on these buckets.

You could say 10 percent got between a 20 and a 39. Now, you could also think about marginal distributions the other way. You could think about the marginal distribution for the time studied in the class. Then, you would look at these counts right over here. You'd say a total of 14 students studied between 0 and 20 minutes.

You're not thinking about the percent correct anymore. A total of 30 studied between 21 and 40 minutes. Likewise, you could write these as percentages. This would be 7, this would be 15, this would be 43 percent, and this would be 35 right over there.

Now, another idea that you might sometimes see when people are trying to interpret a joint distribution like this or get more information or more realizations from it is to think about something known as a conditional distribution. A conditional distribution is the distribution of one variable given something true about the other variable.

For example, an example of a conditional distribution would be the distribution of percent correct given that students study between, let's say, 41 and 60 minutes. To think about that, you would first look at your condition. Okay, let's look at the students who've studied between 41 and 60 minutes. That would be this column right over here.

Then, that column, the information in it can give you your conditional distribution. An important thing to realize is a marginal distribution can be represented as counts for the various buckets or percentages, while the standard practice for a conditional distribution is to think in terms of percentages.

So, the conditional distribution of the percent correct given that students study between 41 and 60 minutes would look something like this. Let me get a little bit more space. If we set up the various categories: 80 to 100, 60 to 79, 40 to 59, continued over here, 20 to 39, and 0 to 19. What we'd want to do is calculate the percentage that fall into each of these buckets given that we're studying between 41 and 60 minutes.

So, this first one, 80 to 100, it would be 16 out of the 86 students. We would write 16 out of 86, which is equal to 16 divided by 86. I'll just round to one decimal place; it's roughly 18.6 percent.

18.6 approximately equal to 18.6. Then, to get the full conditional distribution, we would keep doing that. We would figure out the percentage for 60 to 79, which would be 30 out of 86, whatever percentage that is, and so on and so forth in order to get that entire distribution.

More Articles

View All
JUST BOUGHT MY 5TH PROPERTY!!
What’s up you guys? So I actually made this video two weeks ago, and then as soon as I was about to post it, I thought, “What if it doesn’t go through? What if something happens? What if I don’t close on it? I’ll look like a total idiot if I upload this.”…
Strategies for subtracting more complex decimals with tenths
Some more examples subtracting decimals. So let’s say we want to figure out what 2 - 1.2 is. Pause this video and see if you can calculate this. So, there’s multiple ways to tackle this. One way is you could say, look, this is the same thing as 2 - 1, 2 …
Fractions greater than 1 on the number line
We’re asked to move the dot to 7⁄6 on the number line, so pause this video. I can move this dot right over here, but I encourage you: pause the video and put your finger on where 7⁄6 would be on the number line. All right, now let’s work on this together…
How Did the 'Unsinkable' Titanic End Up at the Bottom of the Ocean? | National Geographic
It took three years to build and less than three hours to sink. The most iconic shipwreck in history, the Titanic, held as the most beautiful and luxurious boat of her time. The Titanic set sail once and for all from Southampton, England, to New York City…
Fishing in Thorne Bay | Life Below Zero
COLE: You ready to reel a fish in, Willow? WILLOW: Yeah. COLE: It’s been a while, huh? WILLOW: Yeah. COLE: We’ll see. Well, today, Timber and Willow, Willow mostly, they both been asking to go fishing. So, see if we can just pull one winter king in. K…
How I make money mining bitcoins
Eric Elliott: “I’m an internet developer. I am a Bitcoin miner. Coin is a decentralized cryptocurrency, basically a virtual form of money. Bitcoin is controlled by a software algorithm in order to control the amount of Bitcoins that are released into the …