yego.me
💡 Stop wasting time. Read Youtube instead of watch. Download Chrome Extension

Identifying individuals, variables and categorical variables in a data set | Khan Academy


2m read
·Nov 11, 2024

We're told that millions of Americans rely on caffeine to get them up in the morning, which is true. Although, if I drink caffeine in the morning, I'm very sensitive; I wouldn't be able to sleep at night.

Here's nutritional data on some popular drinks at Ben's Beans Coffee Shop. All right, so here we have the different names of the drinks, and then here we have the type of the drink, and it looks like they're either hot or cold. Here we have the calories for each of those drinks, here we have the sugar content in grams for each of those drinks, and here we have the caffeine in milligrams for each of those drinks.

Then we are asked, "The individuals in this data set are," and we have three choices: Ben's Beans customers, Ben's Beans drinks, or the caffeine contents. Now, we have to be careful; when someone says the individuals in a data set, they don't necessarily mean that they have to be people; they could be things. The individuals in this data set—each of these rows—are referring to a certain type of drink at Ben's Beans Coffee Shops.

So, the different types of drinks that Ben's Beans offers, those are the individuals in this data set. So, they're Ben's Beans drinks. Next, they ask us the data set contains, and they say how many variables and how many of those variables are categorical.

So, if we look up here, let's look at the variables. So, this first column—that's essentially giving us the type of drink—this wouldn't be a variable; this would be more of an identifier. But all of these other columns are representing variables.

So, for example, type is a variable; it can either be hot or cold. Because it can only take on one of kind of a number of buckets, it's either going to be hot or cold; it's going to fit in one category or another. And you don't just have two categories; you could have more than two categories, but it isn't just some type of variable number that can take on a bunch of different values.

So, this right over here is a categorical variable. Calories is not a categorical variable; you could have something with 4.1 calories; you could have something with 178. Things aren't fitting into nice buckets.

Same thing for sugars and for the caffeine; those are quantitative variables that don't just fit into a category. And so, here I would say that we have four variables: one, two, three, four, one of which is categorical. So, that would be choice A over here.

More Articles

View All
Area of trapezoid on the coordinate plane | High School Math | Khan Academy
So we have a trapezoid here on the coordinate plane, and what we want to do is find the area of this trapezoid just given this diagram. Like always, pause this video and see if you can figure it out. Well, we know how to figure out the area of a trapezoi…
Representing systems of equations with matrices | Matrices | Precalculus | Khan Academy
I’m a big fan of looking at the same problem in different ways or different ways to conceptualize them. For example, if I had a system of three equations with three unknowns, let me just make one up: Three x minus two y minus z is equal to negative one. …
Chaos: The Science of the Butterfly Effect
Part of this video is sponsored by LastPass. More about LastPass at the end of the show. The butterfly effect is the idea that the tiny causes, like a flap of a butterfly’s wings in Brazil, can have huge effects, like setting off a tornado in Texas. Now …
Daily Homeroom: Congratulations Class of 2020!
Hi everyone! Welcome to Khan Academy’s daily homeroom live stream. For those of you all who do not know what this is, this is something that we thought of when we started seeing mass school closures. We know that people are going to be at home, socially d…
The Stock Market Just Peaked
What’s up, Graham? It’s guys here. So, between record high inflation, imminent rate hikes, and outsized earnings, there’s no denying that there’s a lot of uncertainty and opposing viewpoints in the market right now. On the one side, we have some of the m…
Curvature formula, part 4
So, we’ve been talking about curvature, and this means, uh, you’ve got some sort of parametric curve that you might think of as parameterized by a vector-valued function s of t. Curvature is supposed to measure just how much this curve actually curves. So…