yego.me
💡 Stop wasting time. Read Youtube instead of watch. Download Chrome Extension

Dataset individuals and categorical variables


3m read
·Nov 11, 2024

So we have this question that says millions of Americans rely on caffeine to get them up in the morning, and that is probably true. Although for me, if I drink even a little bit of caffeine in the morning, I won't be able to sleep that night. Here's nutritional data on some popular drinks at Ben's Beans Coffee Shop.

We have the names of different drinks, and for each of those drinks, we know not only their name, we know whether they are hot or cold, we know how many calories they contain, sugar content, caffeine. All right, now with that, let's try to answer these questions. Like always, try to pause the video and see if you can answer these questions before I do.

And on this one on the right, there's a little bit below where you see; there's a little bit below, and actually, I'll just leave it there. So the individuals in this data set are... So when you see the term individuals, you might immediately say, "Oh, they must be talking about people," because in everyday language, when we talk about individuals, we tend to be talking about people. But in statistical language, it's really referring to what are the different entries of the data set.

So each of these rows of the data set, they're not referring to types of people or customers and things about those customers. So it's not about Ben's Beans customers. I have trouble saying that. Each of these rows represents one of the drinks that are offered at Ben's Beans Coffee Shop. So those are the individuals in statistical language.

The individuals could refer to things—what are the things representing Ed in that data set? Then we're asked what the data set contains, and we're asked how many variables it contains and whether or not they're categorical or not.

So first, what's a variable? Well, the variables are what are the things that are different about these different entries, about these different individuals that this table tells us about, that this data set tells us about. Well, the name is a variable; each of these drinks has different names—that's one variable. The type is a variable; they can be hot or cold. The calories are a variable, the sugar content is a variable, and the caffeine is a variable.

So we have five variables over here: the name, type, calories, sugar, caffeine. So we have five variables, so we can rule out the six variable choices right over there. We're going to rule that one and that one out.

And so how many of these are categorical? A categorical variable—let me write this down—categorical variable means that you're going to that dimension of the thing you're measuring is going to be placed into some categories, or that variable takes on values that are the name of a category.

So for example, drink name—this is a categorical variable because the values that drink name can take on, well, those are the actual names of the drinks. These are the categories. It could be brewed coffee, café latte, café mocha, cappuccino, so on and so forth.

So this one right over here is... is this one over here categorical. Now, you might say, "Well, what's another type of variable?" Well, another type of variable is quantitative. A quantitative variable is going to take on numerical values as opposed to categorical values, numerical values that are measuring some type of attribute.

So, let's go through these. So what about type? Well, type isn't a numerical value; it's taking on a category. It's either going to be hot or cold, so that is also going to be categorical. But these other three, these take on numerical values that represent, in the case of calories, the number of calories a serving has, the amount of sugar in the sugar variable, or the amount of caffeine in the caffeine variable.

So these three right over here, these are quantitative variables. So you have five total variables, two of which are categorical and three of which are quantitative. And so that is this choice right here: five variables, two of which are categorical. The two categorical ones are, well, what name, what's the name of the drink, and also whether they are hot or cold.

More Articles

View All
What If the Electoral College is Tied?
The United States picks its president with the Electoral College, 538 votes distributed by population (mostly) to the 50 States and DC. To become president, you need to win a majority of those votes. But, 538 is an even number, so what happens when the ra…
Why We’re All Burning Out | Byung-Chul Han’s Warning to the World
Aren’t we living in the best age ever!? I mean, look at the world around us! Modern society grants us endless possibilities. Contrary to our grandparents (and their parents), who were told to just pray to God, have kids, work in the factory, and shut up, …
Inside Jason Oppenheim's $10,000,000 Custom Hollywood Home
What’s up guys! It’s Graham here, so today I’m gonna be touring Jason Oppenheim’s snootily finished ten million dollar house here in the Hollywood Hills. I gotta say, this one has some of the most outrageous high-quality, just extravagant finishes, furnit…
Initial value & common ratio of exponential functions | High School Math | Khan Academy
So let’s think about a function. I’ll just give an example: let’s say h of n is equal to ( \frac{1}{14} \times 2^n ). So first of all, you might notice something interesting here: we have the variable, the input into our function, it’s in the exponent. A…
YC Tech Talks: Designing Game Characters with Deep Learning, from Cory Li at Spellbrush (W18)
My name is Corey, uh I’m the CEO at Spell Rush and I’m here to talk to you today about, uh, designing characters with deep learning. So, um, we’re Spell Rush. Uh, we’re a YC company as well. Uh, we’re building deep learning tools for art and artists. Uh,…
Slope and intercept meaning from a table | Linear equations & graphs | Algebra I | Khan Academy
We’re told that Felipe feeds his dog the same amount every day from a large bag of dog food. Two weeks after initially opening the bag, he decided to start weighing how much food remained in the bag on a weekly basis. Here’s some of his data: So we see af…