You need data literacy now more than ever – here’s how to master it | Talithia Williams
Literacy of the 21st century is more than just reading and writing; it really is understanding information and understanding data. If you've ever felt like your phone is listening to you, often it is. You may talk about green rugs, and then all of a sudden you pull up Facebook or Instagram, and you scroll down—it's like, whoa, here's an advertisement for a green rug. That data is being transferred real-time and then turned into a commercial or an ad for you.
With the rise of AI, it's easy to use that for corruption. As a society, when we help move everyone towards statistical literacy and data literacy, it forces us to be very objective in our analysis, to take in all different viewpoints and different pieces of information. It empowers everyone because we can all make our own informed, rational choices instead of depending on the belief of someone else or trusting the belief of someone else because we don't understand the information that's right in front of us.
My name is Dr. Tiia Williams, and I am a math professor and a science communicator. So every day we're doing math and working with numbers just to live in society. Really, all of us are statisticians and data scientists and mathematicians. If I want to get a ride to the airport and I do Uber or Lyft, I'm looking at it, and I'm interacting, and it's telling me a distance, a time, and a price. Or exercise data—you know, the number of steps that you took per day or, you know, what was the range of your heart rate today? How much movement did you have? All of these things can not only help us make decisions but can help our doctor understand when things are going wrong in our system or when things are working right.
There are so many ways that math and statistics show up in the beauty of nature. The Fibonacci sequence is a wonderful example of how it shows up in the pattern of leaves on a plant or things like fractals. Fractals are seen in the human body with the way that blood vessels sort of grow and move around and expand. So many of these mathematical concepts that we study and understand have direct occurrences in the world around us—in ways that are just beautiful and awe-inspiring.
Part of my excitement is to make those connections and help people see that if they walk outside, there's mathematics right in front of them that they can be really excited about. What's really critical to understand when you analyze data is, first, what does it have the power to tell you, and what does it not have the power to tell you? People think a statistical model is the end-all, be-all, and it's not.
If I wanted to paint a narrative or tell a story that was not true, I could very easily do that, and many people would not know—especially if it feeds an underlying belief, right? Then people will take that story, whether it's true or not, and latch on to it. It's almost like being a detective. When you give me a set of data, it's my job to uncover what's hidden in it.
The first thing I'm looking for is, where was the data collected? Is it a reputable source? Is it accurate? The other thing that I'm often looking for is understanding that correlation does not imply causation. For example, if you mapped the ability to tie shoes relative to age, you would see that you tie them much more effectively when you go from zero to 10 years old. But it's not just because your age is increasing; it's because you're becoming nimble with your hands, and now you can reach down and tie your shoes.
I have to be careful that once I uncover those relationships, I understand the true root cause of them. Then I want to know: do I have a representative sample? Often, data that we collect is data that has come from society in some way, and so the same biases that we might possess end up in the data set. We call that confounding variables. As a statistician, if there's a bias in the data, it's my job to find it, uncover it, pull it out, and get rid of it.
There was a healthcare algorithm that was developed so that if you were to show up and present with symptoms, it would use this historical data to predict what your treatment should be. It was pretty accurate; it was very good. However, it didn't take into account patient race, which made the statisticians believe that it was a very unbiased model. Once the model was in practice, they found that it was biased because built into the data were some underlying racist practices in terms of how Black and brown patients showed up and would often get assigned to lesser treatments. The model actually perpetuated that bias.
If we don't uncover those biases, those confounding variables, the correlations that don't imply causations beforehand, we end up implementing it in a model, putting it into a real-life situation, but that affects people or outcomes or decisions, and those decisions then have consequences. When you have a sort of data-literate component of society moving in the direction of AI and machine learning, and folks in society who could care less about it—who are more maybe just consumers—we end up with people making decisions for us that we don't understand or we've had very little influence on.
Because so many of our daily decisions depend on numbers, math, and data, it's important that as a society, we are data literate and statistically literate so that we are not only aware of the decisions that are being made on our behalf, but we're a part of them.