Snippet on Random Sampling vs. Bias

I wish I knew. I mean, to me, that’s like one of the great epistemological problems that we all face, is you go around the world and you just you don’t get a random experience of the world. You get your little slice, you get the people you know, who are probably very similar to you in lots of ways. Yeah, I think about this a lot because you just like the kind of questions that we all care about, but you don’t get data on, you know, like, how do people act when they’re in conflict, you know, when they’re fighting with each other? Or like, what, what makes a good relationship? Or how do people approach death? You know, like, you don’t get a random sample of this. I don’t know how the world approaches this. I know how my very particular demographic world approaches this. It’s really hard to break out of. I think the first thing you can do, and I think math helps with this, is just to see that that’s the problem, that you’re seeing this tiny, tiny little slice that’s very biased to be like you. The world around you is going to resemble you a little bit. The exercise I like is, I’ll ask a class, think about Wikipedia, what fraction of pages do you think have pictures? Like what percentage of pages? If you pick a random Wikipedia page, what percentage of pictures? And it’s like, I never go to a Wikipedia page without a picture. Like John Travolta, yeah, they’ve got pictures of John Travolta. There’s pictures everywhere. So I’ve never seen anybody guess below 90%. Maybe one kid who didn’t use Wikipedia very much guessed like 80% or something. But everybody thinks it’s going to be 95 plus. A lot of people think it’s going to be effectively 100. And then you say, okay, everybody in the room, go go find a couple of Wikipedia pages and report back. And so yeah, they’ll go find a Wikipedia page for chicken. It’s like, Oh, yeah, there’s a picture of a chicken right there. And they come back and yeah, it’s 100% in the sample. He said, Okay, great. Now Wikipedia, the reason is Wikipedia, not Instagram or, you know, TikTok for some equivalent thing, is social media doesn’t give you a random button. But Wikipedia is this beautiful human institution here in the 21st century. It’s built to be accessible and transparent. And so they’ve got a random button. So click random article, do it 10 times, and come back and tell me how many have pictures. And so, you know, room of 20 kids, you get 200 examples, and it’s about half. Because what you realize is when you look at Wikipedia, you see, you know, you go to the page for the Oppenheimer movie, and then you go to the page for the Barbie movie. And so, yeah, they have pictures above those. But if you click a random article, you get the 2010 National Swimming Championships in Belarus, or you get a train station in Sri Lanka, or get, you know, a midterm election in Ireland in 1997. Like, you get stuff that you don’t think of. That’s not the pages you’re thinking to go to. And you realize, like, oh, I’m just going to the big famous pages. I’m getting these very biased glimpses of reality. And if you can really randomly sample it, you see that so much is missing from your daily experience. Like we all see the big stuff. We all see the famous stuff. There’s stuff that’s invisible and shows up all over and over again. And then there’s the invisible kind of dark matter of every population.

Share this:

Like this:

Leave a ReplyCancel reply

Discover more from Party at the Moontower