 with a zero in it, I could probably just zero that out. Or I'm gonna remove that. I don't think that's gonna be useful to our dataset. And this is where tailoring your dataset to try to get the most relevant data is useful or is applicable, right? We gotta say, does it make sense to keep that in or should we pull it out? How should we pull it out? Should we base it on how many at-bats they had and so on? I'm just gonna take all of these rows down to all of the zeros for sure, remove them. Point five, let's take it, let's take it up to like point 059, everything above that we will keep and assume is legitimate data. So we're gonna right click and I'm gonna delete all of that stuff. And then if I sort the batting average from Z to A, if we have these really large batting averages, that would probably indicate that they didn't have a lot of that bats possibly because that would be a quite a substantial batting average. So I'm gonna say, let's just remove at least this top, one maybe these two, I'm gonna remove these two and these are somewhat subjective as to whether we should remove those or not, right? Because you have to, but that's it. Now, so now we got the batting averages which are the heart of the data. And I'm gonna say, let's pick that up then and use it to do our statistical analysis on. So we've done this before, we're gonna make a skinny D, we could do this with our eyes closed. We could do it with our, even if some jerk tied our eyelashes to our nose hairs forcing our eyes closed and to be watered at the same time, we could still do this at this point. So let's say that we're gonna pull in the data, let's make this age and the batting average. I'll make this black and white for the header, home tab, font group, black and white for the headers. Let's center it, alignment and center our normal calculations. This is gonna be the mean, the standard D and let's pull this in a bit. We don't need it to be that large and then I'm just gonna do the average equals the average age of the baseball players. Control shift down and 28. So not, I'm not gonna be in there unless I'm that crazy phenomenon that could, I'm gonna copy that to the right and let's do the batting average, just go to the home tab, numbers, decimalize it. Let's put three decimals and there's the average batting average, point two. So we percentize that 22%. So remember when we were talking about baseball, it's the likelihood of actually getting on base is not that high is the general thing, 22% is the average. So then we're gonna say, according to this statistic, which we basically, you can see how we trimmed the data that we have over here, which might be a different way of trimming the data than some other stats that you might take a look at which might use different techniques, but that's what we'll go with now. Standard deviation equals the standard D we want for the sample this time and I'm gonna pick up the age, control shift down as we've seen before, enter. And there is, there's that for, let's decimalize it, home tab, number group, decimalize, copying it to the right, fill handle, dragging it to the right. And so there we have the standard D here, home tab number, let's add a couple more decimals on that one. All right, so then we could say, well, what does this data look like in terms of a histogram? Let's check that out, let's take the ages and say insert chart and make a histogram of this. And so there's our age histogram, I should have put it, let's delete that. And then I'm gonna go up to the top and then insert it so I don't have to drag it up, chart, insert, histogram. All right, so this is our age histogram, age histogram. And so we've got people in, it looks like it's kind of somewhat bell shaped, not, you got the people at the middle part is the higher point and then it tapers off, although you've got this peak at 24 to 25 and then this other peak at a 28 to 29. And then you go way out here and I'm past, would be an outlier out there, but I'd still, if I wanted to, I could do it. I could do it right here, right now if I just don't want to. If I did, I would do it, but it's stupid because I don't need it, I don't need to do that. Anyway, I'm gonna say control shift down on the batting average and notice I have some blank data down here still. I thought we removed all of the blank data. Let's go back up top and let's redo that. Let's fix that. Batting average, I'm sorting by batting average. Oh, I deleted the low ones. Let's go down and delete all of that and do this again, control shift down. Everything below here I'm gonna delete. So I'm gonna delete the blanks. If they have no batting average, we don't even want you here. You didn't make the team. What are you even doing on the team if you don't have a batting average? Get out of here. Let's do it again. Let's do the mean again equals the average. They can show we pick up the right data. Copy that to the right, decimalize it. And then the standard D, let's do that again, equals the standard D. Go back to the minors, man. Go back to the minors. Oh, that's mean. Whatever, that's how it is when you're in the big leagues. You can't hack it. Home tab, we're gonna say, okay. So then let's do our age again. Let's do this age again thing. Now that we have this data. And insert charts histogram. So there we have it. All right, so this is our age. Okay. And then let's do one for the batting average. Control shift down, control backspace. And insert charts histogram. Boom. Oh, no, I did that again. Let's pull it down. And this is gonna be the batting average, BA. And so again, it looks kind of like it's bell shape. It's tapering off. We've got these outliers that maybe we should have trimmed off over here and that middle point being around the mean. So that doesn't necessarily give us an indication that there's any kind of correlation, but it might give us some ideas sometimes about our hypotheses about the data. So now let's do our, let's take this and take our formula for the correlation calculation. I'm gonna make it a large H. And we'll say this is going to be smaller here. Oh, now I'm moving it. That's not what I wanted to do. Make a smaller one here. And then let's make a smaller one here. Okay. And then we'll pull this into the large H. All right. So it's still probably quite large. I don't think I need, anyways, that's okay. So then let's take our data and do our calculation for the age and the batting average. I'm just gonna take those two and paste that over here. And so there we have it. Let's make our header tab this way, going to the,