Hands On Analysis - Simple Linear Regression - Baseball Data
Hands On Analysis - Simple Linear Regression - Baseball Data
Suppose we are interested in whether there is a relationship between batting average (X)
and number of home runs (Y) a player hits. Some fans might argue, for example, that
those who hit lots of home runs also tend to make a lot of strikes outs so that their batting
average is lower. Let us check it out, using a regression of the number of home runs
against the player’s batting average (hits divided by at bats). Because baseball batting
averages tend to be highly variable for low number of at bats, we restrict our data set to
those players who has at least 100 at bats for the 2002 season. This leaves us with 209
players.