Sample Data Sets |
The Baseball data set contains performance measures and salary levels for regular hitters and leading substitute hitters in Major League Baseball for the year 1986 (Reichler; 1987). There is one observation per hitter.
The following list describes each variable:
player’s name
number of times at bat (in 1986)
number of hits (in 1986)
number of home runs (in 1986)
number of runs (in 1986)
number of runs batted in (in 1986)
number of bases on balls (in 1986)
years in the major leagues
career at-bats
career hits
career home runs
career runs
career runs batted in
career bases on balls
player’s league at the end of 1986
player’s division at the end of 1986
player’s team at the end of 1986
positions played (in 1986)
number of putouts (in 1986)
number of assists (in 1986)
number of errors (in 1986)
salary, in thousands of dollars (in 1986)
The position variable in the Baseball data set is encoded as follows:
13 |
First base and third base |
CS |
Center field and shortstop |
1B |
First base |
DH |
Designated hitter |
1O |
First base and outfield |
DO |
Designated hitter and outfield |
23 |
Second base and third base |
LF |
Left field |
2B |
Second base |
O1 |
Outfield and first base |
2S |
Second base and shortstop |
OD |
Outfield and designated hitter |
32 |
Third base and second base |
OF |
Outfield |
3B |
Third base |
OS |
Outfield and shortstop |
3O |
Third base and outfield |
RF |
Right field |
3S |
Third base and shortstop |
S3 |
Shortstop and third base |
C |
Catcher |
SS |
Shortstop |
CD |
Center field and designated hitter |
UT |
Utility |
CF |
Center field |
Copyright © SAS Institute, Inc. All Rights Reserved.