There has been growing and large disparity in recent years among the
payrolls of salaries of major league sports team. In order to acquire the players,
very high prices are being paid by many teams so that team could be made up of
best players. It hasbeen seen that high budget teams are often successful such
as New York Yankees; this raises the question that either tag of high prices worth
the improvement in players’ performance or not. Many statistics such as on base
percentage plus slugging, stolen base, and batting average can be examined to evaluate
their impact on the salariesof players.
The higher salaries are expected to be correlated to good players’
performance. OPS takes on base percentage plus slugging of players and
normalize the number around whole league and accounts for ballparks kind of
external factors. It is a solid tool for players’ performance evaluation at the
place and it is a good measure to rank players switching the teams. The number
of tears of experience refers to time period between a player joined the team
till present working time and it also matters to evaluate the performance of
the players.This project aims to identify the relationship between salary of
player and their performance in major league sports and predicting the salary
of baseball based on the performance as well in the field. The model for this
project is explained as:
Wins = number of
games won = dependent variable
Errors = number
of errors committed = independent variable
ERA = team ERA =
independent variable
HR = number of
homes runs = independent variable
League =whether
the team places in the National League or the American League. League code
variable is added by using 1 for the American League and 0 for the National
League.
Payroll = Players Pay = Independent variable
SB = Stolen base = independent variable
BA = Batting average = independent variable
On base percentage plus slugging is major independent variable because
this variable plays an important role to identify the performance of player.
Following is the general form of variables:
Definition of
Variables of Pay and Performance
in Major League Sports
Wins: Wins refer to the number of games that league
has won.
League: League refers to two sports league i.e. National
League and the American eague.
HR: It refers to the number of home runs. The
expected result of this variable is positive.
Payroll: It refers to players’ salary that is what players
are being paid for their work in a team. The expected result of this variable
is positive.
Stolen base:In major league sports, stolen base occurs
when players is not entitled to some base and advances to that base and it is
ruled by the official scorer that action of the runner is credited by advances.
It often occurs when ball is being pitched by pitcher to home plate and runner
advances to a next base. The expected result of this variable on dependent
variable is negative.
Batting average:the average score of a player is called batting
average. For example, in cricket, runs scored by a batsman on one completed
innings and in baseball, the safe hits of batter at bat per official times is
knowns as batting average. The expected result of this variable on dependent
variable is positive.
Data Description
This project is based on data of above mentionedvariables collected form
“Bovée, C.L., Thill, J.V. and Raina, R.L., 2016. Business communication today.
Pearson Education India.” The data is presented in following table:
Team
|
League
|
Wins
|
ERA
|
BA
|
HR
|
SB
|
Errors
|
Payroll
|
|
NL
|
65
|
4.81
|
0.250
|
|
86
|
102
|
60.7
|
|
NL
|
91
|
3.56
|
0.258
|
|
63
|
126
|
84.4
|
|
AL
|
66
|
4.59
|
0.259
|
|
76
|
105
|
81.6
|
|
AL
|
89
|
4.20
|
0.268
|
|
68
|
111
|
162.7
|
|
NL
|
75
|
4.18
|
0.257
|
|
55
|
126
|
146.9
|
|
AL
|
88
|
4.09
|
0.268
|
|
160
|
103
|
108.3
|
|
NL
|
91
|
4.01
|
0.272
|
|
93
|
72
|
72.4
|
|
AL
|
69
|
4.30
|
0.248
|
|
91
|
110
|
61.2
|
|
NL
|
83
|
4.14
|
0.263
|
|
99
|
101
|
84.2
|
|
AL
|
81
|
4.30
|
0.268
|
|
69
|
109
|
122.9
|
|
NL
|
80
|
4.08
|
0.254
|
|
92
|
123
|
55.6
|
|
NL
|
76
|
4.09
|
0.247
|
|
100
|
103
|
92.4
|
|
AL
|
67
|
4.97
|
0.274
|
|
115
|
121
|
72.3
|
|
AL
|
80
|
4.04
|
0.248
|
|
104
|
113
|
105
|
|
NL
|
80
|
4.01
|
0.252
|
|
92
|
98
|
94.9
|
|
NL
|
77
|
4.58
|
0.262
|
|
81
|
101
|
81.1
|
|
AL
|
94
|
3.95
|
|
142
|
68
|
78
|
97.6
|
New York Mets
|
NL
|
79
|
3.70
|
0.249
|
128
|
130
|
87
|
132.7
|
|
AL
|
95
|
4.06
|
0.267
|
|
103
|
69
|
206.3
|
|
AL
|
81
|
3.56
|
0.256
|
|
156
|
99
|
51.7
|
|
NL
|
97
|
3.67
|
0.260
|
|
108
|
83
|
141.9
|
|
NL
|
57
|
5.00
|
0.242
|
|
87
|
127
|
34.9
|
|
NL
|
90
|
3.39
|
0.246
|
|
124
|
72
|
37.8
|
|
|
|
|
0.257
|
|
55
|
73
|
97.8
|
|
AL
|
61
|
3.93
|
0.236
|
|
142
|
110
|
98.4
|
|
NL
|
86
|
3.57
|
0.263
|
|
79
|
99
|
93.5
|
|
AL
|
96
|
3.78
|
0.247
|
|
172
|
85
|
71.9
|
|
AL
|
90
|
3.93
|
|
|
|
105
|
55.3
|
|
AL
|
85
|
4.22
|
0.248
|
|
58
|
92
|
62.7
|
|
NL
|
69
|
4.13
|
0.250
|
|
110
|
127
|
61.4
|
Presentation and Interpretation of Results of Pay and Performance in Major League Sports
Correlation Analysis of Major League Sports
|
Wins
|
League
|
ERA
|
BA
|
HR
|
SB
|
Errors
|
Payroll
|
Wins
|
1
|
|
|
|
|
|
|
|
League
|
0.049402548
|
1
|
|
|
|
|
|
|
ERA
|
-0.681075006
|
0.144781586
|
1
|
|
|
|
|
|
BA
|
0.460875034
|
0.224153518
|
0.058049927
|
1
|
|
|
|
|
HR
|
0.437799827
|
0.114081832
|
0.08710859
|
0.317179144
|
1
|
|
|
|
SB
|
0.034441106
|
0.270324406
|
-0.203424468
|
-0.176253311
|
-0.307739372
|
1
|
|
|
Errors
|
-0.634078486
|
-0.015525505
|
0.479930323
|
-0.16569051
|
-0.279342476
|
-0.132660629
|
1
|
|
Payroll
|
0.34902618
|
0.148628679
|
-0.142205861
|
0.296919745
|
0.26227353
|
-0.1603734
|
-0.216774
|
1
|
|
League
|
Wins
|
ERA
|
BA
|
HR
|
SB
|
Errors
|
Payroll
|
|
|
|
|
|
|
|
|
|
Mean
|
0.466666667
|
81
|
4.073333333
|
0.257266667
|
153.7666667
|
98.63333333
|
101
|
91.01666667
|
Standard Error
|
0.092641111
|
2.00917436
|
0.076555642
|
0.00189672
|
6.119440162
|
5.697445149
|
3.19662178
|
6.984363625
|
Median
|
0
|
81
|
4.07
|
0.257
|
151
|
92.5
|
102.5
|
84.3
|
Mode
|
0
|
80
|
3.56
|
0.268
|
149
|
68
|
126
|
#N/A
|
Standard Deviation
|
0.507416263
|
11.00470119
|
0.419312519
|
0.010388765
|
33.51755416
|
31.20619228
|
17.50861857
|
38.25493507
|
Sample Variance
|
0.257471264
|
121.1034483
|
0.175822989
|
0.000107926
|
1123.426437
|
973.8264368
|
306.5517241
|
1463.440057
|
Kurtosis
|
-2.126913265
|
-0.652291187
|
0.170077771
|
-0.817470219
|
1.797977689
|
-0.025766615
|
-0.759303791
|
1.62727711
|
Skewness
|
0.14076918
|
-0.489521729
|
0.492762757
|
0.092907631
|
0.991335172
|
0.697212811
|
-0.2699435
|
1.133049818
|
Range
|
1
|
40
|
1.64
|
0.04
|
156
|
117
|
58
|
171.4
|
Minimum
|
0
|
57
|
3.36
|
0.236
|
101
|
55
|
69
|
34.9
|
Maximum
|
1
|
97
|
5
|
0.276
|
257
|
172
|
127
|
206.3
|
Sum
|
14
|
2430
|
122.2
|
7.718
|
4613
|
2959
|
3030
|
2730.5
|
Count
|
30
|
30
|
30
|
30
|
30
|
30
|
30
|
30
|
The above table represents the correlation analysis. It can be seen that
wins variable is positively correlated with league, ERA, andBA. While the
variable is negatively correlated with HR, SB, and errors. The wins are highly
correlated with league. From the total 7 variables only two variables are
negatively correlated with the Wins these are; ERA and errors which have values
roundabout the -0.681075006 and -0.634078486 respectively. Meanwhile the
remaining five variables as; League, BA, HR, SB and payrolls are positively related
with the Wins. The negative values shows that there is negative relationship
among the dependent and independent variables and the positive values shows
positive relationship.
Summary Statistics of Major League Sports
The summary statistics of variables is given in the above table
containing means, standard error, standard deviation, median, and mode etc. The
above data shows that HR has highest mean value followed by error, SB, and payroll
while the BA has lowest mean value. The summarystatics analysis used to
represent the minimum and maximum values of the variables and it also can calculate
the ranges along with the mean mode medianfor each variable individually. This
tables is also represents the values of the Skewness
andKurtosis that is commonly used to measure
the relative size of the two tails and combined sizes of the two
tails respectively. The probability of the two tails also can be measure by the
analysis.The value of the leagues is the best match for the Kurtosis.
Regression Analysis of Major League Sports
SUMMARY OUTPUT of Major League Sports
|
|
|
|
|
|
|
|
|
|
|
Regression
Statistics
|
|
|
|
|
Multiple R
|
0.931129925
|
|
|
|
|
R Square
|
0.867002938
|
|
|
|
|
Adjusted R Square
|
0.824685691
|
|
|
|
|
Standard Error
|
4.607729093
|
|
|
|
|
Observations
|
30
|
|
|
|
|
|
|
|
|
|
|
ANOVA
|
|
|
|
|
|
|
df
|
SS
|
MS
|
F
|
Significance
F
|
Regression
|
7
|
3044.914317
|
434.9877596
|
20.48816966
|
2.89702E-08
|
Residual
|
22
|
467.0856827
|
21.2311674
|
|
|
Total
|
29
|
3512
|
|
|
|
|
|
|
|
|
|
|
Coefficients
|
Standard
Error
|
t
Stat
|
P-value
|
Lower
95%
|
Intercept
|
39.7199497
|
26.13628013
|
1.51972467
|
0.142822471
|
-14.48337776
|
League
|
-0.061828574
|
1.918998774
|
-0.032219184
|
0.974587782
|
-4.041588449
|
ERA
|
-16.80095049
|
2.496407119
|
-6.730052309
|
9.16822E-07
|
-21.97818198
|
BA
|
385.2606269
|
92.33324928
|
4.172501563
|
0.000395919
|
193.773188
|
HR
|
0.113088294
|
0.030441284
|
3.714964627
|
0.001205621
|
0.049956934
|
SB
|
0.021301062
|
0.032870296
|
0.648033787
|
0.523669467
|
-0.046867759
|
Errors
|
-0.097128388
|
0.061543682
|
-1.578202406
|
0.128790918
|
-0.224762173
|
Payroll
|
0.010435676
|
0.024790363
|
0.420956958
|
0.677870466
|
-0.040976391
|
Interpretations of Major League Sports
The above given tables shows the regression analysis which is usually represent
the effects of the independent variable on the dependent variables. In the
coefficients table the value of the coefficient for all variablesconsidered for
applying the equation.
Y= a+ bx
Y=a+ League x1+ ERA x2+ BA x3+ HR x4+ SB x5+ Errors x6+ Payroll x7
Y=39.7199497+-0.061828574x1+-16.80095049 x2+385.2606269x3+0.113088294x4+0.021301062x5+-0.097128388x6+0.010435676x7
The above table is showing that there is postiche relationship among all
of these variables except three variables as; Errors, League and ERA. Because
these three variables have negative relationship with the wins. It shows the winning power of the supports can
be decrees by increasing the errors leagues and ERA. All the variables has
significant values less than 0.05 but only league, ERA and payroll are not
significant for the Wins.
In the table of the model summary the value of the adjusted R square is
0.824685691 which shows wins has 82% influence on the independents variables.
Conclusion Major League Sports
It is concluded
that salaries is major key that enforce the person to continue to his jobs. The higher salaries are expected to be
correlated to good players’ performance. OPS takes on base percentage plus
slugging of players and normalize the number around whole league and accounts
for ballparks kind of external factors. It has been
concluded in this paper that there is the positive relationship between salary of player and their
performance in major league sports and predicting the salary of baseball based
on the performance as well in the field. By increasing
the salaries the performance of the players will be enhance as well. It has
been observed by applying the regression and correlation that there is the
significant positive relationship among the salaries and good players’ performance.