eERA (beta) - adjusting for Inherited Runs
On June 3, 2011 the Nationals were battling the Diamondbacks with Yunesky Maya on the mound. In the bottom of the 5th inning, the score was 1-0 in favor of Arizona. Four batters and one out later, the bases were loaded, and Stephen Drew was at bat. Jim Riggleman came out and brought in lefty specialist Doug Slaten, who had at the time a sparkly ERA of 2.19 for the season. Drew hammered the first strike he saw for a triple which scored all three of the runners Maya had put on base. Coffey then entered the game and stranded Drew at 3rd base, keeping Slaten’s shiny ERA intact once more.
The way in which each Earned Run is tied to a runner who is put on base is one weakness of the ERA statistic when measuring performance. To this point, Slaten had allowed 50% of inherited runners to score, yet his ERA didn’t show it. It was after this game that Slaten was sent to the disabled list with was said to be an elbow injury.
Unfortunately, the only decent statistic for tracking how a pitcher backs up his teammates is the Inherited Score Percentage, which just shows what percentage of inherited runners a pitcher allows to score. Though useful in its own way, it does not allow comparison between pitchers with the ease of a statistic like ERA, and it has no regard for starting pitchers, who of course never have inherited runners on their hands.
I looked at this problem, and wondered if it would not be possible to split each earned run that scored as an inherited run up between the two different pitchers involved. The formula would have to be simple. At first, I decided to simply score each base given up by a pitcher to a runner who eventually scored as .25 runs earned. Soon, though, it became clear that it would be far more equitable, and not much more difficult, to incorporate some situational statistics.
| OUTS | RATE_1B | RATE_2B | RATE_3B |
| 0 | 0.380 | 0.600 | 0.826 |
| 1 | 0.253 | 0.410 | 0.652 |
| 2 | 0.123 | 0.227 | 0.289 |
That chart contains Tom Tango's calculations, based on Retrosheet data, of the chance a runner has to score from a given base by out. Using these percentages, I was able to divide up earned runs between pitchers. If, for example, a starting pitcher is pulled with one out and a runner on 2nd who later scores, that pitcher is credited with .41 runs, and the relief pitcher who let that runner score is credited with .59 runs. Using a season of play by play data, I was able to track who really earned some of those earned runs everybody has been earning.
In the example above, Slaten took over from Maya with the bases loaded and allowed all three runners to score. Normally, that would mean three earned runs for Maya, but under this system, those three runs would be split up. Maya left with the bases loaded and one out, which means he gets (.253+.410+.652) runs - 1.315 ER . Slaten allowed them to score, so he picks up the balance of the three earned runs - 1.685 ER. It is certainly no perfect system, but it gives us new information than simple ERA does not. I call it eERA (earned Earned Run Average).
Here is a chart showing the 2011 Nationals pitching staff, ordered by innings pitched.
| Name | IP | ER | ERA | Run Diff | eERA | % diff | IS | IS% |
| John Lannan | 184 | 76 | 3.71 | -3.95 | 3.52 | -5.2% | ||
| Livan Hernandez | 175 | 87 | 4.47 | -3.96 | 4.27 | -4.6% | ||
| Jordan Zimmermann | 161 | 57 | 3.18 | -2.03 | 3.07 | -3.6% | ||
| Jason Marquis | 120 | 53 | 3.97 | -3.96 | 3.67 | -7.5% | ||
| Tom Gorzelanny | 105 | 47 | 4.03 | -0.65 | 3.97 | -1.4% | 0 | |
| Tyler Clippard | 88.1 | 18 | 1.83 | 3.10 | 2.15 | 17.2% | 10 | 22% |
| Drew Storen | 75.1 | 23 | 2.75 | -1.84 | 2.53 | -8.0% | 2 | 20% |
| Ross Detwiler | 66 | 22 | 3.00 | 0.17 | 3.02 | 0.8% | 1 | 25% |
| Henry Rodriguez | 65.2 | 26 | 3.56 | 0.82 | 3.68 | 3.2% | 7 | 35% |
| Chien-Ming Wang | 62.1 | 28 | 4.04 | 0.00 | 4.04 | 0.0% | ||
| Todd Coffey | 59.2 | 24 | 3.62 | 1.06 | 3.78 | 4.4% | 7 | 19% |
| Sean Burnett | 56.2 | 24 | 3.81 | 8.98 | 5.24 | 37.4% | 19 | 44% |
| Collin Balester | 35.2 | 18 | 4.54 | 0.81 | 4.75 | 4.5% | 4 | 44% |
| Yunesky Maya | 32.2 | 19 | 5.23 | -1.54 | 4.81 | -8.1% | 0 | |
| Ryan Mattheus | 32 | 10 | 2.81 | 3.46 | 3.79 | 34.6% | 8 | 28% |
| Tom Milone | 26 | 11 | 3.81 | -0.78 | 3.54 | -7.1% | ||
| Stephen Strasburg | 24 | 4 | 1.50 | 0.00 | 1.50 | 0.0% | ||
| Doug Slaten | 16.1 | 8 | 4.41 | 6.71 | 8.11 | 83.9% | 15 | 47% |
| Cole Kimball | 14 | 3 | 1.93 | 2.07 | 3.26 | 69.1% | 4 | 80% |
| Brian Broderick | 12.1 | 9 | 6.57 | -0.33 | 6.33 | -3.7% | 5 | 100% |
| Brad Peacock | 12 | 1 | 0.75 | 1.34 | 1.75 | 133.7% | 2 | 100% |
| Craig Stammen | 10.1 | 1 | 0.87 | 0.35 | 1.17 | 34.8% | 1 | 11% |
| Chad Gaudin | 8.1 | 6 | 6.48 | -0.29 | 6.17 | -4.8% | 1 | 14% |
| Atahualpa Severino | 4.2 | 2 | 3.86 | 0.35 | 4.53 | 17.4% | 2 | 50% |
Run Diff: Run differential between Earned Runs and earned Earned Runs.
IS: Number of inherited runners allowed to score.
IS%: Percentage of inherited runners allowed to score.
Starting pitchers enjoy, as would be expected, a lower eERA than ERA. Taking off even this prorated portion of inherited earned runs scored lowers their ERA by over five percent in some cases, which is far from insignificant.
There is a lot of variation among relief pitcher performance here. As expected, Doug Slaten’s eERA is much higher than his ERA – a stunning seventy-five percent higher. No other Nationals pitcher has an ERA, eERA, FIP or xFIP as high as Slaten’s 8.11 eERA. Also struggling this year was Sean Burnett, whose eERA was more than a full run higher than his ERA.
Much more surprising are the numbers belonging to All-Star Tyler Clippard, who put together a stellar season, yet has an eERA that is a seventeen percent higher than his ERA. While he did not allow an abnormally high number of inherited runners to score, a lot of the runs he gave up came via the home run, which allows runners to score from 1st base quit easily regardless of the number of outs.
One relief pitcher who recorded a lower eERA than ERA was Drew Storen, as he locked down the closer role and had a terrific year. The only two inherited runners to score on his watch were scored as unearned runs.
eERA (beta) is a young and undisproven statistic. It is only vaguely scientific, yet I believe it helps cast light on the shameful racket that is ERA laundering. Every year, hundreds of major league starters and relievers are saddled with earned runs they never truly earned, and one of their teammates quietly reaps the benefits of the system. eERA is one more small step toward exposing this corruption and bringing justice to the way performance in pitching is calculated.
Don’t be satisfied with ERA when you can have earned Earned Run Average.
39 comments
|
3 recs |
Do you like this story?
Comments
Interesting
You mentioned/questioned that my version would penalize a player’s “adjusted ERA” for the runners he leaves on, regardless of whether they score, and would give a bonus to relievers for stranding runners they inherit. I like the notion, because a pitcher shouldn’t be let off the hook for bad play just because the team happens to have a super-stopper ready to come in and clean up others’ messes. I also like the idea of tying outs into the formulas, as we mentioned.
Still, I like it. And I still don’t know how to put tables into fan posts…
Rob
-- In baseball we trust.
At its simplest, that problem is addressed by WHIP
but at its most wackiest and beautiful you would have a system where every action has a situational run value attached to it which is turned into an ERA-like number. A leadoff walk is worth say .3 runs, so a pitcher would earn .3 runs for that at bat regardless of the inning. It would be very complicated, but if it is possible, someone should certainly do it for posterity’s sake.
WPA might be your answer
It is entirely based on the situation, and has a set run value for every single base state in every inning with ever score.
by William.Hatheway on Oct 4, 2011 2:44 PM EDT up reply actions
I would prefer to take the context out of the equation
A reliever who turns a 7-run lead in the eighth into a 3-run lead in the eighth has done serious damage, but probably doesn’t even move the WPA marker.
Rob
-- In baseball we trust.
You'd want something context-neutral, though, maybe WPA/LI?
"If you ain't got the pants, you ain't got a chance." --PerryMason (on the sartorial component of being a Real Ballplayer)
I feel stupid.. they don’t need to add up to 1 run at all. If we use Tom Tango’s old numbers:
Chance of scoring, from each base/out state
0 outs 1 out 2 outs
1B .38 .25 .12
2B .61 .41 .21
3B .86 .68 .29
The number listed in that table would be the percentage of the run credited to the original pitcher if that inherited runner scored. It would not take any more math than my original method, and would be much more fair. Thoughts?
I guess these are similar to the ones I proposed:
1B .25 .17 .08
2B .50 .33 .17
3B .75 .50 .25
My values typically run low, but they’re not too far off (save for runner on 3rd w/1 out)
Rob
-- In baseball we trust.
steroid era, non-steroid era? more numbers from Tango
Chance of scoring from base, all Retrosheet years.
OUTS RATE_BAT RATE_1B RATE_2B RATE_3B
0 0.155 0.380 0.600 0.826
1 0.111 0.253 0.410 0.652
2 0.064 0.123 0.227 0.289
Chance of scoring from base, 1993-2007 (excluding 1999).
OUTS RATE_BAT RATE_1B RATE_2B RATE_3B
0 0.167 0.397 0.614 0.842
1 0.120 0.266 0.417 0.663
2 0.070 0.132 0.232 0.292
My weights do have the advantage
of not having to be recalculated every few years.
Rob
-- In baseball we trust.
I like that part
but the difference between .75 runs and .83 runs is pretty substantial.
The updated retrosheet numbers are the lowest of the three I found, but still much higher than yours.
It's not that much
The margin of error for that .83 estimate is probably at least .05 anyway, my guess.
Rob
-- In baseball we trust.
Why worry about whether the runner scores at all?
Go for straight base/out run expectancies. A pitcher gets eER equal to runs scored, plus the run expectancy of the situation when he leaves the game (for relievers, minus the run expectancy of when he enters the game).
A pitcher who gives up 2 runs, then leaves with one out and runners corners (RE = 1.2 runs) has 2 (runs given up) +1.2 (leaving RE) = 3.2 eERA. Doug Slaten comes in and gives up a sac fly and an RBI double and gets hooked, leaving runners 2nd with two outs (RE = 0.3), has 2 (runs given up) + 0.3 (leaving RE) – 1.2 (starting RE) = 2.9 eER. Tyler Clippard comes up and gets a pop up, getting 0 (runs gives up) -0.0 (leaving RE) – 0.3 (starting RE) = -0.3 eER. I think this shows the value of relievers in an interesting way, since good ones would take runs away.
"If you ain't got the pants, you ain't got a chance." --PerryMason (on the sartorial component of being a Real Ballplayer)
For eERA as I have designed it, I purposely do not want to change the number of ER within a team, to allow it to mesh with existing statistics smoothly. I am simply shifting earned runs around from player to player.
What you are suggesting is a great idea, as I said above in my first comment, before I accidentally dumped a reply meant for the comment just below on it. Doing all that would create a WHIP/ERA situational superstat which would measure actual performance like nothing else.
I haven't thought this through carefully, but...
…I think you’ll end up with the same total number of ER (or possibly RA), since the “extra” ER you get from charging RE values to the exiting pitcher will get subtracted from the pitcher who follows.
"If you ain't got the pants, you ain't got a chance." --PerryMason (on the sartorial component of being a Real Ballplayer)
you're right.. it should be the same aggregrate ER, just a much weirder "ERA"
that may be seem rather unfair to starters.. still worth pursuing I think.
This is more or less what I proposed
Using the weights above instead.
Guess who ends up with the best aERA using this technique?
Craig Stammen! A negative net aERA!
Rob
-- In baseball we trust.
How to tie outs into the eERA formula
I imagine that to implement the use of outs in what I attempted above, we would need nine separate values for each base advanced, one for each base advanced per out. The value of moving around the bases would have to add up to 1 run in each case, but with a greater share of that run falling to the reliever the more outs he started with.
Do you have any ideas what we could use as a basis for that calculation?
Nice work
It’s good that Doug Slaten’s suckitude caused a “vaguely scientific” discovery!
Check out DC is for Baseball and 2011 Nationals Draft Info!
by what Juneau about that? on Oct 4, 2011 2:43 PM EDT reply actions
Very nice.
As has so often happened over the course of the year, I noticed Brad Peacock’s eERA affected by SSS. That one relief outing sure ballooned his eERA. I think the coefficients are somewhat arbitrary, but budgets, etc., consume so much of my brain I don’t have sufficient cells left to figure out how to compensate…
Brain: "Pinky, are you pondering what i'm pondering?"
Pinky: "Yes, ... wait, ... no, ... never mind"
You mean his first inning, when Desmond made an error that the scorekeeper decided to call a base hit?
You know, the second time in that game that Desmond got away from an error?
(I’m not saying this because I dislike Desmond, I actually respect him, I’m saying this because I’m a stats guy and Peacock was corn-holed by the scorekeeper…and Desmond)
Skins rule
So, I stated earlier that I could "theoretically" use BBRs game logs to find the aERA
if I ever got around to building a parsing script. Well, now I see that I can’t do it… the game logs do not give enough information to determine whether runs that are charged to a pitcher scored while he was on the mound or after. So now I don’t know how I could calculate the number.
Rob
-- In baseball we trust.
I thought that aERA ignored who runs were charged to in real life..
since the earned runs are handed out for each at bat regardless whether actual runs were scored. You’d just have a value assigned to every possible outcome of each at bat and add that number to the pitcher who was standing on the mound that at bat.
Well, dangit, now I'm going to have to download the 2011 play logs and see if I can write a query to do the math.
Does anyone know a quicker way to do it that downloading one team at a time using the Play Finder?
"If you ain't got the pants, you ain't got a chance." --PerryMason (on the sartorial component of being a Real Ballplayer)
That's the idea
What I would need is simply what’s the score when the pitcher comes in, and what’s the score when he leaves (in addition to the data which IS there, i.e. the number of outs and the baserunners.) All they show is the relative score before and after, e.g. ahead 3 / behind 2.
The calculation would then go:
aER = runs scored while pitcher is on mound – %runs based on baserunners inherited + %runs based on baserunners left to next pitcher.
Then obviously, the sum of all aERs will be the actual number of runs scored. NB: I’m ignoring “earned” and “unearned” here, so “aERA” is probably not the best name for the stat, but I’m going with it anyway! :-)
Rob
-- In baseball we trust.
Of course,
knowing the pitcher came in with the team ahead by 3 and left with the team behind 2 is not enough — he gave up at least five runs, but it’s not known how many runs the offense scored for him in the meantime. The one exception is when he enters and leaves in the same inning.
Rob
-- In baseball we trust.
Could you use Fangraph play logs to find this out?
They give every bit of information you’d need, I think.
by William.Hatheway on Oct 4, 2011 8:39 PM EDT up reply actions
9 PM EST - eERA has now been updated to 2.0b
The formula now uses Tom Tango’s retrosheet calculculations (link in fanpost), so each run division depends on outs as well as inherited baserunners. At the moment, I am trying to track down an error I believe to be in my dataset, but by and large the data should be more useful and accurate than that from the first incarnation
More mistakes weeded out. Slaten’s eERA mysteriously continues to climb.. I really don’t know what if it’s him or me, but nobody can double check my work without going through all the play by play data for every game this year.
However, it appears, whether I transferred all my data correctly or not, that eERA does an surprisingly good job of reflecting a layer of performance that ERA does not show but that we all know exists.
I'm a bit skeptical on some points
HRod, in particular. According to my calculations (using my weights), he should get credit for 3.917 runs for baserunners he inherits, and he should get penalized 5.583 runs for the baserunners he leaves to other relievers, for a net penalty of 1.667 runs. It seems to me that you have a neutral total with your weights, which is vastly different, far more than the difference one can expect just because of different weights.
Rob
-- In baseball we trust.
Yes, I am indeed using a neutral total
It is central to what I was trying to accomplish with eERA. All I am doing is moving existing ER around to “fix” ERA, so I do not use credits and such. This ensures that eERA is directly relateable to ERA proper.
Once you add a credit for runners left on base, you are abandoning the establish system of earned runs. Your system does not pretend to improve/fix ERA like eERA does. However, your system can be useful and should be developed further.
In the long run, your system should prove very useful in evaluating relievers, since it measures all inherited base runners, while eERA does not. This allows relievers to get credit for stranding base runners, and should measure their performance in that part of the game quite effectively.
Rodriguez is not affected a lot under eERA because while he both slatened and was slatened for five earned runs. He should come out looking worse under your system because he allows a lot of baserunners.
I did something related with RE24
A few days ago I posted something on the Nats Noodles blog that showed how you can use Fangraphs’ RE24 (which is based on the 24-state run expectancy matrix) to adjust relief pitchers’ ERA for inherited runners. Using RE24 has the advantage that it depends only on what happens while the pitcher is on the field, and doesn’t depend on whether a later pitcher lets a runner score.
Thanks for taking the time to comment
ER24 may be the closest thing to an all-inclusive system for measuring performance that exists today. Obviously, its weakness is the fact that it ignores defense, meaning pitchers and batters get credit for defensive miscues. Yes, I do call it a weakness.. in a perfect world, defenses would defend perfectly and we would not have this problem of unearned runs. Given that pitchers are forced to pitch in an environment where other players make mistakes, they should not be penalized for the mistakes those other players make. The ER system does not deal with this problem perfectly, but it tries. I love what RE24 comes so close to doing, but because I can not rely on it to accurately measure any player’s individual contributions, I tend to ignore its existence most of the time. :)
I like what you’ve done with ER24 on your blog though, especially the idea of comparing ER24 to wRAA. I’d love to see those numbers for the whole team, though I suppose it wouldn’t take much time or effort to do that myself. Btw, Kershaw beats Halladay by a mile using RE24.. welcome to the NL, Doc.

by 























