Sabermetrics And You: The Big Three, Part 1 - Batting Average
You can read all of the articles in the "Sabermetrics And You" series here.
Sabermetrics Reading Level (1-5): 1 - Beginner. This is an introductory-level article designed to give you a better understanding of basic sabermetric principles and/or stats. It should be well-trodden ground for Amazin' Avenue veterans.

It used to be that if you wanted the measure of a hitter you needed to know only three things about him: his batting average, his RBIs, and his home runs. Maybe you could throw stolen bases in there, but really those were the big three of traditional batter evaluation.
- Batting average told you how good a hitter your guy was. The old axiom was that a good hitter, a .300 hitter, still failed 70% of the time. If a guy hit .300 you knew he could swing the bat and was definitely the kind of player you'd want in the first half of the lineup. Heck, fill your squad with eight or nine .300 hitters and you'd be set.
- RBI told you pretty clearly whether or not your guy was a run producer. It didn't so much matter what his batting average was. If he knocked in 100 runs a season that guy was a middle-of-the-order threat and an All-Star. He knew how to drive in runs; that was his job.
- Home runs told you if your hitter was a slugger or not. He probably pulled the ball a lot and hit 25+ dingers a season. He likely also had a lot of RBIs, but some guys could pile up RBIs with a bunch of doubles, sacrifice flies, and clutch singles. Those were fine, but our heroes hit home runs.
Today we're going to take a look at batting average, what it tells us, what it doesn't, and what better alternatives we have to address its shortcomings.
WHAT'S WRONG WITH BATTING AVERAGE?
Batting average (AVG or BA) tells you one very specific thing: how often a player got a hit. The formula is strikingly simple: just divide hits by at-bats. Almost anyone can understand it which is probably what has made it so appealing. However, it has two crippling problems that almost entirely drain it of its usefulness.
-
Batting average completely ignores walks, which are often just as good as singles. On balance are they quite as good as singles? No, not really, but they're about two-thirds as valuable as singles, and batting average doesn't count them at all. Nor does it count hit batsmen, which are roughly as valuable as walks.
Let's take a very simple example.
Batter A comes to the plate 100 times, collecting 30 singles and grounding out 70 times.
Batter B comes to the plate 100 times, collecting 15 singles and 50 walks, grounding out 35 times.Batter A has a .300 batting average and reached base safely 30% of the time.
Batter B has a .300 batting average and reached base safely 65% of the time.Batter B is clearly the better hitter, and by a significant margin. Batting average wouldn't tell us that, though. Instead, batting average would see these as two identical hitters, which is a massive lapse of reason.
Here's a real example from 2010.
The Rangers' Vladimir Guerrero hit .300 (178 hits in 593 at-bats). He walked 19 times and was struck by a pitch four times.
The Indians' Shin-Soo Choo hit .300 (165 hits in 550 at-bats). He walked 83 times and was struck by a pitch 11 times.The currency of a baseball game is the out. A team has a finite number of outs per game (usually 27), and the team that maximizes its opportunities while conserving its outs is most often the winner. In plain terms, while the fundamental objective of an offense is to score runs, the surest way to score runs is to avoid making outs. In roughly the same number of plate appearances, Choo reached base (i.e. avoided making an out) 259 times while Guerrero did so just 197 times. That's a huge difference in value that batting average sweeps under the rug.
- Batting average treats all hits as if they were equal. Singles are not as good as doubles, nor are doubles as good as triples, nor are triples as good as home runs. Singles, then, are clearly not as good as triples, and they're laughably inferior to home runs. Batting average is blind to these distinctions that even a tee-ball player would instantly recognize. If batting average is the first thing you reach for when determining a player's offensive production, this fact should trouble you.
Here's another simple example.
Batter A comes to the plate 100 times, collecting 30 singles and grounding out 70 times.
Batter B comes to the plate 100 times, mashing 30 home runs and grounding out 70 times.Again, Batter A looks like a scrawny hobo compared to the muscly-armed Batter B, but batting average tells you they're exactly the same player.
Here's a real example from 2010.
The Mariners' Chone Figgins hit .259. He had 132 singles, 21 doubles, two triples, and one home run.
The Blue Jays' Jose Bautista hit .260. He had 56 singles, 35 doubles, three triples, and 54 home runs.These two hitters came to the plate approximately as often as one another and had virtually the same batting average. The vast majority of Figgins's hits were singles; just 24 of his hits went for extra bases. Meanwhile, Bautista, the AL home run champ, collected 92 extra-base hits, four times as many as Figgins. Bautista hit for prodigious power, while Figgins slapped the ball around, yet batting average tells you nothing about that difference in potency.
WHAT'S BETTER?
At the very least, batting average should be relegated to the "marginally useful" drawer in your baseball research toolbox. Without even getting into more advanced baseball metrics, replacing batting average in your go-to repertoire with two other stats -- on-base percentage (OBP) and slugging percentage (SLG) -- would leave you far better equipped to evaluate and compare individual hitters. Each stat addresses one of the two areas that batting average neglects.
On-base percentage tells you how often a hitter avoids making an out, or, in more obvious terms, how often he reaches base safely. Its inclusion of walks and hit-by-pitches as vital components of offensive value give you a far better accounting of a player's contributions than batting average ever could.
Going back to our earlier example, while Vladimir Guerrero and Shin-Soo Choo each hit .300 this past season, on-base percentage reveals the stark difference between the two hitters. Equals in batting average, Choo's .401 OBP was considerably better than Guerrero's .345 mark. On-base percentage tells us quite distinctly what batting average could not -- that Choo was a much better hitter than Guerrero despite identical batting averages.
Here's the math, for those who are interested:
![]() |
Slugging percentage addresses problem #2 above, in that it does a far better job than batting average at distinguishing between different types of hits. Home runs count four times as much as singles and twice as much as doubles, doubles are twice as valuable as singles, and so on.
In our earlier real-world example, Chone Figgins and Jose Bautista each hit around .260 in 2010. However, while Figgins slugged just .306, Bautista slugged .617! These two players could hardly be less similar, but their batting averages suggest they're essentially the same player.
Again, the math if you're interested:
![]() |
THIS ARTICLE IS LONG. CAN YOU SUM IT UP FOR ME?
Batting average neglects important offensive attributes like walks and hitting for power. On-base and slugging percentages account for these shortcomings and help provide a much more comprehensive profile of a batter's contributions at the plate.
Even if you don't read another article in this series, simply replacing batting average with on-base percentage and slugging percentage will dramatically improve your baseball IQ. Equipped with these useful but alarmingly simple tools, you'll be prepared the next time someone tells you that so-and-so is a good hitter because he hit .300. "No," you'll retort. "He barely walked and he hit nothing but singles. That guy isn't really a good hitter at all."
SO, IS BATTING AVERAGE COMPLETELY USELESS?
Not at all, it's just not nearly as useful as it should be given its ubiquitous use in traditional baseball writing and reporting. Batting average generally does a good job of describing how well a player hits the ball and reaches base safely as a result. That's certainly a valuable skill, and while it is included in OBP and SLG (as a portion of a larger, more comprehensive measurement of batting ability), if you need to know specifically how often a player reached base via a hit, batting average will do exactly that (and little else). In a later column we will discuss how batting average (and to a lesser extent on-base and slugging percentages) is a function of not just a player's hitting ability, but also of things that he has very little control over: defense and luck.
WHAT ELSE?
If two stats are too many for you, you can add on-base and slugging percentages together to arrive at a single stat abbreviated OPS (On-base Plus Slugging). Doing so will give you one easy number that can be used for quick-and-dirty comparisons. OPS is not without its problems. For one, it assumes on-base and slugging percentages are of equal value when they're not. OBP is almost twice as valuable as slugging, so on-base percentage is shortchanged a bit by OPS. Nevertheless, adding OBP and SLG together, clumsy as it may be, gives you a pretty useful single stat to work with.
If you want to take AVG, OBP, and SLG analysis even further, check out run correlations, which illustrate the impact of individual metrics on overall run scoring. Fascinating stuff, but perhaps more math than some are comfortable with.
We'll cover ways of adjusting OPS -- and more advanced alternatives to it -- in future columns.
QUESTIONS & ANSWERS
I know that .300 is a good batting average. What's a good on-base percentage or slugging percentage?
Since different seasons (and different eras) have varying degrees of offensive dominance, your first step in evaluating a player's on-base or slugging percentage is to compare it to the rest of the league. You can find average league values on Baseball-Reference.com's league pages. For instance, in 2010 the average NL on-base percentage was .324 and the average slugging was .399. Generally, for recent seasons a .350 on-base percentage is pretty good and a .400 on-base percentage is outstanding; a .450 slugging percentage is pretty good and a .550 slugging percentage is outstanding.
Have a question about this article? Send us an e-mail and we might answer it here.
35 comments
|
7 recs |
Do you like this story?
Comments
Excellent job Eric
Please continue this saber lesson on the other basic stats we need to understand with respect to pitching and WAR, etc.
Thanks.
"Never throw a slider to The Glider."
- Ed Charles, No. 5
"Who has more fun than people?"
- Ralph Kiner
tl;dr
I don’t think anyone who doesn’t understand the flaws of batting average would ever want to read so much.
But it was well written! I read it!
Very nice. I'm looking forward to more of the advanced things.
"Blinding ignorance does mislead us. O! Wretched mortals, open your eyes!"
Gil Hodges IS a Hall of Famer.
AA Gamethread Embiggening Record Holder- 453 posts (10/03/10)
by Brooklyn Dodgers Mets Fan on Dec 9, 2010 1:03 PM EST reply actions
Math?
Batter B comes to the plate 100 times, collecting 15 singles and 35 walks, grounding out 50 times.
15/65 is not a .300 hitter (That’d be 15/50).
Cool.
Though I still think the popularity of batting average owes more to its giant head start than to its simplicity.
Thank you very much for this
"WHO WOULD LEAD?! THE CLOWN?!"
by I'mGivingYouARaise on Dec 9, 2010 2:50 PM EST reply actions
Thanks a lot for this
It’s real value, IMHO, will be giving the link to newer folks who are trying to understand this site and why we are so weird about the old bromides. Rather than explaining all of this in a comment, we can just link to this. But you make one big mistake:
Singles are not as good as doubles, nor are doubles as good as triples, nor are triples as good as home runs. Singles, then, are clearly not as good as triples, and they’re laughably inferior to home runs.
Yeah, but no single has ever killed a rally
It's a triumph of number crunching over the human spirit...aaaaaand, it’s about time. -- Play-by-Play Announcer, The Simpsons.
I think that GS single killed a rally.
In lobby for: Jaime Cevallos, Zack Lutz, orange unis
The Unwritten Rules of AA
In that game
I needed a rally. I never realized how awful the 7th inning beer cutoff was until deep extras.
It's a triumph of number crunching over the human spirit...aaaaaand, it’s about time. -- Play-by-Play Announcer, The Simpsons.
MookieTheCat
I understand sabermetrics perfectly, but like you said, thanks to whoever posted this. Good for those who are just learning about it.
Seriously?
Here is a link to your comments at other blogs. You do not put other user’s names in the subject line at Lone Star Ball. Why do you do it here then, despite a myriad requests to stop?
by James Kannengieser on Dec 9, 2010 5:16 PM EST up reply actions 1 recs
This is phenomenal
Great work. Looking forward to more posts.
"That guy mvhsbball is really an insufferable schmuck." - FuquaManuel
by Scott Coleman on Dec 9, 2010 3:58 PM EST via mobile reply actions
One small argument
I don’t think that BA is completely worthless. While there are better (and more complicated) stats to show a batter’s worth, you can still see that a guy batting .320 is (usually) better than a guy batting .196. I use this example because everyone seems to love Carlos Pena, and are going nuts over the fact that the cubs signed him. I don’t care how many times you take a walk, you hit sub-Mendoza, you are not a good player.
"Reynolds batted under .200 last season and still managed a league-average wRC+."
http://www.fangraphs.com/blogs/index.php/offseason-notes-rule-5-is-alive/
In lobby for: Jaime Cevallos, Zack Lutz, orange unis
The Unwritten Rules of AA
Two points.
1) BA isn’t worthless, and I even went to (what I thought to be) reasonable lengths to say so.
2) A sub-.200 hitter might still be a decent player even though he doesn’t hit for average. He’s certainly not a complete hitter, but he might not be awful if he can walk a lot and hit for power when he does get hits.
There certainly aren’t very many players capable of something like this (Adam Dunn, Jason Giambi at one point, maybe Carlos Pena), and in general if you hit under .200 you probably aren’t going to be good at those other things — drawing walks, hitting for power — either.
Sorry
1) I wasn’t attacking your article, in fact I thought that it was outstanding
I was trying to point out to some, who very commonly like to throw away traditional stats, saying that they are outdated.
Many will cite a low BABIP and say that he really was a better player than traditional stats say, however he still hit what he hit.
"he still hit what he hit"
Sure, that’s an account of what happened, but if a guy hits .300 in a season it doesn’t mean his true talent is a .300 hitter because of all of the things that go into hitting the ball and reaching base safely. That — trying to figure out what he’s likely to do next year and beyond — is where further scrutiny of base hits (and everything else) is most valuable. After all, what’s more important to a GM: what a player did, or what he is likely to do?
Again
I agree with you completely.
However many on this board (NOT YOU) will say that his (insert advanced stat here) was better than his actual number and say player X is better than his stats say. That is no consolation to the team employing him. However while that is great for POSSIBLY predicting future performance it is about as acurate as using traditional statistics.
The only caveat being if there is one outlier year (which you would see using both types).
This is a good intro into sabermetrics for people like me.
I Rec this!
Proud supporter of a New York baseball team and a Boston football team. Yeah, deal with it!
"We don’t listen to the hype. I don’t think we ever have. We really take after our coach and he says ‘When you win, say little. When you lose, say less.'"--Tom Brady
OBPIP
“If two stats are too many for you, you can add on-base and slugging percentages together to arrive at a single stat abbreviated OPS (On-base Plus Slugging). Doing so will give you one easy number that can be used for quick-and-dirty comparisons. OPS is not without its problems. For one, it assumes on-base and slugging percentages are of equal value when they’re not. OBP is almost twice as valuable as slugging, so on-base percentage is shortchanged a bit by OPS.”
A stat I prefer (primarily because I invented it, so I have a vested interest) is OBPIP — On Base Plus Isolated Power. You can calculate it easily by either adding on base percentage to isolated power (duh) or by subtracting batting average from OPS.
My position is that it’s a more useful stat than BA, OBP, SP or OPS, while still being much easier to calculate than Win Shares, wOBA, etc.
I can tell you absolutely that it works like a charm in games like Statis-Pro, Strat-O-Matic, etc (where I also break it down into OBPIP vs. left and OBPIP vs. right) in predicting how players you draft will do in a particular PAST season of the game, or who you should bat against a specific starting pitcher.
A stat maven on the Out of the Park Baseball forum tells me that in theory it still has the same problem as OPS (discounting the value of OBP), but in practice it seems to be more accurate, at least for my (gaming) purposes. If anyone who knows a lot more about statistics than I do, which would be most of you, would like to take a shot at it and let us know how it fares in the ‘real’ world, I’d greatly appreciate it.
This is a good stat, at least for another perspective on a quick and dirty stat that could have some meaning
I’m not quite sure that it’s a level 1 though, as isolating things is a little bit abstract. Still, thanks for this, it seems like it has some promise (like all stats, with some caveats attached).
It's a triumph of number crunching over the human spirit...aaaaaand, it’s about time. -- Play-by-Play Announcer, The Simpsons.
by MookieTheCat on Dec 10, 2010 12:11 AM EST up reply actions
I use the same stat
but I call it OPI (OBP + ISO)
Must be one of those “great minds think alike” things.
"The Mets are gonna be amazing!" - Casey Stengel
Muchas gracias, Mookie and Russ
Thanks for the positive reinforcement. And Russ, if you ever need to establish priority of authorship, I started using OBPIP in 1999.
I use the same stat for hitters and pitchers (for pitchers it’s OBPIP Allowed or OBPIPA). Prior to 1999 I used (for pitchers only) a variant of Pitchers’ Ratio (RAT) in which I added three times homers allowed to the base runners allowed before dividing by 9. This helped correct for pitchers who routinely give up the long ball, but was never satisfactory. The only real point in its favor was that it could be easily calculated using the resources I then had available. I called it ARAT (Adjusted Ratio), but it’s long dead.
I briefly toyed with an SPS rating (slugging plus steals), in which you add stolen bases to total bases before dividing by at bats. Stolen bases was quickly amended to net stolen bases (steals – caught stealing), but within a year I gave up on it.
The only other stat I routinely calculate is RPA, or runs produced average, which is: (runs scored + runs batted in – home runs)/plate appearances. (Maybe should be RPP?) It’s more useful than runs scored or runs batted in, since it’s less dependent upon (though not independent of) position in batting order and who you have hitting around you. Obviously it will still be influenced if you’re on a very high or low scoring team, but again it’s easy to calculate, which is my prime concern.
Somebody really out to invent a machine that would compute this stuff.
I think the machine has already been invented
It’s likely sitting right in front of you as you read this :)
It’s really a matter of programming and testing, which I am very bad at so I leave it to those who know what they’re doing.
It's a triumph of number crunching over the human spirit...aaaaaand, it’s about time. -- Play-by-Play Announcer, The Simpsons.
by MookieTheCat on Dec 11, 2010 3:24 PM EST up reply actions
"the fundamental objective of an offense is to avoid making outs"
I imagine that some people reading this might say “no, the fundamental objective of an offense is to score runs”. Hence RBI > BB! etc. I understand what you’re trying to say but I’m not sure that’s the best way to put it.
"I think Murdertron makes a good point though."
No, that's the right way to put it IMO.
It’s something of a derivative concept and might not be intuitively pleasing, but there’s enough math behind it to shore it up. If someone has an issue with the phrasing, then they should ask the question “why do you believe this to be the case?” It’s really not that hard a concept to get once you have it explained.
It's a triumph of number crunching over the human spirit...aaaaaand, it’s about time. -- Play-by-Play Announcer, The Simpsons.
by MookieTheCat on Dec 11, 2010 5:08 PM EST up reply actions
I disagree
The objective of an offense, in order to win baseball games, is to score runs. Avoiding making outs is the best way to do it. Obviously, it’s necessary to avoid outs to score runs. But you don’t pursue the goal of avoiding outs in a vacuum. It’s done in pursuit of the ultimate objective of an offense, which is to score.
Obviously, I’m not disagreeing with the underlying meaning. I’m just thinking about how to maximize the accessibility of this to the masses.
"I think Murdertron makes a good point though."
Ah ok
I get your point, but to me you might as well get to the heart of the matter by the most direct route without restating that goal as the means to that goal. Yes, the point is to score runs. I guess I view that point as so apparent that it means very little in the scheme of how to run a baseball team. There are many ways to score runs, and the difference between these must be expressed as exactly that: what’s the best means of all these to accomplish that goal.
It's a triumph of number crunching over the human spirit...aaaaaand, it’s about time. -- Play-by-Play Announcer, The Simpsons.
by MookieTheCat on Dec 12, 2010 1:09 AM EST up reply actions
Yeah, I hear you
Was just trying to anticipate knee-jerk responses. Some people will look for any reason to disagree, even with things that are plainly logical. Though maybe those people are not really the audience for this anyway…
"I think Murdertron makes a good point though."
Can't disagree with this
I guess if Eric’s goal was to speak to those who haven’t been indoctrinated yet, then you have a good point.
It's a triumph of number crunching over the human spirit...aaaaaand, it’s about time. -- Play-by-Play Announcer, The Simpsons.
by MookieTheCat on Dec 12, 2010 11:38 PM EST up reply actions
































