Saturday, February 16, 2008

Equivalent Average Unmasked

Runs Batted In was created in the late 1800s. A few teams created the statistic to show how good they were. In fact, some sportswriters of the day realized it's inherent bias towards hitters in the middle of the order and disregarded it. The little guys with pointy hats and horse-drawn carriages knew what they were talking about. RBI would not surface as widely accepted statistic until after the dead ball era was over. Eventually it became THE way to grade an offensive players "production." We all know why it's a bad statistic.

Batting average has its flaws as well. If you go out on the street and ask someone what batting average is, they will respond with something sounding like this: How often a player gets a hit. Wrong. Batting average does not tell us how often a player gets a hit. It tells how often a player gets hit when while deciding to throw out some times he goes up to the plate for no reason other than we feel like it. It also fails to tell us to what type of a hit the player got. A single is not worth the same as a double. This is why we use on base average and slugging average. Then again is slugging average really any better? Well yes and no. It tells you the type of hit, but it still has the first problem of batting average. We're partitioning the times the player comes up to bat and excluding one for inherently biased reasons. Is on base average any better? It fixes the first problem, but fails to solve the second problem of batting average. It acknowledges all plate appearances, but it makes a walk and a home run equal.

We can sum on base average and slugging average for OPS, but then again who says that the relationship for that is better. Instead we can try to develop a system that solves both problems. Enter equivalent average. This post is going to describe anything and everything about EqA so you can come up with the exact EqA's BaseballProspectus comes up with. One of the criticisms for EqA is that BP develops it in a black box. No one knows how they arrive at it. They do spell out the method here. You can do all the things they do. You'll find out that the league leaders in EqA are generally around .300. BP's EqA leaders are generally around .350 or so. You can play around with the stuff in that article for days and never come up with anything remotely close to their EqA. Sorry. As TangoTiger put it: Opening up the black box will not cause a single dent on [BaseballProspect's] bottom line.

What I am going to tell you is everything and why Baseball Prospectus is doing what they do. It's rather simple. In fact it's essentially what people say mathematicians criticize sabermetricians for: Units. People who dislike sabermetrics generally say real mathematicians would hate their "work" because they shed units completely. This really isn't true. Everything in EqA is measured in relatively precise units that in the end cancel out leaving an answer in runs.

Now let's go on and attack the two major problems with oba, slg, and avg. We need to create some sort of rate statistic that includes getting on base and hitting for extra bases as well as stealing a base efficiently. The first thing that is calculated answers all of these problems in what they feel is the best way. We'll call this Raw:

Raw = (SF + SH + 1.5*BB + 1.5*HBP + 1.5*SB + 2*1B + 3*2B + 4*3B + 5*HR)/(SF+SH+BB+HBP+SB+CS+AB)

What is Raw measuring? It's essentially scaled bases per opportunities of moving up a base. Intuitively the idea that walks are worth more than sacs, but not quite as much as singles is good. Raw EqA addresses our two problems effectively, only adding in SB and CS, which can be described as a third problem with each oba, slg and avg. So in the end what does raw measure? Scaled Bases per PA+CS. It gives a numeric value of production. Now we can use Raw and convert it to runs. For a team we do this with this equation:

EqR = (Raw/LgRaw )^2* PA * LgR/LgPA

So what is EqR doing? It's measuring the relative production of the team divided by what an average team does squaring and multiplying it by PA and the runs per PA an average team scores. The squared term is based on the idea that the relationship between Raw/LgRaw and runs is not linear. This makes sense because when you add good hitters your other good hitters get more guys on base and each of their hits cause more runs. Now since we're looking at EqR on a team level and we want it on the player level let's look at that.

First, an assumption: The player in question is being analyzed by an average team in his home park. This assumption is needed to derive the equation most people see for EqR. Now, to look at the change in EqR for some change in Raw, take the derivative of EqR with respect to Raw. We get this equation:

dEqR = 2*Raw/LgRaw*PA*LgR/LgPA

Now we're adding some guy to this team, but a team only has nine slots it can play. So what are we doing? We're replacing an average player on this team and adding this players production. So basically we have our runs minus an average player's runs in the same PA. We're NOT measuring runs over an average player. We're measuring all of the runs created by a player. So our equation becomes:

dEqR = 2*Raw/LgRaw*PA*LgR/LgPA - PA*LgR/LgPA

Now we can factor out PA*LgR/LgPA resulting in the equation for EqR for a player you'll see at BP, only they drop the dEqR and call it EqR.

EqR = (2*Raw/LgRaw - 1) * PA* LgR/LgPA

Generally people look at that and say what the heck are they doing? Now you know why you're subtracting 1 and multiplying the ratio by two. Here is where we can multiply this by our park factor to normalize for parks, if desired. Now we want to scale EqR and to some rate statistic. What should we use? Outs of course. Why? Outs are the stopclock in baseball. We have 9 sets of 3 outs. We can bat as long as we want as long as we don't make those outs. So we decide to make our rate be something close to runs per out used. So then we get this equation, that you can find at BP, albeit not in the article I linked to regarding how to compute EqA (lol).

EqA = (EqR/Out/5)^.4

First let's analyze the "units". We have runs divided by outs, which is want we wanted. Pay no attention to the .4 right now. The thing that should cross your mind is what crosses everyone's mind: Why the hell do they divide by five? WHY? This is where everyone gets lost. In fact if you follow the calculations done in this thread and divide by five you will won't get the EqA BP computes. This is the black box, so to speak. Remember, average EqA is supposed to be .260. If you plug all this in you'll get the league average to be about .266 or so, depending on the season. IT DOESN'T WORK. 5 is more or less a constant that forces the average to be equal to .260. How do we do that?

Well League average is going to be (LgR/LgOut/C)^.4. Since we want to "force" EqA to be equal to .260 for an average player, simply set that equation equal to .260 and solve for C. So C =(LgR/LgOut)/.260^2.5. This number tends to be around 5, ranging anywhere from 4.6 (Japan Central League) to about 5.6 (2007 AL). The 2007 National League was about 5.2.

And there, with the above information you can get the exact answers that BP gets for EqA and puts on their player cards. In fact, If you want to you can find out the park factors to extra digits. I've gotten to the point where the average "error" on the EqA I come up with is .000226 compared to their's. Remember that their EqA is the ring of integers divided by 1000. In other words: It's rounded after three digits. Theoretically, the average error in rounding then will be .00025, which is actually greater than the error I come up with.

So there you have it. EqA perfectly. Now go look up EqR on BP and you'll see this:

EqR = 5*Out*EqA^2.5

Oh and 1/2.5=.4, so solving that equation for EqA gives us the EqA=(EqR/Out/5)^.4. Look familiar? Oh, but now we're all smart enough to realize that the five isn't five.

And yes, in case you noticed LgRuns gets canceled out. If you plug in everything you get:

EqA = ((2*Raw/LgRaw - 1) * PA* LgR/LgPA) * Out * LgOut/LgR*.26^2.5)^.4
EqA = ((2*Raw/LgRaw - 1) * PA * Out * LgOut/LgPA * .26^2.5)^.4

When you ever want to scale EqA to some league average production based on runs, it's going to cancel out....which of course makes sense.

236 comments:

«Oldest   ‹Older   201 – 236 of 236
モバゲー said...

モバゲーでは友達から恋愛まで、様々な出 会いを探せる無料のコミュニティサイトです。常時サポートスタッフが掲示板をチェック、サクラや業者を排除しておりますので安心してご利用いただけます

スタービーチ said...

日本最大級の出会いコミュニティ「スタービーチ」で探しませんか。素敵な出会いを経験して理想の人と楽しい思い出を作りましょう

スタービーチ said...

スタービーチなら好みの女性がきっと見つかる!会員数ナンバーワンのスタビでご近所さんを探そう

スタービーチ said...

日本最大級の出会いコミュニティ「スタービーチ」で探しませんか。素敵な出会いを経験して理想の人と楽しい思い出を作りましょう

モバゲー said...

出会い専門のモバゲーSNSが誕生!メル友、恋人、セフレetc…貴方の理想に合った関係になれちゃいます。素敵な人と過ごしたいならココで見つけてみませんか

スタービーチ said...

最近スタービーチが女の子のコミュニティーサイトで話題中みたい!!夏休み目前ってのもあり登録比率に女性の数が半端ね~、やっぱり夏に一人は辛すぎや~とか思ってたら、暇な娘多すぎ・彼女簡単に出来ちゃった

スタービーチ said...

出 会いを探しているのなら会員数ナンバーワンのスタービーチ!開放的な夏休みはここで新しい出 会いを見つけよう!

グリー said...

グリー発信!!出会い専門SNS誕生、今話題のgreeから出会いをGETしよう。greeにしかできない事が盛りだくさん!!気軽にメールするだけで出会えるサイト・・・貴方の理想の異性と出会いませんか

gree said...

話題のgreeで新しい出会いを見つけよう!夏休みに入ってヒマしてる女の子が急増中!無料期間中にご近所の女の子をGETしちゃおう

セフレ said...

セフレを探しているならここ!割りきった体の関係を探している女性と出会いのチャンス!セックス目的なので会ってすぐにヤれちゃいます

モバゲー said...

モバゲー専門の出会い掲示板が誕生!出会いを求めている方にはオススメのサイトです。幸せな時間を過ごしたい方、淋しい思いをしたくない方はぜひご覧ください

逆援助 said...

逆援助でお金持ちになる事が可能なこの時代。セックスのテクニックさえあれば簡単に大金をゲットする事ができます

スタビ said...

スタビでできる新しい恋・・・素敵な出合いをするならスタビから始めてみませんか?女性に面識のない方でもサポート機能があるから安心してご利用できます。是非一度利用してみて下さい

SM度チェッカー said...

気になる人の性癖を知ることは、その人に近づく良いキッカケや話題作りに成ります。そんな時にSM度チェッカーを使ってみて下さい。話が合い一気に仲良くなり、どのまま・・・な関係にその日の内に為れるよ。なんと言っても今は夏休みですからね

モバゲー said...

モバゲー出会い掲示板は色々な種類の出会いを扱っております。登録無料で新しい恋の始まる事は確実です

ツイッター said...

今話題沸騰中!?新感覚出会いをツイッターで体験しませんか?リアルタイムで相手が何をしてるか分かるから、理想の人かどうかが簡単に分かる。貴方の理想の人をGETしませんか

mコミュ said...

素敵な出 会 いで愛を育む♪理想の人と楽しめる関係を築きませんか?mコミュでしか味わえない幸せを掴みましょう

スタービーチ said...

スタービーチで会える!?理想の異性をGETしよう☆素敵な出会いばかりだから求めている関係も作りやすい!!貴方が求めているのはどういった恋ですか?

名言チェッカー said...

他の人が言ってる名言や格言って良い事言ってるな~とか思ってる方、名言チェッカーで今日から自分に相応しい言葉を見つけませんか!!これでどんな人にも一目置かれる存在に為れますよ

モバゲー said...

モバゲーで出会いをすれば楽しい事は間違いありません。暑いからこそ出会いを楽しむべきなのです。登録無料で簡単に利用可能!

モバゲータウン said...

モバゲータウンでは恋愛から出合いまでのキッカケをつかめる無料のコミュニティサイトです。常時サポートスタッフが掲示板をチェック、サクラや業者を排除しておりますので安心してご利用いただけます

スタビ said...

スタビが今一番アツイのはご存じでしょうか?夏休みで出会いを探している娘とすぐに会えちゃうんです。登録無料でここまで出会える所は他には存在しません。今登録して良いパートナーに巡り合おう

スタービーチ said...

出会いのシーズン、夏到来!スタービーチでご近所さんと知り合っちゃおう!ひと夏の体験も女の子は求めている

モバゲー said...

モバゲーでついに出会いができる!?楽しめる出会い、求めていた出会いはココから始まる。素敵な出会いでまずは関係づくりwしていきましょう

gree said...

greeで素敵な時間を過ごしたい・・・そんな願望を叶えてくれるサイト誕生!!今までにないドキドキ感と興奮をこのグリーで楽しみましょう

スタビ said...

スタビで出会いができる!!いつでもどこでも出会いが可能なスタービーチで最高の出会いをしてみませんか

mコミュ said...

簡単な出逢いはココでできる☆素敵な出逢いをmコミュで体験していきませんか?楽しめる出逢いを経験するならここしかない!!まずはお試しを

ツイッター said...

新時代突入!ツイッターで始まる出逢い…ここでしかできない出逢いが新しい風を巻き起こす!!素敵な巡りあわせを体験していこう!

SMチェッカー said...

あなたの秘められたSM度がわかるSMチェッカー!簡単な質問に答えるだけで自分の隠された部分が分かります!みんなで試してみよう

モバゲー said...

今やモバゲーは押しも押されもせぬ人気SNS!当然出 会いを求めてる人も多い!そこで男女が出 逢えるコミュニティーが誕生!ここなら友達、恋人が簡単にできちゃいますよ

モバゲー said...

もう夏休みも終わりに近づき、この夏最後の思い出を作りたいと焦ってる方が、モバゲーのコミュニティーに書かれてましたよ!!折角なんで夏の思い出作りに協力して自分も美味しい思いをしてみるのはどうですか?大手スポンサーサイトが付いてるので全部タダですよ

グリー said...

最近はどこのSNSサイトも規制ばっかりで、ちょっと出 合いに関して書き込みするとアク禁食らうけど、夏休み終盤に差し掛かり色々なサイトを調べた結果、グリーだけはどうも規制が緩んでるみたいです。今がチャンスの時期ですよ

ツイッター said...

ツイッターで出 合 いを求めるのです。気の合う異性と交流して楽しいひと時をお過ごしください。登録無料で使えるので気軽さは100点満点!

モテる度チェッカー said...

夏休みももう終わりに近づきこのまま一人は寂しいのちがう?そこでモテる度チェッカーを使い自分がなぜモテないか診断してもらいましょう。10~20代の女性アドバイザーが多数在籍してるので、アドバイスを参考に夏終盤を楽しもう

スタービーチ said...

友達探しや恋人探しならやっぱり定番のスタービーチ!ヒマな女の子たちとすぐに会えちゃう、スタビで友達を増やしちゃおう

グリー said...

今年のクリスマスも後少しですね。グリー内でもクリスマスに備えて異性と交流を持つコミュニティが活発で、自分も今年のクリスマスにお陰で間に合いました!!みなさんもイブを一人で過ごさなくても良いように、グリーで異性をGETしよう

«Oldest ‹Older   201 – 236 of 236   Newer› Newest»