Wednesday, January 18, 2012

Beyond a Boxscore



One of my favourite blogs is an American Baseball blog that attempts to explain the information that can be found beyond a box score.  To all those non-baseball fans, a box score generally lists what a player did throughout the game. Typically includes a different box for pitchers and batters.[1]  I’ll try to avoid boring you with history, but it adds an important factor to this conversation. The boxscore was invented to let people know how their teams were doing while they were on the road. So when the Boston Braves played the Philadelphia Athelthics , a reporter would send home the information to the Boston Globe, letting them the information.  So this history loops back in, because for a while, different reporters would [naturally] report different statistics. They reported their batting averages, how many hits a batter got, if he walked or so on.  What this did was, well create a market for statistics. As the process continued, newspapers began to report the same statistics; a unified thing known as a boxscore. If a player was substituted, he would be added under the position that he took over for.
Boxscores changed the way we looked at the game; it gave those of us not at the game, a way to look at how well the player was doing. It allowed us to quantify the portion the player was contributing to a win against the opponent.  So the blog, as mentioned above, tried to think beyond the basic statistics. Bill James [for all you moneyballers] did the same thing. He decided to take a look at what these statistics meant, and how well they measured the players contribution to a win. Well long behold, he (as did others) found out that those statistics did an average job, but not the best job at describing the outcome of the game.
Here is the problem with soccer (football for those abroad). Soccer doesn’t have a boxscore, and people only care about few thigns. They want to know how well their team is doing in the league [as presented by a table], who has the most amount of goals, and how many clean sheets a team/keeper  has.  ESPN [by now you know my thoughts on them] decided to create their own version of a boxscore, but with a soccer twist. They gave the following statistics, saves,goals, shots (on goal), Time of Possession, yellow cards, red cards, free kicks, offsides and  fouls. Here is the kicker, not one of those statistics is a useful predictor of the outcome. That is, if you model points (3 for a win 1 for a draw and zero for a loss), you find that not one of them predicts the points a team receives. I’ve started to take note of this, as I would watch New Castle United play its games. New Castle United, currently sixth in the table, has been outshot all but three of its games. [2] What we need is something that will reflect a basic understanding (leave the advanced stuff for nerds like us). How about a player by player boxscore, something that describes turnovers forced, shots allowed, break aways, defenders drawn, TOP (per player), minutes played (per player) tackles won, tackles lost and headers.  Which team has allowed the most amount of headers this season?  All these add to important was to play the game. Liverpool (bias be aware), could use this information for when to play Andrew Carrol over Luis Suarez. If a team allows a lot of breakways, move your faster players up front. If the team plays a conservative D, bring in Carroll and work on set pieces. You would be amazed that these statistics are out there, but few people are recording them.



[1] Baseball’s sister (or brother we are gender neutral here) sport cricket, also has something similar.

[2] The three games they weren’t outshot, they lost two of them

No comments: