Monday, December 26, 2011

Sports: The final frontier for nerds to takeover

The word nerd has negative connotations and is applied derisively to describe people who focus on academics then on oh so important social matters like partying, drinking and everything that is wasteful. The Wikipedia search for Nerd  suggests "...the word is derived from "knurd" ("drunk" spelled backwards), which was used to describe people who studied rather than partied.". For better definition of nerd than what is on Wikipedia, read this on NPR. While nerd calling exists widely in the west, it is not so outwardly manifested in the eastern culture where academic success is given due respect. But nerd calling does exist in the east. I know because I have been called one in both the worlds.

During high school and beyond, I was a kid who was picked the last when spitting into two teams for a game of cricket. During my Bachelors days, I had a friend who had an issue with my academic pursuits(he almost always lost). He would suggestively say that more than academic skills, practical skills were necessary to survive in the world. Few years later, I was in the USA and my supervisor at work thought it(I) was weird that I could add two random three digit numbers in few seconds without using a calculator. I was surprised at her behavior because I wasn't even the best among my friends back home when it came to skills related to basic arithmetic without the use of calculator.

With introduction and personal anecdotes out of the way, let me get to the main part. I recently read a book called "Money Ball" by Michael Lewis. I liked the authors other works(books and magazine articles on Economy of US and the world) and bought this book. It was a good read as most of his works are. Coincidentally, the movie starring Brad Pitt by the same name based on the book was released at about the time I was reading the book. Moneyball, published in 2003, is The story of Oakland A's general manager Billy Beane's successful attempt to put together a baseball club on a budget by employing computer-generated analysis to draft his players.

In this book, he describes how the traditional scouts are enamored by the looks of the athletes and judge them by what they felt were good players. The traditional scouts are very bad at picking good players. Billy Beane used computers, spreadsheets and statistics to identify undervalued players, sign them on to play for his team and was very successful in winning baseball games.

The nerds have done well later in life in almost every field, like science, engineering, technology, medicine. It is in entertainment and sports they failed to make inroads for a long time. Other than being the lead men/women, the entertainment industry has slowly but surely been taken over by professional writers, graphics artists, production assistants, technicians etc. Now shows are made with leads portraying nerds. Some sitcoms that come to my mind are '30 Rock'(Tina Fey plays Liz Lemon, a head writer), 'How I met your mother'(the lead Ted Mosby by Josh Radnor is an architect and professor), 'The Big Bang Theory'(my favorite and all 4 men are well stereotypical nerds).

The only field left was sports. Just like in entertainment, the nerds and geeks are making inroads into sports. Nerds have found a way to fit analytics((in baseball they are called sabermetrics) ,what they can do without a blink, to change sports. In baseball, other teams have tried to emulate what Billy Beane did with the Oakland As. Other sports are quickly following suite. NBA is one such sport.  The current champions, Dallas Mavericks, employed analytics that helped them win the championship. They have a Basketball Analytics team headed by Roland Beech and is credited for helping the team win the championship. Here is an ESPN article on Dallas Mavericks employing data geekery to win championship. Analytics have been part of NFL for some time now. Here is short history of it. The same is the case for NHL and a good site for hockey analytics is http://hockeyanalytics.com/.

Then there is the Sloan Sports Analytics Conference that is held every year in Boston by MIT Sloan School of Business. In 2011, into it's fifth year since the start in 2007, the conference has grown to 1500 attendees with 300 in the waiting list. In 2007 about 175 attended the conference. That's a leap of  850% in 5 years and is widely covered by the print and television media now. Guest speakers for 2011 include Micheal Lewis, Malcolm Gladwell, Bill Simmons, Mark Cuban etc. This is just to say how popular this conference has become. The research paper grand prize winner for 2011 was an Indian guy Arup Sen(He is a PhD candidate in Economics at Boston University). In this paper Arup Sen provides evidence of moral hazard in long term guaranteed contracts using data from NBA. Of course this is not the first time that ‘contract-year effect’ was suggested, excepting Arup Sen showed evidence, with statistical analysis build in, that players productivity peaks in the last year of the contract and drops in the first year of their long term contract. Another favorite subject of mine is the 'hot hands' and 'crunch-time' lore. There is lot of debate on hot hand in basketball. Analytics say that it does not exists while Athletes vow that it exists. Whatever side you are on, how do counter this - Illusion of Hot Hand makes athletes take bad shots. This is supported in another research paper that professional basketball players may overgeneralize from their very recent experience to their expected future performance. Similarly it is being shown that the widely held belief that some start are better in crunch time is not so true. Kobe Bryant will surely disagree with that analysis.

There are other aspects of sports, other than the actual players playing the game, that is important at the professional level. Some of these aspects are scouting, picking talent in the draft, trading talent, managing the team, coaching etc. I don't want to cover every aspect of sports here as it is beyond the scope of my knowledge. However, there is a general perception that an ex-athlete is better at picking talent and managing the team. Back in the days, whenever India lost a match or series in the game of cricket(which was most of the time unlike now), the reason discussed was that retired athletes are not given enough role in picking the talent and managing the team. On the face of it, this argument is simple. Who better to gauge how good a player is, than one who played the sport at a high level for a long time? However, there is evidence that it is not to be so. For instance, Michael Jordan who is considered to be the best ever to play basketball, has proven so far highly ineffective at picking talent and at making the team he owns, the Charlotte Bobcats, successful. The scouts are mostly players who did not make it big and most of their picks turn out to be duds. This is true in every sport. Here is a video from the Sloan conference that discusses the Draft and how most of the draft picks are busts. The problem is scouts and retired athletes are all looking at a prospect who has the looks. They are carried away by the physical aspects and the recent good showing of the player than the actual metrics of the players athletic abilities in the sport. Analytics can chafe that information out and with technology better analytics can be produced in sports where it is lacking now.

The other part of sports that geekery is making inroads is from the efficiency standpoint of referees. Previously referees could commit mistakes(still do) and get away(still do) with it. The professional leagues levy fines on the players for expressing dissent and reduce accountability on part of the referees for their mistakes. Analysts now are trying to quantify the efficiency of referees and thereby are bringing changes to how sports are regulated on the field. Referee Analytics is now a part of what is discussed at the Sloan Conference. Here is one article quantifying referee bias in Soccer. The introduction of replay in many sports is a welcome change to minimize errors(blunders in some cases) that have direct relation to the outcome of the game. In cricket, a constant problem with referees(called umpires) is with regards to run outs, LBW and nicks. The on field umpires were making and continue to make bad calls with those cases and the television instant replay has fixed the first problem of calling run-outs. All they have to do, is make a call to the third umpire when in doubt. Instant replay is widely used in almost every sport in US. Virtual Eye ball tracking technology and Hot Spot technology address the latter two issues with umpiring in cricket and are now used in the Umpire Decision Review System introduced in international cricket last year.There was a loud outcry and the Indian Cricket management did not accept it as the technologies were not perfect. I would rather take the side of technology than humans when it comes to these close calls in cricket.

With all this hyperbole on the geek adventures in sports, a note of caution is in order. Analytics can only supplement and not take over human decisions. When taken to extremes it has proven to cause trouble. Let us take an example of Long Term Capital Management, from the world of financial markets, that was too Too Big To Fail in 1998 and was bailed out by the Fed. LTCM was a hedge fund started by John Meriwether. It had on its board Myron Scholes and Robert C. Merton who won Nobel Prize in 1997 for their work on creating a model to discover the price of an option. Their work was built upon an earlier work by Louis Bachelier - He is credited with being the first person to model the stochastic process now called Brownian motion.

Scholes and Merton in collaboration with then deceased Black developed a mathematical model for the financial market called The Black–Scholes model or Black–Scholes-Merton  or simply Black Model. The mathematical model they derived was a partial differential equation, now called the Black–Scholes equation, which gives one the price of the option with time variable. The problem however was with the assumptions in the model. The Brownian movement assumes that the movements are small, random devoid of external influences or biases. Plotting these movements will generate the familiar bell curve also known as normal or Gaussian distribution. However the stock markets with its risks and frequent wild movements does not behave like a gas molecule. In fact other mathematical models have been proposed to represent the financial markets. One such model supported by Nassim Tabeb of the Black Swan fame is Power Law. LTCM could not in it's mathematical models anticipate the rare events and incurred heavy losses from the 1997 Asian Financial crisis and Russian default in 1998 and deadly combo with leverage(at the height of euphoria they were levered 25:1). It had to be bailed out by a consortium of banks and the fed.


There is great debate still being waged on the caused of the recent financial crisis dubbed the great recession. Of the many reasons some blame has been levied on the excessive reliance on flawed mathematical models VaR(derived by the risk management team at JPM)  and the Gaussian Copula Function by David Li. You can read more on how the Var and Copula Function contributed to the crisis here and here respectively. The weird thing is that both these models use the Gaussian Distribution as the core and  are widely popular of their simplicity. They spent much time developing a simple model for a complex system with frequent occurring big risks. Both models underestimated the potential risk that they were supposed to measure.

What does mathematical model that over simplifies a financial market got to do with sports and nerds getting into it. Nothing I suppose, but I wanted to caution the excessive reliance on technology and statistical models on measuring players and the sports in general. There is a definite need for advanced statistics that measures the value of players better than the current popular metrics used. For eg., average ppg used in NBA is skewed towards players that play lot more minutes and take lot more shots than say a defensive expert whose contributions to the game are largely overshadowed with this metric. One simple metric that measures other contributions in NBA is Efficiency. A little more complex number called PER(Player Efficiency Rating) was created by John Hollinger of ESPN. PER takes into contributions almost all aspects of basketball including the minutes and pace of the game played. It is a step in a right direction and is getting accepted as a measure of player's current value. However this number has problems - it rewards inefficient shooting and can produce a distorted picture of some defensive specialists who produce few blocks and steals. Please find here a good analysis of problems with PER. I personally think that both efficiency and PER are both skewed towards offensive abilities of a player and are highly correlated with the ppg of players. Clearly a better model is still wanted in NBA.

Thankfully it does not use Gaussian distribution to predict the future value of the player so team managements can load up on a single player. But wait, some teams have already done that without any mathematical model. Eg., Eddy Curry with the Knicks till 2010, Gilbert Arenas with the Magic and Rashard Lewis with the Wizards are just examples of bad contracts. Thankfully there is the amnesty clause that came of the new CBA. Bad Contracts have one advantage though. It gives the owners a good excuse to come back to the table to discuss CBA and demand bigger piece of the revenue pie as reward for their own mistakes. I digress.

We already discussed baseball and how moneyball highlighted the use of analytics by a certain team in the sport. The book highlights the undervalued usage of the metric On-base Percentage. Please listen to this podcast that throws more light on it as I am not a big follower of baseball to provide good analysis.

Lets jump to cricket. A lot of data is collected in cricket, but most of it is centered around the batsmen or the offensive side of the game. Most Man of the Match (equivalent to MVP in American sports) awards go to batsmen. A bowler rarely gets the MVP award unless a player takes at least 4 wickets and the match is won, this too when there is no other big contribution by a batsmen. The value of a batsmen is usually measured by his batting average and the number of 100's and 50s he has scored. This to me is a very skewed metric. A batsmen that comes in the first three positions has a chance to play lot more  balls and hence a chance to have better batting average and better chance at scoring more 100s and 50s. What we consider great batsmen come onto the crease at the first three positions. This however skews other important metrics like, how many wins did he contribute to, how many partnerships did he generate, how well did he rotate the bat, how many dot balls did he have, how many chances did he give away(how lucky he was on any given day) that the opposition failed to capitalize on. Similarly for a bowler, the metrics that are used are wickets taken and average runs per over given away. This again fails to measure the contribution to the win, the conditions in which he was playing, how many dot balls did he have(the more the better), how many good length balls did he bowl, did the bowler in tandem get any wickets etc. Fielding is another contribution that is given scant respect. No metrics exist on how much a fielder contributes to the win of a team. Other than who caught the ball when a player gets out, nothing about fielding is measured. A good thorough review of some concerns related to cricket stats can be found here.

With all this I have to conclude with the saying from a superhero nerd aka Spider Man comic strip - 'With Great powers come great responsibility'.



While nerds are making inroads into sports, they are making inroads into the fashion business with their looks. Look for athletes like Dwyane Wade, Lebron James, Kevin Durant who sport the nerdy look quite often. Kevin Durant has taken it a step further with a back pack in tango. Grantland has a feature on this new trend in NBA - The rise of the NBA nerd. More power to my peeps.


A few more links of interest can be found below.
Redistribution: Blocking the Revenge of the Nerds?
Who’s a Nerd, Anyway?
The beauty of the geek

Sloan Sports Analytics Conference Recap
Will Kuntz '06, Real Life MoneyBall "Scout"
Shane Battier - The No-Stats All-Star

Prof. Daniel Kahneman - Nobel prize winner for prospect theory
Reinforcement Learning and Investor Behavior
The Gambler's and Hot-Hand Fallacies: Theory and Applications
Recency: Hot-hands and the Gambler's fallacy
The hot hand belief and the gambler’s fallacy in investment decisions under risk
Success Rates of Traders
What Makes a Rogue Trader Tick?
Hot Hand in Sports
Option Traders Use (very) Sophisticated Heuristics, Never the Black–Scholes–Merton Formula
Warren Buffett on LTCM
Eight Days - The battle to save the American financial system

No comments: