Wednesday, April 3, 2013

Do Matchups Matter?

The following is my entry for Stat Geek Idol. The contest is run by TeamRankings.com. Check out some of the entries from last year's competition here.

Introduction:

There are 347 teams in Division I college basketball. The nature of the sport allows for all different kinds of styles of play. Every team has varying player personnel and coaching philosophy. College basketball analysts are given the tough task of forecasting the end result of games featuring contradicting styles. It seems undeniable that, in some cases, certain teams can be bad matchups for other teams. Still, quite frequently analysts just say what sounds good. To illustrate this point, let’s look at a first round matchup from this year’s NCAA tournament:


The four factors (shooting, rebounding, taking care of the ball, and drawing fouls) are a very good way to assess style of play. The Minnesota-UCLA matchup featured the best offensive rebounding team in the country (Minnesota) and the 263rd defensive rebounding team in the country (UCLA). A smart analyst would point this compatible Minnesota strength and UCLA weakness out, but what does it really mean for the expected outcome of a game? On one hand, Minnesota should kill UCLA on the offensive boards, possibly creating a huge advantage for Minnesota. On the other hand, Minnesota kills just about everyone on the offensive boards. UCLA wouldn't be able to stop the lethal Minnesota rebounding attack regardless, so maybe this is a waste of an opponent weakness for Minnesota.

Essentially, the question I am asking here is simply: If you are really good at one of the four factors, would you rather play a team that is normally good at defending that factor (strength on strength) or really bad at defending that factor (strength on weakness)? At first thought, strength on weakness feels like the right choice. The goal of the following analysis is to try to answer this question.


Part 1: The effects of opponent on each four factor

To begin this study, I compiled a sample size of every single Division I college basketball game from 2009, 2010, 2011, and 2012. Games from 2013 (up until around the first week of March) were also included. I wound up with exactly 26,000 games to draw conclusions from.

In order to look at what happens when a {good/bad} offensive {eFG/TO/OR/FTR} team played a {good/bad} defensive {eFG/TO/OR/FTR} team, I had to define what exactly good or bad means. I decided that any team in the 90th percentile or better of a given four factor was “good” at that skill and any team in the 10th percentile or worse of a given four factor was “bad” at that skill.



NOTE: Good/bad is just the opposite for defense (i.e. - you want your opponent’s eFG% to be low)

The next step was to use these definitions of good and bad to find instances of strengths meeting strengths, weaknesses meeting weaknesses, and so on in the 26,000 game sample. First, let’s take a look at what happens when a good shooting team plays a good defensive shooting team:


The above table shows that there were 409 games where a good offensive eFG% team played a good defensive eFG% team. The offense averaged an eFG% of 54.5% on the season. However, when they played a good eFG% defense, that number decreased to 49.8%.

I did this same analysis for all types of matchups and all the four factors. The results are below:


There is a lot going on here, but the two biggest takeaways are:

1. Bad vs. bad brings out more good than good vs. good brings out bad. Basically, when two bad teams at one factor play each other, the offense improves a lot. When two good teams at one factor play each other, the offense does not diminish quite as much.

2. The defense controls FTR the most and eFG% the least. If you look at the percent change column, an offenses ability to get to the foul line changed a lot depending on the defense. On the other hand, an offenses ability to make shots did not change nearly as much. This is consistent with past research on similar topics.


Part 2: The effects of style on efficiency

Part 1 showed exactly what happens to the individual four factor based on opponent, but that is only so helpful in determining if there is evidence for good/bad matchups. The more important thing to look at is the effects of style on points per possession. Let’s go back to UCLA-Minnesota. Say UCLA decided that they needed to make an extra effort to keep Minnesota off the offensive glass. This decision might come at the risk of a different four factor. Maybe UCLA focusing on defensive rebounding diminishes their ability to create turnovers. This idea wouldn’t show up in the part 1 results, but it would show up in points per possession.

To look at the effects of efficiency, I first calculated an expected points per possession using simply the ORtg (adjusted for schedule) of the offense and the DRtg (adjusted for schedule) of the defense. This expected PPP was made without looking at matchups or style of play at all. Then, the expected PPP could be compared to the actual PPP. If the two numbers significantly differ, that means that mismatches in four factors can give us more information on which team will most likely win the game.


As you can see, matchups had virtually no effect on the actual points per possession of the game. I was able to predict PPP by simply using the offensive and defensive averages extremely effectively. Here are the final key takeaways from the tables above.

1. Four factor matchups don’t increase prediction accuracy. If we once again go back to Minnesota-UCLA, this means that we shouldn’t have looked too far into the offensive rebounding advantage. Simply looking at which team is better efficiency wise is adequate.

2. FTRate had very little effect on the points per possession of an offense. If you look at the Actual PPP column, there is not much change in general. This particular study indicates the eFG% is the most important four factor, followed by OR%, TO%, and finally FTR.


Conclusion

It would be foolish to say that specific matchups have no effect on the outcome of a basketball game. It doesn't mean that matchups can’t possibly matter just because this study shows no evidence for it. However, the study does indicate that it may not be wise to focus too much on the compatibility of the strengths and weaknesses of opponents. Trying to breakdown strength and weaknesses may be a futile activity. Simply put, the best way to predict the winner of a game appears to be just picking the better of the two teams.

5 comments:

  1. FWIW (probably not much) I've come to the same conclusion myself, albeit from a completely different direction. Various predictive methods that ought to be able to tease out and take advantage of these sorts of matchups invariably do no better at prediction than a straight linear regression. If I get a chance, I'll post in a little more detail about it on my blog.

    ReplyDelete
  2. Glad to hear you came to the same conclusion. If you do post your findings, feel free to put the link here.

    ReplyDelete
  3. Interesting stuff!

    Next question from your findings - do these mis-matches affect which teams COVER THE SPREAD?! haha..

    But seriously, nice idea and work.

    ReplyDelete
  4. Great, interesting work. I'm trying to wrap my head around the PPP conclusion. I guess I was hoping for a more significant kernel of evidence, but it does make sense that the expected PPP would be close to the actual PPP.

    An interesting next step would be to figure out how teams reach their expected PPP even though one of their 4 factors is impacted fairly significantly.

    For example: Take a -Bad TO% team vs. a +Good TO% team.

    The average impact is a 14% increase in TO% by the -Bad team. How do they make up the difference in PPP? Is it through a greater eFG%, OR%, or FTRate?

    The take away could be that, a -Bad TO% generally increases their OR% to reach the expected PPP. Or a -Bad eFG% team generally increases its FTRate.

    By finding out how a certain match-up impacts another of the 4 factors, maybe you could then analyze that match-up as a predictor.

    I'm rambling but just a thought. Great work again.

    ReplyDelete
  5. Ryan -

    I think you're on the right track for the next step in this analysis. The idea of opponent compatibility is (to my knowledge) fairly new, so there is definitely more to discover.

    The thing I want to address from you comment is when you asked "How do they make up the difference in PPP?"

    The answer here is that they probably don't. My expected PPP takes into account both the offense and the defense. So in the case of a 14% increase in TO%, that offense is probably performing well below their normal average. The goal was to see if extreme stylistic mismatches change PPP beyond what we would expect from how generally good the defense is (PPP allowed). Hope that makes sense.

    ReplyDelete