Showing posts with label Chance Creation. Show all posts
Showing posts with label Chance Creation. Show all posts

Friday, 19 July 2013

Rate of Attack and Creative Efficiency (RACE) to Goals Model


Yes, yet another model looking at the quality of chances and finishing of football teams.

This is something that I had hoped to have finished before last season ended, unfortunately life got in the way and it got delayed. Since then there have been a number of very interesting analyses done, including those by @colinttrainor (like this) and @11tegen11 (like this), which have continued the good work done by @footballfactman (like this), where they have put in a significant amount of work to look at where shots are taken from and what the conversion rates are for shots from those areas. Hopefully I can hang on to their coattails.

Personally, I am far too lazy to collect all that data, so I have let the experts (Opta) decide upon chance quality for me, and I hope to make the model as simple as possible. In my blog so far I have looked at how Liverpool and Tottenham have performed in terms of finishing and creativity, and to add some context, I compared them to the league and Top 4 average. Whilst compiling the numbers, I noticed that the League averages were quite consistent year on year over each of the past 3 seasons, and realised that I could create a theoretical average team that I could use as a benchmark to compare the performance of all the Premier League teams.

I am sure I am not telling anyone anything new when I say that the amount of goals a team scores is essentially dependent on 3 things, the amount of shots they take, the quality of chances they create, and the quality of their finishing, and it’s against these metrics that I will be comparing teams against.

Rate of Attack
This is very simply the amount of shots a team takes, and can be measured on a per game (SpG) or per season (SpS) basis. Yes, I know that not all attacks end with a shot and I am basically just using total shots, but I wanted the model to have a 'racey' acronym, so Rate of Attack it is.

On average, each team takes about 14.5 SpG, or about 550 SpS. Between 9-10% of all shots end up with a goal, and this has been found to be consistent season upon season and across different leagues. For those that don’t know, this is called the Reep Ratio, after an amateur statistician named Charles Reep, who looked at various stats, including the conversion rate of shots, in the 1950s. 

Creative Efficiency (%CCC)
This is a measure of the creativity of the team and quality of chances they have, and this is where I am relying on Opta to decide upon what is a good chance, as I am using their Clear Cut Chance (CCC) for this. A CCC is one of Opta’s few subjective statistics, and whilst a full description is not given, a brief description is given by Opta in their Event Definitions under Big Chance (here

“A situation where a player should reasonably be expected to score usually in a one-on-one scenario or from very close range.”

Creative Efficiency (%CCC) is measured as a proportion of Clear Cut Chances to Total Shots.
A team with a high %CCC will, over time, create chances that are easier to score from than the average team. Whilst CCCs make up only about 13% of all shots in the Premier League, they are vitally important, as for each of the last 3 seasons, around 52% of all goals have been scored from a CCC. It should be noted that CCCs include penalties, and whilst I did consider removing them from the analysis as they have their own average conversion rate, I decided to include them for a few reasons, there will be some open play CCCs that will be easier to score from than a penalty, I also think that teams that attack more or are more creative will tend to get more penalties, at least over the long term, and that should be included in their Creative Efficiency, and finally because I want to keep the model simple and with as few adjustments as possible. 

Obviously when you multiply a team’s Rate of Attack by their %CCC, you will get the number of shots which are CCCs. The remaining shots will be what I will call, as I can’t think of a more appropriate term, the Non-CCCs. The two types of shot have their own average conversion rate, and the model analyses the quality of finishing of both types of chance by comparing the goal expectancy (number of chances multiplied by the average conversion rate) to actual goals scored for each type of chance.

CCC Conversion
To give an indication of the average difficulty of a CCC compared to the average shot, it is on average about 4x easier to score a CCC as they have an average conversion rate of just under 38%. It should be remembered though that there is a large range in the probability of a CCC being scored, Sam Green of Opta has said (here) he considers the base probability to start at about 20% and it of course goes up to 100%.

Non-CCC Conversion
The average conversion rate of Non-CCCs is slightly above 5%. The reason why I won’t classify them along the lines of a ‘difficult’ chance is that with the goal expectancy range for individual shots being between 0% and 20%, anything with an expectancy above 10% will still be easier than average.

The Numbers  
Here are the hard numbers I have collected for the past 3 seasons.


 




And these are the benchmark ratios/rates that I have either mentioned or will be using for the theoretical average team.



So, how did each team perform last season? In terms of number of shots, Liverpool lead the way by far with 740 shots over the season, 59 more than Tottenham took, the next best team, and not far off double the amount of shots that Stoke had.





























It may not come as much of a surprise to see that Manchester United had the best %CCC, with 21% of the efforts being from a CCC, compared to 18.3% for 2nd placed Manchester City. To put this difference into perspective, whilst Man City took 98 more shots than Man Utd, they only had 3 more CCCs. Liverpool had 178 more shots, but with a %CCC of ‘only’ 13.6% (still above average) had 17 less CCCs.

In terms of shot conversion, the team with the best conversion rate for CCCs was, yes you’ve guessed it, the team who scored the most goals, Manchester United with 44.1% of them scored. The team with the worst conversion of CCCs was, yes you’ve guessed it, the team who scored the least…oh, it was actually Manchester City, with only 28.9%, I didn’t guess it either. So, City had 3 more CCCs, but scored 17 fewer CCCs, a significant amount.

The team with the best conversion rate of Non-CCCs was Chelsea at 7.4% leading to goals, and this time we do find the expected QPR at the bottom of the pile with only 3.1%.

I’ll admit that the table above is a little hard to read though, we’ve got different units and magnitudes of measurement and its hard to see how well each team is doing overall, so lets add some context and measure each teams performance as the percentage change from our benchmark team.

Now things become a bit clearer. We can see that despite only taking 3% more shots more than the benchmark, Manchester United’s %CCC was a whopping 62% higher than average, which goes some way to explaining why their total shot conversion was so much stronger than everyone else at 14.2%. However they also significantly outperformed both conversion rate metrics, meaning they scored almost 13 goals more than expected if they had average finishing. If they had scored at average rates, their total conversion rate would still have been the highest in the league though at a touch under 12%.





























Only 2 teams managed to beat the benchmark for all 4 metrics, Man Utd and Arsenal. Of the other top teams, Chelsea and Tottenham had a relatively poor %CCC, Man City were poor at converting their CCCs, Liverpool were poor at converting their Non-CCCs, and Everton were poor at converting both types of chances.

At the other end of the table, only 2 teams performed worse on all 4 metrics compared to the benchmark as well, unsurprisingly QPR, with the other team being Newcastle. Reading were very good at finishing their chances, its just that they struggled to create any.

So what does this all look like when we convert these metrics to expected goals and how did the teams compare? There were 3 big outperformers, Chelsea (15.7 goals above expected), Man Utd (+12.6), and Arsenal (+10.4) whilst there was 2 big underperformers in QPR (-12.3)  and Everton (-10.1). For those of you who are into your ‘proper’ statistics, I’ve calculated the Mean Absolute Percentage Error for the model over the last 3 seasons as 10% and the Root Mean Squared Error as 7 goals. Its been a loooong time since I studied statistical methods, so I may have used the wrong error measurements, but I think that shows that the model isn’t too bad.



I’ll finish with how my model differs from those I’ve mentioned which look at shot location. I’ll start with the weaknesses. The first is that my model is far less granular as I have lumped the 87% of all shots that are Non-CCCs with the same goal expectancy, which means that the type of analysis that I can do with my model probably can’t go quite as deep as the others. Due to the creative efficiency element, I think the model is only applicable to teams and won’t be able to do player analysis. There is an element of trust in Opta that they are consistent when collecting the CCC data as it is subjective, particularly as we do not know their precise definition, although having read this (here), I think its fair to assume they are consistent. And because we do not know exactly how Opta define CCC, I think it will be very difficult to see how or if the metrics change depending on the Game state as the info of when a CCC occurred is not available. Whilst on average there are just under 4 CCCs per game, so it might be possible by watching the highlights or reading the match reports to figure it out for most games, in some cases however, as shown by @analysesport (here), it would be very difficult. Another issue is the relative lack of CCC data, it is not freely available (you need to pay for a subscription at www.eplindex.com for the data), it only goes back 3 years, and as far as I am aware, there is no  CCC data publicly available for leagues other than the Premier League

The positives are that it is very easy to collect and analyse the data, you only need the number of games played by a team, their total shots and the number of CCCs they’ve had to be able to estimate the number of goals they should have scored. One of the issues with simply using shot location, as discussed by @mixedknuts (here), is that it does not take into account the positioning of the defenders. For instance a player may take a shot in the central area of the box but have 4 defenders and the keeper between him and the goal, so the probability of a goal would be low, equally a player may break an offside trap and have the ball outside the area but be 1 on 1 with the keeper, so the likelihood of scoring would be quite high. This model at least separates out those chances where the defenders are not making a significant difference to the difficulty of a goal being scored, and whilst these only make up 13% of the chances, they do make up 52% of the goals.

Hopefully, if I get enough time, I’ll look at how repeatable these metrics are and if they could be of use for predicting matches and also look at how the teams performed on these metrics from a defensive point of view.   

You can follow me on twitter at @The_Woolster


Data taken from www.eplindex.com

Friday, 29 March 2013

Liverpool: Has there been progression in a season of transition?


In my first post, I looked at Liverpool’s conversion and creativity measures, and noted that when it came to converting chances, Liverpool were not only still lagging behind the average for the Top 4, but also the league as a whole. But I also pointed out that Liverpool, under the new management of Brendan Rodgers, are a team in transition and that an improvement should be expected over time as the players become used to his system, and that the recent goal tally at that point, was probably showing this improvement.

I am going to look further into this and track how these measures have changed over the season. I am also going to introduce form into the measures to show how Liverpool has performed over short term periods. To give some extra context, I will also include Liverpool’s average from the whole of previous season, and the same for the League as an average as well as the Top 4 from last season (I have used last season as it’s a complete season with all teams having played each other). For the form measure, I will be using a 6-game moving average as this is what we are used to seeing in the media, it should not give too much or little significance to individual matches, and I also believe that over 6 games you should generally see an equal number of Home and Away fixtures, which is not the case for other even numbered amount of games.

I’ll start by looking at shots per game. In my first post I mentioned my surprise that Liverpool had taken more shots than any other team in the top 5 leagues in Europe, as I had been expecting the number of shots to reduce under a more patient approach. Since then, Liverpool still have the highest shots per game in those leagues. This is one area Liverpool were relatively strong last season, with an average just slightly lower than that of the Top 4. This season the rate of shots started off more or less in line with last season up until Game 17, in which Liverpool had 29 shots against Aston Villa, and there was a big increase in form (the 9 shots against Chelsea also dropped out of the 6-game average equation), since then Liverpool’s shots per game has been at a level above what the Top 4 achieved last season, and as we know, the highest in the top leagues in Europe.

 
Shooting accuracy (this excludes blocked shots, as we don’t know if they were on target or not) started off very poorly, and the season average has only recently started to creep above the level of last season. This has been significantly affected by the poor start however, as the 6-game average has been above last season’s since match 14, although it still has not quite reached the league or Top 4 average from last season yet.


Next I will look at Shot Conversion and Shots on Target Conversion (these conversion rates exclude own goals, as its not clear from the stats alone whether an own goal came from a shot or from something like a misplaced clearance). As you might expect, they follow a very similar pattern. Two things to note are that this season's Shots on Target Conversion ratio is relatively stronger than the Shot Conversion ratio when compared to last season's League and Top 4 averages, and that between match 6 and 11 it was even stronger. As I mentioned in my previous post, I think that last season Liverpool were unlucky in that a number of goalkeepers had outstanding games against them and I think the relative improvement is down, in part, to that. If we look at the peak between game 6 an 11, we see that game 6 was the 5-2 win against Norwich, in which Liverpool had 16 shots, with only 5 on target, but 4 of those were goals (there was one own goal), and 3 were from outside the area, for a massive Shots on Target Conversion of 80%. The other thing to note is that since Match 22 for Shot Conversion, and Match 20 for SOT conversion, Liverpool have been more or less at or above the Top 4 average from last season.

  

Next up is Clear Cut Chance (CCC) Conversion. For those that don’t know what a CCC is, it is one of Opta’s few subjective stats, although they don’t give a full description, it can broadly be described as a chance where the attacker is probably central to goal with only the keeper to beat, however I don’t think it should be thought of as a chance that should be scored, as statistically speaking, just under 40% of all CCCs in the Premier League are scored, so on average they are a bit harder than a 50/50. The conversion rate for CCCs is much more variable than the other conversion rates, this is because in some games there will be few or even no CCCs, which means that both very high and very low single game conversion rates are far more likely. It is clear to see however that there has been a general improvement over the season, following a start of only 2 CCCs scored in the first 8 games, and that Liverpool have been significantly more clinical at taking easier chances than they were last season.


I will now look at a measure of how creative Liverpool have been, which is the proportion of chances that they have which are CCCs. One of the main premises of the possession based football that Brendan Rodgers likes is that the team will be patient and wait for good chances to score rather than take the first opportunity to shoot. But we’ve seen, counterintuitively, that Liverpool are taking more shots, so have they been creating better opportunities? Liverpool started the season not showing the patience that was required of them and rushing too many shots, and after that slow start, Liverpool’s season average proportion of CCCs has only just edged above last season’s in the last 2 games. However in terms of form, Liverpool have been more creative than last season since Match 20, and around or above the Top 4 average since Match 22. It should not be a surprise that conversion charts have a similar shape to this one, as the easier the chance, the more likely it is to be scored. In my view, this is the clearest indication that Rodgers’ methods, from an attacking point of view, are starting to take shape, although I still think they can often show that impatience, as this excellent forensic analysis by the Sports Analysis blog of the Swansea game shows.



The final graph is the CCCs per game, which in essence is a combination of the first graph and the last one shown. The graph has a similar shape to the one above, however due to the increase in the shots per game form from Match 15, relatively speaking, Liverpool are doing even better in this metric, and have been showing Top 4 form since Match 20.

 
If you were to just look at the averages of these conversion and creativity metrics so far this season, they would show somewhat of an improvement over last season, particularly in terms of conversion rates. They would not take into account the poor start to the season that Liverpool had however, or the steady improvements they have since made, or that they have consistently been showing the attacking strength of a Top 4 team for close to half the season (Match 18, the 4-0 win against Fulham, can probably be pinpointed as the game when things really started to click). It seems to me that Rodgers’ has decided to build from the front, and that he may now be close to having the team playing, at least offensively, how he would like. Of course football is about defence as well as attack, and it is Liverpool’s defence that has garnered the most criticism this season, and in my next post I will look to see if there has been any progression in similar stats from a defensive point of view.

Looking ahead to the final 8 games of the season, 3 of the remaining matches are against teams from the bottom 4, with only 2 teams being higher than Liverpool, so if Liverpool can maintain their recent form (losses against West Brom and Southampton aside), then they should have a relatively good run in. One thing that could of concern though is that in the last 3 games Liverpool have had only 8, 12 and most recently 10 shots in those games, which are their worst, 4th worst and 3rd worst totals of the season respectively. Liverpool fans will hope that this is a short term blip rather than a downward change in form.

Data taken from www.eplindex.com
.

Thursday, 7 February 2013

Liverpool: They shoot, they score, but should less be more?



The above tweet from WhoScored was tweeted prior to game against Norwich, and 9 goals in 3 games has seen the shot conversion rate increase to 8.4% and no doubt move Liverpool up the rankings. But it got me got me thinking, one of the premises of the patient, possession based style of football that Brendan Rodgers would like Liverpool to play is that it should in theory create better chances, but with the likely effect that less chances are actually created. So what has happened, how is it that Liverpool have created more chances than anyone else in Europe whilst also having one of the lowest conversion rates?
Whilst a relatively crude measure, as they are driven by both the quality of the finishing as well as the quality of chances created, I’ll start by looking at shooting accuracy and conversion rates, as they can give an indication of how easy the chances created are to score, as on average, a player will convert a greater proportion of easier chances.

Shot per Game
Shooting accuracy
Total shot conversion
Chance conversion
Shots on target Conversion
CCC conversion
Liverpool 2011-12
17.6
40.2%
6.3%
8.2%
20.3%
29.3%
Liverpool 2012-13
19.1
40.5%
8.4%
11.7%
29.0%
37.9%

Conversion rates exclude own goals. Shooting Accuracy excludes blocked shots. Chance = A shot on or off target, excluding blocked shots. CCC = Clear Cut Chance
The first thing that jumps out is that Liverpool are on average taking 1.5 shots per game more this season compared to last. So much for a more patient build up play! Whilst shooting accuracy has changed little, conversion rates have seen an improvement, and what is also noticeable is the easier the chance, so shots on target conversion and CCC conversion, the greater the improvement.

Is this down to creating easier chances or just better finishing? It is well known that Liverpool’s conversion rates were poor last season, as discussed in this article on EPLIndex.com, but they become even worse when you consider that Liverpool actually created a lot of good chances. When the shot difficulty is also taken into account, as has been done at team level by Differentgame, and on individual player level by Opta, then it is clear just how bad Liverpool’s finishing was last season, so perhaps it couldn’t get any worse and the only way was up. 

I think it is important to not only compare to how Liverpool did last season, but also compared to the rest of the league, and as we can see, Liverpool are lagging behind when looking at shooting accuracy and shots and chance conversion, and its not until we again look at the easier chances that they start to look more respectable. However, Liverpool’s current aspirations are to return to being a Champions League club, and when compared to the top 4 averages, Liverpool’s shooting accuracy and conversion rates look even worse. There is clearly some work to be done.
  

Shooting accuracy
Total shot conversion
Chance conversion
Shots on target Conversion
CCC
conversion
Liverpool
40.5%
8.4%
11.7%
29.0%
37.9%
League Average
44.8%
9.8%
13.2%
29.5%
37.6%
Top 4 Average
48.0%
11.4%
15.5%
32.3%
38.4%


So what’s happened since last season? Well, for a start, 2 of Liverpool’s worst culprits are not with the club this season, Kuyt having left for Turkey and Carroll on loan at West Ham, although that was countered by two of the better finishers in Bellamy and Maxi also leaving. The other big difference is the massive improvement by Suarez’, and having taken 29% of Liverpool’s shots this season, as well as 27 more shots than anyone else in the league, it is his improvement that is driving Liverpool’s improvement. 2+2=11’s blog  has taken a closer look at Suarez’ improvement from last season to this, but when you take Suarez’ numbers out of the equation, whilst the rest of the team is converting more than the previous season, they are still not good enough when you compare to the rest of the league, although only having one recognised striker for most of the season is unlikely to have helped.


Shooting accuracy
Total shot conversion
Chance conversion
Shots on target Conversion
CCC conversion
Liverpool 2011-12
40.2%
6.3%
8.2%
20.3%
29.3%
Liverpool 2012-13
40.5%
8.4%
11.7%
29.0%
37.9%
Liverpool 2012-13
(exc. Suarez)
38.0%
6.8%
9.7%
25.6%
26.5%


Are Liverpool creating the easier chances we would expect them to? Well, by a number of measures, they do not appear to be. If we consider that the more defenders there are between the player shooting and the goal then the harder the shot is, as there is a less clear sight of goal, possibly more pressure being applied to the person taking the shot, and more chance of a shot being blocked, then the percentage of shots being blocked is a good proxy for difficulty of the type of shots being taken. Last season 22.8% of Liverpool’s shots were blocked, this season that has increased to 28.7%. Another measure we can look at is what proportion of chances being made are Clear Cut Chances, the greater the proportion, the easier the chances in general that are being created. Last season 17.9% of Liverpool’s chances were Clear Cut Chances, this year it has fallen to 17.0%. We can also look at the rate of Clear Cut Chances per game, this has also reduced this season, although only by a small amount from 2.4 per game to 2.3 per game, using the CCC conversion rate, this could have amounted to another goal or 2 scored so far this season.


% Shots Blocked
%CCC
CCC per Game
Liverpool 2011-12
22.8%
17.9%
2.4
Liverpool 2012-13
28.7%
17.0%
2.3


So Liverpool are not are not necessarily creating easier chances, but their conversion rates have gone up, so the quality of their finishing must have improved, right? Well, at the most basic level we’ve seen that shooting accuracy has barely increased. Another way to see if it is improved finishing that has improved is to look at shot placement and if the shots are in the corners, but in this case, shot placement between seasons is also pretty similar. Last season saw 46.4% of shots in the bottom corner and 13.0% in the top corners, whilst this season has seen 47.1% and 11.6% respectively. However it did seem that many goal keepers saved their best game of the season for Liverpool last year, and as noted in the blog by Opta posted above, both Suarez and Kuyt were particularly unlucky, missing out on an expected 11.8 goals between them due to good goal keeping by Opta’s calculations (although Bellamy was fortunate with 3 of his goals). When you also consider that Liverpool had a record breaking number of shots hit the woodwork, it becomes clear how much of a role luck, although I think the term chance is more appropriate, can play.

Perhaps counter intuitively, despite the change in style introduced by Rodgers, Liverpool are in fact creating more chances than last season, and it is this increase in chance creation, as well as a big improvement in the finishing of easier chances that should be scored that has seen Liverpool’s goal return improve this season. However they are a team in transition, and my guess is that they are now creating better quality chances than they were earlier in the season. 25 goals in the last 10 league games certainly gives that impression anyway! Where once there were few players making runs into the box and shots were taken with little chance of scoring from, they are now combining better decision making and improved movement to create easier chances. The addition of Sturridge, as we have already seen, and Coutinho, as well as the return to fitness of Borini, should help in terms of both creating and finishing chances, although we may see less shots being taken. Brendan Rodgers will hope that by this time next year, Liverpool will have a conversion rate that is better than 87 of the teams in Europe’s top 5 leagues, rather than worse.



Data taken from www.eplindex.com