College Football's Legs Race (Part 1)

View as article

Lance Roffers

Junior
Joined
Feb 23, 2018
Messages
2,099
Recruiting is the lifeblood of any program. You can have all the X’s & O’s but you won’t win big without the Jimmy’s & Joe’s. Today, you will find numerous sites almost solely developed to the recruiting business. Fans of programs gather around their computer screens to see which prospect might be considering their school this week. In this series, I will take a look at the ever-growing “leg’s race” happening in college football and examine how data can help a program identify recruits.

A few years back I read an excellent article from the Harvard Sports Analytics Collective about the NFL Combine and the impact that athleticism testing had on the outcome of their NFL performance. It was a fascinating read and got me to thinking about the impact of athleticism testing in the college recruiting world. As such, I started on my journey towards a better understanding of how important athleticism is to a player transitioning into college football at the Power-5 level.

Methodology
To start you out, here’s a chart with positional results of athletic data from Harvard’s study to whet your appetite (correlation is reflected by color):
Combine Positional Data.webp

What I’m attempting to do is determine if there is a correlation in athletic testing results and All-Conference players at the Power-5 level. The athletic testing results come from verified Combine data such as Nike Opening, Under-Armour All-American Combine etc. The data includes regional events.

Therefore, my sample includes all players with verified testing results from these events from 2014-2018. Percentile rank shown for any player represents their percentile rank in SPARQ score against other individuals in their positional grouping. For the percentiles I only included individuals who completed all events and were able to calculate a SPARQ score from. Within any All-Conference percentages for each testing event I took all individuals who completed that event, so some individuals in those results might not have completed all events.

In addition to percentile rank for SPARQ scores at each position, I also calculated the number of standard deviations above/below the mean for each category and summed the totals. Here is a chart showing positional results from the Combine study with error bars included (bigger the bar, the wider the range):
Positional Graph.webp

I also wanted to test the results on individual teams’ recruiting classes. I chose the following teams as my sample: Alabama, LSU, Florida, Georgia, Florida State, Clemson, Ohio State, and USC. This sample group includes the elite teams (Alabama, Clemson), peer state teams (Florida, Florida State), a team from every area of the country (Ohio State and USC representing the Midwest and West, respectively), and a couple of programs that Miami recruits against often (LSU, Georgia).

The final area of data I included in my study was that of the ESPN300 recruiting lists and how they compared to predicting an All-Conference player versus simple athletic testing data. Recruiting lists have every resource available to them when making their list (camps, offers, athletic testing, production, interviews, film) versus just athletic testing data so the expectation would be that a list of 900-1200 players would perform much better than data-only.

Team Data
As you would expect, the team data results for Miami were not great and show just how far Miami has to go from a recruiting standpoint to catch up to the elite schools. I am especially happy to see that the results pass the “smell test” for the most part. The team results use the total of the standard deviations above/below the mean for each team rather than average SPARQ score. This way you are adjusting for positional differences in average SPARQ testing. For context, here are the average SPARQ scores for the positional groups for teams in our sample:
SPARQ Averages.webp


Here are the total number of Standard Deviations for the peer teams from 2014-2018:
Team SD's.webp


Ohio State has recruited some supremely talented athletes during this timeframe and it shows up in these results. It’s interesting to note that Florida actually rates lower than Miami when it comes to recruiting athleticism. The data mirrors the W/L records of the teams fairly well, except for Clemson, who outperformed their athleticism recruiting by the greatest margin. Note: One of the limitations of the study is that not every player chooses to participate in athletic testing and cannot be included in the data.
Records.webp


Breaking it down by year also matches up fairly well with what you would expect to see from an athleticism testing standpoint, as Miami’s best recruiting class was in 2017. The worst recruiting class was in 2015. Here are the results with a yearly class rank for Miami among the nine teams. If Miami wants to improve on a national level, these results need to improve.
Miami.webp


All-Conference Data
The driver of this study is to test for markers of athletic traits that continually popped up for players on All-Conference teams in the Power-5 leagues. The intention isn’t to say that if you have this trait you will definitely be an All-Conference player but rather to identify that others have consistently hit certain markers in order to identify recruits that have the highest percentage chance of success.

One observation that stands out is players who tested in the 90th percentile of a position group almost always become starting players at the P5 level and most become All-Conference of some type. Players in parenthesis represent a sampling of players who fit the criteria listed for each position.

QB- (Lamar Jackson, DeShaun Watson, Josh Rosen, Sam Darnold, Drew Lock, Jalen Hurts, Tate Martell, Jarren Williams)

It might surprise many of you to learn how athletic most All-Conference QB’s turn out to be in college. Players such as Sam Darnold and Josh Rosen are not known as athletic, dual-threat QB’s, but they tested out at very high levels. Here are the results of the QB data for P5 All-Conference Players:
QB.webp

The obvious correlation here is with foot quickness/speed. This makes a lot of sense if you think of how often a QB must manipulate the pocket in a tight space to keep a play alive. This information also joins very well with the NFL Combine data in weight, 40-yard dash, Short-shuttle having a large correlation to success.

RB- (Nick Chubb, JK Dobbins, AJ Dillon, Travis Homer, Bryce Love, Saquan Barkley, Lorenzo Lingard, Travis Etienne, Dalvin Cook, Joe Mixon)

RB.webp

For a position known for athleticism as much as RB, the data shows that there are merely minimum thresholds that must be met in order to profile as an All-Conference player. Having solid athleticism combined with excellent vision and strength seem to be the keys at this position.

WR- (Jeremiah Payton, Marquez Ezzard, Mike Harley)

WR.webp

The only position group of the entire study that didn’t show a high correlation between athletic testing and P5 All-Conference performance was the WR group. Much like the data regarding the NFL Combine showed little correlation, so to does the HS data. Route-running seems to be by far the biggest component to being a successful WR at any level. There was almost no correlation with height, weight, speed, SPARQ etc. Short-shuttle was the only metric that showed a positive and statistically relevant correlation which ties in heavily with route-running.

TE- (Hunter Bryant, Alize Jones, Brevin Jordan, OJ Howard, Brian Polendey, Chris Herndon, Dalton Schultz)

TE.webp

Much like WR’s, TE’s is the position that shows the second least correlation to athleticism testing, but there are a couple of clear markers to pass at this position, with minimum speed and strength performance required. It makes sense at a position like TE that fast and strong would be requirements. Weights ranged from 210 to 240 coming out of HS.

OT- (Mitch Hyatt, Jonah Williams, Kai-Leon Herbert, Martez Ivey, Mason Cole, Toa Lobendahn, Kolton Miller, Greg Little, Alex Leatherwood, Cam Robinson)
OT.webp

One position that showed heavily correlation with athleticism testing was at OT where being able to move and hit certain testing markers was imperative. For the most part, T’s that became All-Conference players were in the 260-pound range and added good weight over a period of a couple of years and had good athleticism. In fact, three of the T’s weighed less coming out of HS than recent Miami commit Zion Nelson.

OG- (Ross Pierschbacher, Braden Smith, Wyatt Teller, Sean Welsh, Saahdiq Charles)

OG.webp

Another position group where the testing markers match up very well with what you would expect of what asked of them on a football field. The G group is asked to go up against the biggest and strongest defensive players, so it is no surprise that the Powerball event matches up so well with excellent performance at the P5-level. Again, most All-Conference OL are not sloppy coming out of HS as the average weight of this group was 288 coming out of HS.

OC- (Michael Jordan, Brian Allen, Frank Ragnow)

OC.webp

You ask the C to anchor against a NT and to get to the second level with movement skills. Therefore, it makes sense that you want C’s who are quick, strong, and somewhat explosive. The results of the athletic testing measurables matches up perfectly with the position.

DE- (Lorenzo Carter, Derek Barnett, Tyquan Lewis, Austin Bryant, Chad Thomas, Carl Lawson, Jon Garvin, Josh Sweat, K’Lavon Chaisson, Xavier Thomas, Micah Parsons)

DE.webp

I did expect that SPARQ as an overall testing metric to correlate better at the position, but results showed more marginal correlation. What did show significant correlation though were common-sense factors such as strength, quickness, and explosiveness. For the most part they weighed more than expected, with only Duke Ejiofor of Wake Forest coming in at less than 215 pounds (he weighed 196 pounds). While the short-shuttle metric for 100% is fairly high at 4.87, the average was 4.58 and better reflects the type of numbers I would be looking for. The average 40-yard dash was only 4.98, so this metric really hits the mark of what you need as a minimum. I expected that number to be lower, honestly.

DT- (Solomon Thomas, Rashan Gary, Ed Oliver, Da’Shawn Hand, Taven Bryan, Gerald Willis, Raekwon Davis, Breeland Speaks, Jeffery Simmons, Harrison Phillips, Maurice Hurst, Vita Vea)

DT.webp

Perhaps the one position above all others, where if you post elite athleticism numbers, you are probably going to be a college star is DT. Only the Pac-12 (with two players) and Tim Settle (Virginia Tech) had even one below-average athlete at the position make All-Conference. The best course of action at this position seems to be to get 260-280 pound athletic freaks and let them gain weight and keep their athleticism.

LB- (Malik Jefferson, Dylan Moses, Raekwon McMillan, Dorian O’Daniel, Rashaan Evans, Jerome Baker, Devin White, Waynmon Steed, Roquan Smith, Micah Kaiser, Skai Moore)

LB's.webp

Really, the big surprise for me here was the weight threshold. I expected a lot of players to be in the 180-pound range that were rangy in space and super athletic. The average weight coming out was 216 pounds, with only a couple even under 200 pounds. Much like WR seems to be dominated by route-running as a skill, LB seems to be dominated by an ability to use your instincts for production, but you have to have a certain level of speed and quickness to reach high levels.

CB- (Levonta Taylor, Adoree Jackson, Nigel Knott, Marco Wilson, Mark Fields, Grant Delpit, Trajan Bandy, Budda Baker, Al Blades Jr, Donte Jackson, Mecole Hardeman, Treon Harris, Mike Jackson, Bubba Bolden, Levi Wallace, Tre’Davious White)

CB.webp

At this position there appear to be two types: Either a freak speed athlete who is two or more standard deviations above the mean, or long and rangy CB’s with the arm-length to jam and play the ball away.

S- (Minkah Fitzpatrick, Derwin James, Jamal Adams, Quin Blanding, Justin Reid, Quincy Wilson, MJ Stewart, Tony Brown, Jaquan Johnson)

S.webp

If anything, S’s actually posted more explosive numbers than CB’s, which surprised me.

ESPN300 Rankings
I wanted to have a means of comparison for the data metrics and I settled on the ESPN300 for a few reasons: It is easily accessible, and being the largest list it allowed for a larger sample size to smooth the comparison. For the time period 2014-2018, this allowed for as many as 1500 players possible to be All-Conference in their playing career at the P5 level. Note: I realize very few true freshmen make All-Conference at the P5 level so the number is probably much closer to 1200 than 1500.

I found that the vast majority (99.2%) of players on the list went to P5 schools, so the comparison would match P5 designation for All-Conference status. Seeing as how the list has access to combine performances, film reviews, production scores, All-State designations, offer lists (meaning Saban and Urban etc. can essentially do much of their evaluations for them), athletic testing data, and interviews (insight into their drive and personalities), you would expect the list to perform much better than athletic testing on its own.

Here are the percentages that the ESPN300 produced for the All-Conference lists:
ESPN300.webp

These results are fairly intuitive with the top two positions being RB and DT. RB makes sense as it is the easiest position to transition from HS to college from an evaluation standpoint and DT due to the way athleticism translates to the next level at the position.

To compare the results to the athletic testing data I ran a Ridge Regression rather than a Least Squares Regression for the purposes of regularization. I won’t lengthen the article by going in-depth with the math stuff, but I can tell you that the correlation from simply going off of ESPN300 rankings for P5 All-Conference is around 0.45 and doing nothing other than paring down positions by the above athletic criteria gets you to around 0.38. Getting to 84% of the accuracy of a ranking list from websites that have access to infinitely more information and are able to rely on the best coaches in the country for backup evaluations is pretty impressive in my view. There is still a lot of noise in the data, as you would expect when you are dealing with the volume of high school players that are moving into the division I college level every year, along with injuries, scheme changes, transfers etc. but you are starting to see real value in evaluating athletic testing and the characteristics that make up each positional grouping.

As an example, by filtering the data results at the QB position to include only the players that fit the model characteristics listed above (< 5.0 40, <4.47 SS, 34’ Powerball, 190 pounds or more, >82 SPARQ) you get a nice group that includes out of 27 names:

Justin Fields, Josh Rosen, DeShaun Watson, Sam Darnold, Tua Tagovailoa, Jalen Hurts, Tate Martell, Jarren Williams, Jake Bentley, Sam Ehlinger, Brandon Wimbush, Adrian Martinez, Trevor Lawrence, Drew Lock, Lamar Jackson.

You’re starting from a pretty good place if that’s the type of list returned from simply filtering through athletic testing numbers. Taking that down to include only players from the 14, 15, 16 classes (to give time to establish as the QB) you are returned with 11 players out of 44 total QB’s who completed athletic testing:
QB Example.webp

That list returns a pretty nice group of QB’s and really allows you to limit your focus as a recruiter. Eight of the 11 QB’s have had pretty good college careers, with four of them already having been 1st round NFL draft picks. Will Grier missed the list by nine pounds or he would have been on there as well.

Of Note
I wanted to point out that when I started the endeavor, I almost didn’t push forward due to my belief that HS athletic testing would be so fluid once the athlete hits a college S & C program, I expected huge leaps in the testing performance once they reached the NFL Combine. The NFL Combine seems to be an excellent proxy to use since we are looking for P5 All-Conference potential and individuals who reach that status are generally invited to the Combine at the conclusion of their careers.

Since I also have an NFL Combine testing database available to me, I was able to easily cross-reference the players listed in both databases and found that- on average- players do not improve their athletic testing by massive amounts through college. There are certainly outliers who go to college, enter into a weight program and see their athleticism blow up (Myles Garrett immediately comes to mind), but by-and-large you are who you are as an athlete. In fact, several athletes regressed in their testing after gaining weight in college.

Next Steps
This article is part one of a three-part series. The next installment will review the current recruits in the ’19 class and compare each player to the criteria set for each position in order to determine if they meet certain thresholds for All-Conference potential as well as overall team data with the class. The third installment will review the overall roster and how each position group is positioned to return Miami to a national contender again, as well as where the deficiencies are from an athletic standpoint.
 
Advertisement
Nice work. I wonder if there is something other than all-conference to use as a bench mark. As we saw this year in the ACC, those lists can be crap. Also, maybe someone can think of a good way to normalize WR performance to QB play since the two are so connected. That could be why the athleticism alone doesn't have a correlation to the all-conference list.
 
This is awesome work. I hope we have an analytics department using modern statistics like this to find trends and determine who to go after.

Based on the past, we must have some similar algorithmic type test because even though we don’t get these recruits a lot, we usually identify them early.

Also, seeing Jarren Williams in this article many times gives me a lot of optimism.
 
Advertisement
Nice work. I wonder if there is something other than all-conference to use as a bench mark. As we saw this year in the ACC, those lists can be crap. Also, maybe someone can think of a good way to normalize WR performance to QB play since the two are so connected. That could be why the athleticism alone doesn't have a correlation to the all-conference list.

Thanks for the thoughts.

If you look at the chart showing correlation with the NFL Combine it matches perfectly. I actually think my model ended up faring extremely well in comparison. WR has shown very little correlation with athleticism testing for many years now. Really, what shows the most correlation with the NFL draft is market share percentages. It helps to account for QB play by showing the amount of production the player has in comparison to his overall offense.

I used both 1st & 2nd team All-Conference lists to smooth the issues you cited. The players themselves ended up coming out pretty much like you'd expect on those lists. If the ultimate goal is to fill your roster with good-to-great players, you need to have a benchmark higher than "starter" because there is such little context if you settle for starter. In that prism, Art Sitkowski would have ended up a recruiting win solely because he was a starter this year at a P5 school.

It's not perfect- there is really a difficult problem to make anything in this realm perfect in college football, but the reason I used four years was to give time for development and to include wunderkids as well.
 
I feel like this has to be pinned on this board so that we can come back to it whenever we offer someoone. MODS?????

The hope is that as I add to the model we can use it to determine the number of metrics the recruit satisfies etc. to include in a sort of profile.

That's just sort of my thoughts though, since I just work here, that decision is above my pay grade.
 
I don’t understand anything in your article. But it’s you and all your stuff is well researched and well written. Hopefully Miami finds a way to put you on the pay roll and you can do this for a living.
 
Advertisement
Great work, Lance. In particular, I took note of your conclusion about the "they are who we thought they are" bit about how college athletes are largely what they were in HS (i.e, a cut above the rest, statistically).

Hopefully, this will put the final nail in the coffin of the "let's-recruit-a-bunch-of-2-star-diamonds-in-the rough-because-that-one-guy-that-one-time-became-a-first-rounder" crowd.
 
Looking forward to the next installment! Since route-running is such an important metric re WR's, the hire of Stubblefield, being familiar with the desires of the OC, may work to our advantage even more than someone better at technique or recruiting . . .
 
Advertisement
Advertisement
Can someone get this to the coaching staff? STAT!!!! Seriously tho this is a massively helpfult tool for identifying the right recruits at each position. Bravo!
 
The stat that blew my mind is that 100% of All-Conference QBs ran below a 4.47 shuttle. Foot quickness is paramount.

I went back and looked at Malik Rosier since he came in as a dual-threat QB. Sure enough, he ran a 4.56 shuttle.

Jarren and Tate met the threshold.
Yeah, this definitely paints Jarren and Tate in a positive light.
 
Advertisement
Back
Top