You can see first Utah boys edition here.
The Problem:
- In cross country we always run into the same issue. The only accurate way to compare teams/athletes is in head to head match ups. It's fun to play around with times and see what you can do with that data, but ultimately, when comparing athletes, xc times don't carry much weight.
Limited Options:
- Speed Ratings: Fantastic system developed by Bill Meylan of tullyrunners.com in NY. Precise but virtually impossible to accurately replicate, and reliant on detailed course info and historical data.
- Butler Ratings: Developed by Jimmy Butler and used in partnership with FloTrack to accurately rate and predict collegiate athletes 2010-2011. Again, virtually impossible to replicate.
Next Step:
Before I got into running, my athletic background was in xc skiing. After mulling over possible directions to go, I kept coming back to the FIS World Cup points system and Jimmy Butler's xc ratings. Both of which utilize different methods to rate performances based off of head to head match ups.
The Solution:
I spent a lot of time trying to come up with a viable and time effective solution. So far, I've failed miserably in regards to time effective, but through some early tests, it's proven to be relatively accurate. I'll try to explain it without nerding out too much and diving into the gritty/boring details.
- Lower rating = better
- Time relative to course distance is irrelevant = Avoids dealing with inconsistencies in course distance reporting.
- Ratings are given by race, not meet = Every race at a meet rated separately
- Winner of every race is assigned a 0 base score. Using the formulas, increasing values are then assigned to the rest of the field based off of how many seconds back they finished from 1st.
- Penalty: derived for each race based on the quality of the competition
- Base value + penalty = athlete race rating
- Athlete rankings then based off of season average
That's a very very very simplistic explanation, but gives the general overview.
Now What?
I first implemented this for the 2015 Texas XC season. I built spreadsheets that had around 6k female athletes and 8k male athletes and then compiled 1000s of performances from 2015 xc and a few races from 2014 and began crunching numbers. After weeks of wanting to throw my computer out the window, things finally began to operate relatively smoothly.
Test #1:
Nike South Invitational:
By taking the athlete ratings, I scored the 2015 Nike South boys elite race based off of the teams in the race. Projected vs actual scores and finish place can be seen in the image to the right.
Test #2:
Austin Westlake Invitational:
Using the same process, I scored the Girls Varsity race at the 2015 Austin Westlake invite. Projected vs actual scores and finish place can be seen in the image to the right.
Takeaways:
- Pretty dang accurate. Anything can happen in xc, so race projections are always a shot in the dark. Assuming every team is racing their full varsity squad, among many other factors.
- It's brutally time consuming so not very practical in that aspect.
- Still testing and tweaking.
- Need to develop a program to handle the data as it is already overloading my excel spreadsheets and life.
Utah Application
I have begun implementing this system for Utah. First boys release is out, and I hope to have the girls' ratings posted by next week. The boys database is built incluiding 1,600 athletes and over 37,000 performances.
- You can see the boys all-state team scores HERE
- MileSplit subscribers have access to complete boys individual & team ratings by class HERE
- Interested in seeing projections for your upcoming meet?
- Send me teams/entries: travis.miller@flosports.tv