Like any analytics problem, let's start by getting our hands on data. For the optimization problem, we'll need at least two pieces of information:
- Salary information per player
- Player information (position, league)
- Player statistics / metrics to measure value
Ideally we could download this information straight from Draft Kings. However, I didn't want to create an account and it didn't seem straight forward. I took the easier route using Google to find someone that was already posting some of the data I needed.
Draft King Salaries
It was a bit difficult to access the salary information directly on draft kings, but
RotoGuru is nice enough to post the daily data for us. Using the
httr,
dplyr and
stringr packages was easy enough to scrape his website and pull down the salary data.
ESPN Game Score
Next up was some metrics and statistics for each player. My first though was go to ESPN, they have everything right? Well, yes, however, it wasn't easy to grab. Their
daily notes section gives lots of tips on who to pick up, including a nice metric called
Game Score for pitchers. Here's some code that we'll use to grab that data.
Fangraphs Advanced Metrics
Well, game score is certainly handy, but it'd be nice to have a great metric for hitters too. Since I'm a SABR person, I figured why not go for some advanced metrics. Fangraphs is a great site with articles discussing baseball in terms of advanced metrics and hosting an accompanying
glossary for those unfamiliar with them. Here's the code for downloading that data:
No comments:
Post a Comment