PGA Data Scraper

Taken from an effort by Patrick Young on Github

This scraper uses BeautifulSoup to scrape stats between 2010-2017 from the pgatour.com website that is built into a single pandas dataframe. In this notebook, we set the range of years (season) and pickle the dataframe so it can be used in multiple projects.

Patrick goes pretty far to build a very usable dataframe for analysis by merging a bunch of discrete dataframes into a master dataframe laid out to suit machine learning exercises.

 

View this notebook in your browser

 

 

Download the PGA Scraper Notebook