Web Scraping: Tables, PDFS, Ocr: Cleo O'Brien-Udry
Web Scraping: Tables, PDFS, Ocr: Cleo O'Brien-Udry
Cleo O’Brien-Udry
Yale University
25 May 2020
Web scraping: extract data from websites and store on your computer (or
an external server)
1 Find web-page
2 Identify location of relevant data on web-page
3 Import into R
4 Clean data
5 Repeat
How have global levels of voting changed over the last 50 years? Which
countries show similar patterns of turnout and registration; which show
different patterns?
How have global levels of voting changed over the last 50 years? Which
countries show similar patterns of turnout and registration; which show
different patterns?
Data we need:
Country voter turnout data
Covariates (country development indicators, VDEM indicators, etc.)
How have global levels of voting changed over the last 50 years? Which
countries show similar patterns of turnout and registration; which show
different patterns?
Data we need:
Country voter turnout data
Covariates (country development indicators, VDEM indicators, etc.)