Webscraping#
These notebooks give you examples of how to collect data from websites and then curate that data in to a dataset that other researchers can use. Unfortunately, there’s no one-size-fits-all method for webscraping. Instead each website and data source is different. To that end, these notebooks document how I do webscraping, though there are many ways to do the same thing and get the same result.
Static Webscraping with BeautifulSoup
Learn to issue requests and parse page content for data collection and curation.
Dynamic Webscraping with Selenium
Automate website interaction in Python for data extraction.
Scraping Reddit
Use the pushpull.io API to discover the depth of Reddit data.