GW Calendar
Sign Up

2130 H Street NW, Washington DC 20052

View map

Web scraping is a technique for extracting data from web pages using code. While great tools exist in Python and other programming languages, scraping modern websites poses many challenges. This workshop will help you understand the structure of web pages and identify the features of a website that may make scraping easy, hard, or (in some cases) impossible. We'll walk through some examples together, using Python and the tools available in your web browser. Some prior familiarity with Python or another programming language is highly recommended.


This workshop is part of the Using Programming and Code for Research series for anyone who wants to get started or learn more about use programming languages like Python, R, or other applications. These tools can help you to collect, manipulate, clean, analyze, and visualize research data or automate many repetitive tasks. If you need personalized assistance with a data analysis, programming, or coding project, consider booking a consultation with one of our librarian-experts. Learn more about our services for programming and coding and for working with data.

All sessions are free to GW students, faculty, staff, and alumni. GW has an institutional commitment to ensuring that all of our programs and events are accessible for all individuals. If you require any accommodations to participate in this event, please contact libraryevents@gwu.edu at least 72 business hours (3 business days) prior to the event.

Event Details