Lucy Training: Web Scraping with Python - Lucy Family Institute for Data & Society

Presenter: Yang Xu

Explore the World of Web Scraping with Scrapy

Web scraping enables you to fetch and extract data from web pages, opening doors to information that might otherwise remain hidden.

The workshop provides an introduction to web scraping, and involves a hands-on project to scrape, parse and extract desired data from a webpage using Scrapy.

Scrapy is the Python framework of choice for web scraping. It’s known for its speed, flexibility, and suitability for projects of all sizes.

What You’ll Learn

Grasp web scraping fundamentals.
Master the art of HTML parsing and extracting data.
Use regex for precise data retrieval.
Set up Scrapy for efficient data extraction.

Who Should Attend

This workshop is open to university students and researchers who would like to boost their data skills. It is expected that the participants already know the python basics, such as different data types (list, dict, etc.), and writing functions.

A Friendly Note

We’ll delve into the technical aspects of HTML and how it helps locate data, allowing us to create Python functions for quick extraction. To fully appreciate and make the most of the workshop, some programming experience is recommended.

When and Where

This immersive workshop will be offered in-person in Hesburgh Library, 3:30 – 5pm Oct.3. There is a limit of 10 participants for this workshop.

Register Now!