Lucy Training Workshop: Quantitative Text Analysis in R

Presenter: Jessica Young

This workshop is broken into two parts; the first focused on cleaning text and the second on analysing cleaned text. Participants are highly encouraged to have at least an intro level understanding of R to make the most of the workshops.

For the first workshop, we will focus on understanding what clean and tidy text means. After learning different ways to read in a variety of documents (such as word, csv, txt, etc.), we will go through built-in functions for cleaning text and then delve into using regular expressions to do more precise cleaning.

The second workshop will focus on getting text summaries and running analyses on cleaned data. Beginning with an understanding of the different major types of analyses done in most social sciences, we will then delve into what type of summaries are normally required for papers and how to create them. The majority of part 2 will focus on understanding and running topic modeling and key word analysis in R.

These workshops will be offered via Zoom and is open to undergraduate and graduate students, faculty, and staff. 

Quantitative Text Analysis in R: Part 1 will take place on Thursday, February 8, from 1-2pm.

Quantitative Text Analysis in R: Part 2 will follow on Thursday, February 15, from 1-2pm.

The workshops require a minimum of 10 registrants.  Fewer than 10 registrants by the registration deadline (February 5) will result in the workshop being cancelled. In the event of cancellation, registrants will be notified.

Please register here by February 5.