HTRC Derived Datasets (HTRC)
Event box

This session will introduce you to HTRC’s derived datasets, how they can be used and for which types of research methods they are suitable. After each dataset is introduced, we’ll get hands-on working with Python code in Google Colab notebooks, where we’ll use the Extracted Features 2.0 and BookNLP for English-Language Fiction datasets to conduct exploratory data analysis and visualization.
Note: working in Google Colab notebooks requires a Google account
Recommended prerequisites: Either Introduction to HathiTrust and HTRC workshop, or some previous experience with HathiTrust or HTRC.
See here for all upcoming HTRC workshops
Check out our Research & Scholarship guides (including TextDataMining Guide with information on HTRC) for self-guided help. For questions, Ask Us.
Any person who requires a reasonable accommodation on the basis of a disability in order to participate in this program should contact htrc-help@hathitrust.org. at least one week prior to the event to arrange for the accommodation.
Presented by HTRC
Related LibGuide: Text Data Mining by SMU Libraries
- Date:
- Monday, October 28, 2024
- Time:
- 2:00pm - 3:30pm
- Audience:
- Faculty/staff Graduates Undergraduates
- Categories:
- Scholarship and Research > Digital Scholarship