The HathiTrust Research Center (HTRC) is the research arm of HathiTrust. It develops tools and resources that enable text or computational analysis of the HathiTrust corpus. This corpus or digital library includes over 10 million volumes (mostly books and journals), 3 million of which are in the public domain. It covers 400 languages and publication dates from 1500 to the present day, representing a broad variety of subjects.
Getting started
- Please visit our guide for details on how to access the HathiTrust Digital Library and HathiTrust Analytics
Learning resources
Workshops
Text Analysis Tasting Menu: A Sampling of Available Tools
- This workshop covers a number of text and data mining tools, including the HathiTrust Research Center (start at 47:20 for a short demo)
- Download the accompanying slides and a cheat sheet comparing various text analysis platforms
Additional resources
- HathiTrust Research Center's documentation guide at the University of Illinois
- HathiTrust Research Center's workshop page
- Materials from previous workshops are available on Google Drive and via the University of Illinois' "train the trainer" curriculum
Utilities