Skip to Main Content
It looks like you're using Internet Explorer 11 or older. This website works best with modern browsers such as the latest versions of Chrome, Firefox, Safari, and Edge. If you continue with this browser, you may see unexpected results.

Your browser is unsupported

We recommend using the latest version of IE11, Edge, Chrome, Firefox or Safari.

HathiTrust Digital Library: Getting Started

The HathiTrust Digital Library brings together the immense collections of partner institutions in digital form, preserving them securely to be accessed and used today, and in future generations.

Creating an Account

To get started, go to the HathiTrust Research Center website and click the “Sign Up” button in the upper right hand corner of the page. To access much of the functionality, you will need to create an account using your university email address and a password of your choosing. 

You can find directions and step-by-step tutorials for using the HTRC on the Research Center's documentation: https://wiki.htrc.illinois.edu/x/CAAb

HTRC Tools and Serivces

The HTRC provides tools and services for doing text analysis with the HathiTrust collection

Algorithms

There are off-the-shelf algorithms built into the HTRC that you can use for basic text analysis processes, such as topic modeling or making a word cloud. Learn more on the HTRC documentation wiki: https://wiki.htrc.illinois.edu/x/HoJnAQ

HathiTrust+Bookworm

This visualization tool lets you explore word frequency over time. You can read more information in this guide: http://guides.library.illinois.edu/htbookworm or on the HTRC documentation wiki: https://wiki.htrc.illinois.edu/x/AoCXAQ

HTRC Derived Datasets

The HTRC releases datasets for text analysis, such as the Extracted Features dataset, which includes words, word counts, and page-level metadata for volumes in the HathiTrust. Learn more here: https://wiki.htrc.illinois.edu/x/WQCGAQ

HTRC Data Capsules

Researchers can provision their own secure virtual machine "capsule" for performing their own, advanced text analysis workflows. Results are vetted before they are released to the researcher. Documentation is available here: https://wiki.htrc.illinois.edu/x/SAFRAQ

HTRC Introduction Video

Want to learn more about HTRC? This video gives an overview of the research center.

 

Credit and Licensing

Credit

Adopted from A Guide to the HathiTrust Research Center from the University of Illinois Urbana-Champaign Library Scholarly Commons.

Licensing

Creative Commons License

Except where otherwise indicated, original content in this guide is licensed under a  Creative Commons Attribution (CC BY) 4.0 license. You are free to share, adopt, or adapt the materials. We encourage broad adoption of these materials for teaching and other professional development purposes, and invite you to customize them for your own needs.