Skip to main content

Text mining: Web-based resources

Open Access Resources

Login Required, but not licensed by the Libraries

The following links resources are available to anyone in the Penn State Community using their @psu.edu access account.

Specific Library Databases Allowing Text Mining

Most of the libraries' databases do not allow text or data mining due to our license agreements with the vendors. However, some do include permission to do this, and the following providers have kindly offered some text and data mining options for our users. We will continue to work with database vendors to include TDM into future license agreements. 
 
Unless someone is specified as a point of contact, please contact your subject specialist to initiate the process. Some of these links are to specific text mining platforms, rather than full-text access. Any unauthorized web scraping of these databases can result in the vendor cutting off access to the entire campus!
 
At the present moment, LexisNexis does not officially support or provide data/text mining options. However, since text files can be batch downloaded (up to 500 articles at a time in one text file) it is possible to compile your own corpus. Contact Heather Froehlich, Literary Informatics Librarian, for help. Below are the databases we have negotiated TDM for: