Skip to main content

Special Collections: Web Archiving

Information about Penn State's Web archiving program and resources used in web archiving at the Special Collections Library


According to the International Internet Preservation Consortium (IIPC), "Web archiving is the process of collecting portions of the World Wide Web, preserving the collections in an archival format, and then serving the archives for access and use." Penn State University Libraries engages in web archiving to capture, preserve and provide access to web-based content that fits the Libraries’ collection development mission. 

Penn State University Libraries uses the Archive-It vended service from the Internet Archive to crawl and capture content on the Web. More information about Archive-It can be found online at The full scope of University Libraries’ web archives can be found online at

Web content is complex and the technologies used to crawl and capture it are imprecise. Please be aware, as you navigate the content of an archived website, that some content may not have been captured accurately or not included in the scope of the crawling activity, resulting in broken links. If you encounter missing content that you feel should be available, please let us know.