Project to produce comprehensive digital archive of 60 million pages of federal government documents.

Boston Public Library joins project as first Contributing Library, will furnish 50-year run of Congressional Hearings to be digitized.
Omidyar Network and Kahle-Austin Foundation fund phase 1 of two-year project.
Call for participants and investors for phase 2 announced.

SEBASTOPOL, CA—December 27, 2007—Public.Resource.Org, the Internet Archive, and the Boston Public Library announced today the commencement of phase 1 of a project that aims to create a comprehensive digital archive of 60 million pages of government documents over the next two years.

Phase 1 of the project will produce a minimum of 2.5 million pages of digital text using a scanning and optical character recognition (OCR) technology suite developed by the Internet Archive. The Boston Public Library is the first Contributing Library in the program, and has agreed to lend a 50-year run of Congressional Hearings from 1936–1986, as well as a complete copy of the Catalog of Copyright Entries. Scanning will take place at the Boston Library Consortium's Northeast Regional Scanning Center.

Phase 1 is funded by a $250,000 matching grant challenge from Omidyar Network which was met with a matching grant of $250,000 from the Kahle-Austin Foundation.

Phase 2 of the program will produce a full digital archive of 60 million pages of government documents, consisting of the entire corpus of federal publications such as the Congressional Record and the Federal Register. The total cost of Phase 2 is estimated at $6 million and will take approximately two years.

The Internet Archive and Public.Resource.Org are currently soliciting interested parties who might wish to contribute to Phase 2 funding or participate as Contributing Libraries. Contributions to this project are tax-deductible under section 501(c)(3) of the IRS Code.

The scanned documents produced by this project will remain in the public domain and be available in bulk without restriction. In addition to becoming a part of the libraries at the Internet Archive and the Boston Public Library, the project will actively solicit and promote use of these data by commercial, noncommercial, and government groups.

Media Contacts

Brewster Kahle Maura Marx Carl Malamud
Internet Archive Boston Public Library Public.Resource.Org
+1.415.561.6767 +1.617.859.2386 +1.707.827.7290
brewster at archive dot org mamarx at bpl dot org carl at media dot org

About the Internet Archive

The Internet Archive is a 501(c)(3) non-profit that was founded to build an Internet library, with the purpose of offering permanent access for researchers, historians, and scholars to historical collections that exist in digital format.

About the Boston Public Library

The Boston Public Library's mission is to preserve and provide free access to the historical record of our society while serving the cultural, educational and informational needs of the people of City of Boston and the Commonwealth of Massachusetts. For more than 150 years, the Library has been able to fulfill its promise “Free to All.”

About Public.Resource.Org

Public.Resource.Org was founded in 2007 to spearhead the creation of public works projects for the Internet. A 501(c)(3) registered public charity, Public.Resource.Org has worked across all three branches of the U.S. government to enhance the public domain.