Collection Overview

Indiana: State and Local Documents seeks to preserve and facilitate access to government information produced by the agencies of the state government and local governments within the state of Indiana. With emphasis on publications and documents made available on official websites of the Indiana state government agencies, the collection captures the websites of these sites every month. While focus is at the state level, the collection also includes major cities and will ultimately also seek to preserve county websites on an annual basis.

Citing Web Sites in the Archive

Indiana: State and Local Documents. Archived by the Indiana University Libraries Web Archive at <accessed 3 December 2012>

Please cite individual seeds or web pages as follows:

"Title of web page." Title of Collection. Archived by the Indiana University Libraries Web Archive at [URL]. <accessed [date]>


"Indiana Arts Commission." Indiana: State and Local Documents. Archived by the Indiana University Libraries Web Archive at <accessed 2 December 2006>

Selection Criteria

Scope: Currently we are collecting the web sites of any state government agency. We have also identified 4 local government sites to monitor and will be adding additional during 2007.

Volume: Active seeds will be crawled quarterly; County government web sites will be manually archived annually.

Crawl Parameters:

  • Collection Dates: Start Date: July 1, 2006
  • How often captured: Monthly for state agencies; annually for city/counties.

Acquisition Parameters:

  • Depth: Complete
  • Breadth: Links are followed out to one external level.


Archive-It provides full text search capability for all public collections. Alternately, if you know the site you are looking for, enter the URL into the search box, and Archive-It will search for instances of that archived URL.

Archive-It enables searching of both the full text of web sites and the metadata that has been assigned to the seeds, or individual URL's.

The search tool used to provide full-text access to the Library's Web archive collections is powered by the open-source search engine, Nutch.

Some hints on searching:

  • Generally, search results are ranked by relevance according to several factors:
    • how often the query terms appear in the page relative to how often they appear throughout the collection
    • how often the query terms appear in the page compared to the length of the page
    • whether the query terms appear in the URL
    • whether the query terms appear in the hostname
  • The Boolean search default is AND.
  • If you know that what you're looking for is in a specific type of file, you can limit your search to just that format by adding type:[file type] to your search terms.
    • e.g., A PDF document about French Lick might be found using the following string: French Lick type:pdf.
  • If you want to find out about a topic discussed specifically on an archived web site, you can limit your search by adding site:[URL of archived site] to your search terms.
    • French Lick site: will find instances of the term French Lick on the Governor of Indiana's web site.
  • You can refine search results in the following ways:
    • The link to other versions will take you to a list of archived versions that were captured on different dates.
    • The more from... link will take you to other hits from that host.

Since the Indiana University Libraries have been archiving web sites only since spring 2006, you may wish to look for earlier versions of many of the sites in the Library's collections through the Internet Archive's general Wayback Machine. The Wayback Machine, however, is not text searchable; you must know the URL of the site that you would like to view.

Return to Government Information, Maps, and Microform Services.