Graduate School of Library and Information Science, UT Austin  


Shortcuts
Home
E-Reserves
 
GSLIS Links
GSLIS Home
Tutorial Junction
IT Services
 
Contact Info
 

Problems in Permanent Retention of Electronic Records

LIS 392P.12, Unique Number 45195

Instructor: Patricia K. Galloway

Class meeting locations: SZB 464 and (in April) Lab Annex

Class meeting period: Tuesday, 8:30-11:30

Office: SZB 566

Office hours: Tuesday, 1:30-2:30 PM or by appointment

Office telephone: 232-9220

Email: galloway@gslis.utexas.edu

Teaching assistant: Lori Eichelberger

Email: lkeich@gslis.utexas.edu

Office: SZB 445

Office hours: Tuesday 2:00-4:00 PM or by appointment

Course Description

This course is focused upon what happens to electronic records from all sources, including preservation reformatting, once they have crossed the "archival threshold" for permanent retention. As such the course will cover media refreshment, conversion to neutral formats vs emulation to retain original format, migration, and electronic records repository construction and administration. Projects addressing these issues in government agencies, worldwide as well as locally, will be discussed as case studies. Students will also be introduced to existing practices in the information technology field and their appropriateness to archival requirements: code vaulting and escrow; data warehousing; knowledge management. The course will include a consideration of the issues of authenticity and reauthentication in the long-term preservation of electronic records. Finally, we will address issues of access (including privacy and open records) in the context of WWW standards and digital library initiatives.

Objectives

The overall objective of this course is simple: if you as an archives professional are confronted with the need to construct an effective electronic records repository for permanent retention of records, you should know what the major difficulties are, what you need to do to meet them, and where you can go for help. More specifically, students will learn:

1) How to build a physical electronic records repository

2) How to build a logical electronic records repository

3) How to preserve the records with credible authenticity

4) How to provide access to permanent electronic records while keeping them secure

Since there are as yet no universally-accepted standards, you will learn through consideration of the emerging Open Archival Information System (OAIS) model what is required to construct and implement credible standards for your repository.

Assignments

Class participation (25% of grade): Students will be expected to read assigned readings and come to class prepared to discuss them. Assigned readings and the sequence of lectures will be directed at supporting the process of repository building in the project, so discussion in the class will be a vital part of progress on it. For lab classes, students will be expected to attend and participate actively. This class will be too small for anyone to slack!

Semester project (30% of grade): We will build a small records repository, guided by the OAIS model, to contain and give access to a body of archival material. This semester I am working with Dr. Susan Cisco (Railroad Commission as well as our own Records Management instructor) and officials of the Texas Department of Information Resources toward working out a solution for Texas state government email; the strongest idea being pursued is the possibility of providing a central repository for the whole of state government email (accommodating restricted access, administrative retention, digital signatures, comprehensive search capabilities, etc.) and framing a prototype project for designing it. I am proposing that we take a stab at designing it ourselves, so that when the process is undertaken there will be a proposal for people to look at.

During April the "repository" will actually be constructed as students carry out the following tasks, with each student concentrating on their own area of domain expertise:

1) select a corpus of email for experimentation, use text analysis tool to produce auxiliary subject line

2) place email messages in database "warehouse" with appropriate metadata

3) set up maintenance tasks and schedules

4) set up access front-end and usage monitoring

Lab sessions led by the instructor and teaching assistant will be used to get each step started, and students will complete any step not completed in lab outside of class; instructor and teaching assistant will be available to assist as needed. Grading for the project will be on the basis of the instructor's observations of students' efforts, the overall success of the project, and students' evaluations of their their own and each others' contributions to the project.

Domain expert pathfinder (20% of grade): A set of "domain expert" assignments will be discussed at the first class and allocated at the second. Each student will be expected to create a web page on the course server which provides a resource for expertise in their chosen domain, which should include materials researched in developing domain specifications (see below).

Domain specifications (25% of grade): Each student will write a set of specifications for the portion of the repository requirements covered by their domain expertise according to a preset format. Due date for the specifications, March 29, is not negotiable because the specifications will be implemented in the April lab classes; a penalty of one-half letter grade will be assessed for each day the essay is late.

Text(s):

I have ordered the following text, which is available at the Co-op:

Gregory S. Hunter, Preserving Digital Information: A How-To-Do-It Manual (New York: Neal-Schuman, 2000).

Although this text is a handy repository of useful information and references, you should not therefore assume that anyone knows “how to do it.” We will have additional reading assignments to get closer to the cutting edge than this appreciation of the field offers. Students will be expected to read this whole text, whether all segments are assigned for class discussion or not, since the book contains more information of the preservation of digitized objects than we will have time to address in class.

Class Schedule

January 15: Course overview

Discussion:

Course content

Student experience and other courses

Reading:

Hunter, Preserving Digital Information, Chapter 1, Digital Information: The Preservation Challenge”


January 22: Preservation issues: reliability, authenticity, custodianship

Readings:

Hunter, Preserving Digital Information, Chapter 3, “Recent Research in Electronic Records and Digital Preservation”

Luciana Duranti, "Reliability and Authenticity: The Concepts and their Implications," Archivaria 39:1-10.

Heather MacNeil, "Conceptualizing an Authentic Electronic Record," available at: http://www.interpares.org/documents/hm_saa_2000.pdf

David Bearman, "Virtual Archives," available at: http://web.archive.org/web/20001022231601/www.sis.pitt.edu/~nhprc/prog6.html


January 29: Preservation issues: conversion, migration, emulation, reauthentication

Readings:

Hilary Berthon, "The Moving Frontier: Archiving, Preservation and Tomorrow's Digital Heritage."www.nla.gov.au/nla/staffpaper/hberthon2.html

Margaret Hedstrom, "Research Issues in Migration and Long-Term Preservation"  http://web.archive.org/web/20001027145531/http://www.sis.pitt.edu/~cerar/s5-mh.html

Jeff Rothenberg, "Avoiding Technological Quicksand: Finding a Viable Technical Foundation for Digital Preservation," available at www.clir.org/pubs/reports/rothenberg/contents.html

David Bearman, "Reality and Chimeras in the Preservation of Electronic Records," available at: www.dlib.org/dlib/april99/bearman/04bearman.html

Stewart Granger, "Emulation as a Digital Preservation Strategy."  www.dlib.org/dlib/october00/granger/10granger.html


February 5: Preservation issues: specific genre concerns

Readings:

Hunter, Preserving Digital Information, Chapter 5, “Electronic Mail and Web Pages,” and Chapter 6, “Digital Imaging and Preservation”

Anne R. Kenney et al., "Preservation Risk Management for Web Resources: Virtual Remote Control in Cornell's Project Prism," D-Lib Magazine 8,1 (January 2002), available at: http://www.dlib.org/dlib/january02/kenney/01kenney.html

Charles McClure and Timothy Sprehe, Guidelines for Electronic Records Management on State and Federal Agency Websites, available at: http://istweb.syr.edu/~mcclure/guidelines.html

Gregory Lawrence, William Kehoe, Oya Rieger, William Walters, and Anne Kenney, Risk Management of Digital Information: A File Format Investigation, CLIR Report # 93, 2000, available at: http://www.clir.org/pubs/reports/pub93/contents.html


February 12: Preservation issues: managing terabytes; digital archaeology

Readings:

Michael Lesk, “How Much Information is there in the World?” http://www.lesk.com/mlesk/ksg97/ksg.html

Reagan Moore et al., "Collection-Based Persistent Digital Archives--Part 1," D-Lib Magazine, March 2000: www.dlib.org/dlib/march00/moore/03moore-pt1.html

--Part 2, D-Lib Magazine, April 2000: www.dlib.org/april00/moore/04moore-pt2.html

Hunter, Preserving Digital Information, Chapter 4, “Storage, Handling and Preservation Best Practices”

William E. Underwood, "Analysis of Presidential Electronic Records: Final Report," available at: perpos.gtri.gatech.edu/perpos/Final_Report.pdf


February 19: Access issues: The role of metadata

Readings:

Anne Gilliland-Swetland, “Setting the Stage,” in Introduction to Metadata: Pathways to Digital Information, available at: www.getty.edu/gri/standard/intrometadata/2_articles/index.htm

Tony Gill, “Metadata and the World Wide Web,”, in Introduction to Metadata: Pathways to Digital Information, available at: in Introduction to Metadata: Pathways to Digital Information, available at: www.getty.edu/gri/standard/intrometadata/2_articles/index.htm

Mary Woodley, “Crosswalks: the Path to Universal Access?” in Introduction to Metadata: Pathways to Digital Information, available at: www.getty.edu/gri/standard/intrometadata/2_articles/index.htm

Traugott Koch and Stuart Weibel, "Introduction to a Special Issue on Metadata: Selected Papers from the Dublin Core 2001 Conference," Journal of Digital Information 2,2 (2002-01-04), available (this page provides links to all the papers) at http://jodi.ecs.soton.ac.uk/Articles/v02/i02/editorial/

Clifford Lynch, “Access Management for Networked Information Resources,” available at: http://www.cni.org/projects/authentication/authentication-wp.html#Appendix


February 26: How to build a repository: physical/technical issues

Readings:

OAIS model: www.ccsds.org/RP9905/RP9905.html (read sections 1-3)

Texas state computing installation specs: www.state.tx.us/ftp/pub/adrocpol.txt


March 5: How to build a repository: three logical models

Readings:

CEDARS/OAIS model (two documents):

Kelly Russell, "Digital Preservation and the Cedars Project Experience," available at: http://www.rlg.ac.uk/events/pres-2000/russell.html

David Holdsworth and Derek M. Sergeant, “A Blueprint for Representation Information in the OAIS Model,” use the "Wayback Machine" at http://www.archive.org to find: http://gps0.leeds.ac.uk/~ecldh/cedars/nasa2000/nasa2000.html

Data warehouse model: William H. Inmon, Building the Data Warehouse, 2nd ed., Chapter 2; available on e-reserves in two files, part 1 and part 2.

Code vaulting/"escrow" model: "Articles on Escrow (18 short "Cases in Point") at: http://www.dsiescrow.com/techescrow/index.html (choose "How-To Guides," then "articles on escrow")


March 12: SPRING BREAK


March 19: How to build a repository: deciding on components of a logical model

Readings:

Hunter, Preserving Digital Information, Chapter 7, “preserving the Information System”

TSLAC, “Functional Requirements” (1999), available at http://www.tyc.state.tx.us/errc/erecfuncreq.doc

TSLAC, “Electronic Records Standards and Procedures” 2000, available at http://www.tsl.state.tx.us/slrm/state/stateerules.pdf

Public Record Office, Management, Appraisal, and Preservation of Electronic Records, Chapter 5, “Preservation of Electronic Records,” available at: www.pro.gov.uk/recordsmanagement/eros/guidelines/procedures5.htm


March 26: Structuring and implementing a metadata set

Readings:

OIAS model, sections 4-6 and annexes

OCLC/RLG Working Group on Preservation Metadata, “Preservation Metadata for Digital Objects: A Review of the State of the Art,” available at http://www.oclc.org/research/pmwg/documents.shtm

OCLC/RLG Working Group on Preservation Metadata, “A Recommendation for Content Information,” available at: http://www.oclc.org/research/pmwg/documents.shtm

 

April 2: Intake of records to repository (reception of SIP) (lab)

Discussion of domain specifications for transfer, conversion


April 9: Records into database with metadata (AIP) (lab)

Discussion of domain specifications for database function and structure


April 16: Set up to provide maintenance of records (lab)

Discussion of domain specifications for migration, security


April 23: Integrate access with WWW and set up usage monitoring (DIP) (lab)

Discussion of domain specifications for web server function/linkage


April 30: Complete any remaining tasks

Class survey