pat gallowaypat galloway
pat galloway pat galloway pat galloway pat galloway pat galloway pat galloway pat galloway pat galloway pat galloway
pat galloway
    INF 389G Introduction to Electronic and Digital Records, Unique 28400 - Schedule, Fall 2017
pat galloway
pat galloway
pat galloway
  INF 389G Home
  Objectives
  Text
  Assignment
pat galloway Schedule
  Resources
  pat galloway
pat galloway
pat galloway
Search
Site Map
Contact Info
UT Home
pat galloway

NOTE: This syllabus is definitely preliminary until the first class meets and may change slightly through the semester if new issues come up--so don't just print or download it and continue to refer to that version. URLs change constantly, so if you find a dead link please do both of the following two things:

1) Stick the URL into the Wayback Machine at http://www.archive.org and see if you can find it there; and

2) Let our TA know one way or the other: if you found it, the Wayback URL for it; or if you didn't find it, so we can do something about it.

August 31: Background, overview of the field

Reminder about ordering textbook: Jones and Teevan, Personal Information Management. See Text page.

Fill out questionnaire about educational, technical, and archival background (in class)

Discuss course and course assignments, including:

  • New Technology research assignment (15%) Start September 21
  • Personal digital archive management report (35%). Note that if you begin thinking about this project from the beginning, you can make use of class discussions to explore and sharpen your thinking about your own records. Due November 16
  • Digital Preservation Use Case project (35%) Due December 7
  • Class participation (15%)

Topic: Overview of the field of digital recordkeeping--history and ideas of interest in a changing social and communication environment. Lecture-discussion. Be prepared to participate by sharing some of your own experiences with digital materials.

September 7: What is a digital record and how can I deal with it?

New Technology research assignments handed out and discussed; protocol for preparation of Personal Information Management Plan discussed.

Topic: Definitions of "electronic and digital records" and the range of objects and environments that are implicated under this rubric. Discuss the archival view of digital records and the skills that seem to be required for coping with them. Students will be expected to have read the assignments and to be prepared to discuss them critically. For a start, read the readings and then prepare to discuss the questions below--which means: have an answer (or indeed another question) in mind and/or written down with your preparation notes for class.

Questions to prepare for discussion:
1) What do you consider the most valuable part of the archival perspective as outlined by Gilliland? Do you consider that any parts are now outdated, and if so, why?
2) How does your skillset fit with the New Skills inventory in general and as expanded in detail in the DigCCurr matrix? If you have questions about the matrix and the skills, please raise them. How might you update and add to your skills?

Readings

Anne J. Gilliland-Swetland, Enduring Paradigm, New Opportunities: The Value of the Archival Perspective in the Digital Environment (Washington: CLIR, 2000) http://www.clir.org/pubs/reports/pub89/pub89.pdf

Richard Pearce-Moses and Susan E. Davis, "Knowledge and Skills Inventory," in New Skills for a Digital Era (2006), pp. 1-31, available at http://www.archivists.org/publications/proceedings/NewSkillsForADigitalEra.pdf The proceedings of this conference are worth reading as a whole if and when you have time, not least because three of the case studies are from the UT iSchool.

Cal Lee, Matrix of digital curation knowledge and competencies, 2009. Available at http://www.ils.unc.edu/digccurr/digccurr-matrix.html BE SURE TO DRILL DOWN INTO THE DIMENSIONS!!

September 14: Official digital records and regulation by statute and computer code

Topic: How people use and work with electronic records/digital objects, including differences that may be introduced by the electronic medium. Discuss the electronic environment and how it is literally legislated from scratch by computer code (and the implications of "net neutrality"), how real legislation deals with paper and digital records, and how individuals manage their own records. Review the MDAH digital archives project and how it has evolved 1999-2010 (including Sovereignty Commission migration and Walker Sampson's work on moving digital archives to DSpace: http://wsampson.wordpress.com/ October 12 and earlier entries); also discuss the TERM email project and its "failure" (see October 7 also for email). How "record" and "evidence" are constituted and how hard it is to do this.

Questions to prepare for discussion:
1) What digital records should governments keep? Think of this in terms of an issue that interests you (Social Security, taxes, passports, gun licenses, etc.) andfind out what the government of Texas actually requires for our discussion.
2) How do the realities of computer technology make it possible (or not) to keep digital records? You might work through this by asking what problems NARA has had in developing a digital archive (see links in the last reading). If you are interested in more detail about NARA's long struggle with e-records, see Bruce Ambacher (ed.), Thirty Years of Electronic Records.

Readings

Lawrence Lessig, Code and other Laws of Cyberspace, Part 1 (1999). Available on Canvas in two files, 1.1 and 1.2 , under Files. This is version 1, which Lessig refers to now as an "ancient text"; the revised version, revised via wiki, can be found as a free download or wiki at http://codev2.cc/ (you can add your own remix). If Lessig's work on how the digital environment changes the impact and meaning of traditional legal frameworks interests you, succeeding books include The Future of Ideas, Free Culture, and Remix.

Uniform Electronic Transactions Act summary: this act, passed in 1999, first made digital records officially acceptable in legal transactions. Find a short general summary at: http://www.uniformlaws.org/ActSummary.aspx?title=Electronic%20Transactions%20Act

"Electronic Records standards and procedures," from TSLAC, revised 2005: http://www.tsl.state.tx.us/slrm/recordspubs/stbull01.html  Also read the Texas public records statutes at Government Code, chapters 441.180-197 (Texas State Library and management of records), 551.021-023 (Open meetings records), 552 (Public Information law: especially subsections 101-136 listing exceptions and 272 on access to electronic records): http://www.statutes.legis.state.tx.us For a list of all the national and international standards that pertain to the official handling of digital records, see a listing here (alas removed from the DIR site and surviving only on the Internet Archive): http://web.archive.org/web/20100616080137/http://www.dir.state.tx.us/pubs/derm/standards/section1.htm

Mississippi Department of Archives and History Electronic Records Project, Management Standards (1999).

State Electronic Records Initiative (SERI; sponsored by CoSA): https://www.statearchivists.org/programs/state-electronic-records-initiative/ This project began in 2011 and is designed to run through 2015 with different aspects of the project.

National Research Council, Building an Electronic Records Archive at the national Archives and Records Administration: Recommendations for a Long-Term Strategy (2005). This was the attempt to bring the project up to speed after long delay. It's available freely downloadable from here: http://www.nap.edu/openbook.php?record_id=11332&page=R1

Federal News Radio, "NARA to suspend development of ERA starting in 2012," 12/17/2010, located here: http://www.federalnewsradio.com/congress/2010/nara-to-suspend-development-of-era-starting-in-2012

September 21: Personal digital information and regulation by computer code

Protocol for preparation of Personal Digital Archiving Plan discussed.

New Technology Presentation 1: VeraPDF 1.8, Lowrance and Lumpkins

Topic: Non-official born-digital (and born-again-digital) objects and how they are managed and preserved. Discuss digital personal records, with a focus on the student project to understand one's own digital belongings and manage a personal archive.

Questions to prepare for discussion:
1) How do the digital tools a person personally uses constrain what they create and what they can personally keep?
2) Is it possible and/or desirable to know and manage the full range of digital objects that a person presently creates? Think about an example or two and why it might be complicated to do so.
3) Is it possible and/or desirable to find out and understand how one's identity is represented on the Internet? How is this a new problem?
4) Should archivists or commercial vendors assist ordinary people to manage their digital belongings? Think of arguments in favor of each alternative..

Readings

Personal Information Management, ed. William Jones and Jaime Teevan (hereafter cited as PIM), pp. 3-75, chapters 1-4 will begin our investigation of how people really keep digital records, regardless of statutes.

Neil Beagrie, "Plenty of Room at the Bottom? Personal Digital Libraries and Collections," D-Lib 11(6), June 2005, available at http://www.dlib.org/dlib/june05/beagrie/06beagrie.html

Gabriela Redwine, Personal Digital Archiving, DPC Technology Watch Report 15-01, December 2015. Available at http://www.dpconline.org/docs/technology-watch-reports/1460-twr15-01/file

Peter Williams, Jeremy Leighton John, and Jan Rowland, "The Personal Curation of Digital Objects: a Lifecycle Approach," ASLIB Proceedings 61(4), 2009, 340-363. Available under journals from the PCL catalog.

Simson Garfinkel and David Cox, "Finding and Archiving the Internet Footprint" (2009): http://simson.net/clips/academic/2009.BL.InternetFootprint.pdf

Clive Thompson, "A Head for Detail," Fast Company 110 (November 2006): http://www.fastcompany.com/magazine/110/head-for-detail.html This essay is an entertaining account of Gordon Bell's "life-logging" experiment about which more anon.

September 28: Record granularity and metadata

New Technology presentation 2: Save the Bits (DPC), He and Newton

Use Case Assignments made and discussed

Topic: Implications for records creators, archives, and users of record-level description and the generation of metadata to provide it. Discuss the issue of descriptive granularity and review various metadata schemes. Investigate in class metadata created by programs.

Questions to prepare for discussion:
1) How does the need for bitstream-level metadata contradict or make problematic the adoption of minimal processing (aka MPLP) standards in an archive? Is there a solution to this contradiction?
2) What kinds of metadata are needed for keeping your own records? How does this differ (if it does) from the kinds needed for archival collections? For digital library collections?
3) Give/find an example of metadata among your own digital files.

Readings

David Bearman, "Item Level Control and Electronic Recordkeeping" (this is a classic 1996 article that makes a very important point while summarizing the Pittsburgh project; the entire Pittsburgh project website, by the way, had not been archived away from its active location and was lost by a site remodel for the department, but has been recovered and restored; there is a link to it on the Resources page together with the story of how it was recovered): http://www.archimuse.com/papers/nhprc/item-lvl.html

Dublin Core metadata set current version (2011); review this originally resource-discovery oriented metadata set and also investigate how it is expanded as Qualified Dublin Core (the "terms namespace") at http://dublincore.org/documents/dcmi-terms/

Review the work being done for their own internal consistency by the Library of Congress Metadata for Digital Content Working Group here: http://www.loc.gov/standards/mdc/index.html See also the Master Metadata List, available on Canvas.

Review work being done on the PREMIS 2.2 metadata set for digital preservation and read the first 20 pages of this document: http://www.loc.gov/standards/premis/v2/premis-2-2.pdf

Investigate the METS metadata set for packaging digital objects and be prepared to discuss the parts of a METS document by reading the "METS Overview and Tutorial" (2011): http://www.loc.gov/standards/mets/METSOverview.v2.html

October 5: Passive vs active systems for managing desktop records

New Technology presentation 3: Portico, Wahl and Zevnik

Topic: Records Management Applications (RMAs) versus careful and systematic exploitation of existing software. Review the Department of Defense 5015.2 EDMS-RM model and commercial implementations of 5015.2-compliant RMAs, practical efforts at implementation in Texas, Kansas, and Mississippi, automated vs creator-assigned classification, Microsoft's nascent efforts to invade this profit space using features of its widely-used integrated business system SharePoint, and a suggestion on why much of this is doomed to failure without further study of how people manage their "own" records.

Questions to prepare for discussion:
1) How might you be likely to be subjected to a digital records management application at work? If you have had such an experience, be prepared to tell us about it.
2) How detailed must a records management application be in order to actually manage records, all records? What does the STD 5015.2 suggest that it must cover? Are these expectations realistic?
3) Would you consider outsourcing your entire personal recordkeeping to Google or another cloud host? How would you set up such a thing, and what would you want to consider?

Readings

PIM, pp. 90-166, chapters 6-9.

DoD 5015.02 specifications (latest version, dated 2007). This is a big document, but I want you to look through it carefully so you can both see the level of detail that a government standard includes and understand what the federal standard proposes to be able to manage: http://www.esd.whs.mil/Portals/54/Documents/DD/issuances/dodm/501502std.pdf

Here is a blog entry on problems with DoD 5015.02 and its deployment by an expert, Don Lueders (be sure to check out the linked replies from others who don't agree): http://www.aiim.org/community/blogs/expert/On-Why-I-No-Longer-Support-the-DoD-50152-Standard

And here is a report of huge amounts of DoD data erased during the wars in the Middle East from operational laptops: http://www.propublica.org/article/lost-to-history-missing-war-records-complicate-benefit-claims-by-veterans

NARA, Continuing Study of Federal Agency Recordkeeping Technologies (2008), a report from NARA on how well the application of the standard outlined in STD 5015.2 is going in federal agencies: http://www.archives.gov/records-mgmt/resources/recordkeeping-tech-2008.pdf

Patricia Galloway, "Big Buckets or Big Ideas: Classification vs Innovation on the Enterprise 2.0 Desktop," (2008). This paper outlines the so-called "Big Buckets" approach to making desktop records management easier to use and questions its blanket usefulness for records that may be among the most important to keep, available here: http://armaedfoundation.org/wp-content/uploads/2016/12/BBpaper30.pdf

Barry Wheeler, "Personal Archiving--Year End Boot Camp," 1/20/2012 entry about real-life personal digital archiving in the Library of Congress-sponsored blog The Signal: Digital Preservation, by a Digital Projects Coordinator at the LoC: http://blogs.loc.gov/digitalpreservation/2012/01/personal-archiving-%E2%80%93-year-end-boot-camp/

October 12: Centralized vs distributed models: custodianship

New Technology presentation 4: Windows 10 Data Collection, Christner

Topic: Where digital records should be archived and by whom. Discuss the issue of traditional archival custodianship, the challenge of postcustodial models, and the emergence of best practice in the form of the OAIS repository model.

Questions to prepare for discussion:
1) Can digital archives be "a place"?
2) Should there be a distinction between public and private records?
3) Should public records preservation be outsourced? Why or why not?
4) What would the individual person's point of view be on custodianship? What cloud locations might individuals use?

Readings

Luciana Duranti, "Archives as a Place,"Archives and Manuscripts 24(2): 242-255 (1996). Available on Canvas.

OCLC-RLG, "Trusted Digital Repositories: Attributes and Responsibilities," May 2002, AKA "OAIS Lite," available at: http://www.oclc.org/research/activities/past/rlg/trustedrep/default.htm

Library of Congress, "How to preserve your own digital materials," http://www.digitalpreservation.gov/you/

October 19: Maintaining the archival bond: Provenance and context

New Technology presentation 5: Budapest OA Initiative, Berry and Forrest

Topic: Provenance and how to maintain it. Discuss what provenance is and how provenance can be provided for digital records; discuss the complexities of multiple or joint provenance issues and changes/accumulation of provenantial history over time.

Questions to consider for discussion:
1) How can you establish the provenance for records that you create? Experiment with this: just look at a file in one of your directories and then see what properties you can see about it (in Windows you'll at least be able to see stuff like when it was created); now open it in your word processor and look at properties again--you should see some additional information. Where is this information coming from? How accurate is it?
2) Go back to the 5015.02 requirements and the discussion we had around it; how does the 5015.2 STD propose to build in maintenance of provenance?

Readings

David Bearman and Richard Lytle, "The Power of the Principle of Provenance," from American Archival Studies: Readings in Theory and Practice, 2000, 345-360, available on Canvas.

Tom Nesmith, "Principle of Provenance," in Encyclopedia of Archival Science, ed. Luciana Duranti and Patricia Franks, 284-288. Available on Canvas.

Shelley Sweeney, "The Ambiguous Origins of the Archival Principle of 'Provenance'," Libraries & the Cultural Record, 43(2), 2008, 193-213. https://muse.jhu.edu/article/237428 (download the pdf).

October 26: Permanence: media, formats, migration, emulation

New Technology presentation 6: Twitter Privacy Update, 2017, Goff and Shook

Topic: How to preserve digital objects over time. We'll discuss two important aspects: what "digital preservation" means and what it is we are trying to preserve.

Questions to prepare for discussion:
1) Considering your personal records, what would you think of as "good enough" preservation for text? What about for photographs? (You can also refer to readings we have already discussed.)
2) What are the major obstacles that you have yourself seen to keeping digital objects that you have created over time?

Readings

Caroline Arms and Carl Fleischhauer, Sustainability of Digital Formats: Planning for Library of Congress Collections: http://www.digitalpreservation.gov/formats/intro/intro.shtml

Jeff Rothenberg, "Ensuring the Longevity of Digital Information," (1999) available at: http://www.clir.org/pubs/archives/ensuring.pdf
This is the serious advocacy piece about archival emulation and Rothenberg has continued to support it until emulation has become more and more important.

Maureen Potter, " Researching Long Term Digital Preservation Approaches in the Digital Preservation Testbed (Dutch Testbed Digitale Bewaring)," RLG Diginews, June 2002, available at: http://worldcat.org/arcviewer/2/OCC/2009/08/11/H1250008792610/viewer/file2.html

Phil Mellor, Paul Wheatley, and Derek Sergeant, "Migration on Request, a Practical Technique for Preservation" in Lecture Notes in Computer Science (Springer, 2002), 516-526. This piece shows the simple argument for why it is crazy to use the chain-of-interpreters form of migration. Available at http://www.springerlink.com/content/752vmvw0g0w40dj2/ If you can't get this without paying, go through the library catalog.

November 2: Guaranteeing authenticity: security vs access

New Technology presentation 7: How to download/delete your Google data, Barraza and Hernandez

Topic: Authenticity vs access. Discuss the requirements of security for the preservation of digital records.

Questions to prepare for discussion:
1) How can you make sure that a digital object has not been changed? How likely is it that visual inspection of any kind would be adequate?
2) What is an "authentic" digital object? How can a digital object be more or less authentic? Is this a black-and-white issue?

Readings

Peter Hirtle, "Archival Authenticity in a Digital Age," Authenticity in a Digital Environment (Washington: CLIR, 2000), 8-23; available at: http://www.clir.org/pubs/reports/pub92/hirtle.html The whole report is well worth reading for an overview, since the issues have not changed.

Geoffrey Yeo, "Contexts, Original Orders, and Item-Lavel Orientation: Responding Creatively to Users' Needs and Technological Change," Journal of Archival Organization 2015 (vol. 12, nos. 3-4, 170-185.

InterPARES, "Findings on the Preservation of Authentic Electronic Records," September 2002; this is the set of principles that the National Archives is using, for better or for worse (focus on pages 11-21), was available at: http://www.gseis.ucla.edu/us-interpares/pdf/InterPARES1FinalReport.pdf; as an exercise, find this by shoving the URL into the Wayback Machine. If you are impatient, use this: http://www.interpares.org/display_file.cfm?doc=ip1_usa_final_report.pdf What does this tell you about how even we should take care to make sure that our links are somehow maintained?

Borja Sotomayor, The Globus Toolkit 4 Programmer's Tutorial (2009), Chapter 9: "Fundamental Security Concepts" (do all the sections of Chapter 9), was available at: http://gdp.globus.org/gt4-tutorial/multiplehtml/ch09.html Now to be found at: http://tkg.im.ncue.edu.tw/wp-content/uploads/2010/04/globus_programers_tutorial.pdf Painful, isn't it?

November 9: Genres of digital records and their management

Guest Speaker: Katherine Cranford, Office of Austin City Clerk, to talk about archiving social media

Topic: Genres of digital records that lack paper analogs and their characteristics and problems. Review of desktop applications output, email, SMS/IM, websites/blogs/wikis, databases, still images, audio and video, etc. Note that many of these genres, especially (but not exclusively) when they are owned by individuals, are migrating into the cloud or never lived anywhere else.

Questions to prepare for discussion:
1) Review these categories of digital objects in terms of your own personal information plan: which of these do you have? Where are they? What do they mean to you?
2) How important are format standards here? What are format standards for? Do you know what the formats of all of your nontext holdings are?

Readings: General overview for records management:

Generally Accepted Recordkeeping Principles: http://www.arma.org/docs/default-source/default-document-library/generally-accepted-recordkeeping-principles_for_website.pdf

Recordkeeping System Functional Requirements. Available on Canvas.

Building a National Strategy for Preservation: Issues in Digital Media Archiving (CLIR, April 2002) provides a series of short summaries of the problems of different genres and media: http://www.clir.org/pubs/reports/pub106/contents.html

Abby Smith, "Distributed Preservation in a National Context," D-Lib Magazine, Vol. 12, No. 6, June 2006, available at: http://www.dlib.org/dlib/june06/smith/06smith.html. Four years later, the Library of Congress reported on the progress of its NDIIPP project to carry out preservation of mostly non-text materials.

November 16: Dealing with ownership: Gating vs sharing

New Technology presentation 8: KIDS REACT TO OLD COMPUTERS (etc.), Dekoning and Martinez

Topic: Discussion of intellectual property issues in providing access to digital records. Also look at the issues raised by information that others own (including public information) available all over the Web for people to aggregate and sell.

Questions toprepare for discussion:
1) It's pretty easy to copy a digital object and use it for anything you want. For example, what do you think about music sharing and the reaction of the music industry?
2) And have you ever heard the expression "Information wants to be free"? What does that mean? How expensive is it to reproduce digital information?
3) Looking at the American copyright law document's listing of enactments, notice how new technologies affect the enactment of new law.
4) If you had written something of which you were very proud and wanted to share it with others, which of the Creative Commons licenses might you choose to protect it? Why did you choose it?
5) What might be the status of personally significant records/information/data that you don't physically control?

Readings

Lessig, The Future of Ideas, Chapter 6, "Commons Lessons," available on Canvas; and look at the Creative Commons website: familiarize yourself with what a CC license is and the kinds of them there can be. http://creativecommons.org/

Current U.S. copyright law, Circular 92: http://www.copyright.gov/title17/ This is a huge document., Look at the "statutory enactments" listed in the preface and then examine the appendices referring to the major versions including 1976 and following in the appendices.

Peter Suber, "Open Access Overview" (2004). http://www.earlham.edu/~peters/fos/overview.htm

World Economic Forum (the folks from Davos), Personal Data: The Emergence of a New Asset Class (2011), is just one example of how interested others might be in your personal data, available here: http://www3.weforum.org/docs/WEF_ITTC_PersonalDataNewAsset_Report_2011.pdf

If you own a home, look it up by typing your address into Google and then see what some of the real estate sites know about you from public databases. If not, look yourself up on Spokeo.

Personal Digital Records Management Plan DUE

November 30: Access and markup: finding aids, internal markup, metadata, and search

Topic: Markup: what it is and what kinds are most important. Discuss markup as a resource discovery aid and especially the level of granularity of markup.

Questions to prepare for discussion:
1) How is it useful to embed tags into text? What kind of embedded tags do you use every day?
2) How are tags used on webpages to assist in search?
3) How is EAD markup related to digital objects kept in archives?
4) Will conventional finding aids to archival collections become obsolete?What does the literature tell us about how easy (or not) they are to use for different audiences?
5) What kind of value added does an archivist bring to a fonds by creating an archival finding aid?

Readings

Text Encoding Initiative (TEI) reference page: http://www.tei-c.org/Support/Learn/intro.xml

Anne Gilliland-Swetland, "Popularizing the Finding Aid: Exploiting the EAD to Enhance Online Discovery and Retrieval in Archival Information Systems by Diverse User Groups," in Pitti and Duff (eds.), Encoded Archival Description on the Internet, 199-225 (2001). Available on Canvas.

Ian Witten, "Text Mining," in Practical Handbook of Internet Computing, 2005, ed. M.P. Singh, pp. 14-22: http://www.cs.waikato.ac.nz/~ihw/papers/04-IHW-Textmining.pdf

Marieke Guy and Emma Tonkin, "Folksonomies: Tidying up tags?," D-Lib 12(1), January 2006. http://www.dlib.org/dlib/january06/guy/01guy.html

Beth Yakel and Polly Reynolds, "The Next Generation Finding Aid..." Case study from New Skills for a Digital Era workshop, June 2006: http://www.archivists.org/publications/proceedings/NewSkillsForADigitalEra.pdf

Mary Flanagan and Peter Carini, "How Games can Help Us Access and Understand Archival Images," American Archivist 75 (Fall/Winter 2012), 514-537. Online via PCL journals.

To expand on this article, view the talk by Luis von Ahn here:http://www.cs.cmu.edu/~biglou/(click on the item labelled "Google tech talk" to see the Human Computation video).

December 7: Digital Preservation Use Case Project presentations

Topic: Student teams will present their use case projects to the class and the class will offer comments on the presentations.

Digital Preservation Use Case Project report DUE