NOTE: This syllabus is definitely preliminary until the first class meets and may change slightly through the semester if new issues come up--so don't just print or download it and continue to refer to that version. URLs change constantly, so if you find a dead link please
do both of the following two things:
1) Stick the URL into the Wayback Machine at http://www.archive.org
and see if you can find it there; and
2) Let our TA know one way or the other: if you found it, the Wayback URL for it; or if you didn't find it, so we can do something about it.
August 31: Background, overview of the field
Reminder about ordering textbook: Jones and Teevan, Personal Information Management. See Text page.
Fill out questionnaire about educational, technical, and archival background (in class)
Discuss course and course assignments, including:
- New Technology research assignment (15%) Start September 21
- Personal digital archive management report (35%). Note that if you begin thinking about this project from the beginning, you can make use of class discussions to explore and sharpen your thinking about your own records. Due November 16
- Digital Preservation Use Case project (35%) Due December 7
- Class participation (15%)
Topic: Overview of the field of digital recordkeeping--history and ideas of interest in a changing social and communication environment. Lecture-discussion. Be prepared to participate by sharing some of your own experiences with digital materials.
September 7: What is a digital record and how can I deal with it?
New Technology research assignments handed out and discussed; protocol for preparation of Personal Information Management Plan discussed.
Topic: Definitions of "electronic and digital records" and the range of objects
and environments that are implicated under this rubric. Discuss the archival
view of digital records and the skills that seem to be required for coping with them. Students will be expected to have read the assignments
and to be prepared to discuss them critically. For a start, read the readings and then prepare to discuss the questions below--which means: have an answer (or indeed another question) in mind and/or written down with your preparation notes for class.
Questions to prepare for discussion:
1) What do you consider the most valuable part of the archival perspective as outlined by Gilliland? Do you consider that any parts are now outdated, and if so, why?
2) How does your skillset fit with the New Skills inventory in general and as expanded in detail in the DigCCurr matrix? If you have questions about the matrix and the skills, please raise them. How might you update and add to your skills?
Readings
Anne J. Gilliland-Swetland, Enduring Paradigm, New Opportunities: The
Value of the Archival Perspective in the Digital Environment (Washington:
CLIR, 2000) http://www.clir.org/pubs/reports/pub89/pub89.pdf
Richard Pearce-Moses and Susan E. Davis, "Knowledge and Skills Inventory," in New Skills for a Digital Era (2006), pp. 1-31, available at http://www.archivists.org/publications/proceedings/NewSkillsForADigitalEra.pdf The proceedings of this conference are worth reading as a whole if and when you have time, not least because three of the case studies are from the UT iSchool.
Cal Lee, Matrix of digital curation knowledge and competencies, 2009. Available at http://www.ils.unc.edu/digccurr/digccurr-matrix.html BE SURE TO DRILL DOWN INTO THE DIMENSIONS!!
September 14: Official digital records and regulation by statute and computer code
Topic: How people use and work with electronic records/digital
objects, including differences that may be introduced by the electronic
medium. Discuss the electronic environment and how it is literally legislated from scratch by computer code (and the implications of "net neutrality"), how real legislation deals with paper and digital records, and how individuals manage their own records. Review the MDAH digital archives project and how it has evolved 1999-2010 (including Sovereignty Commission migration and Walker Sampson's work on moving digital archives to DSpace: http://wsampson.wordpress.com/ October 12 and earlier entries); also discuss the TERM email project and its "failure" (see October 7 also for email). How "record" and "evidence" are constituted and how hard it is to do this.
Questions to prepare for discussion:
1) What digital records should governments keep? Think of this in terms of an issue that interests you (Social Security, taxes, passports, gun licenses, etc.) andfind out what the government of Texas actually requires for our discussion.
2) How do the realities of computer technology make it possible (or not) to keep digital records? You might work through this by asking what problems NARA has had in developing a digital archive (see links in the last reading). If you are interested in more detail about NARA's long struggle with e-records, see Bruce Ambacher (ed.), Thirty Years of Electronic Records.
Readings
Lawrence Lessig, Code and other Laws of Cyberspace, Part 1 (1999).
Available on Canvas in two files, 1.1 and 1.2 , under Files. This is version 1, which Lessig refers to now as an "ancient text"; the revised version, revised via wiki, can be found as a free download or wiki at http://codev2.cc/ (you can add your own remix). If Lessig's work on how the digital environment changes the impact and meaning of traditional legal frameworks interests you, succeeding books include The Future of Ideas, Free Culture, and Remix.
Uniform Electronic Transactions Act summary: this act, passed in 1999, first made digital records officially acceptable in legal transactions. Find a short general summary at: http://www.uniformlaws.org/ActSummary.aspx?title=Electronic%20Transactions%20Act
"Electronic Records standards and procedures," from TSLAC, revised 2005: http://www.tsl.state.tx.us/slrm/recordspubs/stbull01.html Also read the Texas public records statutes at Government Code, chapters 441.180-197
(Texas State Library and management of records), 551.021-023 (Open meetings
records), 552 (Public Information law: especially subsections
101-136 listing exceptions and 272 on access to electronic records):
http://www.statutes.legis.state.tx.us For a list of all the national and international standards that pertain to the official handling of digital records, see a listing here (alas removed from the DIR site and surviving only on the Internet Archive): http://web.archive.org/web/20100616080137/http://www.dir.state.tx.us/pubs/derm/standards/section1.htm
Mississippi Department of Archives and History Electronic Records Project, Management Standards (1999).
State Electronic Records Initiative (SERI; sponsored by CoSA): https://www.statearchivists.org/programs/state-electronic-records-initiative/ This project began in 2011 and is designed to run through 2015 with different aspects of the project.
National Research Council, Building an Electronic Records Archive at the national Archives and Records Administration: Recommendations for a Long-Term Strategy (2005). This was the attempt to bring the project up to speed after long delay. It's available freely downloadable from here: http://www.nap.edu/openbook.php?record_id=11332&page=R1
Federal News Radio, "NARA to suspend development of ERA starting in 2012," 12/17/2010, located here: http://www.federalnewsradio.com/congress/2010/nara-to-suspend-development-of-era-starting-in-2012
September 21: Personal digital information and regulation by computer code
Protocol for preparation of Personal Digital Archiving Plan discussed.
New Technology Presentation 1: VeraPDF 1.8, Lowrance and Lumpkins
Topic: Non-official born-digital (and born-again-digital) objects and how they are managed and preserved. Discuss digital personal records, with a focus on the student project to understand one's own digital belongings and manage a personal archive.
Questions to prepare for discussion:
1) How do the digital tools a person personally uses constrain what they create and what they can personally keep?
2) Is it possible and/or desirable to know and manage the full range of digital objects that a person presently creates? Think about an example or two and why it might be complicated to do so.
3) Is it possible and/or desirable to find out and understand how one's identity is represented on the Internet? How is this a new problem?
4) Should archivists or commercial vendors assist ordinary people to manage their digital belongings? Think of arguments in favor of each alternative..
Readings
Personal Information Management, ed. William Jones and Jaime Teevan (hereafter cited as PIM), pp. 3-75, chapters 1-4 will begin our investigation of how people really keep digital records, regardless of statutes.
Neil Beagrie, "Plenty of Room at the Bottom? Personal Digital Libraries and Collections," D-Lib 11(6), June 2005, available at http://www.dlib.org/dlib/june05/beagrie/06beagrie.html
Gabriela Redwine, Personal Digital Archiving, DPC Technology Watch Report 15-01, December 2015. Available at http://www.dpconline.org/docs/technology-watch-reports/1460-twr15-01/file
Peter Williams, Jeremy Leighton John, and Jan Rowland, "The Personal Curation of Digital Objects: a Lifecycle Approach," ASLIB Proceedings 61(4), 2009, 340-363. Available under journals from the PCL catalog.
Simson Garfinkel and David Cox, "Finding and Archiving the Internet Footprint" (2009): http://simson.net/clips/academic/2009.BL.InternetFootprint.pdf
Clive Thompson, "A Head for Detail," Fast Company 110 (November 2006): http://www.fastcompany.com/magazine/110/head-for-detail.html This essay is an entertaining account of Gordon Bell's "life-logging" experiment about which more anon.
September 28: Record granularity and metadata
New Technology presentation 2: Save the Bits (DPC), He and Newton
Use Case Assignments made and discussed
Topic: Implications for records creators, archives, and
users of record-level description and the generation of metadata to provide
it. Discuss the issue of descriptive granularity and review various metadata schemes. Investigate in class metadata created by programs.
Questions to prepare for discussion:
1) How does the need for bitstream-level metadata contradict or make problematic the adoption of minimal processing (aka MPLP) standards in an archive? Is there a solution to this contradiction?
2) What kinds of metadata are needed for keeping your own records? How does this differ (if it does) from the kinds needed for archival collections? For digital library collections?
3) Give/find an example of metadata among your own digital files.
Readings
David Bearman, "Item Level Control and Electronic Recordkeeping" (this is a classic 1996 article that makes a very important point while summarizing
the Pittsburgh project; the entire Pittsburgh project website, by the way, had not been archived away from its active location and was lost by a site remodel for the department, but has been recovered and restored; there is a link to it on the Resources page together with the story of how it was recovered): http://www.archimuse.com/papers/nhprc/item-lvl.html
Dublin Core metadata set current version (2011); review this originally resource-discovery oriented metadata set and also investigate how it is expanded as Qualified Dublin Core (the "terms namespace") at http://dublincore.org/documents/dcmi-terms/
Review the work being done for their own internal consistency by the Library of Congress Metadata for Digital Content Working Group here: http://www.loc.gov/standards/mdc/index.html See also the Master Metadata List, available on Canvas.
Review work being done on the PREMIS 2.2 metadata set for digital preservation and read the first 20 pages of this document: http://www.loc.gov/standards/premis/v2/premis-2-2.pdf
Investigate the METS metadata set for packaging digital objects and be prepared to discuss the parts of a METS document by reading the "METS Overview and Tutorial" (2011): http://www.loc.gov/standards/mets/METSOverview.v2.html
October 5: Passive vs active systems for managing desktop
records
New Technology presentation 3: Portico, Wahl and Zevnik
Topic: Records Management Applications (RMAs) versus careful
and systematic exploitation of existing software. Review the Department
of Defense 5015.2 EDMS-RM model and commercial implementations of 5015.2-compliant
RMAs, practical efforts at implementation
in Texas, Kansas, and Mississippi, automated vs creator-assigned classification, Microsoft's nascent efforts to invade this profit space using features of its widely-used integrated business system SharePoint, and a suggestion on why much of this is doomed to failure without further study of how people manage their "own" records.
Questions to prepare for discussion:
1) How might you be likely to be subjected to a digital records management application at work? If you have had such an experience, be prepared to tell us about it.
2) How detailed must a records management application be in order to actually manage records, all records? What does the STD 5015.2 suggest that it must cover? Are these expectations realistic?
3) Would you consider outsourcing your entire personal recordkeeping to Google or another cloud host? How would you set up such a thing, and what would you want to consider?
Readings
PIM, pp. 90-166, chapters 6-9.
DoD 5015.02 specifications (latest version, dated 2007). This is a big document, but I want you to look through it carefully so you can both see the level of detail that a government standard includes and understand what the federal standard proposes to be able to manage: http://www.esd.whs.mil/Portals/54/Documents/DD/issuances/dodm/501502std.pdf
Here is a blog entry on problems with DoD 5015.02 and its deployment by an expert, Don Lueders (be sure to check out the linked replies from others who don't agree): http://www.aiim.org/community/blogs/expert/On-Why-I-No-Longer-Support-the-DoD-50152-Standard
And here is a report of huge amounts of DoD data erased during the wars in the Middle East from operational laptops: http://www.propublica.org/article/lost-to-history-missing-war-records-complicate-benefit-claims-by-veterans
NARA, Continuing Study of Federal Agency Recordkeeping Technologies (2008), a report from NARA on how well the application of the standard outlined in STD 5015.2 is going in federal agencies: http://www.archives.gov/records-mgmt/resources/recordkeeping-tech-2008.pdf
Patricia Galloway, "Big Buckets or Big Ideas: Classification vs Innovation on the Enterprise 2.0 Desktop," (2008). This paper outlines the so-called "Big Buckets" approach to making desktop records management easier to use and questions its blanket usefulness for records that may be among the most important to keep, available here: http://armaedfoundation.org/wp-content/uploads/2016/12/BBpaper30.pdf
Barry Wheeler, "Personal Archiving--Year End Boot Camp," 1/20/2012 entry about real-life personal digital archiving in the Library of Congress-sponsored blog The Signal: Digital Preservation, by a Digital Projects Coordinator at the LoC: http://blogs.loc.gov/digitalpreservation/2012/01/personal-archiving-%E2%80%93-year-end-boot-camp/
October 12: Centralized vs distributed models: custodianship
New Technology presentation 4: Windows 10 Data Collection, Christner
Topic: Where digital records should be archived and by whom.
Discuss the issue of traditional archival custodianship, the challenge
of postcustodial models, and the emergence of best practice in the form
of the OAIS repository model.
Questions to prepare for discussion:
1) Can digital archives be "a place"?
2) Should there be a distinction between public and private records?
3) Should public records preservation be outsourced? Why or why not?
4) What would the individual person's point of view be on custodianship? What cloud locations might individuals use?
Readings
Luciana Duranti, "Archives as a Place,"Archives and Manuscripts
24(2): 242-255 (1996). Available on Canvas.
OCLC-RLG, "Trusted Digital Repositories: Attributes and Responsibilities,"
May 2002, AKA "OAIS Lite," available at: http://www.oclc.org/research/activities/past/rlg/trustedrep/default.htm
Library of Congress, "How to preserve your own digital materials," http://www.digitalpreservation.gov/you/
October 19: Maintaining the archival bond: Provenance and context
New Technology presentation 5: Budapest OA Initiative, Berry and Forrest
Topic: Provenance and how to maintain it. Discuss what provenance
is and how provenance can be provided for digital records; discuss the complexities of multiple or joint provenance issues and changes/accumulation of provenantial history over time.
Questions to consider for discussion:
1) How can you establish the provenance for records that you create? Experiment with this: just look at a file in one of your directories and then see what properties you can see about it (in Windows you'll at least be able to see stuff like when it was created); now open it in your word processor and look at properties again--you should see some additional information. Where is this information coming from? How accurate is it?
2) Go back to the 5015.02 requirements and the discussion we had around it; how does the 5015.2 STD propose to build in maintenance of provenance?
Readings
David Bearman and Richard Lytle, "The Power of the Principle of
Provenance," from American Archival Studies: Readings in Theory
and Practice, 2000, 345-360, available on Canvas.
Tom Nesmith, "Principle of Provenance," in Encyclopedia of Archival Science, ed. Luciana Duranti and Patricia Franks, 284-288. Available on Canvas.
Shelley Sweeney, "The Ambiguous Origins of the Archival Principle of 'Provenance'," Libraries & the Cultural Record, 43(2), 2008, 193-213. https://muse.jhu.edu/article/237428 (download the pdf).
October 26: Permanence: media, formats, migration, emulation
New Technology presentation 6: Twitter Privacy Update, 2017, Goff and Shook
Topic: How to preserve digital objects over time.
We'll discuss two important aspects: what "digital preservation" means and what it is we are trying
to preserve.
Questions to prepare for discussion:
1) Considering your personal records, what would you think of as "good enough" preservation for text? What about for photographs? (You can also refer to readings we have already discussed.)
2) What are the major obstacles that you have yourself seen to keeping digital objects that you have created over time?
Readings
Caroline Arms and Carl Fleischhauer, Sustainability of Digital Formats: Planning for Library of Congress Collections: http://www.digitalpreservation.gov/formats/intro/intro.shtml
Jeff Rothenberg, "Ensuring the Longevity of Digital Information," (1999) available at: http://www.clir.org/pubs/archives/ensuring.pdf
This is the serious advocacy piece about archival emulation and Rothenberg has continued to support it until emulation has become more and more important.
Maureen Potter, " Researching Long Term Digital Preservation Approaches in the Digital Preservation Testbed (Dutch Testbed Digitale Bewaring)," RLG Diginews, June 2002, available at: http://worldcat.org/arcviewer/2/OCC/2009/08/11/H1250008792610/viewer/file2.html
Phil Mellor, Paul Wheatley, and Derek Sergeant, "Migration on Request, a Practical Technique for Preservation" in Lecture Notes in Computer Science (Springer, 2002), 516-526. This piece shows the simple argument for why it is crazy to use the chain-of-interpreters form of migration. Available at http://www.springerlink.com/content/752vmvw0g0w40dj2/ If you can't get this without paying, go through the library catalog.
November 2: Guaranteeing authenticity: security vs access
New Technology presentation 7: How to download/delete your Google data, Barraza and Hernandez
Topic: Authenticity vs access. Discuss the requirements of security for the preservation of digital records.
Questions to prepare for discussion:
1) How can you make sure that a digital object has not been changed? How likely is it that visual inspection of any kind would be adequate?
2) What is an "authentic" digital object? How can a digital object be more or less authentic? Is this a black-and-white issue?
Readings
Peter Hirtle, "Archival Authenticity in a Digital Age," Authenticity in a Digital Environment (Washington: CLIR, 2000), 8-23; available at: http://www.clir.org/pubs/reports/pub92/hirtle.html The whole report is well worth reading for an overview, since the issues have not changed.
Geoffrey Yeo, "Contexts, Original Orders, and Item-Lavel Orientation: Responding Creatively to Users' Needs and Technological Change," Journal of Archival Organization 2015 (vol. 12, nos. 3-4, 170-185.
InterPARES, "Findings on the Preservation of Authentic Electronic Records," September 2002; this is the set of principles that the National Archives is using, for better or for worse (focus on pages 11-21), was available at: http://www.gseis.ucla.edu/us-interpares/pdf/InterPARES1FinalReport.pdf; as an exercise, find this by shoving the URL into the Wayback Machine. If you are impatient, use this: http://www.interpares.org/display_file.cfm?doc=ip1_usa_final_report.pdf What does this tell you about how even we should take care to make sure that our links are somehow maintained?
Borja Sotomayor, The Globus Toolkit 4 Programmer's Tutorial (2009), Chapter 9: "Fundamental Security Concepts" (do all the sections of Chapter 9), was available at: http://gdp.globus.org/gt4-tutorial/multiplehtml/ch09.html Now to be found at: http://tkg.im.ncue.edu.tw/wp-content/uploads/2010/04/globus_programers_tutorial.pdf Painful, isn't it?
November 9: Genres of digital records and their management
Guest Speaker: Katherine Cranford, Office of Austin City Clerk, to talk about archiving social media
Topic: Genres of digital records that lack paper
analogs and their characteristics and problems. Review of desktop applications output, email, SMS/IM, websites/blogs/wikis,
databases, still images, audio and video, etc. Note that many of these genres, especially (but not exclusively) when they are owned by individuals, are migrating into the cloud or never lived anywhere else.
Questions to prepare for discussion:
1) Review these categories of digital objects in terms of your own personal information plan: which of these do you have? Where are they? What do they mean to you?
2) How important are format standards here? What are format standards for? Do you know what the formats of all of your nontext holdings are?
Readings: General overview for records management:
Generally Accepted Recordkeeping Principles: http://www.arma.org/docs/default-source/default-document-library/generally-accepted-recordkeeping-principles_for_website.pdf
Recordkeeping System Functional Requirements. Available on Canvas.
Building a National Strategy for Preservation: Issues in Digital Media Archiving (CLIR, April 2002) provides a series of short summaries of the problems of different genres and media: http://www.clir.org/pubs/reports/pub106/contents.html
Abby Smith, "Distributed Preservation in a National Context," D-Lib Magazine, Vol. 12, No. 6, June 2006, available at: http://www.dlib.org/dlib/june06/smith/06smith.html. Four years later, the Library of Congress reported on the progress of its NDIIPP project to carry out preservation of mostly non-text materials.
November 16: Dealing with ownership: Gating vs sharing
New Technology presentation 8: KIDS REACT TO OLD COMPUTERS (etc.), Dekoning and Martinez
Topic: Discussion of intellectual property issues in providing access to digital records. Also look at the issues raised by information that others own (including public information) available all over the Web for people to aggregate and sell.
Questions toprepare for discussion:
1) It's pretty easy to copy a digital object and use it for anything you want. For example, what do you think about music sharing and the reaction of the music industry?
2) And have you ever heard the expression "Information wants to be free"? What does that mean? How expensive is it to reproduce digital information?
3) Looking at the American copyright law document's listing of enactments, notice how new technologies affect the enactment of new law.
4) If you had written something of which you were very proud and wanted to share it with others, which of the Creative Commons licenses might you choose to protect it? Why did you choose it?
5) What might be the status of personally significant records/information/data that you don't physically control?
Readings
Lessig, The Future of Ideas, Chapter 6, "Commons Lessons," available on Canvas; and look at the Creative Commons website: familiarize yourself with what a CC license is and the kinds of them there can be. http://creativecommons.org/
Current U.S. copyright law, Circular 92: http://www.copyright.gov/title17/ This is a huge document., Look at the "statutory enactments" listed in the preface and then examine the appendices referring to the major versions including 1976 and following in the appendices.
Peter Suber, "Open Access Overview" (2004). http://www.earlham.edu/~peters/fos/overview.htm
World Economic Forum (the folks from Davos), Personal Data: The Emergence of a New Asset Class (2011), is just one example of how interested others might be in your personal data, available here: http://www3.weforum.org/docs/WEF_ITTC_PersonalDataNewAsset_Report_2011.pdf
If you own a home, look it up by typing your address into Google and then see what some of the real estate sites know about you from public databases. If not, look yourself up on Spokeo.
Personal Digital Records Management Plan DUE
November 30: Access and markup: finding aids, internal markup, metadata, and search
Topic: Markup: what it is and what kinds are most important.
Discuss markup as a resource discovery aid and especially the level
of granularity of markup.
Questions to prepare for discussion:
1) How is it useful to embed tags into text? What kind of embedded tags do you use every day?
2) How are tags used on webpages to assist in search?
3) How is EAD markup related to digital objects kept in archives?
4) Will conventional finding aids to archival collections become obsolete?What does the literature tell us about how easy (or not) they are to use for different audiences?
5) What kind of value added does an archivist bring to a fonds by creating an archival finding aid?
Readings
Text Encoding Initiative (TEI) reference page: http://www.tei-c.org/Support/Learn/intro.xml
Anne Gilliland-Swetland, "Popularizing the Finding Aid: Exploiting
the EAD to Enhance Online Discovery and Retrieval in Archival Information
Systems by Diverse User Groups," in Pitti and Duff (eds.), Encoded
Archival Description on the Internet, 199-225 (2001). Available on Canvas.
Ian Witten, "Text Mining," in Practical Handbook of Internet Computing, 2005, ed. M.P. Singh, pp. 14-22: http://www.cs.waikato.ac.nz/~ihw/papers/04-IHW-Textmining.pdf
Marieke Guy and Emma Tonkin, "Folksonomies: Tidying up tags?," D-Lib 12(1), January 2006. http://www.dlib.org/dlib/january06/guy/01guy.html
Beth Yakel and Polly Reynolds, "The Next Generation Finding Aid..." Case study from New Skills for a Digital Era workshop, June 2006: http://www.archivists.org/publications/proceedings/NewSkillsForADigitalEra.pdf
Mary Flanagan and Peter Carini, "How Games can Help Us Access and Understand Archival Images," American Archivist 75 (Fall/Winter 2012), 514-537. Online via PCL journals.
To expand on this article, view the talk by Luis von Ahn here:http://www.cs.cmu.edu/~biglou/(click on the item labelled "Google tech talk" to see the Human Computation video).
December 7: Digital Preservation Use Case Project presentations
Topic: Student teams will present their use case projects to the class and the class will offer comments on the presentations.
Digital Preservation Use Case Project report DUE
|