Developing a Digital Archive with Limited Resources

Similar documents
University of Massachusetts Amherst Libraries. Digital Preservation Policy, Version 1.3

Metadata for Photographs SHN Post-Conference Workshop - ATALM 2016 Part 2: Image Digitization

Planning for Digitization

Starting a Digitization Project: Basic Requirements

Digital Projects Made Easy: It s All about Partnerships

Digitization and Scanning Basics at RRLC Planning a Digitization Project: November 27, Vision & Goals:

GUIDELINES FOR THE CREATION OF DIGITAL COLLECTIONS

The NEW IUScholarWorks at Indiana University. Repositories, Journals, and Scholarly Publishing

Written in Light: Creating Access to Photographic Collections

Digitisation Plan

Photo Digitization. Pre-Digitization (including planning) Digitization. Post-Digitization

A Digitisation Strategy for the University of Edinburgh

Title: Self-service, Express Digital Scanning: KIC Bookeye High Speed Scanners

Submittal Exchange Design Team User Guide

Scanning Guide for Adobe Photoshop

Methodology for Agent-Oriented Software

Hyperion System 9 Financial Data Quality Management. Quick Reference Guide

Omeka Archive: Primary Sources & Metadata

Module 1A: Record images of ledger/card or catalog/field notes (materials not stored with specimens)

Maloney, Cath. LAARC Version 05 Date 08/05/2013

Inclusion: All members of our community are welcome, and we will make changes, when necessary, to make sure all feel welcome.

ANSYS v14.5. Manager Installation Guide CAE Associates

What is a collection in digital libraries?

DIGITISATION FOR PRESERVATION AND ACCESS A technical perspective

13. The Digital Archive and Catalogues of the Vanuatu Cultural Centre: Overview, Collaboration and Future Directions

Exploring Public Access: Conservation Treatment Documentation as an Online Digital Collection

PaperCut PaperCut Payment Gateway Module - CBORD Quick Start Guide

PRESERVATION POLICY HOWARD-TILTON MEMORIAL LIBRARY Updated July 2013 PRESERVATION PRIORITIES AND SELECTION FOR TREATMENT

Scanning. Records Management Factsheet 06. Introduction. Contents. Version 3.0 August 2017

Digital Projects Made Easy: It s about Partnerships

TeesRep policy document

Using Career Portfolio for an Academic Job Search

INFS 326: COLLECTION DEVELOPMENT

1. Digital Humanities in the Library: Challenges and Opportunities for Subject Specialists. Copyright 2015 by The Association of College & Research

Administration Guide. BBM Enterprise on BlackBerry UEM

Demonstrating in the Classroom Ideas of Frequency Response

INTEGRATED DATABASE PROJECT

Oklahoma State University Policy and Procedures

Curtis Laws Wilson Library

Reviewing Your Tax Return In Your Portal

2015 OHRAB Final Grant Report. Preserving and Sharing Alliance s Historical Documents ext. 101

6.12. SilverFastJobManager. SilverFast JobManager for Film Scanner. Overview

Issues in Emerging Health Technologies Bulletin Process

University of Kansas. The University of Kansas Libraries

A Guide to Image Management in Art Centres. Contact For further information about this guide, please contact

Legacy FamilySearch Overview

-f/d-b '') o, q&r{laniels, Advisor. 20rt. lmage Processing of Petrographic and SEM lmages. By James Gonsiewski. The Ohio State University

Catching Up: Creating a Digital Preservation Policy After the Fact

Using the Tax Research Center

PaperCut PaperCut Payment Gateway Module - Blackboard Quick Start Guide

Network Scanner Guide for Fiery S300 50C-KM

Ansible Tower Quick Setup Guide

INFS 326: COLLECTION DEVELOPMENT MRS. FLORENCE O. ENTSUA-MENSAH

Convergence of Knowledge and Culture

Basics. Relationships Matter

UNIGIS University of Salzburg. Module: ArcGIS for Server Lesson: Online Spatial analysis UNIGIS

RESEARCHING THE NATIONAL ARCHIVES. Compiled by: Sandra M Barnes From

Figure 1 The Raith 150 TWO

LIBRARY AND ARCHIVES POLICY

Photography Protocol. Software: ZEN Blue 2012

LIS 688 DigiLib Amanda Goodman Fall 2010

Ansible Tower Quick Setup Guide

MYSA Direct Member Club Registration Setup Manual. Contents

Setup and Walk Through Guide Orion for Clubs Orion at Home

Antenie Carstens National Library of South Africa. address:

Concept Connect. ECE1778: Final Report. Apper: Hyunmin Cheong. Programmers: GuanLong Li Sina Rasouli. Due Date: April 12 th 2013

El Camino College Fine Arts Division Photography Department Program Review Fall 2008 Conducted by Professor Darilyn Rowan

HOUSE OF COMMONS JOB DESCRIPTION

Darcy Armstrong Digital Libraries Spark 3. The Sargent John P. Davidson Collection

emuseum User Guide Delaware Art Museum s Online Collection

CONTENTS NCOMPAS MEDIA 1 USER MANUAL

Summer of Sleuthing Saving Our Stories

Switching to Sub Category and Collapsible Skins

TABLE OF CONTENTS. Logging into the Website Homepage and Tab Navigation Setting up Users on the Website Help and Support...

RosterPro by Demosphere International, Inc.

Hyperion System 9 Financial Data Quality Management

EDUCATION GIS CONFERENCE Geoprocessing with ArcGIS Pro. Rudy Prosser GISP CTT+ Instructor, Esri

Optimizing Digital Drawing Files and BIM Models for Measurement and Estimating

PELLISSIPPI STATE TECHNICAL COMMUNITY COLLEGE MASTER SYLLABUS. ARCHITECTURAL 3D MODELING W/LAB CID 2112 (formerly CID 2115)

Rebecca Hernandez-Gerber CINE-GT Metadata Creation Assignment. value to myself: my video and computer game collection. At first, this may seem a

Guidelines for the Professional Evaluation of Digital Scholarship by Historians

Digital Negative. What is Digital Negative? What is linear DNG? Version 1.0. Created by Cypress Innovations 2012

Project Description. Multispectral Image Capture System The Sixth Sensor

Integrated Tertiary Management System

<Insert Picture Here> Using ERPi for EBS/FDM Data Loads into HFM

A Web-Based Genealogy System

ICOM CIDOC Dresden 2014 Short Paper. Documentation Photography: An Integrated Process

G e t t i n g S t a r t e d G u i d e. ID Card Managers. Getting Started Guide. 1 P a g e

Self Learning Game Software Requirements Specification Joint Document Version 1

PORTFOLIO ACTIVITIES 2012 / 2013

Preparing for digitisation Skill session 15 September 2007

Guide to the Clark County, Nevada United States Bicentennial Collection

Storybird Instructions for Patrons

Case Study. British Library 19th Century Book Digitisation Project

Duplicate Checker User Guide for Parishes

Digital Asset Management 7. Interactive Media and Game Development process

MANUAL FOR ONLINE DRAWING APPROVAL TRACKING SYSTEM

Drawing Management Brain Dump

Trends in. Archives. Practice MODULE 8. Steve Marks. with an Introduction by Bruce Ambacher. Edited by Michael Shallcross

Positioning Libraries in the Digital Preservation Landscape

Transcription:

Wilfrid Laurier University Scholars Commons @ Laurier Library 2004 Developing a Digital Archive with Limited Resources Greg Sennema Wilfrid Laurier University, gsennema@wlu.ca Follow this and additional works at: http://scholars.wlu.ca/lib_pub Part of the Library and Information Science Commons Recommended Citation Sennema, Greg, "Developing a Digital Archive with Limited Resources" (2004). Library. 20. http://scholars.wlu.ca/lib_pub/20 This Article is brought to you for free and open access by Scholars Commons @ Laurier. It has been accepted for inclusion in Library by an authorized administrator of Scholars Commons @ Laurier. For more information, please contact scholarscommons@wlu.ca.

Developing a Digital Archive with Limited Resources Greg Sennema Digital Resources and Reference/Instruction Librarian Hekman Library Calvin College and Calvin Theological Seminary 1855 Knollcrest Circle SE Grand Rapids MI 49546-4402 616-526-8455 gsennema@calvin.edu Autobiographical Note Greg Sennema is the Digital Resources and Reference/Instruction Librarian at the Hekman Library, Calvin College and Calvin Theological Seminary. Grand Rapids, Michigan, USA. Abstract This article describes how a small liberal arts college library created a digital library using existing staff and financial resources. The Hekman Digital Archive (HDA) was created using a digital media archive module of the library s ILS, and is maintained by one library staff member with the help of numerous student assistants. Keywords Academic Libraries, Digital Libraries, Internet

Developing a Digital Archive with Limited Resources Introduction Hekman Library staff first considered developing a digital archive while investigating options to upgrade its Integrated Library System (ILS) in the spring of 2000. Serving primarily the students, staff, and faculty of Calvin College and Calvin Theological Seminary in Grand Rapids, Michigan, the Hekman Library provides access to over 1.6 million items including approximately 175,000 ebooks and over 10,000 ejournals. Along with the myriad of issues library staff needed to consider in a new ILS, staff were also mindful that individuals within Calvin s academic community were interested in storing digital files that could be used for classroom instruction. With this in mind, and after reviewing proposals from three ILS vendors, the library purchased an ILS from Sirsi which included Hyperion, a module for cataloging, storing, and making digital files accessible on the Web. This article describes the events, processes, challenges, and decisions that brought the Hekman Digital Archive (HDA) into existence, and turned it into the valuable resource that it is today.[1] Digital Media Archive: Getting Started The decision to purchase a new ILS was made in the spring of 2000, and in the fall of that year Sirsi provided on-site training to the library s 17 FTE staff. In addition to providing training for the catalog module (Sirsi Unicorn), Sirsi also scheduled specific training sessions for creating a digital library using the digital media archive module Hyperion. At the time of this training, library staff had little prior knowledge of what a digital archive was supposed to be, including the basic

definition of digital archive, and how it might differ from or resemble what library literature commonly referred to as a digital library. Furthermore, no significant money or additional staff could be devoted to the digital archive beyond the cost of the Hyperion software and a new server. Other stakeholders the library director, the library systems programmer, and the curator of the college s archives were invited to participate in the two day on-site training. In addition to providing guidance on how to use Hyperion, the Sirsi representative helped participants develop a pilot project by reviewing issues such as staffing, handling of materials, scanning specifications, digital masters and derivative images, Dublin-Core metadata standards, quality control, and delivering the digital files to the Web. For a pilot project, library staff identified 700 historical images from the college and seminary archives that would be digitized in conjunction with Calvin s 125 th anniversary celebrations in May of 2001.[2] Using a pilot project as the basis for training proved helpful in formulating the basic workflow that would be employed for the digital archive. The ultimate goal was to develop foundational infrastructure, rather than to recreate the infrastructure each time a new digital collection was created. As such, library staff wanted to mimic the procedures purchasing, processing, cataloging, shelving, etc. that were already in place in the library s acquisitions and cataloging departments. The digital archive s technical services area was created immediately after training. Both a flatbed and slide scanner were purchased, and two student assistants were hired to complete the scanning and basic metadata for each

image. Scanning for the digital master was completed according to specifications outlined in the literature at that time.[3] During the time it took for each image to be scanned, the student assistants recorded the images descriptive information onto a spreadsheet using metadata rules defined by the digital archive s metadata elements. The plan to create the 125 th Anniversary Archives using Hyperion was not realized because of the time needed to properly configure the library s new ILS, and the author s need for additional training. Library staff decided to create a temporary Web-accessible digital collection, and the library systems programmer swiftly produced a searchable Web portal which displayed records that included metadata, thumbnails, and various resolution sizes. An online administrative component allowed library staff to add, edit, and delete the images and the metadata in this temporary portal. After the 125 th Anniversary Archive Web site was completed, the author focused on creating the infrastructure for a permanent digital archive. The library had a server to store the digital files, client-based software for managing records, and a Web-accessible portal. However, the then current version of the Hyperion software was difficult to use. Using it to upload digital files, move and manipulate menu hierarchies, and enter and edit metadata could only be completed with deliberate effort by a metadata librarian. However, Calvin did not have a metadata librarian, and it was unlikely that already overworked library staff and student assistants working sporadically on digital archive tasks would be able to master Hyperion without constant facilitation. To overcome this considerable

obstacle, the author envisioned that the work could be shared with other library staff and student assistants using simple web-accessible forms. Hekman Digital Archive (HDA) The challenge was to create a digital archive without the oversight of a full time staff person, while at the same time creating an efficient workflow that ensured high quality digital files and metadata, and a timely turnaround on processing this data. Another consideration was how to display the permanent digital archive to the Calvin community. Library staff decided to make the digital media archive resemble the library s online catalog, using similar layout and graphical features, and to call the resource the Hekman Digital Archive (HDA) instead of using Sirsi s brand name Hyperion. The HDA would be comprised of digital collections arranged according to a hierarchical menu system (as defined by Hyperion) with searchable metadata. The first official HDA collection after the pilot project was entitled Calviniana, comprised of digitized files of images and medals depicting John Calvin collected by the H. H. Meeter Center for Calvin Studies.[4] The creation of this collection was helpful since it offered the opportunity to see Hyperion in action, and to further understand its capabilities and limitations. It also provided staff with additional scanning experience, and in particular, experience in digitizing three-dimensional objects. [Take in Figure 1] Figure 1 The medals collected by the Meeter Center are comprised of coins and other commemorative medals. The medals were digitized using a flatbed

scanner. In addition to the thumbnails and metadata, this figure shows the Medium Resolution image in a pop-up window. [Caption] Hekman Digital Archive Queue (HDAQ) The amount of time spent on creating Calviniana, particularly the effort expended using Hyperion for this collection, convinced the author to find an alternative workflow for the digital archive. The library systems programmer suggested an alternative based upon the 125 th Anniversary Archives. A modified version of the administrative component of the Anniversary Archives could be used as the environment (or software ) that library staff and student assistants would use to create the digital archive content. As with the Anniversary Archives, student assistants could be employed to complete the digitization process, and simultaneously to record the basic metadata for each file. This online administrative module was named HDAQ, with the Q referring to the queue of records that are waiting to be uploaded into the HDA by library staff after they are created in the HDAQ by student assistants. After digitizing the images, the student assistant would log into HDAQ with a username that pre-determines particular collection variables. Clicking Add a new entry (see Error! Reference source not found.), the student processes the image. In HDAQ, this means that a piece of software named ImageMagick is utilized to create various lower resolution jpeg images (including a thumbnail) from the original higher resolution tiff image. During the several minutes that this process takes to complete on the server, the student enters metadata onto an

online form. By the time the metadata has been entered, the image processing has been completed, and the next record can be created. [Take in Figure 2] Figure 2 -- HDAQ's main screen. [Caption] In addition to creating HDA records, HDAQ is also used to manage characteristics of each collection. For example, HDA records are placed within a hierarchical menu. During the initial Hyperion training with Sirsi, library staff decided that the hierarchy would contain four top level menu items: the Hekman Library, Heritage Hall, the Meeter Center, and Academic Departments. Beneath these four menu items, submenus could be added according to the type of collection. In the case of the Meeter Center, the Calviniana collection was placed immediately below the main menu item. Beneath the Academic Department menu item the names of Calvin s departments would be listed, and beneath these the names of various collections created within the department. HDAQ logins could be created that would force all created records into a particular menu item. This feature would later become important when additional student assistants would be hired to create records for an increasing number of collections. Finally, HDAQ is the location in which staff can access the queue of records created by student assistants in order to review the quality of the image and the metadata, and to add Library of Congress (LC) subject headings before each record is uploaded to its final location in the HDA. After being loaded into the HDA, the records are no longer accessible from the HDAQ, and subsequent changes to the images or the metadata are completed by using Hyperion.

HDA records rely on two file naming schemes, one for the original digital file, and the other for the HDA record. Since each HDA image is part of a digital collection, it is assigned a letter code abbreviation (based on the collection s name) followed by sequential numbers. After being entered into HDAQ, the image number information is placed in the Identifier metadata field, while the HDAQ record (the image along with the metadata) is assigned an auto-generated number based on the department to which the digital collection belongs. For example, the final image in a collection of theater history slides might be th1672, and the first number of a subsequent collection of costume history slides be assigned as ch1, but their corresponding HDA identifiers would be auto-generated based on their entry into the system. Adding HDA Collections In May 2001, with one HDA collection completed (the Anniversary Archives would be incorporated into the HDA the following year), library staff were approached by a professor from the college s Communication Arts and Science (CAS) department who requested that the library create a digital collection of slides covering the history of theater. Library staff were pleased with this request since the professor offered the services of a student assistant to complete the scanning and basic metadata. Unlike the Anniversary Archives or Calviniana images, the primary purpose of the digital slides in the Theater History Digital Archive Collection was for classroom instruction, and in this case, the image quality did not have to meet the high resolution specifications followed for these previous collections. Although copyright was not an issue for this

collection, it is accessible only to members of the Calvin community. Hyperion allows for various levels of access to individual records or whole menu items, including an Extended Public level which makes a resource freely accessible on the web, and a Public level which requires users to login using ID and password information stored on the college s authentication database. The Theatre History collection offered library staff the first occasion to use HDAQ. The student assistant was trained to scan the slides and to record metadata from information printed on the slide frame, with additional information coming from monographs and other online sources. After the scanning and data gathering was complete, the student assistant was shown how to use HDAQ. His HDAQ login determined that all the records he created would be automatically be assigned an access level of Public, and that they would be placed in the Academic Department/CAS/Theatre History position of the hierarchical menus. Further menu divisions, such as /Medieval/Western European/Miracle Plays were specified when the metadata was entered into HDAQ. As the student assistant continued to add records in HDAQ, library staff reviewed the work, added LC subject headings, and uploaded each record. Upon completion of the this collection, library staff noted that HDAQ proved adequate to accept the image files and metadata, as well as to synchronize this data to its final home in the HDA. Library staff also realized that it was not possible to create one workflow that would effectively deal with each digital collection. The images in the Theatre History collection, for example, were scanned at a lower resolution since the professor was not interested in

preservation as much as in the immediate access of the data by students and faculty in the classroom. In addition to image quality, other questions had to be addressed for each collection. For example, would student assistants be hired by the library or by a college department? Would the same person complete the scanning and the metadata? Where would the collection be placed within the hierarchy in relation to other collections from the same department? Would the collection be available to the public, or restricted to the Calvin community. HDA: Defining a Purpose The library was also approached by the college s Public Relations staff who proposed storing their digital files on the HDA. PR staff wanted to manage their digital images in an intuitive system more than just the simple network directory they were using that allowed each image to be indexed and arranged within directories for easy retrieval and manipulation for their PR purposes. After reviewing their proposal, the library declined their request. Library staff suggested that they consider using photo management software, and pointed out that they were asking us to use the HDA in a manner analogous to using an online catalog to manage their office documents and publications. The author took this opportunity to refine HDA documentation by clarifying the HDA s purpose as a tool to complement classroom instruction, and to describe what types of digital collections would be considered for inclusion.[5] To expand the number and types of HDA collections that could be used for in-class instruction, library staff approached professors from Calvin s history and biology departments who were known to incorporate visual aids into their

instruction. None of the professors was able to commit time to scan or record metadata, so library staff offered to complete the majority of this work, requesting only that the professors commit to adding metadata to a few fields, particularly title and description. All metadata was recorded onto a spreadsheet containing column labels (metadata fields) and rows (records) allowing recurrent data to be easily copied and pasted. Once complete, the student assistant would use the spreadsheet to enter each record into HDAQ. However, by this time (June 2002), the library systems programmer had incorporated the Anniversary Archive records into the HDA using a script that converted records from HDAQ s SQLqueried database tables into files acceptable to a batch-load utility in Hyperion. Library staff soon realized that this script could be used to batch load spreadsheet data directly into the HDA through HDAQ. This batch load utility resulted in a quick turn-around time between collection conception to realization. It also allowed library staff to reload collections if revisions were necessary.[6] By the summer of 2003, library staff had enough experience in creating HDA collections to entertain a request from Calvin s Art department to digitize their collection of over 10,000 slides. With this many slides, library staff realized that in order to incorporate this number of images into the HDA in a timely manner, student assistants would need to be involved in reviewing metadata and completing subject analysis. Using a combination of Library of Congress Subject Headings volumes, OCLC WorldCat, the Getty online Provenance Index, and other online searches, the student assistant was able to efficiently and effectively update and upload over 1500 HDAQ records within two months.

Conclusion The success of the HDA is difficult to measure, especially when compared to some of the impressive digital collections being made available on the Web, collections with innumerable records and polished user interfaces. On the other hand, library staff were able to move from a vague concept to a digital archive of over 5000 records within three years. Furthermore, this work was accomplished using less than 10% of the author s time, less than 10% of a part time systems programmer, and perhaps 20% of student assistants time. With the current level of staff time devoted to HDA collections, it is difficult to imagine a future that is much different than merely adding more records, collection by collection, department by department. However, increasing professional staff time devoted to the HDA would significantly enhance the usefulness of the HDA, opening up the possibility of accommodating additional formats such as audio, visual, and full text, and recovering control over metadata and subject standards. Additional staff would also allow the library to incorporate more diverse and larger collections at one time, or to quickly create collections for special events and exhibitions. Finally, library staff would be able to work closely with the college s Information Technology (IT) and Audio-Visual departments, providing seamless access between the library s archival collections and the rest of the technological initiatives throughout Calvin s campus, and through merging areas of expertise represented by all these departments.

Endnotes [1] Visit the HDA at http://www.calvin.edu/library/tutohelp/hda/index.stm [2] The Heritage Hall Archives serve Calvin College and Calvin Theological Seminary, as well as the Christian Reformed Church, the denomination to which the college and seminary belong. [3] For example, see Kenney, A., & Rieger, O. (2000), Moving theory into practice: digital imaging for libraries and archives, Mountain View, CA, Research Libraries Group. [4] The H. H. Meeter Center for Calvin Studies, see http://www.calvin.edu/meeter for more information. [5] See http://www.calvin.edu/library/tutohelp/hda/purpose.stm [6] After viewing the Plant Taxonomy digital collection in the HDA, the biology professor pointed out that the protocol of displaying plant species in italics. Adding italics, or other types of formatting, is not offered as short-cut options in Hyperion, and would require that the title field of each record incorporate the html code for italics (e.g., <i>osmunda regalis</i>). The batch load utility made it possible to quickly reload the data with the proper html codes.