shallcross code4lib-midwest 20150724

21
ArchivesSpace-Archivematica- Dspace Workflow Integration Mike Shallcross Code4Lib Midwest July 24, 2015 [email protected]

Upload: mikeum

Post on 18-Feb-2017

255 views

Category:

Technology


1 download

TRANSCRIPT

Page 1: Shallcross code4lib-midwest 20150724

ArchivesSpace-Archivematica-Dspace Workflow Integration

Mike Shallcross Code4Lib Midwest

July 24, 2015 [email protected]

Page 2: Shallcross code4lib-midwest 20150724

Background

• Bentley Historical Library (est. 1935)

– Official archives of U-M and repository for papers/records of individuals and organizations in MI

– 2014 reorg: consolidated physical/digital processing

– BHL currently hosts 1+ TB of digital archives in DSpace

• April 2014: $355K grant from Mellon Foundation

– Partner with U-M Library (administratively separate)

– Integrate functionality of ArchivesSpace, Archivematica, DSpace in an end-to-end workflow

7/24/2015 1

Page 3: Shallcross code4lib-midwest 20150724

Ye Olde Days…

• 1997-2009: – Highly manual procedures for born-digital content

(minimal preservation actions) – Very limited resources (staff, storage, IT support)

• 2010-2011: MeMail Project (email preservation grant) – Additional staff and storage infrastructure – Developed more robust workflows (still manual)

• 2011-2014: – Automation of key steps in workflow: AutoPro – Standardize metadata creation/collection

7/24/2015 2

Page 4: Shallcross code4lib-midwest 20150724

14 May 2015 3

Page 5: Shallcross code4lib-midwest 20150724

AutoPro

• Comprised of 28 Windows CMD.EXE scripts.

• Guides staff through 6 step workflow with single interface for 20+ apps/utilities. – Virus scans and checksum calculation.

– Identify sensitive personal data

– Create preservation copies of files.

– Arrange, package, and describe content.

• Documents all actions: log files and PREMIS

• NOT a long-term solution – Limitations of user interface/shell scripts

– Support and maintenance not sustainable

14 May 2015 4

Page 7: Shallcross code4lib-midwest 20150724

Available Community Solutions • ArchivesSpace:

– Archival management system.

– Create accession records, track locations, generate archival description (EAD).

• Archivematica: – Ingest tool: produces Archival Information Packages (AIPs)

– Extensive technical and preservation metadata.

• DSpace: – Preservation and access

– Persistent URLs, secure/managed storage, access controls

14 May 2015 6

Can we make these play nice?

Page 8: Shallcross code4lib-midwest 20150724

Goals

• Facilitate creation/reuse of metadata

• Streamline the ingest and deposit of content in repository

• Find solutions that meet Bentley needs but are flexible and scalable for others – Modular: institutions may adopt some, none, or all

– Employ open standards so that other repository platforms could be used

• Share code and documentation with archives and digital preservation communities

7/24/2015 7

Page 9: Shallcross code4lib-midwest 20150724

Project Status: • April-August, 2014: developer search (failed)

• May 2014: MLibrary announces plans for Hydra

• Dec. 2014: budget reallocation and contract with Artefactual Systems

• Jan. 2015: Artefactual Systems site visit – Review current procedures and collections

– Define use cases

– Analyze existing functionality of platforms, identify development needs

• Jan.-April 2015: Workflow refinement, user stories, and development prioritization plus BHL tool evaluation and data migration

• April 2015: agile development sprints begin

7/24/2015 8

Page 10: Shallcross code4lib-midwest 20150724

Key Development Tasks

• ‘Appraisal and Arrangement’ tab in Archivematica – Characterize transfers, identify sensitive data, preview

content – Tag content for restrictions, deaccession, and intellectual

arrangement of materials

• Archivematica-ArchivesSpace integration – ASpace ‘pane’ within Archivematica interface – Create/edit ASpace components; drag & drop content to

associate with description and create SIPs – Archivematica create/update records in ASpace using API

• Automate deposit of AIPs in DSpace; update ArchivesSpace with handles

7/24/2015 9

Page 11: Shallcross code4lib-midwest 20150724

1. Accession Record (AS)

7/24/2015 10

Page 12: Shallcross code4lib-midwest 20150724

2. Transfer (AM)

7/24/2015 11

Page 13: Shallcross code4lib-midwest 20150724

3. Characterize Content (AM)

7/24/2015 12

Page 14: Shallcross code4lib-midwest 20150724

4. ID Sensitive Data (AM)

7/24/2015 13

Page 15: Shallcross code4lib-midwest 20150724

5. Browse and Review Content

7/24/2015 14

Page 16: Shallcross code4lib-midwest 20150724

6. Tag content (AM)

7/24/2015 15

Page 17: Shallcross code4lib-midwest 20150724

7. Associate content with archival description (AM)

7/24/2015 16

Page 18: Shallcross code4lib-midwest 20150724

8. Ingest SIPs (AM) & Create Records (AS)

7/24/2015 17

Page 19: Shallcross code4lib-midwest 20150724

9. Deposit data (from AM, via SWORD) and metadata (from AS, via ResourceSync)

7/24/2015 18

Page 20: Shallcross code4lib-midwest 20150724

Systems of Record

• ASpace: administrative, descriptive, and rights metadata

• Archivematica: technical and preservation metadata

• Repository: manage content and enforce access restrictions.

7/24/2015 19

Page 21: Shallcross code4lib-midwest 20150724

Updates and More Info…

• Society of American Archivists demo: 8/22/2015

• 2015 iPRES: poster session and workshop (with ArchivesSpace, Archivematica, and BitCurator)

• Artefactual Systems appraisal & arrangement wiki

• Artefactual Labs Github

• Jira user stories

• Project blog: http://archival-integration.blogspot.com/

• Twitter: @umbhlcuration

Questions?

7/24/2015 20

Thanks!