An overview
Wellcome Library
Digitisation Programme
@WellcomeDigital
#welldigi
Wellcome
Digitisation
Photography
19m images digitised
Who are we?
What have we got?
- Books - mainly 19th century
- Early Printed Books
- Archives
- Grey literature, serials
- Manuscripts: Western and Arabic
- Films
- Artworks, posters, prints, transparecies
- High resolution, Hasselblad/leaf
- Oversize books and manuscripts
- Items required for facsimile printing
- Artworks, posters
- Medium resolution, Canon Mark III
- Archives
- Books, grey lit
- Copy stands
- "Guardian" book systems
- Conservation book cradle system
- Flat copy stands
- TT and Scribe book scanners
- The free library for the "incurably curious"
- Focus on history of medicine, health
- Wellcome Trust "A global charitable foundation dedicated to achieving extraordinary improvements in health by supporting the brightest minds"
- Striving to "embed biomedical science in the historical and cultural landscape"
- Library is part of the Wellcome Collection, "exploring the intersection between medicine, life and art"
Dedicated staff
Formats
- 3 FT Photographers
- 1 FT Digitisation support officer + ad hoc support
- Several FT/PT Project managers
- 1 FT Digital Ingest Coordinator + ad hoc support
- 1 FT Programme Manager
- 1 FT Digital Curator
- 21 on-site contractors (mostly Internet Archive)
- + large contributions from the cataloguing team, conservation team, web content editing team, systems management team, library support teams
- Images
- Capture in RAW (in-house)
- Convert to JPEG 2000 via TIFF
- Lossy compression 8:1 or 10:1
- Data
- MARC21/AARC2 or ISAD(g) for metadata
- ALTO XML for full-text
- METS XML for digital object administration
Systems
Metadata
Goobi
Preservica
Cataloguing
Deliverable units
- Permanent home for media
- Exports media via API
- Creates/exports technical metadata
- Handles format migration
- The unit digitised and made available is:
- The "thing" that is catalogued*
- The "thing" you would order/reserve*
- All content is catalogued before digitisation
- All records available via Encore "single search"
- Links to digital content in Encore records
- One user interface for digital and non-digital
- Archives heirarchy built into Encore interface
- Manages workflows
- Media import
- Format conversion
- Format validation
- Ingest to Preservica...
- Tracking, error reporting
- Metadata editor
- METS creation & export
- "Middleware"
Digitisation-related metadata
Storage
Delivery system
- ~50 TB of storage
- Mostly "working" storage
- Preservica currently holds >20 Tb
- "Volatile" cache and "permanent" cache
- Storage requires very active management!
- Structural information
- pagination
- covers
- TOCs etc.
- Access restrictions
- Licences/conditions of use
- Preservation metadata
- Sensitivity and copyright admin
- "Digi" codes to tag collections
- Bespoke Digital Delivery System (DDS)
- Requests content from Preservica or cache
- Controls access rights
- IIP image converts JP2 to JPEG
- Converts METS to JSON
- Delivers media and metadata to Player
- http://bit.ly/1RTQtEm
- http://wellcomelibrary.org/moh/