Loading presentation...

Present Remotely

Send the link below via email or IM

Copy

Present to your audience

Start remote presentation

  • Invited audience members will follow you as you navigate and present
  • People invited to a presentation do not need a Prezi account
  • This link expires 10 minutes after you close the presentation
  • A maximum of 30 users can follow your presentation
  • Learn more about this feature in our knowledge base article

Do you really want to delete this prezi?

Neither you, nor the coeditors you shared it with will be able to recover it again.

DeleteCancel

Make your likes visible on Facebook?

Connect your Facebook account to Prezi and let your likes appear on your timeline.
You can change this under Settings & Account at any time.

No, thanks

Workflow Design

Presents some of the key aspects of the workflow system used for digitization processes within the Dept. of Digitization at the University of Würzburg
by

Hendrik Schöneberg

on 5 September 2014

Comments (0)

Please log in to add your comment.

Report abuse

Transcript of Workflow Design

A Scalable, Distributed and Dynamic Workflow System for Digitization Processes
Workflow System
Who
Why
What
Digitization Center of Würzburg University Library
project management
minimize error rate
medieval manuscripts
historical maps
prints
special requests
Sources
Average Project Statistics
around 300 scans per project
at least 400 DPI resolution
~350 MiB per scan /~100 GiB per project
uncompressed TIFF with color-profiles
Manuscripts
Maps
[...]
Impacts on Design
Errors
Performance
Integration
Human
Technical nature
Domain specific
Compute intensive image operations
I/O operations
Extensibility
Support established platforms / archives
Provide service for others
Actor A
Actor B
internal state
internal state
Actors
send and receive messages
messages delivered to mailbox
one message at a time is processed
internal state never exposed
no need to lock
over state
no need to synchronize
calls
A
B
D
E
remote VM
local VM
Actors
arranged in hierarchies
crash affects only node and its subtree
'parents' can provide supervision strategies
The End
Thanks for your time!
Integration
"... let it crash! "
Akka toolkit --
www.akka.io
concurrency and parallelism abstraction
implements Actor Model
asynchronous, event-driven
Enterprise Integration Patterns
useful message routing patterns
'Pipes and Filters'
'Wiretap' (accounting)
Blur Detection Module
Fixture Removal Module
export modules
TEI: P5
METS / MODS
online publishing platforms
long term storage facilities
Hendrik Schöneberg
schoeneberg@informatik.uni-wuerzburg.de
Dr. Hans-Günter Schmidt
hans-guenter.schmidt@bibliothek.uni-wuerzburg.de
Winfried Höhn
winfried.hoehn@informatik.uni-wuerzburg.de

"
Throughput-time has decreased
by at least 50% by using
the workflow system.
-- Digitization Center of
Würzburg University Library
Evaluation
distributed computing
self-healing, elastic hierarchies
backup-strategies for specific errors
Benefits:
Results:
Akka
REST
Event Driven
Design

J2EE
EIP
Integration
Key Ingredients
Feedback
EIP: Pipes and Filters
Blur Detection
Angle Analysis
Context Vector Analysis
ToC Extraction
source
analysis modules
information retrieval modules
[...]
destination
Quality Assurance
Feedback
(Structural) Metadata
Filters / Processors
error detection
information retrieval
export / publishing
General Idea:
message routed to destination
processed by filters
filters may change message
Filter a
Filter b
Filter c
Blur Detection
Angle Analysis
Map
Annotation
Module
Online Publishing
analysis modules
information
retrieval
modules
[...]
destination
Quality Assurance
Feedback
Metadata
source
export
modules
Web-optimized
Images
Inbound
Outbound
Business Logic
Akka
REST
Event
Evaluation
Advantages
Results
Loose Coupling
Easy Extensibility of Endpoints
Responsive Applications
Support for multitude of clients
Lightweight Java-Client (Java WebStart)
Aggregation-Plugins
WebService: Online Order Service
ToDo
Analysis Modules
Export Modules
object recognition modules
cover more error types
additional information retrieval modules
provide additional export targets
Europeana
World Digital Library
OAI-PMH
[ ... ]
e.g. TCP-Connection
can span multiple VMs
Pipes and Filters Example: Manuscript
Pipes and Filters Example: Map
'
Setup
cluster of dual-core CPUs
Examples
http://libri-kiliani.eu/
http://franconica.de/
Task
image rescaling
- 1 / 12 -
- 2 / 12 -
- 4 / 15 -
- 5 / 12 -
- 5 / 15 -
- 6 / 15 -
- 6 / 12 -
- 7 / 12 -
- 9 / 15 -
- 8 / 12 -
- 9 / 12 -
- 11 / 12 -
- 12 / 12 -
Example
Source:
http://vb.uni-wuerzburg.de/ub/mpthf69/pages/mpthf69/5.html
- 10 / 12 -
8th century manuscript of the Pauline Epistles
Content:
"
"With very little effort Würzburg's digital collections can easily be shared with national and international cultural heritage portals."
-- Digitization Center of
Würzburg University Library
- 14 / 15 -
Scala
C
D
- 4 / 12 -
- 3 / 12 -
Full transcript