Loading presentation...

Present Remotely

Send the link below via email or IM

Copy

Present to your audience

Start remote presentation

  • Invited audience members will follow you as you navigate and present
  • People invited to a presentation do not need a Prezi account
  • This link expires 10 minutes after you close the presentation
  • A maximum of 30 users can follow your presentation
  • Learn more about this feature in our knowledge base article

Do you really want to delete this prezi?

Neither you, nor the coeditors you shared it with will be able to recover it again.

DeleteCancel

Research DLCM

No description
by

Pierre-Yves Burgi

on 13 September 2016

Comments (0)

Please log in to add your comment.

Report abuse

Transcript of Research DLCM

Project Organization
Final words
DLCM's Data Census
49 interviews
30 different disciplines
6 Higher Education Institutions
Creating the framework for a successful research data management in Switzerland
Research DLCM
The Swiss way...
Data-Intensive Science and Campus IT, Research Data Census, Montana State University, 2015
Guidelines
Data Management Plan
Tools for data analysis
Publication & preservation of scientific data
Electronic Laboratory Notebooks (ELN) & Laboratory Information Management Systems (LIMS)
Storing, linking, and annotating of all digital data generated during the research process
Storage
Snapshots of processed data with contextual information
Cost-effective short- and long-term data preservation
Further data use for not-as-yet devised purposes
Staff
Experts in data management
Expressed needs
Deliverables
Data Management Plan (Templates / Tools)
Deliverables
ELN/LIMS for life sciences and other disciplines
Virtual Research Environment for DH
IT solutions for handling complex datasets
Deliverables
OAIS-compliant preservation infrastructure
Data publication toolbox
Web interfaces (API)
Cost model(s)
Deliverables
Training in RDM for students & researchers
Helpdesks in coordination with libraries
Deliverables
Outreaching other institutions (RD Swiss community)
Annual event (First at Rolex Learning Center, November 29, 2016)
International collaborations and networking
Frontends for researchers
openBIS
Salsah
SLIMS
other
Light Tool Box
EUDAT-like tools for managing active research data
Long-Term Preservation
Digital POWRR (Preserving digital Objects with Restricted Resources) Project
Fedora commons
LOCKSS(S) for "Swiss"
Training & Consulting
Examples
73% of researchers want to learn more about data services and infrastructure
Over 50% of researchers expect qualified staff, guidelines, policies for dealing with research data
Pierre-Yves Burgi - DLCM Project Director
Eliane Blumer - DLCM Project Coordinator
André Jelicic - DLCM Project Business Analyst

1. Provide value propositions to researchers (based on use cases)
3. Build on existing expertise and services
Develop partenariats with for instance:
Swiss National Supercomputing Centre (CSCS) in Lugano
Swiss Data Science Center (SDSC) at EPFL and ETHZ
With other Swiss actors (e.g. eSCT)
2. Propose business models for sustainable services at national level
feasibility
desirability
viability
... 3 key principles
MOOCs
Digital Curation Center & other similar structures
Train2Dacar
eScience Coordination Team (eSCT)
archiving vs. active working zone
Guidelines, Policies
DLCM Swiss Brand
DLCM Swiss Portal (a.i. www.dlcm.ch)
Cost Models
Lot Of Copies Keep Stuff Safe (LOCKSS)
DuraCloud like
LOCKSS
SAFE Archiving Federation Private LOCKSS Network (SAFE PLN)
Based on the Byzantine Generals Problem 3m+1 generals can cope with m traitors
With m = 2, we get 7 nodes (dark archives); for archiving 1 TB, we need 8.4 TB storage
SAFE PLN = G3 + 5 European Universities
DuraCloud Like
Based on Fedora Commons
2 distant copies
"cloud compliant" technology
e.g. Amazon S3 + Glacier / SDSC
for 10 years
(CC Digital Law)
Researcher-centered approach
Reproducibility
n=3026
(23 Austrian Institutions)
n=3026
(23 Austrian Institutions)
n=3026
(23 Austrian Institutions)
From R.D. Peng, Science 334, 1226 (2011)
N=18 (articles published in Nature Genetics 2005/2006)
n=3026
(23 Austrian Institutions)
A datum is a "Discernible differences in physical states, in terms of space, time, and energy"
Data?
What is it?
Utility of data?
Their ability to carry information
and in turn information can modify an expectation
Use of data?
Understanding a dataset requires knowledge of the context of its creation, as a datum is not a carrier of meaning
Finality of data
Part of the daily researcher’s job
Stepping stone to the production of articles
Storage?
Quantity of information carried by a given data set vs. subjective amount of information possibly extracted
credibility
autority
compliance with founder guidelines
data (well) managed
publish results and data
Among all information accumulated in memory knowledge occupies only a small part and thus most part of memory is filled with unprocessed information.
Source: Dolgonosov 2012
Data Life Cycle?
DCC Curation Lifecycle Model (from Higgins 2007)
The Ellyn Montgomery, USGS Data LifeCycle Diagram
NDIIPP Preserving Our Digital Heritage
USGS Data Management Plan Fremwork (DMPf)
Data Life Cycle Models, which one?
DLCM CUS P2 Model
Research workflow
Single endpoint
Open-Ended Work
Time Series Data
DLCM Model
Main target audiences
www.dlcm.ch
62 TB per square inch
source: Kalff et al. 2016
Full Topographic Map of Mercury -
composed of 100,000 spacecraft images, which necessitated 4,104 orbits around the planet.
https://www.usgs.gov/news/first-global-topographic-map-mercury-released
for each track
BMC for a National Portal
self-description leads to Gödel end
discipline specific
look for generic tasks
Not just data!
versioning
Research data management policy template will be presented at the next Research delegation at Swissuniversities
Full transcript