Data Management

Data Management Plans for NSF Funding

[Instructions for Use]

1. Types of data, samples, and other materials to be produced

The project will generate several types of data:

  • Raw data as collected by [instruments and methods].  We expect to generate [# Terabytes, Gigabytes] during the course of the project.
  • Samples as [collected/created] through [methods].  These will be held in [preservation method] for a period of [time].
  • Software Programs for the [processing/creation/manipulation] of [samples/raw data].
  • Processed data is data generated through the manipulation of raw data.  We expect to generate [# Gigabytes] of processed data.
  • Curriculum Materials will be generated by the documentation of the materials and methods used in this project.
  • Compiled data is data compiled from an analysis of the processed data and will typically consist of tables and figures.  We expect to generate [# Gigabytes] of compiled data.

2. Standards to be used for data and metadata format and content

  • Raw data will generated and stored in the formats in which they are provided by the [instruments and methods].  The file formats allow for the data to be examined by standard programs for this type of research.  File formats for Raw Data are [e.g. Text or ASCII data, Instrument-specific files].
  • Software Programs created by this project will read [file types] and output [file types] than can(not) by read by other programs such as [e.g. MATLAB, R, Sparky, LabView]
  • Processed data are typically [types of files: ASCII, Binary, PDB files, etc], and are readable by standard software such as [MATLAB, R, Excel, SigmaPlot, etc.]
  • Compiled data utilizes formats supported by standard office and presentation software, such as Adobe Acrobat, Microsoft Word or [Other Programs].

3. Methods and policies for providing access and enabling sharing

After the required period of copyright ownership by the publisher has expired, the compiled data will available through the following methods.

  • Raw Data will initially be stored on the [instruments/data acquisition machines] and then transferred to the [Biophysics file server/Value Storage/Other].  Unless restricted by [policy], access can be requested through the PI via email: [email address].  It will be retained for at least [#] years after conclusion of the project, or [#] years after the last request for such materials, whichever is later.
  • Processed Data (under 200GB) will be stored in Deep Blue indefinitely, and is available to the scientific community online.  A copy will also be retained on the Biophysics file server for [#] years after the conclusion of the project, or [#] years after the last request for such materials, whichever is later.
  • Processed Data (over 200GB) will be stored on the Biophysics file server.  Access can be requested through the PI via email [email address].  It will be retained for at least [#] years after conclusion of the project, or [#] years after the last request for such materials, whichever is later.
  • Compiled Data will be published as part of articles in archival journals; additional materials will be deposited as supplemental materials and made accessible through the journals in accordance with their policies.

4. Provisions for re-use, re-distribution, and the production of derivatives

  • Raw Data: Except where prohibited by the publisher, data will be available through the Biophysics file server.
  • Processed Data: Except where prohibited by the publisher, data will be available through the Biophysics file server.

Those outside the University can ask the P.I. (email address, phone) for access to data stored on the Biophysics file server through MCommunity friend accounts.  Those within the university can be granted access directly.  Except for [period of time] indicated by the publisher, Data stored in Deep Blue is accessible to the entire scientific community.  Published findings may be restricted by subscription.

Data is to be restricted to [groups] due to [ethical, legal, or political concerns].

The data will be particularly useful for [method(s) or use(s)] to [groups] for [purpose].

5. Methods for archiving and preserving access to data and materials

University of Michigan Biophysics File Server - Is a multi-terabyte storage unit which is continuously backed up to an external University-controlled facility.  Access is limited physically through a secured door, a lockable server rack, and lockable server hardware.  Data is archived automatically after five (5) years of inactivitiy.  User and group access is controlled through Kerberos, Samba and LDAP to the University maintained Kerberos and Microsoft Active Directory services.  Power is conditioned and maintained through an uninterruptable power supply which is capable of safely shutting down the server in the event of an extended power outage.  Logs of security incidents and usage are retained for one year for analysis.  In the event of a total loss, services and data can be restored to backup hardware within 24hours.

Value Storage - Is a university provided service for individual researchers which has disaster recovery, replication and continuity plans.  Access can be requested through the PI.  A detailed description of their service is available http://www.itcs.umich.edu/storage/value/sla.php.

Deep Blue - Is a service provided by the University of Michigan Library that serves as a permanent archive of data in perpetuity.  They outline their policies which include backup, disaster recovery, data and format preservation on their web site: http://deepblue.lib.umich.edu/about/deepbluefaq.jsp.  Access is be available to the general scientific community online.  Data stored in Deep Blue will include the published scientific findings, and any processed data that may aid in further investigation.