Skip to Main Content

Open Data & Data Management

What is a Data Repository?

A Data Repository is a collection of datasets that have been deposited for storage and discoverability. They are often discipline specific or affiliated with a research institution.  Examples include ICPSR and Harvard Dataverse Network. 

The Generalist Depository Comparison Chart is designed to assist researchers in finding a generalist repository should no domain repository be available to preserve their research data.

Where to Find Data Repositories

Examples of Subject Data Repositories for Depositing Your Data

Ecology Dryad
Chemistry Cambridge Crystallographic Data Centre (CCDC)
DNA Sequences GenBank
Social Sciences





Open Science Framework

General DataOne


Questions to consider when choosing a repository

Is it one that your researcher community uses?

Can you restrict access to data if necessary? (ie. it is proprietary)

Can the data be cited and found in a unique and persistent way (e.g. with a DOI?)

Does the repository commit to preservation actions that will preserve the integrity of your research data?

How long will your data be retained in the repository?

What happens at the end of the data retention period?

Is there support provided for data documentation and/or data deposit?

Are your rights as data depositor clear?

Are there clear Terms of Use?

Are there costs associated with having your data deposited in this repository?

Source:  NCSU Libraries Data Management Guide