Data repositories

Data repositories are specialized online platforms designed for storing, sharing, and managing scientific data. They enable researchers to securely store their datasets and make them accessible to the broader scientific community.

Types of repositories:  

  • Disciplinary (subject) repositories
    • Specialised in a particular discipline or field – CESSDA (social sciences) 
  • National repositories 
    • Managed at the state level, often with the support of government institutions – NRGL (National Digital Library in the Czech Republic), NDI (National Data Infrastructure) – under preparation 
  • Institutional repositories
    • Managed by the specific institution (e.g., university) – Dspace 
  • Catch-the-all (multidisciplinary) repositories
    • Include data and publications across different disciplines – Zenodo, Figshare, Dryad 

 

How to find a data repository 

re3data.org – international repository registry

CoreTrustSeal – list of certified repositories

OpenAIRE – How to find a trustworthy repository (European Commission recommendation) 

 

How to choose a trustworthy repository (what it should be able to do): 

 

  • Does the repository assign a persistent identifier (e.g., DOI) to your data? 

The data are then easier to find and cite. 

  • Does the selected repository provide open access to the stored data? 

Key feature if you want to share your data openly. 

  • Will the repository license your data? Does it make clear the conditions under which the data stored in the repository can be used? 

It is important that users of your data know how they can use it. 

  • Will the repository provide a metadata landing page for your data? 

Metadata help others find the data, find out what the data are, how to cite them. 

  • Does it allow for long-term archiving?  

It is important that the data can be stored permanently with the possibility of future access. 

  • Does it support storing different types of data? 

Key feature if you want to manage not only text publications, but also datasets, software tools, multimedia materials, etc. 

  • Does the repository allow versioning? 

If you update your dataset, you can upload the updated version as a new version to the original dataset. The new dataset gets assigned its own persistent identifier, and users can easily find out what the latest version is or what version was used in the original study.