Skip to main content Link Menu Expand (external link) Document Search Copy Copied

When should I think about data sharing?

The stage of your project will add constraints to your data sharing plans


Sharing data can be hard if you have not built in the intention to share from the outset. In each of the project stages below, we suggest actions you can take to ensure your data sharing is efficient and stress free!


StageWhat to think about now
cani-reuse PlanningWhen your project is in the planning stage, you can think think carefully about the ethical, governance and identifiability issues which might limit your ability to share your data completely openly. Once you know the limits of what is theoretically possible, you can describe how you can achieve this practically in a data management plan.
cani-reuse Data collectionWhen collecting your data you should aim to work with non-proprietary file file formats where possible. You can also begin to collate metadata and organise your data according to community standards.
cani-reuse AnalysisWhile you are analysing your data, be mindful of the restrictions which might be necessary to ensure the shared data are maximally deidentified. For example, it might be preferable to conduct your analysis using binned categorical data rather than continuous variables which make individual participants more identifiable. Consider writing your code in a way that allows for flexible and efficient re-analysis on any such factors. You should also keep an active data dictionary, where you describe the variables you create, how they are named in your code or data, and how they are derived.
cani-reuse Writing upWhile writing up your project, think about the reason you are sharing your data. If you are sharing data so another researcher can validate your figures, focus on ensuring the data (and code) required for that purpose are accessible and well described. If you are sharing a larger or more complete data set, you may wish to focus on ensuring that the value in the reuse of that data is appropriately emphasised and ‘advertised’ in your project write-up. Where the time and resources allow, you may wish to or consider publishing a separate data paper.
cani-reuse PublishingYou should aim to have all of your data available in your intended repository before you submit your project manuscript to a journal for publication. Your repository may have specific metadata requirements (much like journal formatting guidelines) and in some cases the data may need to be reviewed before it is made public (for example EBRAINS), so sharing it may take some time. By having the data available in advance of submission, you can include a persistent digital object identifier (doi) or reviewer-only access link with your manuscript. Ensure that you have carefully considered how contributors to your data will be recognised.