Data Publication


A data paper (or data publication) takes data that has been used for a research study, or deposited in a repository and expands on the "why, when and how" of its collection and processing, leaving an account of the analysis and conclusions to a conventional article, perhaps written at a different time and by different authors.

Another form of data publication is the "enhanced publication", which integrates underlying datasets into an online article so that readers can interact with the data as they read through the article. These data publications are designed to provide more contextual information about the data, so that a researcher can understand it better for their own purposes. Data publications provide an opportunity for researchers using the data publication to offer attribution and credit to data creators, as a data publication can be cited within an academic research paper.

Data may also be linked to publications by creating a link between the academic publications and their underlying associated datasets. The goal is that anyone viewing a publication will be able to locate the datasets associated with that publication and anyone looking at datasets will be able to locate the publications that were produced from that data. The activity of linking data to publications is a necessary step to improve the culture of data sharing amongst the scientific and research community. Linking data to publications also increases the opportunity for transparent research, as researchers will be able to analyze the data created in conjunction with an academic paper.

Further Resources

Altman M, Castro E, Crosas M, Durbin P, Garnett A, & Whitney J. (2015). Open Journal Systems and Dataverse Integration—Helping Journals to Upgrade Data Publication for Reusable Research(link is external). Code4Lib Journal, 30.

Ball A, Duke M. (2012). How to Cite Datasets and Link to Publications(link is external). DCC How-to Guides. Edinburgh.

Bardi A, Manghi P. (2014). Enhanced Publications: Data Models and Information Systems(link is external). LIBER Quarterly, 23(4), 240–273.

Bardi A, Manghi P. (2015). A Framework Supporting the Shift from Traditional Digital Publications to Enhanced Publications(link is external). D-Lib Magazine, 21(1/2).

Borgman CL, Wallis JC, Enyedy N. (2007). Little science confronts the data deluge: habitat ecology, embedded sensor networks, and digital libraries(link is external). International Journal of Digital Libraries, 7:17–30.

Callaghan S, Murphy F, Tedds J, Allan R, Kunze J, Lawrence R, … Whyte A. (2013). Processes and Procedures for Data Publication: A Case Study in the Geosciences(link is external). International Journal of Digital Curation, 8(1), 193–203.

Callaghan S, Tedds J, Kunze J, Khodiyar V, Lawrence R, Mayernik M, … Whyte A. (2014). Guidelines on Recommending Data Repositories as Partners in Publishing Research Data(link is external). International Journal of Digital Curation, 9(1).

Callaghan S, Tedds J, Lawrence R, Murphy F, Roberts T, & Wilcox W. (2014). Cross-Linking Between Journal Publications and Data Repositories: A Selection of Examples(link is external). International Journal of Digital Curation, 9(1).

Goodman A, Pepe A, Blocker AW, Borgman CL, Cranmer K, Crosas M, … Slavkovic A. (2014). Ten Simple Rules for the Care and Feeding of Scientific Data(link is external). PLoS Computational Biology, 10(4), e1003542.

Hrynaszkiewicz I. (2012). Citing and linking data to publications: more journals, more examples…more impact?(link is external) BioMed Central Blog.

Kervin K, Michener W, & Cook R. (2013). Common Errors in Ecological Data Sharing(link is external). Journal of eScience Librarianship.

Kratz JE, Strasser C, & PLOS ONE Staff. (2015). Researcher perspectives on publication and peer review of data(link is external). PLoS ONE, 10(2), e0123377.

Kratz J, Strasser C. (2014). Data Publication Consensus and Controversies(link is external). F1000Research, 3, 94.

Linking Data to Publications: Towards the Execution of Papers(link is external). For Attribution -- Developing Data Attribution and Citation Practices and Standards: Summary of an International Workshop.  Washington, D.C.: National Academies Press; 2012.

Leadbetter A, Raymond L, Chandler C, Pikula L, Pissierssens P, & Urban E. (2013). Ocean Data Publication Cookbook(link is external). Oostende, Belgium.

Lynch C. (2007). The shape of the scientific article in the developing cyberinfrastructure(link is external). CT Watch Quarterly, 3(3):5–10. 

Murphy F. (2014). Data and Scholarly Publishing: The Transforming Landscape(link is external). Learned Publishing, 27(5), 3–7.

Parsons M, Fox P. (2013). Is Data Publication the Right Metaphor?(link is external) Data Science Journal, 12, WDS32-WDS46.

Read KB, Sheehan JR, Huerta MF, Knecht LS, Mork JG, & Humphreys BL. (2015). Sizing the Problem of Improving Discovery and Access to NIH-Funded Data: A Preliminary Study(link is external). PloS One, 10(7), e0132735.

Reilly S, Schallier W, Schrimpf S, Smit E, Wilkinson M. (2011). Report on Integration of Data and Publications(link is external). p. 1–7.

Roche DG, Lanfear R, Binning SA, Haff TM, Schwanz LE, Cain KE, … Kruuk LEB. (2014). Troubleshooting Public Data Archiving: Suggestions to Increase Participation(link is external). PLOS Biology, 12(1), e1001779.

Smith VS. (2009). Data publication: towards a database of everything(link is external). BMC Research Notes, 2:113. 

Vlaeminck S, Wagner GG. (2014). On the Role of Research Data Centers in the Management of Publication-Related Research Data(link is external). LIBER Quarterly, 23(4), 336–357.

Whyte A. (2013). IDCC13 Data Publication: generating trust around data sharing(link is external). Digital Curation Centre.

Search for a Term

Send us your feedback or suggestions for new terms

Contact information
CAPTCHA This question is to prevent spam submissions. Contact for any accessibility issues.
4 + 5 =
Solve this simple math problem and enter the result. E.g. for 1+3, enter 4.