Leah McEwen and David Martinsen

The Rise of Primary Research Data

De Gruyter | Published online: May 24, 2017


As the scale of global commerce and opportunities for multidisciplinary collaboration increase, there is greater pressure on basic research to supply a quick return on investment (ROI). The emergence and development of digital information technologies in the new millennium have inspired a new look at how research outputs are managed and disseminated. The driving question in the minds of many research funders is this—will lowering the barriers for access increase the value of research for the greater society? This is a particularly interesting question to consider for measurement data, the greater amount of which are scattered across millions of separate, fixed publications (not to mention those never published and lingering in file drawers and on hard drives). Can the advent of cloud technologies, exchange standards, and provenance tracking facilitate improved access, evaluation, and use of data for both research and commerce? Can new value and discovery be realized through the greater aggregation of measured scientific data as “Big Data”?

The past five years has seen practical conversations among stakeholders increasingly focused on the publication of primary research data associated with journal articles. Data publication advocates have lobbied for the availability of data, funding agencies have issued mandates requiring funded researchers to publish their data, and repositories have been created to support researchers in fulfilling these requirements. The arguments put forth are many: it is important that science be as transparent as possible so that the community can properly assess the integrity of the research being published; it is valuable for interested scientists to have access to machine-readable data to more deeply examine and interact with the data described in a journal article; it is important that editors and reviewers have access to all of the available material to better understand the validity of the conclusions being presented, or consider whether the data themselves exhibit evidence of manipulation in a fraudulent manner.

This interest in the publication of research data, among other scholarly communication challenges, has spawned a number of new organizations (for example, FORCE11, [1] the Research Data Alliance), [2] which augment long-standing organizations (such as CODATA [3] and ICSU [4]). In addition, repositories for depositing research datasets, such as Data Dryad, [5] figshare, [6] and Mendeley Data, [7] have appeared. In chemistry, these new services may, in some sense, augment traditional curated data collections, such as the former Beilstein and Gmelin Handbooks, the Cambridge Structural Database, [8] the Protein Data Bank, [9] the Powder Diffraction File, [10] the Spectral Database for Organic Compounds (SDBS), [11] Wiley and NIST’s Mass Spectral Databases, [12, 13] BioRad’s Spectroscopy Databases, [14] and others.

 The guest editors at the University of Tokyo.

The guest editors at the University of Tokyo.

As a result of the emerging expectations for researchers to publish data, scientific publishers and research libraries are beginning to offer support services to their communities in navigating this evolving landscape. Balancing both sides of the time-cost equation for data generators and consumers will be key to how well new practices are established.

Taking a look at how the movement to publish research data more accessibly intersects the practice of research data dissemination in chemistry is the impetus behind a Special Symposium on Research Data, Big Data, and Chemistry at the 46th IUPAC World Congress, and the basis for this special issue of Chemistry International. The perspectives represented here examine a range of issues from coordinating global initiatives to workflows for publication, review, and evaluation to education to applications in industry and society. Also considered are some IUPAC digital initiatives for supporting chemistry data publication, including the International Chemical Identifier (InChI) [15] and the online Gold Book Compendium of Chemical Terminology. [16]

We hope you enjoy the reading, and look forward to meeting you at the Congress in São Paulo, Brazil, 9-14 July and the Special Symposium on 13 July 2017. [17]


1. www.force11.org Search in Google Scholar

2. www.rd-alliance.org Search in Google Scholar

3. www.codata.org Search in Google Scholar

4. www.icsu.org Search in Google Scholar

5. http://datadryad.org Search in Google Scholar

6. https://figshare.com Search in Google Scholar

7. https://data.mendeley.com Search in Google Scholar

8. www.ccdc.cam.ac.uk Search in Google Scholar

9. www.rcsb.org/pdb/home/home.do Search in Google Scholar

10. www.icdd.com Search in Google Scholar

11. http://sdbs.db.aist.go.jp Search in Google Scholar

12. www.wiley.com/WileyCDA/WileyTitle/productCd-1119171016.html Search in Google Scholar

13. http://chemdata.nist.gov Search in Google Scholar

14. www.bio-rad.com/en-us/spectroscopy Search in Google Scholar

15. http://www.inchi-trust.org/ Search in Google Scholar

16. https://goldbook.iupac.org/ Search in Google Scholar

17. www.iupac2017.org/special-symposia.php#tab2 Search in Google Scholar

Online erschienen: 2017-5-24
Erschienen im Druck: 2017-7-26

©2017 by Walter de Gruyter Berlin/Boston