Imagine a scenario in an academic research lab: you are working on a chemical system that has not yet been well characterized, perhaps looking at new catalysts, trying new reagent combinations, or scaling up a new reaction for more sample. How might you gauge the reactivity potential against the equipment and procedures in place to ensure control is maintained? How do you identify critical points in the system where subtle or even inadvertent changes in condition or operation could send things awry, possibly becoming dangerous? How do you assess the potential hazards of untried chemistry? Is there any helpful information readily available at your fingertips? Would a search engine as accessible as Google be able to turn up appropriate data for such scenarios?
Understanding this situation for research chemists has taken increased prominence over the last five years following a series of highly publicized incidents in academic laboratories in the United States and elsewhere. Government , scientific , professional , and academic  organizations have raised concerns about the planning and oversight of chemical laboratory operations in the academic setting. Academic institutions and individual chemists have been subject to criminal prosecution in response to specific incidents where the primary consideration was adequacy of planning and supervision of the work being conducted in the laboratories.
What new ideas might an intersection of chemistry service professionals—educators, hygiene officers, health and safety directors, librarians, information scientists—bring to bear on this conundrum?
The immediate impact of these events has been a rapid increase in the level of documentation expected to support chemistry research in the academic setting. For example, following the death of a researcher in the University of California system , principle investigators of chemistry research laboratories on all 10 campuses are now required to identify which of 45 different hazards are associated with the work in their laboratories, so that the appropriate Personal Protective Equipment (PPE) can be issued to lab workers . However, this approach does not address the broader range of issues that chemists face when planning experiments and managing labs that may contribute to hazardous situations.
Studying lesser-characterized chemistry is a prime focus in chemistry research and all chemical reactions involve energy changes. If too much energy is released too quickly for the immediate environment to absorb it, material damage and dangerous circumstances can result. Known reactivity and interactions of materials as well as documented hazards and handling protocols can help inform prevention and mitigation planning for runaway and even unanticipated reactions. Incident investigations also indicate that many additional factors may be involved in the escalation of adverse chemical lab events, including process conditions, equipment limitations, laboratory setup, and training of operators, among others. There is an increasing focus on assessing risks against a range of considerations in local context. It can be a significant challenge to identify and locate the relevant data and information needed to inform these analyses.
Potentially useful information is widely dispersed across a spectrum of scholarly, supplier, agency, and other institutional data collection, reporting, and publication venues. The most readily available data collections focus on a small number of well-characterized compounds and involve significant time and expense to maintain. Other initiatives, aimed more broadly, are hampered by ambiguous data provenance. The diversity of data formats intended for different audiences and applications often means that the quality of the information is difficult to evaluate, organize, and apply in assessing risks associated with using reactive chemicals. The challenge is particularly pressing in the highly decentralized academic sector, characterized by novel chemistry, novice researchers, and very little consistency of practice and resources within and among institutions. How would a research lab manager working with a new protocol apply the exposure control information on Safety Data Sheets (SDS) that was intended for large-scale hazmat clean-up situations? How would a chemistry student in a teaching lab discern the hazards associated with different physical forms of a compound, perhaps in a variety of dilutions?
What new ideas might an intersection of chemistry service professionals—educators, hygiene officers, health and safety directors, librarians, information scientists—bring to bear on this conundrum?
Braving a record snowstorm and sustained by a small exploratory grant, 25 members of the Chemical Health and Safety (CHAS), Chemical information (CINF), and Chemical Education (CHED) Divisions of the American Chemical Society (ACS) and the ACS Committee on Chemical Safety, Environmental Health and Safety Task Force teamed up at Cornell University in Ithaca, NY in the winter of 2014 to pool their expertise and characterize the opportunities and challenges facing chemical safety information in the academic context. The primary goal of this initial meeting was to identify an overarching paradigm within the chemistry domain that could provide a framework for describing the lab safety process.
The education community provided a model for risk assessment and management in the academic laboratory characterized by a series of tasks necessary to work towards a safety culture: Recognize the hazards, Assess the risks, Manage the hazards, and Prepare for emergencies (RAMP, Figure 1). The group added, “Protect the environment” and the initiative was dubbed “iRAMP” to reflect the iterative nature of both the RAMP process and the project .
To support this process, the workshop identified emerging tools from the safety profession and regulatory agencies that can be deployed to support a range of safety planning and management activities. These include the Global Harmonization System of Hazard Classification (GHS) , Safety Data Sheets (formerly ‘Material’ SDS) , systems of exposure controls , research scale laboratory standards , prudent practice guidance for academic research labs , and a variety of risk assessment methods . Librarians can support this process by focusing on the scholarly processes for critical inquiry, information evaluation techniques, and documentation best practices. Similarly, information scientists are developing new technologies for structuring and interlinking data across sources.
Late in the winter of 2014, another diverse group of chemical professionals from the education, laboratory safety, academic librarianship, and information science sectors convened at the offices of the Royal Society of Chemistry in Cambridge, UK. They reviewed information sources and specific environmental health and safety field cases in the context of current information management technologies. The field cases considered a variety of applications, including Lessons Learned-type factor analyses based on reported laboratory incidents, control banding of laboratory ventilation rates, and risk assessment of common higher education teaching laboratory processes. The discussion re-emphasized the need to consider both hazard potential and exposure potential when assessing risk. Hazards associated with chemical research involve process conditions as well as chemical reactivity potential. These considerations can help inform local decisions for integrating hazard controls and prudent laboratory management practices.
Could a collective approach be developed that would support the use of these varied tools and information sources, connecting relevant data and official guidance into varied local management systems and planning workflows?
From these meetings we were able to take away a set of general goals for the project. The diversity of academic science and laboratory settings argues for a safety culture of prudent practices where not all scenarios can be anticipated, planned for, or addressed in established policies. Such dynamic systems suggest that a formulaic risk management “wizard” is not appropriate; assessing risk is site specific and involves educated judgment. How can we support iterative information discovery and reuse for a range of use-cases in a sustainable, transferable, and scalable manner? Laboratory work evolves in a variety of local contexts under diverse regulatory requirements using loosely determined types of information management systems. This situation calls for a flexibly structured ecosystem of data, domain expertise, and workflow tools mapped to essential commonalities of chemistry research and safety planning processes. A functioning information ecosystem relies on strong data management practices, including robust description of relevant chemistry research processes; collation and indexing of currently published data; organization and visualization tools for evaluation and use of data in risk assessment; codification of output formats; cross-walks for incorporation into local systems; and archiving and markup of method and analysis documentation for future reference.
What are the opportunities for amalgamating and opening up chemical data and information relevant to hazard recognition and safety planning and connecting them to create a flexible and functional information ecosystem?
A key challenge for connecting the relationships between multiple types of factors is the predominant organization of chemical information by chemical entity. The safety literature is no exception, focusing on the hazard-related properties of individual chemicals or substances without reference to specific experimental context or to the surrounding laboratory conditions. Scale, concentration, temperature, pressure, flow rate, and many other chemical, process, operator, and environmental factors have the potential to trigger a runaway hazardous situation. The presence of other compounds in the experiment or immediate vicinity must also be considered. A more complete risk assessment process, as described by the RAMP model, involves a holistic, laboratory level approach to managing risks beyond hazard identification. Complementing the “object-based” focus on specific chemical entities with “process-based” indexing could better identify information and data buried in the published literature on how these chemicals are being used under various conditions and combinations and the potential for subsequent unintentional interactions to arise.
This situation calls for a flexibly structured ecosystem of data, domain expertise, and workflow tools mapped to essential commonalities of chemistry research and safety planning processes.
There are currently a number of exploratory efforts to develop semantic description for indexing and linking disparate information for a variety of scientific, management, and educational applications. Semantic description of data elements such as the Resource Description Framework (RDF)  can support flexible querying across a variety of information types. These technologies are just recently moving beyond describing the basic bibliographic information types into developing approaches for more specified domain data types . Chemical health and safety management is traditionally an information-intensive discipline and has created many data structures for supporting safe practices and incident analysis in chemical laboratories. Looking across the domain reveals many information management practices to build on, including the SDS format and GHS labeling system, clearinghouses of reviewed chemical hazard data such as the European Committee Agency portal, and information exchange formats and algorithms such as the International Uniform ChemicaL Information Database protocol (IUCLID)  and the IUPAC International Chemical Identifier (InChI) .
Developing safety management tools and models useful for the academic sector will necessitate tapping into digitally curated data in ways that are relevant to the decision-making processes of research chemists, safety professionals, institutional administration, and other stakeholders. For example, a researcher might be looking at two known chemicals in a proposed reaction scheme and want to know of any conditions that might trigger an adverse outcome, if there are any known procedures for minimizing the likelihood of these conditions, and how to mitigate potential harm if something untoward did occur. The relevant data and information may come from a diverse set of sources covering physical properties, synthesis protocols, and previously reviewed incidents. However, these sources vary widely, and much of the data is available only in text format, often not parsed with the necessary granularity and not recognized by computers in the implied scientific context. This “text-locked” state of the data limits the options for searching, analysis, tagging, and linking into information management systems such as electronic laboratory notebooks (ELNs). Similarly, institutional Environmental Health and Safety staff need to collect information about different chemical properties to support campus-wide laboratory operations, including coordinating emergency response planning, addressing ventilation needs, and providing waste disposal services.
How do we better identify and organize chemical hazard and risk management concepts to facilitate the discovery and use of what has been reported? Can we classify and define chemical reactivity in the context of hazard identification, evaluation, and exposure control?
The iRAMP project is currently collaborating with the PubChem program at the US National Library of Medicine to explore opportunities and challenges on a practical level. Objectives include connecting the broad range of information sources relevant to chemical safety planning, applying vocabularies of relevant concepts to index and annotate reports, and analyzing indicators to extend classification of potential hazards across lesser-characterized chemical interactions. The approach is to develop a data stream based on the community standard format of the Laboratory Chemical Safety Summary (LCSS)  described by the National Research Council in Prudent Practices . This data stream consolidates the safety and health-related data reported in open, authoritative literature, and can be downloaded or hyperlinked into institutional systems to augment local chemical information, such as chemical inventories, lab-specific personal protective equipment assessments, or ELNs. The project is not aimed at developing local chemical management systems or interfaces, but to facilitate management of the data stream through user manuals and examples of use in risk assessment protocols.
A critical component that has been identified for scalability and sustainability is the use of robust chemical identifiers to allow linking and validation of the information collected. The familiar CAS Registry Number is ubiquitous across many chemical safety information sources, but is not designed for linking and does not include important chemical designations related to concentration and purity. IUPAC projects are underway to extend the InChI algorithm to incorporate the GHS designations and support better linking and access to documentation of chemicals from manufacture through waste disposal and emergency response. In response to community input, we are also investigating the development of a chemical incompatibility classification that will group potentially similar hazardous interactions, extending the functionality of the chemical reactivity worksheets available in CAMEO .
Next steps include extending the LCSS profiles beyond hazard information about specific chemicals to include additional relevant data such as process variables and to support further annotation of reactivity hazards. It will be possible to leverage domain terminology to more systematically analyze the reported data and construct specific process scenarios, such as “chemical A + chemical B under X condition could lead to Y type of adverse event”. An ongoing effort is the development of a Chemical Safety Ontology (CSO) that establishes the relationships among chemical safety concepts, such as hazard classification and exposure control, and maps these to the semantic information structures used by computers for indexing, classification, and further organization through knowledge graphs. These approaches will rely on terminology and descriptions developed for use in the chemistry enterprise.
The chemistry domain boasts myriad glossaries and regulatory terminology developed for a wide range of chemical research and applied subdisciplines, including those represented in the IUPAC color books . Ascertaining the applicability and interconnectivity of the available vocabularies to the questions around chemical hazard management and safety planning will require a process of review and curation by chemistry research support professionals, including environmental safety officers and chemistry librarians in collaboration with domain information scientists. A successful model that interlinks and semantically enables a range of authoritative vocabularies is the AGROVOC thesaurus, a project of the Agriculture Information Management Standards (AIMS) initiative under the auspices of the Food and Agriculture Organization of the United Nations . Could such a work model be developed in the chemistry domain to integrate chemistry vocabularies more systematically into chemical information management systems, building on the efforts of the current IUPAC Color Book Data Management project ?
Everyone we have connected with in the chemical safety, chemical information, and chemical education communities has appreciated the complex nature of the challenge and the potential for a multi-faceted community approach. Safety is a global concern for our discipline and we welcome ideas and contributions from all stakeholders and locations interested in moving this effort forward. The laboratory safety data stream provided through the PubChem database is an ideal proof-of-concept to support community engagement in managing information relevant to chemical safety planning; large scale, government-based, and open to use. The initial word from the academic chemical safety community is that this approach will add much needed value to the prominent electronic information search scenario for researchers via Google. Stay tuned for the next chapter of this compelling exploration.
Leah McEwen <firstname.lastname@example.org> is the Chemistry Librarian at Cornell University in Ithaca, NY USA and Secretary of the Division of Chemical Information of the American Chemical Society.
Ralph Stuart <email@example.com> is the Chemical Hygiene Officer for Keene State College in Keene, NH USA and Secretary of the Division of Chemical Health and Safety of the American Chemical Society.
1. United States Chemical Safety Board. Texas Tech University Laboratory Explosion; Washington, DC, 2011.Search in Google Scholar
2. National Research Council. Safe Science: Promoting a Culture of Safety in Academic Chemical Research; Washington, DC: The National Academies Press, 2014. Search in Google Scholar
3. American Chemical Society, Committee on Chemical Safety. Creating Safety Cultures in Academic Institutions; Washington, DC, 2012. Search in Google Scholar
4. Association of Public and Land-grant Universities. Association of Public and Land-grant Universities Announces Creation of Task Force on Laboratory Safety. April 23, 2015. Search in Google Scholar
5. Kemsley, J. N. Learning From UCLA: Details of the experiment that led to a researcher’s death prompt evaluations of academic safety. Chem. Engr. News., 2009, 87 (31) pp 29-31, 33-34.10.1021/cen-v087n031.p029Search in Google Scholar
6. University of California San Diego, Environmental Health and Safety. Laboratory Hazard Assessment Tool (LHAT). Version 11, 2014.Search in Google Scholar
7. American Chemical Society, Divisions of Chemical Health & Safety and Chemical Information. iRAMP Innovative Project Grant.Search in Google Scholar
8. United Nations, Committee of Experts on the Transport of Dangerous Goods and on the Globally Harmonized System of Classification and Labelling of Chemicals (UNCETDG/GHS). Globally Harmonized System of Classification and Labelling of Chemicals. New York and Geneva, 2003. Search in Google Scholar
9. Occupational Safety and Health Standards. OSHA Brief no. 3514, Hazard Communication Standard: Safety Data Sheets (29 CFR 1910.1200(g)). Search in Google Scholar
10. Department of Labor, Occupational Safety and Health Administration. Preventing occupational illnesses through safer chemical management. Search in Google Scholar
11. Toxic and Hazardous Substances: National Research Council Recommendations Concerning Chemical Hygiene in Laboratories (Non-Mandatory); 29 CFR 1910.1450 App A; Occupational Safety and Health Administration, Code of Federal Regulations; U.S. Government Printing Office: Washington, DC, 2012.Search in Google Scholar
12. National Research Council. Prudent Practices in the Laboratory: Handling and Management of Chemical Hazards, Updated Version; Washington, DC: The National Academies Press, 2011.Search in Google Scholar
13. American Chemical Society, Committee on Chemical Safety, Hazard Identification and Evaluation Task Force. Identifying and Evaluating Hazards in Research Laboratories; Washington, DC, 2015.Search in Google Scholar
14. World Wide Web Consortium, RDF Core Working Group. Resources Description Framework. http://www.w3.org/2001/sw/#rdf.Search in Google Scholar
15. Borkum, M. I.; Frey, J. G. What’s in a Name? Quite a Lot, as it Happens! Chem. Int., 2015, 37(2) pp 7–9.10.1515/ci-2015-0231Search in Google Scholar
16. European Chemicals Agency. International Uniform ChemicaL Information Database, v. 5. http://iuclid.eu/.Search in Google Scholar
17. IUPAC, Chemical Nomenclature and Structure Representation Division, Subcommittee on the IUPAC International Chemical Identifier. IUPAC International Chemical Identifier (InChI).Search in Google Scholar
18. National Institutes of Health, National Library of Medicine, National Center for Biotechnology Information. PubChem Laboratory Chemical Safety Summaries.Search in Google Scholar
19. National Research Council. Prudent Practices in the Laboratory: Handling and Management of Chemical Hazards, Updated Version; Washington, DC: The National Academies Press, 2011. Search in Google Scholar
20. National Oceanic and Atmospheric Administration. CAMEO Chemicals Database of Hazardous Materials. http://cameochemicals.noaa.gov/.Search in Google Scholar
21. IUPAC. Nomenclature and Terminology publications. http://www.iupac.org/home/publications/e-resources/nomenclature-and-terminology.html.Search in Google Scholar
22. Food and Agriculture Organization of the United Nations, Agriculture Information Management Standards. AGROVOC Multilingual agricultural thesaurus. http://aims.fao.org/vest-registry/vocabularies/agrovoc-multilingual-agricultural-thesaurus.Search in Google Scholar
23. IUPAC, Committee on Publications and Cheminformatics Data Standards. IUPAC Color Book Data Management Project. http://www.iupac.org/project/2013-052-1-024.Search in Google Scholar
©2015 by Walter de Gruyter Berlin/Boston