The Cambridge structural database

8.6kCitations
Citations of this article
2.7kReaders
Mendeley users who have this article in their library.

This article is free to access.

Abstract

The Cambridge Structural Database (CSD) contains a complete record of all published organic and metal-organic small-molecule crystal structures. The database has been in operation for over 50 years and continues to be the primary means of sharing structural chemistry data and knowledge across disciplines. As well as structures that are made public to support scientific articles, it includes many structures published directly as CSD Communications. All structures are processed both computationally and by expert structural chemistry editors prior to entering the database. A key component of this processing is the reliable association of the chemical identity of the structure studied with the experimental data. This important step helps ensure that data is widely discoverable and readily reusable. Content is further enriched through selective inclusion of additional experimental data. Entries are available to anyone through free CSD community web services. Linking services developed and maintained by the CCDC, combined with the use of standard identifiers, facilitate discovery from other resources. Data can also be accessed through CCDC and third party software applications and through an application programming interface.This paper is the definitive article describing the creation, maintenance, information content and availability of the Cambridge Structural Database (CSD), the world's repository of small molecule crystal structures.

Figures

  • Table 1 Development of the CSD over a 10-year timespan from 2006 to 2015.
  • Figure 1 Growth of the CSD since 1972, the red bar shows structures added annually.
  • Table 2 Publication sources for CSD entries.
  • Figure 2 The increase in the average molecular weight of CSD entries since 1965, with (a) average formula weight per Z0 of metal–organic structures, (b) average molecular weight of heaviest component of metal–organic structures, (c) average formula weight per Z0 of organic structures and (d) average molecular weight of the heaviest component of organic structures.
  • Figure 3 The increase in resolved disorder in CSD entries since 1965.
  • Figure 4 The validation step of the CSD Deposition process showing the integration with checkCIF (CCDC, 2016b).
  • Figure 5 Screenshot of the CSD community Access Structures results page. This is the ‘landing page’ for many referring services based, for example on DOI or CCDC number (CCDC, 2016a).
  • Figure 6 An illustration of the flow from sample to CSD entry. Solid lines are the elements of the process involving the CCDC. A crystalline sample (a) is put on a diffractometer (b) for an experiment that ultimately results in the experimentally determined coordinates of atoms in the crystal that are captured in a CIF file (c). The CIF file is deposited with the CCDC (d) and may be associated with a scientific article (e). On publication of the article, the structure is validated and enriched by CCDC editorial software and staff (f) to create an entry in the CSD (g). Structures are publicly accessible through CSD community services such as Access Structures (h). Standard identifiers such as DOIs and InChIs facilitate links between articles, structures and other resources. Software developed by the CCDC (i) enables the knowledge embedded in the CSD to be applied to a range of scientific problems including aiding in the determination and refinement of future structures through free resources such as CellCheckCSD (j). The flow of data is largely represented using lamotrigine (Sridhar & Ravikumar, 2009), the 500 000th entry in the CSD, CSD refcode: EFEMUX01; CCDC number: CCDC 749719; DOI: 10.5517/cct54hl. Image (a) is a crystal of p-aminobenzoic acid (Sullivan & Davey, 2015) courtesy of Rachel Sullivan and Roger Davey, University of Manchester, CSD refcode: AMBNAC10; CCDC 983122; DOI: 10.5517/cc1200mq. Image (b) courtesy of Andrew Bond, University of Cambridge.

References Powered by Scopus

The Protein Data Bank

32297Citations
N/AReaders
Get full text

OLEX2: A complete structure solution, refinement and analysis program

25375Citations
N/AReaders
Get full text

Crystal structure refinement with SHELXL

21392Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Mercury 4.0: From visualization to analysis, design and prediction

3470Citations
N/AReaders
Get full text

MoleculeNet: A benchmark for molecular machine learning

1870Citations
N/AReaders
Get full text

Recent advances and applications of machine learning in solid-state materials science

1639Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Groom, C. R., Bruno, I. J., Lightfoot, M. P., & Ward, S. C. (2016). The Cambridge structural database. Acta Crystallographica Section B: Structural Science, Crystal Engineering and Materials, 72(2), 171–179. https://doi.org/10.1107/S2052520616003954

Readers over time

‘12‘14‘15‘16‘17‘18‘19‘20‘21‘22‘23‘24‘250150300450600

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 1017

69%

Researcher 271

18%

Professor / Associate Prof. 143

10%

Lecturer / Post doc 49

3%

Readers' Discipline

Tooltip

Chemistry 1082

82%

Materials Science 86

7%

Chemical Engineering 73

6%

Biochemistry, Genetics and Molecular Bi... 73

6%

Article Metrics

Tooltip
Mentions
Blog Mentions: 1
References: 4

Save time finding and organizing research with Mendeley

Sign up for free
0