NLM-Chem, a new resource for chemical entity recognition in PubMed full text literature

44Citations
Citations of this article
56Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Automatically identifying chemical and drug names in scientific publications advances information access for this important class of entities in a variety of biomedical disciplines by enabling improved retrieval and linkage to related concepts. While current methods for tagging chemical entities were developed for the article title and abstract, their performance in the full article text is substantially lower. However, the full text frequently contains more detailed chemical information, such as the properties of chemical compounds, their biological effects and interactions with diseases, genes and other chemicals. We therefore present the NLM-Chem corpus, a full-text resource to support the development and evaluation of automated chemical entity taggers. The NLM-Chem corpus consists of 150 full-text articles, doubly annotated by ten expert NLM indexers, with ~5000 unique chemical name annotations, mapped to ~2000 MeSH identifiers. We also describe a substantially improved chemical entity tagger, with automated annotations for all of PubMed and PMC freely accessible through the PubTator web-based interface and API. The NLM-Chem corpus is freely available.

References Powered by Scopus

This article is free to access.

BioCreative V CDR task corpus: a resource for chemical disease relation extraction

675Citations
233Readers

This article is free to access.

This article is free to access.

Cited by Powered by Scopus

91Citations
86Readers

This article is free to access.

This article is free to access.

32Citations
31Readers

This article is free to access.

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Islamaj, R., Leaman, R., Kim, S., Kwon, D., Wei, C. H., Comeau, D. C., … Lu, Z. (2021). NLM-Chem, a new resource for chemical entity recognition in PubMed full text literature. Scientific Data, 8(1). https://doi.org/10.1038/s41597-021-00875-1

Readers over time

‘21‘22‘23‘24‘2506121824

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 10

48%

Researcher 10

48%

Lecturer / Post doc 1

5%

Readers' Discipline

Tooltip

Computer Science 8

50%

Chemistry 3

19%

Engineering 3

19%

Biochemistry, Genetics and Molecular Bi... 2

13%

Article Metrics

Tooltip
Mentions
News Mentions: 1
References: 1

Save time finding and organizing research with Mendeley

Sign up for free
0