A Field Sensor: Computing the composition and intent of PubMed queries

4Citations
Citations of this article
11Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

PubMed ® is a search engine providing access to a collection of over 27 million biomedical bibliographic records as of 2017. PubMed processes millions of queries a day, and understanding these queries is one of the main building blocks for successful information retrieval. In this work, we present Field Sensor, a domain-specific tool for understanding the composition and predicting the user intent of PubMed queries. Given a query, the Field Sensor infers a field for each token or sequence of tokens in a query in multi-step process that includes syntactic chunking, rule-based tagging and probabilistic field prediction. In this work, the fields of interest are those associated with (meta-)data elements of each PubMed record such as article title, abstract, author name(s), journal title, volume, issue, page and date. We evaluate the accuracy of our algorithm on a human-annotated corpus of 10 000 PubMed queries, as well as a new machine-annotated set of 103 000 PubMed queries. The Field Sensor achieves an accuracy of 93 and 91% on the two corresponding corpora and finds that nearly half of all searches are navigational (e.g. author searches, article title searches etc.) and half are informational (e.g. topical searches). The Field Sensor has been integrated into PubMed since June 2017 to detect informational queries for which results sorted by relevance can be suggested as an alternative to those sorted by the default date sort. In addition, the composition of PubMed queries as computed by the Field Sensor proves to be essential for understanding how users query PubMed.

References Powered by Scopus

Error Bounds for Convolutional Codes and an Asymptotically Optimum Decoding Algorithm

4758Citations
N/AReaders
Get full text

Comparison of PubMed, Scopus, Web of Science, and Google Scholar: Strengths and weaknesses

3286Citations
N/AReaders
Get full text

PubMed and beyond: A survey of web tools for searching biomedical literature

397Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Best Match: New relevance search for PubMed

110Citations
N/AReaders
Get full text

MedCPT: Contrastive Pre-trained Transformers with large-scale PubMed search logs for zero-shot biomedical information retrieval

40Citations
N/AReaders
Get full text

PubMed Labs: An experimental system for improving biomedical literature search

18Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Yeganova, L., Kim, W., Comeau, D. C., Wilbur, W. J., & Lu, Z. (2018). A Field Sensor: Computing the composition and intent of PubMed queries. Database, 2018(2018). https://doi.org/10.1093/database/bay052

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 3

50%

Researcher 2

33%

Professor / Associate Prof. 1

17%

Readers' Discipline

Tooltip

Biochemistry, Genetics and Molecular Bi... 2

40%

Pharmacology, Toxicology and Pharmaceut... 1

20%

Computer Science 1

20%

Agricultural and Biological Sciences 1

20%

Save time finding and organizing research with Mendeley

Sign up for free