Learning to rank answers to non-factoid questions from web collections

138Citations
Citations of this article
250Readers
Mendeley users who have this article in their library.

Abstract

This work investigates the use of linguistically motivated features to improve search, in particular for ranking answers to non-factoid questions. We show that it is possible to exploit existing large collections of question-answer pairs (from online social Question Answering sites) to extract such features and train ranking models which combine them effectively.We investigate a wide range of feature types, some exploiting natural language processing such as coarse word sense disambiguation, named-entity identification, syntactic parsing, and semantic role labeling. Our experiments demonstrate that linguistic features, in combination, yield considerable improvements in accuracy. Depending on the system settings we measure relative improvements of 14% to 21% in Mean Reciprocal Rank and Precision@1, providing one of the most compelling evidence to date that complex linguistic features such as word senses and semantic roles can have a significant impact on large-scale information retrieval tasks. © 2011 Association for Computational Linguistics.

References Powered by Scopus

The proposition bank: An annotated corpus of semantic roles

1668Citations
N/AReaders
Get full text

Training linear SVMs in linear time

1551Citations
N/AReaders
Get full text

Finding high-quality content in social media

970Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Abusive language detection in online user content

891Citations
N/AReaders
Get full text

Learning to rank short text pairs with convolutional deep neural networks

604Citations
N/AReaders
Get full text

aNMM: Ranking short answer texts with attention-based neural matching model

163Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Surdeanu, M., Ciaramita, M., & Zaragoza, H. (2011). Learning to rank answers to non-factoid questions from web collections. Computational Linguistics, 37(2), 351–383. https://doi.org/10.1162/COLI_a_00051

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 124

72%

Researcher 33

19%

Professor / Associate Prof. 9

5%

Lecturer / Post doc 7

4%

Readers' Discipline

Tooltip

Computer Science 154

87%

Linguistics 13

7%

Engineering 7

4%

Social Sciences 3

2%

Save time finding and organizing research with Mendeley

Sign up for free