Extended language models for XML element retrieval

Rongmei Li; Theo Van Der Weide

Conference Proceedings

Extended language models for XML element retrieval

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 6932 LNCS 89-97

DOI: 10.1007/978-3-642-23577-1_8

N/ACitations

1Readers

Get full text

Abstract

In this paper we describe our participation in the INEX 2010 ad-hoc track. We participated in three retrieval tasks (restricted focused task, relevant-in-context, restricted relevant-in-context) and report our findings based on a single set of measure for all tasks. In this year's participation, we evaluate the performance of the standard language model that is more focused on a fixed number of relevant characters than on relevant paragraphs. Our findings are: 1) the simplest language model for document retrieval performs relatively well in the restricted focused task when using a fixed offset that is close to the average character distance from the beginning of a document to its main content; 2) a good result of document ranking does improve the performance of snippet retrieval; 3) stemming and stopword removal can further boost performance. © 2011 Springer-Verlag.

Cite

CITATION STYLE

APA

Li, R., & Van Der Weide, T. (2011). Extended language models for XML element retrieval. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6932 LNCS, pp. 89–97). https://doi.org/10.1007/978-3-642-23577-1_8

Extended language models for XML element retrieval

Abstract

Cite

Register to see more suggestions