Language modelling of constraints for text clustering

Javier Parapar; Álvaro Barreiro

Conference Proceedings

Language modelling of constraints for text clustering

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2012) 7224 LNCS 352-363

DOI: 10.1007/978-3-642-28997-2_30

N/ACitations

5Readers

Get full text

Abstract

Constrained clustering is a recently presented family of semi-supervised learning algorithms. These methods use domain information to impose constraints over the clustering output. The way in which those constraints (typically pair-wise constraints between documents) are introduced is by designing new clustering algorithms that enforce the accomplishment of the constraints. In this paper we present an alternative approach for constrained clustering where, instead of defining new algorithms or objective functions, the constraints are introduced modifying the document representation by means of their language modelling. More precisely the constraints are modelled using the well-known Relevance Models successfully used in other retrieval tasks such as pseudo-relevance feedback. To the best of our knowledge this is the first attempt to try such approach. The results show that the presented approach is an effective method for constrained clustering even improving the results of existing constrained clustering algorithms. © 2012 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Parapar, J., & Barreiro, Á. (2012). Language modelling of constraints for text clustering. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7224 LNCS, pp. 352–363). https://doi.org/10.1007/978-3-642-28997-2_30

Language modelling of constraints for text clustering

Abstract

Cite

Register to see more suggestions