Constrained co-clustering for textual documents

19Citations
Citations of this article
40Readers
Mendeley users who have this article in their library.

Abstract

In this paper, we present a constrained co-clustering approach for clustering textual documents. Our approach combines the benefits of information-theoretic co-clustering and constrained clustering. We use a two-sided hidden Markov random field (HMRF) to model both the document and word constraints. We also develop an alternating expectation maximization (EM) algorithm to optimize the constrained co-clustering model. We have conducted two sets of experiments on a benchmark data set: (1) using human-provided category labels to derive document and word constraints for semi-supervised document clustering, and (2) using automatically extracted named entities to derive document constraints for unsupervised document clustering. Compared to several representative constrained clustering and co-clustering approaches, our approach is shown to be more effective for high-dimensional, sparse text data. Copyright © 2010, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.

References Powered by Scopus

Data clustering: A review

10809Citations
N/AReaders
Get full text

Cluster ensembles - A knowledge reuse framework for combining multiple partitions

4115Citations
N/AReaders
Get full text

Text classification from labeled and unlabeled documents using EM

2292Citations
N/AReaders
Get full text

Cited by Powered by Scopus

A survey of visual analytics techniques and applications: State-of-the-art research and future challenges

166Citations
N/AReaders
Get full text

Bridging Text Visualization and Mining: A Task-Driven Survey

80Citations
N/AReaders
Get full text

Feature-reduction fuzzy co-clustering approach for hyper-spectral image analysis

28Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Song, Y., Pan, S., Liu, S., Wei, F., Zhou, M. X., & Qian, W. (2010). Constrained co-clustering for textual documents. In Proceedings of the National Conference on Artificial Intelligence (Vol. 1, pp. 581–586). AI Access Foundation. https://doi.org/10.1609/aaai.v24i1.7680

Readers over time

‘10‘11‘12‘13‘14‘15‘16‘17‘18‘19‘20‘21‘22‘23‘24036912

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 25

76%

Professor / Associate Prof. 5

15%

Researcher 3

9%

Readers' Discipline

Tooltip

Computer Science 24

73%

Engineering 6

18%

Mathematics 2

6%

Chemistry 1

3%

Save time finding and organizing research with Mendeley

Sign up for free
0