Methods for exploring and mining tables on Wikipedia

84Citations
Citations of this article
71Readers
Mendeley users who have this article in their library.

Abstract

Knowledge bases extracted automatically from the Web present new opportunities for data mining and exploration. Given a large, heterogeneous set of extracted relations, new tools are needed for searching the knowledge and uncovering relationships of interest. We present WikiTables, a Web application that enables users to interactively explore tabular knowledge extracted from Wikipedia. In experiments, we show that WikiTables substantially outperforms baselines on the novel task of automatically joining together disparate tables to uncover \interesting" relationships between table columns. We find that a \Semantic Relatedness"measure that leverages the Wikipedia link structure accounts for a majority of this improvement. Further, on the task of keyword search for tables, we show that WikiTables performs comparably to Google Fusion Tables despite using an order of magnitude fewer tables. Our work also includes the release of a number of public resources, including over 15 million tuples of extracted tabular data, manually annotated evaluation sets, and public APIs. © Copyright 2013 ACM.

References Powered by Scopus

DBpedia - A crystallization point for the Web of Data

1810Citations
N/AReaders
Get full text

WebTables: Exploring the power of tables on the web

521Citations
N/AReaders
Get full text

Linear feature-based models for information retrieval

282Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Semantic Web in data mining and knowledge discovery: A comprehensive survey

279Citations
N/AReaders
Get full text

Dataset search: a survey

161Citations
N/AReaders
Get full text

TabEL: Entity linking in web tables

128Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Bhagavatula, C. S., Noraset, T., & Downey, D. (2013). Methods for exploring and mining tables on Wikipedia. In Proceedings of the ACM SIGKDD 2013 Workshop on Interactive Data Exploration and Analytics, IDEA 2013 (pp. 18–26). Association for Computing Machinery. https://doi.org/10.1145/2501511.2501516

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 31

70%

Researcher 5

11%

Professor / Associate Prof. 4

9%

Lecturer / Post doc 4

9%

Readers' Discipline

Tooltip

Computer Science 46

88%

Engineering 4

8%

Social Sciences 1

2%

Physics and Astronomy 1

2%

Save time finding and organizing research with Mendeley

Sign up for free