Provenance and the price of identity

4Citations
Citations of this article
16Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

As developers acknowledge that provenance is essential, more and more datasets are attempting to keep provenance records describing how they were created. Some of these datasets are constructed using workflows, others cobble together processes and applications to manipulate the data. While the provenance needs are the same, the inputs and set of processes used must be kept, the identity needs are very different. We outline several identification strategies that can be used for data manipulation outside of workflows.We evaluate these strategies in terms of time to create and store identity, and the space needed to keep this information. Additionally, we discuss the strengths and weaknesses of each strategy.

References Powered by Scopus

NCBI Reference Sequence (RefSeq): A curated non-redundant sequence database of genomes, transcripts and proteins

1483Citations
N/AReaders
Get full text

Why and where: A characterization of data provenance?

743Citations
N/AReaders
Get full text

Tracing the lineage of view data in a warehousing environment

309Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Why Not?

167Citations
N/AReaders
Get full text

Automatically adapting source code to document provenance

4Citations
N/AReaders
Get full text

Distinguishing provenance equivalence of earth science data

2Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Chapman, A., & Jagadish, H. V. (2008). Provenance and the price of identity. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5272, pp. 106–119). Springer Verlag. https://doi.org/10.1007/978-3-540-89965-5_12

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 9

64%

Researcher 4

29%

Professor / Associate Prof. 1

7%

Readers' Discipline

Tooltip

Computer Science 13

93%

Mathematics 1

7%

Save time finding and organizing research with Mendeley

Sign up for free