A formal Account of the Open Provenance Model

Page 1

A Formal Account of the Open Provenance Model NATALIA KWASNIKOWSKA, Hasselt University and Transnational University of Limburg LUC MOREAU, University of Southampton JAN VAN DEN BUSSCHE, Hasselt University and Transnational University of Limburg

On the Web, where resources such as documents and data are published, shared, transformed, and republished, provenance is a crucial piece of metadata that would allow users to place their trust in the resources they access. The open provenance model (OPM) is a community data model for provenance that is designed to facilitate the meaningful interchange of provenance information between systems. Underpinning OPM is a notion of directed graph, where nodes represent data products and processes involved in past computations and edges represent dependencies between them; it is complemented by graphical inference rules allowing new dependencies to be derived. Until now, however, the OPM model was a purely syntactical endeavor. The present article extends OPM graphs with an explicit distinction between precise and imprecise edges. Then a formal semantics for the thus enriched OPM graphs is proposed, by viewing OPM graphs as temporal theories on the temporal events represented in the graph. The original OPM inference rules are scrutinized in view of the semantics and found to be sound but incomplete. An extended set of graphical rules is provided and proved to be complete for inference. The article concludes with applications of the formal semantics to inferencing in OPM graphs, operators on OPM graphs, and a formal notion of refinement among OPM graphs. Categories and Subject Descriptors: H.1.m [Models and Principles]: Miscellaneous General Terms: Languages, Standardization, Theory, Verification Additional Key Words and Phrases: Provenance, temporal reasoning, World Wide Web ACM Reference Format: Natalia Kwasnikowska, Luc Moreau, and Jan Van Den Bussche. 2015. A formal account of the open provenance model. ACM Trans. Web 9, 2, Article 10 (May 2015), 44 pages. DOI: http://dx.doi.org/10.1145/2734116

1. INTRODUCTION

In the context of the Web, provenance is information about entities, activities, and people involved in producing a resource (a piece of data, or any other thing) [Moreau et al. 2013]. This information can be used to form assessments about the quality, reliability, or trustworthiness of the resource. On the Web, data flows across multiple systems, implemented using different technologies and potentially hosted by different institutions. Hence, tracking the provenance of data in this context is particularly challenging, as it involves understanding flows of information in these different systems. Luc Moreau’s work is funded in part by the EPSRC SOCIAM (EP/J017728/1) and ORCHID projects (EP/I011587/1), the FP7 SmartSociety project (600854), and the ESRC ebook project (ES/K007246/1). Authors’ addresses: N. Kwasnikowska and J. Van den Bussche, Universiteit Hasselt, Agoralaan, gebouw D, 3590 Diepenbeek, Belgium; email: jan.vandenbussche@uhasselt.be; L. Moreau, School of Electronics & Computer Science, University of Southampton, Southampton SO17 1BJ, United Kingdom; email: L.Moreau@ecs.soton.ac.uk. Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies show this notice on the first page or initial screen of a display along with the full citation. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, to republish, to post on servers, to redistribute to lists, or to use any component of this work in other works requires prior specific permission and/or a fee. Permissions may be requested from Publications Dept., ACM, Inc., 2 Penn Plaza, Suite 701, New York, NY 10121-0701 USA, fax +1 (212) 869-0481, or permissions@acm.org. c 2015 ACM 1559-1131/2015/05-ART10 $15.00 DOI: http://dx.doi.org/10.1145/2734116

ACM Transactions on the Web, Vol. 9, No. 2, Article 10, Publication date: May 2015.

10


Turn static files into dynamic content formats.

Create a flipbook
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.
A formal Account of the Open Provenance Model by Smart Society Project - Issuu