Detecting ambiguity in prioritized database repairing

Benny Kimelfeld, Ester Livshits, Liat Peterfreund

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

In its traditional definition, a repair of an inconsistent database is a consistent database that differs from the inconsistent one in a "minimal way." Often, repairs are not equally legitimate, as it is desired to prefer one over another; for example, one fact is regarded more reliable than another, or a more recent fact should be preferred to an earlier one. Motivated by these considerations, researchers have introduced and investigated the framework of preferred repairs, in the context of denial constraints and subset repairs. There, a priority relation between facts is lifted towards a priority relation between consistent databases, and repairs are restricted to the ones that are optimal in the lifted sense. Three notions of lifting (and optimal repairs) have been proposed: Pareto, global, and completion. In this paper we investigate the complexity of deciding whether the priority relation suffices to clean the database unambiguously, or in other words, whether there is exactly one optimal repair. We show that the different lifting semantics entail highly different complexities. Under Pareto optimality, the problem is coNP-complete, in data complexity, for every set of functional dependencies (FDs), except for the tractable case of (equivalence to) one FD per relation. Under global optimality, one FD per relation is still tractable, but we establish Πp2-completeness for a relation with two FDs. In contrast, under completion optimality the problem is solvable in polynomial time for every set of FDs. In fact, we present a polynomial-time algorithm for arbitrary conflict hypergraphs. We further show that under a general assumption of transitivity, this algorithm solves the problem even for global optimality. The algorithm is extremely simple, but its proof of correctness is quite intricate.

Original languageEnglish
Title of host publication20th International Conference on Database Theory, ICDT 2017
EditorsGiorgio Orsi, Michael Benedikt
PublisherSchloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
ISBN (Electronic)9783959770248
DOIs
StatePublished - 1 Mar 2017
Externally publishedYes
Event20th International Conference on Database Theory, ICDT 2017 - Venice, Italy
Duration: 21 Mar 201724 Mar 2017

Publication series

NameLeibniz International Proceedings in Informatics, LIPIcs
Volume68
ISSN (Print)1868-8969

Conference

Conference20th International Conference on Database Theory, ICDT 2017
Country/TerritoryItaly
CityVenice
Period21/03/1724/03/17

Bibliographical note

Publisher Copyright:
© Benny Kimelfeld, Ester Livshits,and Liat Peterfreund; licensed under Creative Commons License CC-BY 20th International Conference on Database Theory (ICDT 2017).

Keywords

  • Conflict hypergraph
  • Data cleaning
  • Functional dependencies
  • Inconsistent databases
  • Preferred repairs

Fingerprint

Dive into the research topics of 'Detecting ambiguity in prioritized database repairing'. Together they form a unique fingerprint.

Cite this