The complexity of dependency detection and discovery in relational databases

Blaesius, Thomas; Friedrich, Tobias; Schirneck, Friedrich Martin

doi:10.1016/j.tcs.2021.11.020

Treffer 2 von 2

Zurück zur Trefferliste

The complexity of dependency detection and discovery in relational databases

Thomas Blaesius, Tobias Friedrich, Friedrich Martin Schirneck

Multi-column dependencies in relational databases come associated with two different computational tasks. The detection problem is to decide whether a dependency of a certain type and size holds in a given database, the discovery problem asks to enumerate all valid dependencies of that type. We settle the complexity of both of these problems for unique column combinations (UCCs), functional dependencies (FDs), and inclusion dependencies (INDs). We show that the detection of UCCs and FDs is W[2]-complete when parameterized by the solution size. The discovery of inclusion-wise minimal UCCs is proven to be equivalent under parsimonious reductions to the transversal hypergraph problem of enumerating the minimal hitting sets of a hypergraph. The discovery of FDs is equivalent to the simultaneous enumeration of the hitting sets of multiple input hypergraphs. We further identify the detection of INDs as one of the first natural W[3]-complete problems. The discovery of maximal INDs is shown to be equivalent to enumerating the maximalMulti-column dependencies in relational databases come associated with two different computational tasks. The detection problem is to decide whether a dependency of a certain type and size holds in a given database, the discovery problem asks to enumerate all valid dependencies of that type. We settle the complexity of both of these problems for unique column combinations (UCCs), functional dependencies (FDs), and inclusion dependencies (INDs). We show that the detection of UCCs and FDs is W[2]-complete when parameterized by the solution size. The discovery of inclusion-wise minimal UCCs is proven to be equivalent under parsimonious reductions to the transversal hypergraph problem of enumerating the minimal hitting sets of a hypergraph. The discovery of FDs is equivalent to the simultaneous enumeration of the hitting sets of multiple input hypergraphs. We further identify the detection of INDs as one of the first natural W[3]-complete problems. The discovery of maximal INDs is shown to be equivalent to enumerating the maximal satisfying assignments of antimonotone, 3-normalized Boolean formulas.…

Metadaten
Verfasserangaben:	Thomas Blaesius GND, Tobias Friedrich ORCiD GND, Friedrich Martin Schirneck ORCiD GND
DOI:	https://doi.org/10.1016/j.tcs.2021.11.020
ISSN:	0304-3975
ISSN:	1879-2294
Titel des übergeordneten Werks (Englisch):	Theoretical computer science
Verlag:	Elsevier
Verlagsort:	Amsterdam
Publikationstyp:	Wissenschaftlicher Artikel
Sprache:	Englisch
Datum der Erstveröffentlichung:	02.12.2021
Erscheinungsjahr:	2021
Datum der Freischaltung:	07.12.2023
Freies Schlagwort / Tag:	Unique column combination; W[3]-completeness; data profiling; dependency; enumeration complexity; functional dependency; inclusion; parameterized complexity; parsimonious reduction; transversal hypergraph
Band:	900
Seitenanzahl:	18
Erste Seite:	79
Letzte Seite:	96
Organisationseinheiten:	An-Institute / Hasso-Plattner-Institut für Digital Engineering gGmbH
DDC-Klassifikation:	0 Informatik, Informationswissenschaft, allgemeine Werke / 00 Informatik, Wissen, Systeme / 000 Informatik, Informationswissenschaft, allgemeine Werke
Peer Review:	Referiert

The complexity of dependency detection and discovery in relational databases

Metadaten exportieren

Weitere Dienste