Search CORE

11,262 research outputs found

A Call to Arms: Revisiting Database Design

Author: Antonio Badia
Berners-Lee T.
Chauduri S.
Daniel Lemire
Golab L.
Helland P.
Kiely G.
Nagarajan S.
Olivé A.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2011
Field of study

Good database design is crucial to obtain a sound, consistent database, and - in turn - good database design methodologies are the best way to achieve the right design. These methodologies are taught to most Computer Science undergraduates, as part of any Introduction to Database class. They can be considered part of the "canon", and indeed, the overall approach to database design has been unchanged for years. Moreover, none of the major database research assessments identify database design as a strategic research direction. Should we conclude that database design is a solved problem? Our thesis is that database design remains a critical unsolved problem. Hence, it should be the subject of more research. Our starting point is the observation that traditional database design is not used in practice - and if it were used it would result in designs that are not well adapted to current environments. In short, database design has failed to keep up with the times. In this paper, we put forth arguments to support our viewpoint, analyze the root causes of this situation and suggest some avenues of research.Comment: Removed spurious column break. Nothing else was change

arXiv.org e-Print Archive

CiteSeerX

R-libre

Crossref

An object-oriented approach to distributed data management.

Author: Marinos L.
Papazoglou M.
Publication venue
Publication date
Field of study

Research Papers in Economics

Justification for inclusion dependency normal form

Author: Levene Mark
Vincent Millist W.
Publication venue: IEEE Computer Society
Publication date: 01/01/1999
Field of study

Functional dependencies (FDs) and inclusion dependencies (INDs) are the most fundamental integrity constraints that arise in practice in relational databases. In this paper, we address the issue of normalization in the presence of FDs and INDs and, in particular, the semantic justification for Inclusion Dependency Normal Form (IDNF), a normal form which combines Boyce-Codd normal form with the restriction on the INDs that they be noncircular and key-based. We motivate and formalize three goals of database design in the presence of FDs and INDs: noninteraction between FDs and INDs, elimination of redundancy and update anomalies, and preservation of entity integrity. We show that, as for FDs, in the presence of INDs being free of redundancy is equivalent to being free of update anomalies. Then, for each of these properties, we derive equivalent syntactic conditions on the database design. Individually, each of these syntactic conditions is weaker than IDNF and the restriction that an FD not be embedded in the righthand side of an IND is common to three of the conditions. However, we also show that, for these three goals of database design to be satisfied simultaneously, IDNF is both a necessary and sufficient condition

CiteSeerX

Birkbeck Institutional Research Online

Reasoning about Independence in Probabilistic Models of Relational Data

Author: Jensen David
Maier Marc
Marazopoulou Katerina
Publication venue
Publication date: 06/01/2014
Field of study

We extend the theory of d-separation to cases in which data instances are not independent and identically distributed. We show that applying the rules of d-separation directly to the structure of probabilistic models of relational data inaccurately infers conditional independence. We introduce relational d-separation, a theory for deriving conditional independence facts from relational models. We provide a new representation, the abstract ground graph, that enables a sound, complete, and computationally efficient method for answering d-separation queries about relational models, and we present empirical results that demonstrate effectiveness.Comment: 61 pages, substantial revisions to formalisms, theory, and related wor

arXiv.org e-Print Archive

CiteSeerX

Evolving information systems: meeting the ever-changing environment

Author: Brinkkemper S.
Falkenberg E.D.
Falkenberg E.D.
Falkenberg E.D.
Gane C.
Hofstede A.H.M.
Hofstede A.H.M.
Lundeberg M.
Nijssen G.M.
Proper H.A.
Roddick J.F.
Rolland C.
Snodgrass R.
Veenstra B.M.J.M.
Verrijn-Stuart A.A.
Wijers G.M.
Wintraecken J.J.V.R.
Yourdon E.
Publication venue: Pergamon
Publication date: 10/02/1993
Field of study

To meet the demands of organizations and their ever-changing environment, information systems are required which are able to evolve to the same extent as organizations do. Such a system has to support changes in all time-and application-dependent aspects. In this paper, requirements and a conceptual framework for evolving information systems are presented. This framework includes an architecture for such systems and a revision of the traditional notion of update. Based on this evolutionary notion of update (recording, correction and forgetting) a state transition-oriented model on three levels of abstraction (event level, recording level, correction level) is introduced. Examples are provided to illustrate the conceptual framework for evolving information systems

CiteSeerX

Crossref

University of Twente Research Information

Why is the snowflake schema a good data warehouse design?

Author: Levene Mark
Loizou George
Publication venue: 'Elsevier BV'
Publication date: 01/01/2003
Field of study

Database design for data warehouses is based on the notion of the snowflake schema and its important special case, the star schema. The snowflake schema represents a dimensional model which is composed of a central fact table and a set of constituent dimension tables which can be further broken up into subdimension tables. We formalise the concept of a snowflake schema in terms of an acyclic database schema whose join tree satisfies certain structural properties. We then define a normal form for snowflake schemas which captures its intuitive meaning with respect to a set of functional and inclusion dependencies. We show that snowflake schemas in this normal form are independent as well as separable when the relation schemas are pairwise incomparable. This implies that relations in the data warehouse can be updated independently of each other as long as referential integrity is maintained. In addition, we show that a data warehouse in snowflake normal form can be queried by joining the relation over the fact table with the relations over its dimension and subdimension tables. We also examine an information-theoretic interpretation of the snowflake schema and show that the redundancy of the primary key of the fact table is zero

CiteSeerX

Birkbeck Institutional Research Online