Repeating fields in similar database tables
Posted
by
user1738833
on Stack Overflow
See other posts from Stack Overflow
or by user1738833
Published on 2012-12-12T10:22:53Z
Indexed on
2012/12/12
11:03 UTC
Read the original article
Hit count: 302
I have been tasked with working on a database that I have never seen before and I'm looking at the DB structure. Some of the central and most heavily queried and joined tables look like virtual duplicates of each other. Here's a massively simplified representation of the situation, with business-sensitive information changed, listing hypothetical table names and fields:
TopLevelGroup:
PK_TLGroupId, DisplaysXOnBill, DisplaysYOnBill, IsInvoicedForJ, IsInvoicedForK
SubGroup:
PK_SubGroupId, FK_ParentTopLevelGroupId, DisplaysXOnBill, DisplaysYOnBill, IsInvoicedForJ, IsInvoicedForK
SubSubGroup:
PK_SubSUbGroupId, FK_ParentSubGroupId, DisplaysXOnBill, DisplaysYOnBill, IsInvoicedForJ, IsInvoicedForK
I haven't listed the types of the fields as I don't think it's particularly important to the situation. In addition, it's worth saying that rather than four repeated fields as in the example above, I'm looking at 86 repeated fields. For the most part, those fields genuinely do represent "facts" about the primary table entity, so it's not automatically wrong for that reason.
In addition, the "groups" represented here have a property inheritance relationship. If DisplaysXOnBill
is NULL in the SubSubGroup
, it takes the value of DisplaysXOnBill
from it's parent, the SubGroup
, and so-on up to the TopLevelGroup.
Further, the requirements will never require that the model extends beyond three levels, so there is no need for flexibility in that area.
Is there a design smell from several tables which describe very similar entities having almost identical fields? If so, what might be a better design of the example above?
I'm using the phrase "design smell" to indicate a possible problem. Of course, in any given situation, a particular design might well be the best solution. I'm looking for a more general answer - wondering what might be wrong with this design and what might be the better design were that the case.
Possibly related, but not primary questions:
- Is this database schema in a reasonably normal form (e.g. to 3NF), insofar as can be told from the information I've provided. I can't see a problem with the requirements of 2NF and 3NF, except in their inheriting the requirements of 1NF. Is 1NF satisfied though? Are repeating groups allowed in different tables?
- Is there a best-practice method for implementing the inheritance relationship in a database as I require? The method above feels clunky to me because any query on the
SubSubGroup
necessarily needs to join onto theSubGroup
and theTopLevelGroup
tables to collect inherited facts, which can make even trivial joins requiring facts from the SubSubGroup table rather long-winded.
There are, of course, political considerations to making a relatively large change like this. For the purpose of this question, I'm happy to ignore that fact in the interests of keeping the answers ring-fenced to the technical problem.
© Stack Overflow or respective owner