Selecting unique records in XSLT/XPath
Posted
by Daniel I-S
on Stack Overflow
See other posts from Stack Overflow
or by Daniel I-S
Published on 2010-06-10T17:36:01Z
Indexed on
2010/06/11
1:03 UTC
Read the original article
Hit count: 301
I have to select only unique records from an XML document, in the context of an <xsl:for-each>
loop. I am limited by Visual Studio to using XSL 1.0.
<availList>
<item>
<schDate>2010-06-24</schDate>
<schFrmTime>10:00:00</schFrmTime>
<schToTime>13:00:00</schToTime>
<variousOtherElements></variousOtherElements>
</item>
<item>
<schDate>2010-06-24</schDate>
<schFrmTime>10:00:00</schFrmTime>
<schToTime>13:00:00</schToTime>
<variousOtherElements></variousOtherElements>
</item>
<item>
<schDate>2010-06-25</schDate>
<schFrmTime>10:00:00</schFrmTime>
<schToTime>12:00:00</schToTime>
<variousOtherElements></variousOtherElements>
</item>
<item>
<schDate>2010-06-26</schDate>
<schFrmTime>13:00:00</schFrmTime>
<schToTime>14:00:00</schToTime>
<variousOtherElements></variousOtherElements>
</item>
<item>
<schDate>2010-06-26</schDate>
<schFrmTime>10:00:00</schFrmTime>
<schToTime>12:00:00</schToTime>
<variousOtherElements></variousOtherElements>
</item>
</availList>
The uniqueness must be based on the value of the three child elements: schDate
, schFrmTime
and schToTime
. If two item
elements have the same values for all three child elements, they are duplicates. In the above XML, items one and two are duplicates. The rest are unique. As indicated above, each item contains other elements that we do not wish to include in the comparison. 'Uniqueness' should be a factor of those three elements, and those alone.
I have attempted to accomplish this through the following:
availList/item[not(schDate = preceding:: schDate and schFrmTime = preceding:: schFrmTime and schToTime = preceding:: schToTime)]
The idea behind this is to select records where there is no preceding element with the same schDate
, schFrmTime
and schToTime
. However, its output is missing the last item. This is because my XPath is actually excluding items where all of the child element values are matched within the entire preceding document. No single item
matches all of the last item's child elements - but because each element's value is individually present in another item, the last item gets excluded.
I could get the correct result by comparing all child values as a concatenated string to the same concatenated values for each preceding item. Does anybody know of a way I could do this?
© Stack Overflow or respective owner