Algorithm to generate numerical concept hierarchy

Posted by Christophe Herreman on Stack Overflow See other posts from Stack Overflow or by Christophe Herreman
Published on 2010-03-25T16:54:39Z Indexed on 2010/03/25 17:13 UTC
Read the original article Hit count: 629

I have a couple of numerical datasets that I need to create a concept hierarchy for. For now, I have been doing this manually by observing the data (and a corresponding linechart). Based on my intuition, I created some acceptable hierarchies.

This seems like a task that can be automated. Does anyone know if there is an algorithm to generate a concept hierarchy for numerical data?


To give an example, I have the following dataset:

Bangladesh     521
Brazil         8295
Burma          446
China          3259
Congo          2952
Egypt          2162
Ethiopia       333
France         46037
Germany        44729
India          1017
Indonesia      2239
Iran           4600
Italy          38996
Japan          38457
Mexico         10200
Nigeria        1401
Pakistan       1022
Philippines    1845
Russia         11807
South Africa   5685
Thailand       4116
Turkey         10479
UK             43734
US             47440
Vietnam        1042

alt text

for which I created the following hierarchy:

  • LOWEST ( < 1000)
  • LOW (1000 - 2500)
  • MEDIUM (2501 - 7500)
  • HIGH (7501 - 30000)
  • HIGHEST ( > 30000)

© Stack Overflow or respective owner

Related posts about version-space

Related posts about artificial-intelligence