More information about clustering and similarity/dissimilarity
measures can be found in:
[1] Mathematical Classification and Clustering, by Boris Mirkin, published by Kluwer Academic Publishers, Netherlands, 1996.
[2] Finding Groups in Data, by Leonard Kaufman and Peter J. Rousseeuw, published by John Wiley, New York, 1994.
[3] Data Clustering: A Review, by A.K. Jian, M.N. Murty, and P.J. Flynn, published in ACM Computing Surveys, vol 31, number 3, 1999, pages 264-323.
The generalization of entropy and conditional entropy was introduced in:
[4] Impurity Measures and Applications to Classification and Clustering, by Dan Simovici and Dana Cristofor and Laurentiu Cristofor, published in International Conference on Advances in Infrastructure for Electronic Business, Science, and Education - Scuola Superiore G. R. Romoli, Aquila, Italy, 2000.
The genetic algorithms implemented in GAClust were introduced in:
[5] Finding Median Partitions Using Information-Theoretical-Based Genetic Algorithms, by Dana Cristofor and Dan Simovici, published in Journal of Universal Computer Science, vol 8, number 2, 2002, pages 153-172.
[6] An Information-Theoretical Approach to Clustering Categorical Databases using Genetic Algorithms, by Dana Cristofor and Dan Simovici, published in Proceedings of the Second SIAM International Conference on Data Mining - Workshop on Clustering High Dimensional Data and its Applications, 2002.