PROTEIN Dissimilarity dataset.
D = PROTEIN The protein data are provided as a 213x213 dissimilarity matrix comparing the protein sequences based on the concept of an evolutionary distance. It was used for classification in [Graepel] and for clustering in [Denoeux and Masson]. There are four classes of globins: heterogeneous globin (G), hemoglobin-A (HA), hemoglobin-B (HB) and myoglobin (M). Reference(s)T. Graepel, R. Herbrich, P. Bollmann-Sdorra, K. Obermayer, Classification on pairwise proximity data. In Adv. in Neural Information System Processing vol. 11, 438-444, 1999. T. Denoeux, T. and M.-H. Masson, EVCLUS: Evidential clustering of proximity data. IEEE Transations on Systems, Man and Cybernetics, vol. 34, 95-109, 2004. See alsoprtools, datasets, prdisdata,
|