Probabilistic clustering of interval data

Brito, Paula; Silva, A. Pedro Duarte; Dias, José G.

http://hdl.handle.net/10400.14/20309

Utilize este identificador para referenciar este registo.

Nome:	Descrição:	Tamanho:	Formato:
26873917.pdf		535.56 KB	Adobe PDF	Ver/Abrir

Contacte-nos

Autores

Brito, Paula

Silva, A. Pedro Duarte

Dias, José G.

Resumo(s)

In this paper we address the problem of clustering interval data, adopting a model-based approach. To this purpose, parametric models for interval-valued variables are used which consider configurations for the variance-covariance matrix that take the nature of the interval data directly into account. Results, both on synthetic and empirical data, clearly show the well-founding of the proposed approach. The method succeeds in finding parsimonious heterocedastic models which is a critical feature in many applications. Furthermore, the analysis of the different data sets made clear the need to explicitly consider the intrinsic variability present in interval data.

Palavras-chave

Clustering methods Finite mixture models Interval-valued variable Intrinsic variability Symbolic data

URI

http://hdl.handle.net/10400.14/20309

Citação

BRITO, Paula; DUARTE SILVA, A. P.; DIAS, José G. - Probabilistic Clustering of Interval Data. Intelligent Data Analysis. 1571-4128. Vol. 19, n.º 2 (2015), p. 293-313