Catalogo dei prodotti della ricerca

Discussion on "Data with mixed‐type (metric–ordinal–nominal) variables are typical for social stratification, i.e. partitioning a population into social classes. Approaches to cluster such data are compared, namely a latent class mixture model assuming local independence and dissimilarity‐based methods such as k‐medoids. The design of an appropriate dissimilarity measure and the estimation of the number of clusters are discussed as well, comparing the Bayesian information criterion with dissimilarity‐based criteria. The comparison is based on a philosophy of cluster analysis that connects the problem of a choice of a suitable clustering method closely to the application by considering direct interpretations of the implications of the methodology. The application of this philosophy to economic data from the 2007 US Survey of Consumer Finances demonstrates techniques and decisions required to obtain an interpretable clustering. The clustering is shown to be significantly more structured than a suitable null model. One result is that the data‐based strata are not as strongly connected to occupation categories as is often assumed in the literature."

Discussion on “How to find an appropriate clustering for mixed type variables with application to socio-economic stratification” / Vicari, D.. - In: JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS. - ISSN 0035-9254. - 62:3(2013), pp. 359-360. [10.1111/j.1467-9876.2012.01066.x]

Discussion on “How to find an appropriate clustering for mixed type variables with application to socio-economic stratification”

Donatella Vicari

2013

Abstract

Discussion on "Data with mixed‐type (metric–ordinal–nominal) variables are typical for social stratification, i.e. partitioning a population into social classes. Approaches to cluster such data are compared, namely a latent class mixture model assuming local independence and dissimilarity‐based methods such as k‐medoids. The design of an appropriate dissimilarity measure and the estimation of the number of clusters are discussed as well, comparing the Bayesian information criterion with dissimilarity‐based criteria. The comparison is based on a philosophy of cluster analysis that connects the problem of a choice of a suitable clustering method closely to the application by considering direct interpretations of the implications of the methodology. The application of this philosophy to economic data from the 2007 US Survey of Consumer Finances demonstrates techniques and decisions required to obtain an interpretable clustering. The clustering is shown to be significantly more structured than a suitable null model. One result is that the data‐based strata are not as strongly connected to occupation categories as is often assumed in the literature."

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2013
			
	Parole chiave
	
				average silhouette width; cluster philosophy; dissimilarity measure; interpretation of clustering; k-medoids clustering; latent class clustering; mixture model; number of clusters; social stratification
			
	Tipologia
	
				01 Pubblicazione su rivista::01a Articolo in rivista
			
	Citazione
	
				Discussion on “How to find an appropriate clustering for mixed type variables with application to socio-economic stratification” / Vicari, D.. - In: JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS. - ISSN 0035-9254. - 62:3(2013), pp. 359-360. [10.1111/j.1467-9876.2012.01066.x]
			
	Appartiene alla tipologia:
	
				01a Articolo in rivista

File allegati a questo prodotto

File	Dimensione	Formato
Vicari_Discussion_2013.pdf solo gestori archivio Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 8.04 MB Formato Adobe PDF Contatta l'autore	8.04 MB	Adobe PDF	Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1193807

Citazioni

ND

231

195

social impact