Catalogo dei prodotti della ricerca

Visualization recommendation is a novel and challenging field of study, whose aim is to provide non-expert users with automatic tools for insight discovery from data. Advances in this research area are hindered by the absence of reliable datasets on which to train the recommender systems. To the best of our knowledge, Plotly corpus is the only publicly available dataset, but as complained by many authors and discussed in this article, it contains many labeling errors, which greatly limits its usefulness. We release an improved version of the original dataset, named Plotly.plus, which we obtained through an automated procedure with minimal post-editing. In addition to a manual validation by a group of data science students, we demonstrate that when training two state-of-the-art abstract image classifiers on Plotly.plus, systems' performance improves more than twice as much as when the original dataset is used, showing that Plotly.plus facilitates the discovery of significant perceptual patterns.

Plotly.plus, an Improved Dataset for Visualization Recommendation / Podo, Luca; Velardi, Paola. - (2022), pp. 4384-4388. (Intervento presentato al convegno CIKM '22: Proceedings of the 31st ACM International Conference on Information & Knowledge Management tenutosi a Atlanta, Georgia, USA) [10.1145/3511808.3557669].

Plotly.plus, an Improved Dataset for Visualization Recommendation

Podo, Luca;Velardi, Paola

2022

Abstract

Visualization recommendation is a novel and challenging field of study, whose aim is to provide non-expert users with automatic tools for insight discovery from data. Advances in this research area are hindered by the absence of reliable datasets on which to train the recommender systems. To the best of our knowledge, Plotly corpus is the only publicly available dataset, but as complained by many authors and discussed in this article, it contains many labeling errors, which greatly limits its usefulness. We release an improved version of the original dataset, named Plotly.plus, which we obtained through an automated procedure with minimal post-editing. In addition to a manual validation by a group of data science students, we demonstrate that when training two state-of-the-art abstract image classifiers on Plotly.plus, systems' performance improves more than twice as much as when the original dataset is used, showing that Plotly.plus facilitates the discovery of significant perceptual patterns.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2022
			
	Nome convegno
	
				CIKM '22: Proceedings of the 31st ACM International Conference on Information & Knowledge Management
			
	Parole chiave
	
				visualization recommendation systems, visualization datasets, amchine learning for visualization
			
	Tipologia
	
				04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
			
	Citazione
	
				Plotly.plus, an Improved Dataset for Visualization Recommendation / Podo, Luca; Velardi, Paola. - (2022), pp. 4384-4388. (Intervento presentato al  convegno CIKM '22: Proceedings of the 31st ACM International Conference on Information & Knowledge Management tenutosi a Atlanta, Georgia, USA) [10.1145/3511808.3557669].

File allegati a questo prodotto

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1662588

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

ND

ND

social impact