Catalogo dei prodotti della ricerca

Learning to Rank (LtR) is the machine learning method of choice for producing high quality document ranking functions from a ground-truth of training examples. In practice, efficiency and effectiveness are intertwined concepts and trading off effectiveness for meeting efficiency constraints typically existing in large-scale systems is one of the most urgent issues. In this paper we propose a new framework, named CLEaVER, for optimizing machine-learned ranking models based on ensembles of regression trees. The goal is to improve efficiency at document scoring time without affecting quality. Since the cost of an ensemble is linear in its size, CLEaVER first removes a subset of the trees in the ensemble, and then fine-tunes the weights of the remaining trees according to any given quality measure. Experiments conducted on two publicly available LtR datasets show that CLEaVER is able to prune up to 80% of the trees and provides an efficiency speed-up up to 2.6x without affecting the effectiveness of the model. © 2016 ACM.

Post-Learning Optimization of Tree Ensembles for Efficient Ranking / Lucchese, C; Nardini, Fm; Orlando, S; Perego, R; Silvestri, F; Trani, S. - (2016), pp. 949-952. (Intervento presentato al convegno 39th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2016 tenutosi a Pisa, Italy) [10.1145/2911451.2914763].

Post-Learning Optimization of Tree Ensembles for Efficient Ranking

Lucchese C;Nardini FM;Orlando S;Perego R;Silvestri F;Trani S

2016

Abstract

Learning to Rank (LtR) is the machine learning method of choice for producing high quality document ranking functions from a ground-truth of training examples. In practice, efficiency and effectiveness are intertwined concepts and trading off effectiveness for meeting efficiency constraints typically existing in large-scale systems is one of the most urgent issues. In this paper we propose a new framework, named CLEaVER, for optimizing machine-learned ranking models based on ensembles of regression trees. The goal is to improve efficiency at document scoring time without affecting quality. Since the cost of an ensemble is linear in its size, CLEaVER first removes a subset of the trees in the ensemble, and then fine-tunes the weights of the remaining trees according to any given quality measure. Experiments conducted on two publicly available LtR datasets show that CLEaVER is able to prune up to 80% of the trees and provides an efficiency speed-up up to 2.6x without affecting the effectiveness of the model. © 2016 ACM.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2016
			
	Nome convegno
	
				39th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2016
			
	Parole chiave
	
				Efficiency; Learning to Rank; Pruning
			
	Tipologia
	
				04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
			
	Citazione
	
				Post-Learning Optimization of Tree Ensembles for Efficient Ranking / Lucchese, C; Nardini, Fm; Orlando, S; Perego, R; Silvestri, F; Trani, S. - (2016), pp. 949-952. (Intervento presentato al  convegno 39th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2016 tenutosi a Pisa, Italy) [10.1145/2911451.2914763].
			
	Appartiene alla tipologia:
	
				04b Atto di convegno in volume

File allegati a questo prodotto

File	Dimensione	Formato
Lucchese_Post-Learning_2016.pdf solo gestori archivio Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 572.83 kB Formato Adobe PDF Contatta l'autore	572.83 kB	Adobe PDF	Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1476552

Citazioni

ND

52

39

social impact