Catalogo dei prodotti della ricerca

Visual recognition algorithms are required today to exhibit adaptive abilities. Given a deep model trained on a specific, given task, it would be highly desirable to be able to adapt incrementally to new tasks, preserving scalability as the number of new tasks increases, while at the same time avoiding catastrophic forgetting issues. Recent work has shown that masking the internal weights of a given original conv-net through learned binary variables is a promising strategy. We build upon this intuition and take into account more elaborated affine transformations of the convolutional weights that include learned binary masks. We show that with our generalization it is possible to achieve significantly higher levels of adaptation to new tasks, enabling the approach to compete with fine tuning strategies by requiring slightly more than 1 bit per network parameter per additional task. Experiments on two popular benchmarks showcase the power of our approach, that achieves the new state of the art on the Visual Decathlon Challenge.

Adding new tasks to a single network with weight transformations using binary masks / Mancini, Massimiliano; Ricci, Elisa; Caputo, Barbara; Rota Bulò, Samuel. - 11130:(2019), pp. 180-189. ( 15th European Conference on Computer Vision, ECCV 2018 Munich; Germany ) [10.1007/978-3-030-11012-3_14].

Adding new tasks to a single network with weight transformations using binary masks

Massimiliano Mancini;Elisa Ricci;Barbara Caputo;Samuel Rota Bulò

2019

Abstract

Visual recognition algorithms are required today to exhibit adaptive abilities. Given a deep model trained on a specific, given task, it would be highly desirable to be able to adapt incrementally to new tasks, preserving scalability as the number of new tasks increases, while at the same time avoiding catastrophic forgetting issues. Recent work has shown that masking the internal weights of a given original conv-net through learned binary variables is a promising strategy. We build upon this intuition and take into account more elaborated affine transformations of the convolutional weights that include learned binary masks. We show that with our generalization it is possible to achieve significantly higher levels of adaptation to new tasks, enabling the approach to compete with fine tuning strategies by requiring slightly more than 1 bit per network parameter per additional task. Experiments on two popular benchmarks showcase the power of our approach, that achieves the new state of the art on the Visual Decathlon Challenge.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2019
			
	Nome convegno
	
				15th European Conference on Computer Vision, ECCV 2018
			
	Parole chiave
	
				Incremental learning; Multi-task learning; Computer Vision
			
	Tipologia
	
				04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
			
	Citazione
	
				Adding new tasks to a single network with weight transformations using binary masks / Mancini, Massimiliano; Ricci, Elisa; Caputo, Barbara; Rota Bulò, Samuel. - 11130:(2019), pp. 180-189. ( 15th European Conference on Computer Vision, ECCV 2018 Munich; Germany ) [10.1007/978-3-030-11012-3_14].
			
	Appartiene alla tipologia:
	
				04b Atto di convegno in volume

File allegati a questo prodotto

File	Dimensione	Formato
Mancini_Postprint_Adding_2019.pdf solo gestori archivio Note: https://link.springer.com/chapter/10.1007/978-3-030-11012-3_14 Tipologia: Documento in Post-print (versione successiva alla peer review e accettata per la pubblicazione) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 634.62 kB Formato Adobe PDF Contatta l'autore	634.62 kB	Adobe PDF	Contatta l'autore
Mancini_Adding_2019.pdf solo gestori archivio Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 2.78 MB Formato Adobe PDF Contatta l'autore	2.78 MB	Adobe PDF	Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1331876

Citazioni

ND

10

11

social impact