Catalogo dei prodotti della ricerca

Faster R-CNN has established itself as the de-facto best object detector but it remains strongly limited in two aspects: (i) it is sensitive to background clutter and its classification performance decreases when it is confronted with more noisy proposals; (ii) it suffers when the objects vary largely in scale and specifically for the small objects. We address both issues with our geometric-proposals for Faster R-CNN (GP-FRCNN), whereby we re-rank the generic object proposals with an approximate geometric estimate of the scene. But the devil is in the details: the simple extension requires involved scale adjustments (e.g. anchors, layer resolution) which we detail in this paper. Finally, our GP-FRCNN performs equally well on smaller and larger objects, a long standing challenge for any object detection algorithm. The application of GP-FRCNN to surveillance videos is straightforward and does not require an explicit geometric formulation. We extensively test the model on the UA-DETRAC dataset, where GP-FRCNN outperforms the standard Faster R-CNN by 19%.

Geometric proposals for faster R-CNN / Amin, S; Galasso, F. - (2017). (Intervento presentato al convegno IEEE International Conference on Advanced Video and Signal Based Surveillance, AVSS 2017 tenutosi a Lecce; Italy) [10.1109/AVSS.2017.8078518].

Geometric proposals for faster R-CNN

Amin S^Primo;Galasso F^Ultimo

2017

Abstract

Faster R-CNN has established itself as the de-facto best object detector but it remains strongly limited in two aspects: (i) it is sensitive to background clutter and its classification performance decreases when it is confronted with more noisy proposals; (ii) it suffers when the objects vary largely in scale and specifically for the small objects. We address both issues with our geometric-proposals for Faster R-CNN (GP-FRCNN), whereby we re-rank the generic object proposals with an approximate geometric estimate of the scene. But the devil is in the details: the simple extension requires involved scale adjustments (e.g. anchors, layer resolution) which we detail in this paper. Finally, our GP-FRCNN performs equally well on smaller and larger objects, a long standing challenge for any object detection algorithm. The application of GP-FRCNN to surveillance videos is straightforward and does not require an explicit geometric formulation. We extensively test the model on the UA-DETRAC dataset, where GP-FRCNN outperforms the standard Faster R-CNN by 19%.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2017
			
	Nome convegno
	
				IEEE International Conference on Advanced Video and Signal Based Surveillance, AVSS 2017
			
	Parole chiave
	
				computer vision; detection; recognition
			
	Tipologia
	
				04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
			
	Citazione
	
				Geometric proposals for faster R-CNN / Amin, S; Galasso, F. - (2017). (Intervento presentato al  convegno IEEE International Conference on Advanced Video and Signal Based Surveillance, AVSS 2017 tenutosi a Lecce; Italy) [10.1109/AVSS.2017.8078518].
			
	Appartiene alla tipologia:
	
				04b Atto di convegno in volume

File allegati a questo prodotto

File	Dimensione	Formato
Amin_Geometric_2017.pdf solo gestori archivio Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 645.28 kB Formato Adobe PDF Contatta l'autore	645.28 kB	Adobe PDF	Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1317752

Citazioni

ND

27

6

social impact