Joint Detection and Tracking in videos with Identification Features

Munjal, Bharti; Abdul Rafey Aftab,; Amin, Sikandar; Brandlmaier, Meltem D.; Tombari, Federico; Galasso, Fabio

doi:10.1016/j.imavis.2020.103932

Recent works have shown that combining object detection and tracking tasks, in the case of video data, results in higher performance for both tasks, but they require a high frame-rate as a strict requirement for performance. This assumption is often violated in real-world applications, when models run on embedded devices, often at only a few frames per second. Videos at low frame-rate suffer from large object displacements. Here re-identification features may support to match large-displaced object detections, but current joint detection and re-identification formulations degrade the detector performance, as these two are contrasting tasks. In the real-world application having separate detector and re-id models is often not feasible, as both the memory and runtime effectively double. Towards robust long-term tracking applicable to reduced-computational-power devices, we propose the first joint optimization of detection, tracking and re-identification features for videos. Notably, our joint optimization maintains the detector performance, a typical multi-task challenge. At inference time, we leverage detections for tracking (tracking-by-detection) when the objects are visible, detectable and slowly moving in the image.Weleverage instead re-identification features to match objects which disappeared (e.g. due to occlusion) for several frames or were not tracked due to fast motion (or low-frame-rate videos). Our proposed method reaches the state-of-the-art on MOT, it ranks 1st in the UA-DETRAC’18 tracking challenge among online trackers, and 3rd overall.

Joint Detection and Tracking in videos with Identification Features / Munjal, Bharti; Rafey Aftab, Abdul; Amin, Sikandar; Brandlmaier, Meltem D.; Tombari, Federico; Galasso, Fabio. - In: IMAGE AND VISION COMPUTING. - ISSN 0262-8856. - 100:0262-8856(2020). [10.1016/j.imavis.2020.103932]

Joint Detection and Tracking in videos with Identification Features

Abdul Rafey Aftab;Sikandar Amin;Meltem D. Brandlmaier;Federico Tombari;Fabio Galasso^Ultimo

2020

Abstract

Recent works have shown that combining object detection and tracking tasks, in the case of video data, results in higher performance for both tasks, but they require a high frame-rate as a strict requirement for performance. This assumption is often violated in real-world applications, when models run on embedded devices, often at only a few frames per second. Videos at low frame-rate suffer from large object displacements. Here re-identification features may support to match large-displaced object detections, but current joint detection and re-identification formulations degrade the detector performance, as these two are contrasting tasks. In the real-world application having separate detector and re-id models is often not feasible, as both the memory and runtime effectively double. Towards robust long-term tracking applicable to reduced-computational-power devices, we propose the first joint optimization of detection, tracking and re-identification features for videos. Notably, our joint optimization maintains the detector performance, a typical multi-task challenge. At inference time, we leverage detections for tracking (tracking-by-detection) when the objects are visible, detectable and slowly moving in the image.Weleverage instead re-identification features to match objects which disappeared (e.g. due to occlusion) for several frames or were not tracked due to fast motion (or low-frame-rate videos). Our proposed method reaches the state-of-the-art on MOT, it ranks 1st in the UA-DETRAC’18 tracking challenge among online trackers, and 3rd overall.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2020
			
	Parole chiave
	
				Computer vision; Machine Learning; Detection; Recognition; Re-identification; Tracking
			
	Tipologia
	
				01 Pubblicazione su rivista::01a Articolo in rivista
			
	Citazione
	
				Joint Detection and Tracking in videos with Identification Features / Munjal, Bharti; Rafey Aftab, Abdul; Amin, Sikandar; Brandlmaier, Meltem D.; Tombari, Federico; Galasso, Fabio. - In: IMAGE AND VISION COMPUTING. - ISSN 0262-8856. - 100:0262-8856(2020). [10.1016/j.imavis.2020.103932]
			
	Appartiene alla tipologia:
	
				01a Articolo in rivista

File allegati a questo prodotto

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1407853

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

16

13

Catalogo dei prodotti della ricerca