Low-Level feature detectors and descriptors for smart image and video analysis: a comparative study

Avola, D.; Cinque, L.; Foresti, G. L.; Martinel, N.; Pannone, D.; Piciarelli, C.

doi:10.1007/978-3-319-73891-8_2

Local feature detectors and descriptors (hereinafter extractors) play a key role in the modern computer vision. Their scope is to extract, from any image, a set of discriminative patterns (hereinafter keypoints) present on some parts of background and/or foreground elements of the image itself. A prerequisite of a wide range of practical applications (e.g., vehicle tracking, person re-identification) is the design and development of algorithms able to detect, recognize and track the same keypoints within a video sequence. Smart cameras can acquire images and videos of an interesting scenario according to different intrinsic (e.g., focus, iris) and extrinsic (e.g., pan, tilt, zoom) parameters. These parameters can make the recognition of a same keypoint between consecutive images a hard task when some critical factors such as scale, rotation and translation are present. The aim of this chapter is to provide a comparative study of the most used and popular low-level local feature extractors: SIFT, SURF, ORB, PHOG, WGCH, Haralick and A-KAZE. At first, the chapter starts by providing an overview of the different extractors referenced in a concrete case study to show their potentiality and usage. Afterwards, a comparison of the extractors is performed by considering the Freiburg-Berkeley Motion Segmentation (FBMS-59) dataset, a well-known video data collection widely used by the computer vision community. Starting from a default setting of the local feature extractors, the aim of the comparison is to discuss their behavior and robustness in terms of invariance with respect to the most important critical factors. The chapter also reports comparative considerations about one of the basic steps based on the feature extractors: the matching process. Finally, the chapter points out key considerations about the use of the discussed extractors in real application domains.

Low-Level feature detectors and descriptors for smart image and video analysis: a comparative study / Avola, D.; Cinque, L.; Foresti, G. L.; Martinel, N.; Pannone, D.; Piciarelli, C.. - (2018), pp. 7-29. [10.1007/978-3-319-73891-8_2].

Low-Level feature detectors and descriptors for smart image and video analysis: a comparative study

Avola D.^Primo;Cinque L.;Foresti G. L.;Martinel N.;Pannone D.;Piciarelli C.

2018

Abstract

Local feature detectors and descriptors (hereinafter extractors) play a key role in the modern computer vision. Their scope is to extract, from any image, a set of discriminative patterns (hereinafter keypoints) present on some parts of background and/or foreground elements of the image itself. A prerequisite of a wide range of practical applications (e.g., vehicle tracking, person re-identification) is the design and development of algorithms able to detect, recognize and track the same keypoints within a video sequence. Smart cameras can acquire images and videos of an interesting scenario according to different intrinsic (e.g., focus, iris) and extrinsic (e.g., pan, tilt, zoom) parameters. These parameters can make the recognition of a same keypoint between consecutive images a hard task when some critical factors such as scale, rotation and translation are present. The aim of this chapter is to provide a comparative study of the most used and popular low-level local feature extractors: SIFT, SURF, ORB, PHOG, WGCH, Haralick and A-KAZE. At first, the chapter starts by providing an overview of the different extractors referenced in a concrete case study to show their potentiality and usage. Afterwards, a comparison of the extractors is performed by considering the Freiburg-Berkeley Motion Segmentation (FBMS-59) dataset, a well-known video data collection widely used by the computer vision community. Starting from a default setting of the local feature extractors, the aim of the comparison is to discuss their behavior and robustness in terms of invariance with respect to the most important critical factors. The chapter also reports comparative considerations about one of the basic steps based on the feature extractors: the matching process. Finally, the chapter points out key considerations about the use of the discussed extractors in real application domains.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2018
			
	Titolo del volume
	
				Bridging the Semantic Gap in Image and Video Analysis
			
	ISBN
	
				978-3-319-73891-8
			
	Parole chiave
	
				features extraction and description; image analysis; video analysis
			
	Tipologia
	
				02 Pubblicazione su volume::02a Capitolo, Articolo o Contributo
			
	Citazione
	
				Low-Level feature detectors and descriptors for smart image and video analysis: a comparative study / Avola, D.; Cinque, L.; Foresti, G. L.; Martinel, N.; Pannone, D.; Piciarelli, C.. - (2018), pp. 7-29. [10.1007/978-3-319-73891-8_2].
			
	Appartiene alla tipologia:
	
				02a Capitolo o Articolo

File allegati a questo prodotto

File	Dimensione	Formato
Avola_ Low-Level_2018.pdf solo gestori archivio Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 1.49 MB Formato Adobe PDF Contatta l'autore	1.49 MB	Adobe PDF	Contatta l'autore
2018_Bookmatter_BridgingTheSemanticGapInImageA.pdf solo gestori archivio Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 187.39 kB Formato Adobe PDF Contatta l'autore	187.39 kB	Adobe PDF	Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1075956

Citazioni

ND

13

ND

Catalogo dei prodotti della ricerca