Modern neural networks models for computer vision are trained on millions of images. The idea is that models are able to increase generalization when the dataset contains well diversified images, e.g. with varied illumination and environmental conditions of the same objects. Generalization is particularly relevant in object detection, especially for what concerns the cross-depiction problem. In this work we explore the use of Neural Style Transfer as a novel technique to morph the original data, with the aim to enhance model generalization. To verify the effect on performances for object detection models, we selected the Faster R-CNN model to be applied on the Pascal VOC 2012 dataset. A number of tests were performed through style variations on images and by tuning Neural Style Transfer parameters to maintain the content of the original images. The experiments showed promising results, which effectively provide a foundation for future studies on cross-depiction via Neural Style Transfer.

Enhancing Object Detection Robustness for Cross-Depiction Through Neural Style Transfer / Fiani, F.; Puglisi, A.; Napoli, C.. - 3684:(2023), pp. 15-20. (Intervento presentato al convegno 8th International Conference of Yearly Reports on Informatics, Mathematics, and Engineering, ICYRIME 2023 tenutosi a ita).

Enhancing Object Detection Robustness for Cross-Depiction Through Neural Style Transfer

Fiani F.
Co-primo
Investigation
;
Puglisi A.
Co-primo
Investigation
;
Napoli C.
Supervision
2023

Abstract

Modern neural networks models for computer vision are trained on millions of images. The idea is that models are able to increase generalization when the dataset contains well diversified images, e.g. with varied illumination and environmental conditions of the same objects. Generalization is particularly relevant in object detection, especially for what concerns the cross-depiction problem. In this work we explore the use of Neural Style Transfer as a novel technique to morph the original data, with the aim to enhance model generalization. To verify the effect on performances for object detection models, we selected the Faster R-CNN model to be applied on the Pascal VOC 2012 dataset. A number of tests were performed through style variations on images and by tuning Neural Style Transfer parameters to maintain the content of the original images. The experiments showed promising results, which effectively provide a foundation for future studies on cross-depiction via Neural Style Transfer.
2023
8th International Conference of Yearly Reports on Informatics, Mathematics, and Engineering, ICYRIME 2023
Faster R-CNN; Neural Style Transfer; Object detection; Pascal VOC 2012
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Enhancing Object Detection Robustness for Cross-Depiction Through Neural Style Transfer / Fiani, F.; Puglisi, A.; Napoli, C.. - 3684:(2023), pp. 15-20. (Intervento presentato al convegno 8th International Conference of Yearly Reports on Informatics, Mathematics, and Engineering, ICYRIME 2023 tenutosi a ita).
File allegati a questo prodotto
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1714649
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact