Modern neural networks models for computer vision are trained on millions of images. The idea is that models are able to increase generalization when the dataset contains well diversified images, e.g. with varied illumination and environmental conditions of the same objects. Generalization is particularly relevant in object detection, especially for what concerns the cross-depiction problem. In this work we explore the use of Neural Style Transfer as a novel technique to morph the original data, with the aim to enhance model generalization. To verify the effect on performances for object detection models, we selected the Faster R-CNN model to be applied on the Pascal VOC 2012 dataset. A number of tests were performed through style variations on images and by tuning Neural Style Transfer parameters to maintain the content of the original images. The experiments showed promising results, which effectively provide a foundation for future studies on cross-depiction via Neural Style Transfer.

Enhancing Object Detection Robustness for Cross-Depiction Through Neural Style Transfer / Fiani, F.; Puglisi, A.; Napoli, C.. - 3684:(2023), pp. 15-20. (Intervento presentato al convegno 8th International Conference of Yearly Reports on Informatics, Mathematics, and Engineering, ICYRIME 2023 tenutosi a Napoli; Italia).

Enhancing Object Detection Robustness for Cross-Depiction Through Neural Style Transfer

Fiani F.
Co-primo
Investigation
;
Puglisi A.
Co-primo
Investigation
;
Napoli C.
Supervision
2023

Abstract

Modern neural networks models for computer vision are trained on millions of images. The idea is that models are able to increase generalization when the dataset contains well diversified images, e.g. with varied illumination and environmental conditions of the same objects. Generalization is particularly relevant in object detection, especially for what concerns the cross-depiction problem. In this work we explore the use of Neural Style Transfer as a novel technique to morph the original data, with the aim to enhance model generalization. To verify the effect on performances for object detection models, we selected the Faster R-CNN model to be applied on the Pascal VOC 2012 dataset. A number of tests were performed through style variations on images and by tuning Neural Style Transfer parameters to maintain the content of the original images. The experiments showed promising results, which effectively provide a foundation for future studies on cross-depiction via Neural Style Transfer.
2023
8th International Conference of Yearly Reports on Informatics, Mathematics, and Engineering, ICYRIME 2023
Faster R-CNN; Neural Style Transfer; Object detection; Pascal VOC 2012
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Enhancing Object Detection Robustness for Cross-Depiction Through Neural Style Transfer / Fiani, F.; Puglisi, A.; Napoli, C.. - 3684:(2023), pp. 15-20. (Intervento presentato al convegno 8th International Conference of Yearly Reports on Informatics, Mathematics, and Engineering, ICYRIME 2023 tenutosi a Napoli; Italia).
File allegati a questo prodotto
File Dimensione Formato  
Fiani_Enhancing_2023.pdf

accesso aperto

Note: https://ceur-ws.org/Vol-3684/p03.pdf
Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Creative commons
Dimensione 4.17 MB
Formato Adobe PDF
4.17 MB Adobe PDF

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1714649
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact