A large-scale database of images and captions for automatic face naming

Özcan, Mert; Jie, Luo; Ferrari, Vittorio; Caputo, Barbara

doi:10.5244/C25.29

We present a large scale database of images and captions, designed for supporting research on how to use captioned images from the Web for training visual classifiers. It consists of more than 125,000 images of celebrities from different fields downloaded from the Web. Each image is associated to its original text caption, extracted from the html page the image comes from. We coin it FAN-Large, for Face And Names Large scale database. Its size and deliberate high level of noise makes it to our knowledge the largest and most realistic database supporting this type of research. The dataset and its annotations are publicly available and can be obtained from http://www.vision. ee.ethz.ch/∼calvin/fanlarge/. We report results on a thorough assessment of FAN-Large using several existing approaches for name-face association, and present and evaluate new contextual features derived from the caption. Our findings provide important cues on the strengths and limitations of existing approaches. © 2011. The copyright of this document resides with its authors.

A large-scale database of images and captions for automatic face naming / Özcan, M., Jie, L., Ferrari, V., Caputo, B.. - STAMPA. - (2011). (2011 22nd British Machine Vision Conference, BMVC 2011 Dundee; UK 29 August- 02 September 2011) [10.5244/C25.29].

A large-scale database of images and captions for automatic face naming

Özcan, Mert;Jie, Luo;Ferrari, Vittorio;CAPUTO, BARBARA

2011

Abstract

We present a large scale database of images and captions, designed for supporting research on how to use captioned images from the Web for training visual classifiers. It consists of more than 125,000 images of celebrities from different fields downloaded from the Web. Each image is associated to its original text caption, extracted from the html page the image comes from. We coin it FAN-Large, for Face And Names Large scale database. Its size and deliberate high level of noise makes it to our knowledge the largest and most realistic database supporting this type of research. The dataset and its annotations are publicly available and can be obtained from http://www.vision. ee.ethz.ch/∼calvin/fanlarge/. We report results on a thorough assessment of FAN-Large using several existing approaches for name-face association, and present and evaluate new contextual features derived from the caption. Our findings provide important cues on the strengths and limitations of existing approaches. © 2011. The copyright of this document resides with its authors.

Scheda breve

Scheda completa

	Anno di pubblicazione
	
				2011
			
	Nome convegno
	
				2011 22nd British Machine Vision Conference, BMVC 2011
			
	Parole chiave
	
				1707
			
	Tipologia
	
				04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
			
	Citazione
	
				A large-scale database of images and captions for automatic face naming / Özcan, M., Jie, L., Ferrari, V., Caputo, B.. - STAMPA. - (2011). (2011 22nd British Machine Vision Conference, BMVC 2011 Dundee; UK 29 August- 02 September 2011) [10.5244/C25.29].
			
	Appartiene alla tipologia:
	
				04b Atto di convegno in volume

File allegati a questo prodotto

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/951697

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

15

10

Catalogo dei prodotti della ricerca