Generating images from brain waves is gaining increasing attention due to its potential to advance brain-computer interface (BCI) systems by understanding how brain signals encode visual cues. Most of the literature has focused on fMRI-to-Image tasks as fMRI is characterized by high spatial resolution. However, fMRI is an expensive neuroimaging modality and does not allow for real-time BCI. On the other hand, electroencephalography (EEG) is a low-cost, non-invasive, and portable neuroimaging technique, making it an attractive option for future real-time applications. Nevertheless, EEG presents inherent challenges due to its low spatial resolution and susceptibility to noise and artifacts, which makes generating images from EEG more difficult. In this paper, we address these problems with a streamlined framework based on the ControlNet adapter for conditioning a latent diffusion model (LDM) through EEG signals. We conduct experiments and ablation studies on popular benchmarks to demonstrate that the proposed method beats other state-of-the-art models. Unlike these methods, which often require extensive preprocessing, pretraining, different losses, and captioning models, our approach is efficient and straightforward, requiring only minimal preprocessing and a few components. The code is available at https://github.com/LuigiSigillo/GWIT.

Guess What I Think: Streamlined EEG-to-Image Generation with Latent Diffusion Models / Lopez, Eleonora; Sigillo, Luigi; Colonnese, Federica; Panella, Massimo; Comminiello, Danilo. - (2025), pp. 1-5. ( 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025) Hyderabad; India ) [10.1109/icassp49660.2025.10890059].

Guess What I Think: Streamlined EEG-to-Image Generation with Latent Diffusion Models

Lopez, Eleonora;Sigillo, Luigi;Colonnese, Federica;Panella, Massimo;Comminiello, Danilo
2025

Abstract

Generating images from brain waves is gaining increasing attention due to its potential to advance brain-computer interface (BCI) systems by understanding how brain signals encode visual cues. Most of the literature has focused on fMRI-to-Image tasks as fMRI is characterized by high spatial resolution. However, fMRI is an expensive neuroimaging modality and does not allow for real-time BCI. On the other hand, electroencephalography (EEG) is a low-cost, non-invasive, and portable neuroimaging technique, making it an attractive option for future real-time applications. Nevertheless, EEG presents inherent challenges due to its low spatial resolution and susceptibility to noise and artifacts, which makes generating images from EEG more difficult. In this paper, we address these problems with a streamlined framework based on the ControlNet adapter for conditioning a latent diffusion model (LDM) through EEG signals. We conduct experiments and ablation studies on popular benchmarks to demonstrate that the proposed method beats other state-of-the-art models. Unlike these methods, which often require extensive preprocessing, pretraining, different losses, and captioning models, our approach is efficient and straightforward, requiring only minimal preprocessing and a few components. The code is available at https://github.com/LuigiSigillo/GWIT.
2025
2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025)
Diffusion Models; EEG; Image Generation
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
Guess What I Think: Streamlined EEG-to-Image Generation with Latent Diffusion Models / Lopez, Eleonora; Sigillo, Luigi; Colonnese, Federica; Panella, Massimo; Comminiello, Danilo. - (2025), pp. 1-5. ( 2025 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025) Hyderabad; India ) [10.1109/icassp49660.2025.10890059].
File allegati a questo prodotto
File Dimensione Formato  
Lopez_Guess What I Think_2025.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 486.77 kB
Formato Adobe PDF
486.77 kB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1742870
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? ND
social impact