Quantum diffusion models are rapidly advancing generative remote-sensing analytics by harnessing quantum computation to deliver higher-quality, more realistic satellite images, while converging faster and with substantially fewer trainable parameters than comparable classical diffusers. In this context, we first realise a fully-quantum latent diffusion model (QLDM) whose denoiser is a variational-quantum circuit acting on a 10-dimensionaI latent code. On EuroSAT, focusing on three land-cover classes Forest, Herbaceous Vegetation and SeaLake, QLDM lowers the Fréchet Inception Distance (FID) and Kernel Inception Distance (KID) by 21.5 % and 29.9 %, respectively, relative to a parameter-matched classical latent diffuser. Yet the severe spatial compression intrinsic to latent processing limits fine-grained detail. To restore image fidelity, we develop a hybrid quantum—classical architecture—the Quanvolutional Conditioned U-Net (QCU-Net)—which inserts entangling quantum layers both at the U-Net bottleneck and as a quanvolutional filter early in the encoder. Trained on the full ten-class EuroSAT RGB set, QCU-Net reaches an FID of 2.57 and a KID of 0.0008, representing 64 % and 76 % improvements over the best classical diffusion baseline (FID = 7.22; KID = 0.0034), and boosts class-conditioning accuracy to 81.7 % versus 62.2 % for the classical model. Thèse gains confirm that embedding quantum circuits within the feature-extraction pipeline yields richer spatial— spectral representations than latent-space quantum processing alone. Our results outline a clear progression in quantum generative modelling for Earth Observation: by first demonstrating with the QLDM that fully quantum denoising is not only feasible but also highly parameter- efficient, then showing how the hybrid QCU-Net architecture restores perceptual fidelity by embedding quantum feature extractors within a classical U-Net backbone, and finally pointing toward a fully pixel-space quantum diffusion model as future work—one that would leverage deep quantum circuits to perform both the forward and reverse diffusion directly in Hilbert space, remove classical auto-encoding steps, and promise even greater image quality and efficiency on fault-tolerant hardware.
Quantum Diffusion Models in Earth Observation / Mauro, F.; De Falco, F.; Papa, L.; Ceschini, A.; Sebastianelli, A.; Meoni, G.; Panella, M.; Gamba, P.; Ullo, S. L.. - (2025), pp. 1-1. (Intervento presentato al convegno Quantum Technology Conference (QTC 2025) tenutosi a Creta, Grecia).
Quantum Diffusion Models in Earth Observation
F. De Falco;A. Ceschini;M. Panella;
2025
Abstract
Quantum diffusion models are rapidly advancing generative remote-sensing analytics by harnessing quantum computation to deliver higher-quality, more realistic satellite images, while converging faster and with substantially fewer trainable parameters than comparable classical diffusers. In this context, we first realise a fully-quantum latent diffusion model (QLDM) whose denoiser is a variational-quantum circuit acting on a 10-dimensionaI latent code. On EuroSAT, focusing on three land-cover classes Forest, Herbaceous Vegetation and SeaLake, QLDM lowers the Fréchet Inception Distance (FID) and Kernel Inception Distance (KID) by 21.5 % and 29.9 %, respectively, relative to a parameter-matched classical latent diffuser. Yet the severe spatial compression intrinsic to latent processing limits fine-grained detail. To restore image fidelity, we develop a hybrid quantum—classical architecture—the Quanvolutional Conditioned U-Net (QCU-Net)—which inserts entangling quantum layers both at the U-Net bottleneck and as a quanvolutional filter early in the encoder. Trained on the full ten-class EuroSAT RGB set, QCU-Net reaches an FID of 2.57 and a KID of 0.0008, representing 64 % and 76 % improvements over the best classical diffusion baseline (FID = 7.22; KID = 0.0034), and boosts class-conditioning accuracy to 81.7 % versus 62.2 % for the classical model. Thèse gains confirm that embedding quantum circuits within the feature-extraction pipeline yields richer spatial— spectral representations than latent-space quantum processing alone. Our results outline a clear progression in quantum generative modelling for Earth Observation: by first demonstrating with the QLDM that fully quantum denoising is not only feasible but also highly parameter- efficient, then showing how the hybrid QCU-Net architecture restores perceptual fidelity by embedding quantum feature extractors within a classical U-Net backbone, and finally pointing toward a fully pixel-space quantum diffusion model as future work—one that would leverage deep quantum circuits to perform both the forward and reverse diffusion directly in Hilbert space, remove classical auto-encoding steps, and promise even greater image quality and efficiency on fault-tolerant hardware.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


