In this paper, we investigate the exploitation of the latent space of a MelGAN architecture by the vector arithmetic for the generation of new sounds that may be appealing for musicians, similar to what has already been done in the case of words and images. Specifically, since the MelGAN uses directly the spectrogram as input to its generator, we focus our attention on the linear combination of two or three instrumental sounds. This combination is then fed to the MelGAN generator, and the produced output will be the new sound with innovative sonority. Some simulations, performed over different sounds and different combination coefficients, show the effectiveness of the proposed idea.

Generating New Sounds by Vector Arithmetic in the Latent Space of the MelGAN Architecture / Scarpiniti, M.; Massaro, E.; Comminiello, D.; Uncini, A.. - (2023), pp. 3-15. - SMART INNOVATION, SYSTEMS AND TECHNOLOGIES. [10.1007/978-981-99-3592-5_1].

Generating New Sounds by Vector Arithmetic in the Latent Space of the MelGAN Architecture

Scarpiniti M.;Massaro E.;Comminiello D.;Uncini A.
2023

Abstract

In this paper, we investigate the exploitation of the latent space of a MelGAN architecture by the vector arithmetic for the generation of new sounds that may be appealing for musicians, similar to what has already been done in the case of words and images. Specifically, since the MelGAN uses directly the spectrogram as input to its generator, we focus our attention on the linear combination of two or three instrumental sounds. This combination is then fed to the MelGAN generator, and the produced output will be the new sound with innovative sonority. Some simulations, performed over different sounds and different combination coefficients, show the effectiveness of the proposed idea.
2023
Smart Innovation, Systems and Technologies
978-981-99-3591-8
978-981-99-3592-5
audio generation; generative adversarial networks; latent space; MelGAN; vector arithmetic
02 Pubblicazione su volume::02a Capitolo o Articolo
Generating New Sounds by Vector Arithmetic in the Latent Space of the MelGAN Architecture / Scarpiniti, M.; Massaro, E.; Comminiello, D.; Uncini, A.. - (2023), pp. 3-15. - SMART INNOVATION, SYSTEMS AND TECHNOLOGIES. [10.1007/978-981-99-3592-5_1].
File allegati a questo prodotto
File Dimensione Formato  
Scarpinti_postprint_Generating_2023.pdf

embargo fino al 01/09/2025

Note: postprint
Tipologia: Documento in Post-print (versione successiva alla peer review e accettata per la pubblicazione)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 722.66 kB
Formato Adobe PDF
722.66 kB Adobe PDF   Contatta l'autore
Scarpinti_Generating_2023.pdf

solo gestori archivio

Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 430.71 kB
Formato Adobe PDF
430.71 kB Adobe PDF   Contatta l'autore

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1693489
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact