In this paper we describe an automatic voice-to-MIDI transcription procedure. In particular we propose a note segmentation method based on the analysis of the signal envelope and its derivative. The pitch of the segmented note is extracted with a novel generalized correlation function called Correntropy function achieving high accuracy with the same computational cost of traditional correlation based methods. The performances of our transcription system have been measured on examples extracted by some repositories available on internet; they consist in sung melodies and hummed queries. Results show the ability of our transcription procedure to cope with query-by-humming systems, as well as with monophonic singing transcription. Performances can be easily evaluated by downloading from the authors web site the original PCM files and the corresponding MIDI files produced by the proposed transcription algorithm.

A Correntropy-Based Voice to MIDI Transcription Algorithm / Antonelli, Mario; Rizzi, Antonello. - STAMPA. - (2008), pp. 978-983. ((Intervento presentato al convegno International Workshop on Multimedia Signal Processing (MMSP 2008) tenutosi a Cairns; Australia nel 8-10 Ottobre 2008 [10.1109/MMSP.2008.4665216].

A Correntropy-Based Voice to MIDI Transcription Algorithm

ANTONELLI, MARIO;RIZZI, Antonello
2008

Abstract

In this paper we describe an automatic voice-to-MIDI transcription procedure. In particular we propose a note segmentation method based on the analysis of the signal envelope and its derivative. The pitch of the segmented note is extracted with a novel generalized correlation function called Correntropy function achieving high accuracy with the same computational cost of traditional correlation based methods. The performances of our transcription system have been measured on examples extracted by some repositories available on internet; they consist in sung melodies and hummed queries. Results show the ability of our transcription procedure to cope with query-by-humming systems, as well as with monophonic singing transcription. Performances can be easily evaluated by downloading from the authors web site the original PCM files and the corresponding MIDI files produced by the proposed transcription algorithm.
International Workshop on Multimedia Signal Processing (MMSP 2008)
Computational costs; Correntropy; Generalized correlations
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
A Correntropy-Based Voice to MIDI Transcription Algorithm / Antonelli, Mario; Rizzi, Antonello. - STAMPA. - (2008), pp. 978-983. ((Intervento presentato al convegno International Workshop on Multimedia Signal Processing (MMSP 2008) tenutosi a Cairns; Australia nel 8-10 Ottobre 2008 [10.1109/MMSP.2008.4665216].
File allegati a questo prodotto
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/62767
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 0
social impact