The correct classification of single musical sources is a relevant aspect for the source separation task and the automatic transcription of polyphonic music. In this paper, we deal with a classification problem concerning the recognition of six different musical instruments: violin, clarinet, flute, oboe, saxophone and piano. A satisfactory solution of such a recognition problem depends mainly on both the preprocessing procedure (set of features extracted from row data) and the adopted classification system. As concerns feature extraction, a suitable signal preprocessing based on FFT, QFT (Q-constant frequency transform) and cepstrum coefficients are employed. We adopt min-max neurofuzzy networks as the classification model, both in their classical and generalized version. The synthesis of these classifiers is performed by the adaptive resolution training technique (ARC, PARC and GPARC algorithms), since it assures good performances and an excellent automation degree.
Recognition of Musical Instruments by Generalized Min-Max Classifiers / Costantini, Giovanni; Rizzi, Antonello; Casali, Daniele. - STAMPA. - (2003), pp. 555-564. (Intervento presentato al convegno Workshop on Neural Networks for Signal Processing (NNSP 2003) tenutosi a Toulouse, Francia nel 17-19 Settembre) [10.1109/NNSP.2003.1318055].
Recognition of Musical Instruments by Generalized Min-Max Classifiers
RIZZI, Antonello;
2003
Abstract
The correct classification of single musical sources is a relevant aspect for the source separation task and the automatic transcription of polyphonic music. In this paper, we deal with a classification problem concerning the recognition of six different musical instruments: violin, clarinet, flute, oboe, saxophone and piano. A satisfactory solution of such a recognition problem depends mainly on both the preprocessing procedure (set of features extracted from row data) and the adopted classification system. As concerns feature extraction, a suitable signal preprocessing based on FFT, QFT (Q-constant frequency transform) and cepstrum coefficients are employed. We adopt min-max neurofuzzy networks as the classification model, both in their classical and generalized version. The synthesis of these classifiers is performed by the adaptive resolution training technique (ARC, PARC and GPARC algorithms), since it assures good performances and an excellent automation degree.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.