In this paper, a subbands multirate architecture is presented for audio signal recovery. Audio signal recovery is a common problem in digital music signal restoration field, because of corrupted samples that must be replaced. The subband approach allows for the reconstruction of a long audio data sequence from forward-backward predicted samples. In order to improve prediction performances, neural networks with spline flexible activation function are used as narrow subband nonlinear forward-backward predictors. Previous neural-networks approaches involved a long training process. Due to the small networks needed for each subband and to the spline adaptive activation functions that speed-up the convergence time and improve the generalization performances, the proposed signal recovery scheme works in online (or in continuous learning) mode as a simple nonlinear adaptive filter. Experimental results show the mean square reconstruction error and maximum error obtained with increasing gap length, from 200 to 5000 samples for different musical genres. A subjective performances analysis is also reported. The method gives good results for the reconstruction of over 100 ms of audio signal with low audible effects in overall quality and outperforms the previous approaches.
Subbands Neural Networks Prediction for On-line Audio Signal Recovery / Cocchi, A; Uncini, Aurelio. - In: IEEE TRANSACTIONS ON NEURAL NETWORKS. - ISSN 1045-9227. - STAMPA. - 13 N.4:(2002), pp. 867-876. [10.1109/TNN.2002.1021887]
Subbands Neural Networks Prediction for On-line Audio Signal Recovery
UNCINI, Aurelio
2002
Abstract
In this paper, a subbands multirate architecture is presented for audio signal recovery. Audio signal recovery is a common problem in digital music signal restoration field, because of corrupted samples that must be replaced. The subband approach allows for the reconstruction of a long audio data sequence from forward-backward predicted samples. In order to improve prediction performances, neural networks with spline flexible activation function are used as narrow subband nonlinear forward-backward predictors. Previous neural-networks approaches involved a long training process. Due to the small networks needed for each subband and to the spline adaptive activation functions that speed-up the convergence time and improve the generalization performances, the proposed signal recovery scheme works in online (or in continuous learning) mode as a simple nonlinear adaptive filter. Experimental results show the mean square reconstruction error and maximum error obtained with increasing gap length, from 200 to 5000 samples for different musical genres. A subjective performances analysis is also reported. The method gives good results for the reconstruction of over 100 ms of audio signal with low audible effects in overall quality and outperforms the previous approaches.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.