Audio Classification in Speech and Music: A Comparison between a Statistical and a Neural Approach

<p/> <p>We focus the attention on the problem of audio classification in speech and music for multimedia applications. In particular, we present a comparison between two different techniques for speech/music discrimination. The first method is based on Zero crossing rate and Bayesian cla...

Full description

Bibliographic Details
Main Authors:	Bugatti Alessandro, Flammini Alessandra, Migliorati Pierangelo
Format:	Article
Language:	English
Published:	SpringerOpen 2002-01-01
Series:	EURASIP Journal on Advances in Signal Processing
Subjects:	speech/music discrimination indexing of audio-visual documents neural networks multimedia applications
Online Access:	http://dx.doi.org/10.1155/S1110865702000720

Description
Summary:	<p/> <p>We focus the attention on the problem of audio classification in speech and music for multimedia applications. In particular, we present a comparison between two different techniques for speech/music discrimination. The first method is based on Zero crossing rate and Bayesian classification. It is very simple from a computational point of view, and gives good results in case of pure music or speech. The simulation results show that some performance degradation arises when the music segment contains also some speech superimposed on music, or strong rhythmic components. To overcome these problems, we propose a second method, that uses more features, and is based on neural networks (specifically a multi-layer Perceptron). In this case we obtain better performance, at the expense of a limited growth in the computational complexity. In practice, the proposed neural network is simple to be implemented if a suitable polynomial is used as the activation function, and a real-time implementation is possible even if low-cost embedded systems are used.</p>
ISSN:	1687-6172 1687-6180

Audio Classification in Speech and Music: A Comparison between a Statistical and a Neural Approach

Similar Items