Audio Query by Example Using Similarity Measures between Probability Density Functions of Features

This paper proposes a query by example system for generic audio. We estimate the similarity of the example signal and the samples in the queried database by calculating the distance between the probability density functions (pdfs) of their frame-wise acoustic features. Since the features are continu...

Full description

Bibliographic Details
Main Authors: Marko Helén, Tuomas Virtanen
Format: Article
Language:English
Published: SpringerOpen 2010-01-01
Series:EURASIP Journal on Audio, Speech, and Music Processing
Online Access:http://dx.doi.org/10.1155/2010/179303
id doaj-cf76acea4e52409598a094dcbc8c21c7
record_format Article
spelling doaj-cf76acea4e52409598a094dcbc8c21c72020-11-25T01:48:42ZengSpringerOpenEURASIP Journal on Audio, Speech, and Music Processing1687-47141687-47222010-01-01201010.1155/2010/179303Audio Query by Example Using Similarity Measures between Probability Density Functions of FeaturesMarko HelénTuomas VirtanenThis paper proposes a query by example system for generic audio. We estimate the similarity of the example signal and the samples in the queried database by calculating the distance between the probability density functions (pdfs) of their frame-wise acoustic features. Since the features are continuous valued, we propose to model them using Gaussian mixture models (GMMs) or hidden Markov models (HMMs). The models parametrize each sample efficiently and retain sufficient information for similarity measurement. To measure the distance between the models, we apply a novel Euclidean distance, approximations of Kullback-Leibler divergence, and a cross-likelihood ratio test. The performance of the measures was tested in simulations where audio samples are automatically retrieved from a general audio database, based on the estimated similarity to a user-provided example. The simulations show that the distance between probability density functions is an accurate measure for similarity. Measures based on GMMs or HMMs are shown to produce better results than that of the existing methods based on simpler statistics or histograms of the features. A good performance with low computational cost is obtained with the proposed Euclidean distance. http://dx.doi.org/10.1155/2010/179303
collection DOAJ
language English
format Article
sources DOAJ
author Marko Helén
Tuomas Virtanen
spellingShingle Marko Helén
Tuomas Virtanen
Audio Query by Example Using Similarity Measures between Probability Density Functions of Features
EURASIP Journal on Audio, Speech, and Music Processing
author_facet Marko Helén
Tuomas Virtanen
author_sort Marko Helén
title Audio Query by Example Using Similarity Measures between Probability Density Functions of Features
title_short Audio Query by Example Using Similarity Measures between Probability Density Functions of Features
title_full Audio Query by Example Using Similarity Measures between Probability Density Functions of Features
title_fullStr Audio Query by Example Using Similarity Measures between Probability Density Functions of Features
title_full_unstemmed Audio Query by Example Using Similarity Measures between Probability Density Functions of Features
title_sort audio query by example using similarity measures between probability density functions of features
publisher SpringerOpen
series EURASIP Journal on Audio, Speech, and Music Processing
issn 1687-4714
1687-4722
publishDate 2010-01-01
description This paper proposes a query by example system for generic audio. We estimate the similarity of the example signal and the samples in the queried database by calculating the distance between the probability density functions (pdfs) of their frame-wise acoustic features. Since the features are continuous valued, we propose to model them using Gaussian mixture models (GMMs) or hidden Markov models (HMMs). The models parametrize each sample efficiently and retain sufficient information for similarity measurement. To measure the distance between the models, we apply a novel Euclidean distance, approximations of Kullback-Leibler divergence, and a cross-likelihood ratio test. The performance of the measures was tested in simulations where audio samples are automatically retrieved from a general audio database, based on the estimated similarity to a user-provided example. The simulations show that the distance between probability density functions is an accurate measure for similarity. Measures based on GMMs or HMMs are shown to produce better results than that of the existing methods based on simpler statistics or histograms of the features. A good performance with low computational cost is obtained with the proposed Euclidean distance.
url http://dx.doi.org/10.1155/2010/179303
work_keys_str_mv AT markohelamp233n audioquerybyexampleusingsimilaritymeasuresbetweenprobabilitydensityfunctionsoffeatures
AT tuomasvirtanen audioquerybyexampleusingsimilaritymeasuresbetweenprobabilitydensityfunctionsoffeatures
_version_ 1725010593976418304