Font Size: a A A

Partitioning of prosodic features for audio similarity comparison

Posted on:2011-03-23Degree:M.SType:Thesis
University:Michigan State UniversityCandidate:Geimer, Matthew StevenFull Text:PDF
GTID:2448390002957323Subject:Computer Science
Abstract/Summary:
Multiple methods for partitioning space for use in comparing audio samples using prosodic features are examined and researched. Specific prosodic features are chosen for use within an online system that will allow for users to submit audio clips and receive matches. The audio requires processing before being input to the system which is comprised of multiple steps. Existing methodologies using classifier systems requiring classifier training are discussed and deemed unsuitable for this application. The partitioning of extracted features into representative points or regions in the search space is focused on, with 2 approaches. k-means clustering with multiple different validity measures is examined as well as vector quantization using a scalar quantizer. Experimental results show that clustering is ill-suited for use and finding a good k is unlikely. A scalar quantizer is implemented based on its ability to effectively quantize the space without changing how the space is discretized. It is also concluded that a method to trim the input data to reduce the codebook size of the quantizer is not inherently better, yielding more representative points compared to using all the input data.
Keywords/Search Tags:Prosodic features, Audio, Partitioning, Using, Space
Related items