Computational Modeling and Analysis of Multi-timbral Musical Instrument Mixtures

Posted on:2015-03-31

Degree:Ph.D

Type:Dissertation

University:Drexel University

Candidate:Scott, Jeffrey

Full Text:PDF

GTID:1478390020952284

Subject:Electrical engineering

Abstract/Summary:

In the audio domain, the disciplines of signal processing, machine learning, psychoacoustics, information theory and library science have merged into the field of Music Information Retrieval (Music-IR). Music-IR researchers attempt to extract high level information from music like pitch, meter, genre, rhythm and timbre directly from audio signals as well as semantic meta-data over a wide variety of sources. This information is then used to organize and process data for large scale retrieval and novel interfaces.;For creating musical content, access to hardware and software tools for producing music has become commonplace in the digital landscape. While the means to produce music have become widely available, significant time must be invested to attain professional results. Mixing multi-channel audio requires techniques and training far beyond the knowledge of the average music software user. As a result, there is significant growth and development in intelligent signal processing for audio, an emergent field combining audio signal processing and machine learning for producing music.;This work focuses on methods for modeling and analyzing multi-timbral musical instrument mixtures and performing automated processing techniques to improve audio quality based on quantitative and qualitative measures. The main contributions of the work involve training models to predict mixing parameters for multi-channel audio sources and developing new methods to model the component interactions of individual timbres to an overall mixture. Linear dynamical systems (LDS) are shown to be capable of learning the relative contributions of individual instruments to re-create a commercial recording based on acoustic features extracted directly from audio. Variations in the model topology are explored to make it applicable to a more diverse range of input sources and improve performance.;An exploration of relevant features for modeling timbre and identifying instruments is performed. Using various basis decomposition techniques, audio examples are reconstructed and analyzed in a perceptual listening test to evaluate their ability to capture salient aspects of timbre. These tests show that a 2-D decomposition is able to capture much more perceptually relevant information with regard to the temporal evolution of the frequency spectrum of a set of audio examples. The results indicate that joint modeling of frequencies and their evolution is essential for capturing higher level concepts in audio that we desire to leverage in automated systems.

Keywords/Search Tags:

Audio, Music, Signal processing, Modeling, Information

Related items

1	High-resolution sinusoidal analysis for resolving harmonic collisions in music audio signal processing
2	Research On Mobile Music Player System Design And Audio Processing Algorithm
3	Research On Feature Recognition And Representation Technology Of Music Signal Based On DSP
4	Music Recommendation System Based On Audio Features And Social Tags
5	Research On Music Sytle Based On Music Signal Processing
6	The Research And Optimization On The Algorithm Of The Key Module Of Audio Processing
7	Music Recommendation Method Based On Audio Characteristics
8	Research On Acoustic Feature Analysis In Audio Retrieval
9	An iterative approach to automatic music transcription and audio signal decomposition
10	Audio Directional Loudspeaker Signal Processing System Based On Fpga Design