Research On Automatic Music Transcription Technology

Posted on:2017-05-29

Degree:Master

Type:Thesis

Country:China

Candidate:L Yan

Full Text:PDF

GTID:2308330485988454

Subject:Signal and Information Processing

Abstract/Summary:

PDF Full Text Request

Automatic music transcription(AMT) is the process of converting an acoustic musical signal into some form of musical notation. It is considered to be a key enabling technology in music signal processing,And while it is also a complex and challenging problem. The performance of any transcription system for polyphony music is still significantly below that of a human expert.Automatic music transcription has numerous applications associated with the area, such as automatic music accompaniment, the technology of advanced music editing, Music retrieval, Music Teaching, and so on. Here we divide The AMT problem into two main subtasks, which include: multi-F0 s estimation and multi-F0 s streaming. We also do some research and improve on them. The following is our main research and innovations:(1) Statistical model-based multi-F0 s estimationWe address the multiple F0 estimation problems in a Maximum-Likelihood fashion, where the power spectrum of a time frame is the observation and the F0 s are the parameters to be estimated. When learning the parameters of the models, to take advantage of all areas of the spectrum, we modify the parameters of the model. We add the information of the nonpeak region. The peak region information and the non-peak region information act as a complementary pair. The former helps find F0 s that have harmonics that explain peaks, while the latter helps avoid F0 s that have harmonics in the non-peak regionWhen we estimate the multiple F0 s, to solve the problem of Polyphony estimation, we use the method of hypothetical partial sequences(HPS), calculate the harmonic amplitude and spectrum smoothness, take into account the partials of the other candidates for a given combination. Therefore, we can select the most likely combination according to their energy and smoothness.Finally, we uses a post-processing technique to refine F0 estimates in each frame using neighboring frames, Results show that the improved method shows superior F0 estimation(2) Timbre feature based multi-F0 s streamingWe use the method which based on timbre feature to address the problem of multi-F0 s streaming. We cast the problem as a constrained clustering problem. There we explore different timbre features for music, such as Mel-frequency Campestral Coefficients(MFCC), Harmonic Structure(HS), ordinary cestrum(OC), and Discrete Cestrum(DC). In order to ensure that the same source have similar timbre, and the timbres come from different sources have much more discrimination. We improve the Discrete Cestrum(DC) feature. We called the new timbre feature Uniform Discrete Cestrum(UDC), and it can achieve a better result.When we initial partitions, unlike traditional random initialization,we use the pitch-order initialization. The pitch-order initialization decrease much more the number of iterations for Algorithm while have the similar results. So it can improve the efficiency of the algorithm.

Keywords/Search Tags:

Automatic music transcription, Multi-F0s estimation, Multi-F0s streaming, Maximum likelihood, Timbre tracking

PDF Full Text Request

Related items

1	Research On Polyphonic Music Pitch Estimation
2	Maximum Likelihood Natural And Music Algorithm For Doa Estimation Of Dsp Realization
3	Research On Multiple Fundamental Frequency Estimation Of Automatic Music Transcription System
4	The End-to-end Approach For Piano Music Transcription
5	Research And Implementation On Music Transcription Technology
6	Research Of Parameter Estimation Based On Cos Method
7	A New Parameter-estimation Method Of 2PLM Based On Item Response Theory
8	Multi-target Angle Estimation And The Implementation Of The Digital Multi-beam System
9	Study On Bimodal Noises In Electronic System
10	Research And Implementation Of Humming To MIDI Automatic Music Transcription System