Research On The Related Issues Of Melody Extraction

Posted on:2019-01-30

Degree:Master

Type:Thesis

Country:China

Candidate:C Jia

Full Text:PDF

GTID:2348330545984503

Subject:Information and Communication Engineering

Abstract/Summary:

PDF Full Text Request

Melody extraction is an important issue in music information retrieval(MIR).The purpose of melody extraction is to convert the waveform file into a sequence of human voice fundamental frequencies.As an important acoustic feature,fundamental frequency plays an important role in many fields.Therefore,the main melody extraction plays an important role in the interaction of music information.The tradition main melody extraction method mainly includes two modules,one is the candidate fundamental frequency extraction and the other is the main fundamental frequency discrimination.At present,the candidate fundamental frequency extraction module mainly has the problem that it extracts so many false fundamental frequencies.The main fundamental frequency discrimination module mainly has two disadvantages,one is insufficient utilization of short-time features in time frame and the other is the reliance on extraction accuracy of the initial phase of the melody.This paper made improvements to the above drawbacks,and the main innovation includes the following two parts:1.This paper proposed a candidate fundamental frequency extraction module base on tandem filter system.The system consists of three parts,preprocessing part,Hyper-Fourier transform filter part and the transverse stripe filter part.First,there exists so much interfere in the original signal without preprocessing,so we use the repeatability of the music,and apply the state-of-art source separation method to process the original signal.Then we use Hyper-Fourier Transform to filter the accompaniment spectrum components.This part reduce the number of non-human voice fundamental frequency in all candidate fundamental frequencies.Finally,we believe that the STFT spectrogram have the original information of the melody frequency structure,here we used a transverse stripe filter system to eliminate the non-possible fundamental frequency position.2.This paper proposed a machine learning classifier base on Mel Frequency Cepstral Coefficient,Harmonic Feature,Spectral Shape Feature,Spectrum Contrast Feature to decision which frame is human voice frame.The experiment was conducted on two terms,single feature and combination feature.Finally,we found a feature combination that both perform well on frame precision and frame recall rate.

Keywords/Search Tags:

main melody, tandem filter system, acoustic feature, logistic regression classifier, feature combination

PDF Full Text Request

Related items

1	Research And Application Of Recommendation Technology Based On Logistic Regression
2	Research And Application Of Main Melody Extraction Algorithm
3	Pedestrian Detection On Android System
4	Design Of Face Recognition Classifier Based On Regression Algorithm
5	The Algorithm To Improve Feature Selection In Personalized Recommender Systems For Items In The Tail Of The Long Tail Theory
6	Design And Implementation Of Click Through Rating System Based On Logistic Regresion Model
7	Face Recognition Algorithm Based On Monogenic Binary Coding And Patterns Of Oriented Edge Magnitudes
8	Face Recognition Method Of Identifying Projection And Regression Classification
9	Research On A QBH System Based On Melody Information Invariance
10	Research On Stable Humming Feature Extraction