Research On Two Methods Of Single Channel Speech Separation

Posted on:2020-01-18

Degree:Master

Type:Thesis

Country:China

Candidate:X L Dong

Full Text:PDF

GTID:2428330590954688

Subject:Engineering

Abstract/Summary:

PDF Full Text Request

Artificial intelligence technology constantly updates and iterates,and permeates to the increasingly rich application scene,man-machine voice interaction technology is becoming more and more indispensable.However,the external environment is changeable,the noise interference will often seriously affect the performance of speech interaction,especially the strong noise,single channel condition,thus hindering the real application of speech technology,so a good front-end speech separation module is particularly important.In recent years,supervised speech separation technology has made important progress,among which the mainstream supervised learning algorithms include computational auditory scene analysis,non-negative matrix decomposition and deep neural network-based speech separation algorithms.This paper mainly studies supervised speech separation algorithms based on non-negative matrix decomposition and neural network.The main contents and innovations are as follows:Firstly,the speech separation method based on non-negative matrix decomposition is deeply studied and implemented in this paper,and the existing models are improved and optimized.A strong noise mono-channel speech separation algorithm based on convolutional non-negative matrix partial joint decomposition is proposed.The speech starting point of the mixed signal is obtained by pitch detection algorithm,and then the pure noise segment in the mixed signal is determined.Finally,the mixed signal spectrum and the noise spectrum are decomposed partly by convolutional nonnegative matrix,and the speech base matrix is obtained.Then the separated speech spectrum and time domain signal are obtained.The experimental results show that under the conditions of different noise types and noise intensity,the speech separation of the convolutional non-negative matrix partial joint decomposition has achieved better results.Secondly,this paper studies the supervised speech separation algorithm and network framework based on depth clustering,and then proposes a speech separation method based on threshold convolution depth clustering.It makes full use of the strong feature learning ability of convolution neural network with multi-level nonlinear structure,and is good at exploring the advantages of space-time structure information in speech time-frequency unit.The algorithm allows the context feature modeling of speech spectrum,and considers the time-frequency dependence and local characteristics of speech signal,which is beneficial to improve the performance of speech separation.The experimental results show that the method not only achieves good separation effect,but also improves the operation speed significantly on the premise of ensuring speech performance.Finally,this paper summarizes the research and points out the future research direction.

Keywords/Search Tags:

Convolutive nonnegative matrix partial co-factorization, speech separation, low SNR, monaural speech, deep clustering

PDF Full Text Request

Related items

1	Nonnegative Matrix Decomposition And Application In Mono Speech Separation
2	Speech Enhancement Using Nonnegative Matrix Factorization With The Constrained Speech Spectrum
3	Research On Underdetermined Convolutive Speech Signal Separation Methods
4	Study On Speech Separation Based On Non-negative Matrix Factorization And Deep Clustering
5	Single Channel Speech Separation Based On Nonnegative Matrix Factorization
6	Research On Monaural Speech Separation Technology Based On Deep Learning Joint Optimization And Feature Fusion
7	Study On Speech Enhancement Based On NMF Algorithm
8	Research On Auto-regressive Deep Neural Networks' Based Monaural Speech Separation
9	The Research Of Key Techniques Of Speech Separation And Speech Recognition
10	Research On Monaural Speech Separation Of Specific Speaker Based On Deep Learning