Supervised Speech Separation Using The Optimal Ratio Mask

Posted on:2018-07-17

Degree:Master

Type:Thesis

Country:China

Candidate:S S Xia

Full Text:PDF

GTID:2348330515952367

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Speech separation technique is aimed at separating interested target speech from a noisy mixture.Speech separation is widely applied in automatic speech recognition?hearing aids design and telephone communication,etc.Although there are already many achievements in the area of speech separation,while further developments are still expected.This dissertation is focused on monaural speech separation.Separation problem can be treated as a supervised learning problem and solved by adopting supervised learning algorithm.Training target is one of the most important part in a supervised speech separation algorithm,which have great impact on the performance of speech separation.The ideal binary mask and the ideal ratio mask are most popular training targets,which need the assumption that the clean speech is dependent with the noise which is hard to obtain in real environment.The complex ratio mask and phase sensitive mask are hard to estimate leads to its practical performance worse than that in theory.This dissertation utilize the optimal ratio mask which gains the maximize signal-to-noise ratio(SNR)in theory.ORM consider the relationship between clean speech and the noise in real environment.We construct a new separation target function by combining ORM with the supervised speech separation system,utilizing the ORM as training target,which is proposed as a new separation algorithm in this dissertation.We carried out experiments under various noise situation and SNRs,analyze and compare the separation results with several popular training targets.The results show that the separation algorithm we proposed outperform the others as a better solution of speech separation problem in real environments.Considering that ORM separates mainly based on the relations between clean speech and noise,we further experiment with the separation of speech from different speakers,our method also shows superiority.Monaural speech dereverberation is another key problem in the area of signal processing.As the advancing of deep learning,researchers adapt it in speech dereverberation and achieve excellent results.In this dissertation,we utilize the algorithm we proposed into the dereverberation problem.The experiment result shows ideal performance.

Keywords/Search Tags:

Deep neural networks, Speech separation, Supervised learning, Training targets

PDF Full Text Request

Related items

1	Research On Auto-regressive Deep Neural Networks' Based Monaural Speech Separation
2	Research On Supervised Speech Separation Based On Deep Learning
3	Research On Supervised Speech Separation Based On Deep Learning
4	Single Channel Speech Separation Technology Based On Deep Neural Network And Research
5	Supervised speech separation using deep neural networks
6	Speech Separation Based On Deep Learning
7	Speech Recognition Front-End Processing Based On Deep Neural Network
8	Underdetermined Speech Separation Based On Sparse Representation And Deep Learning
9	Binaural Speech Separation Research Based On Deep Learning
10	Research And Design Of Speech Separation Algorithm Based On Deep Learning