Research Of Speech Quality Assessment Based On Deep Learning

Posted on:2020-04-09

Degree:Master

Type:Thesis

Country:China

Candidate:G S Zhang

Full Text:PDF

GTID:2428330575991197

Subject:Communication and Information System

Abstract/Summary:

PDF Full Text Request

Speech is indispensable in people's information communication.The quality of speech directly determines whether the information communication is smooth.Therefore,how to effectively evaluate the quality of the output speech is the goal of researchers at home and abroad.Aiming at the problem that the existing nonintrusive speech quality evaluation has low correlation with the subjective evaluation result and cannot meet the market demand,this paper studies the non-intrusive speech quality evaluation.This paper researchs a speech quality evaluation method combining stack autoencoder(SAE)and back propagation(BP)neural network.The method is implemented by BP neural network and a deep neural network composed of SAE.The essential features of speech are extracted by the SAE and then the feature is mapped to the subjective MOS score through the BP neural network.The simulation results show that the proposed method reduces the mean square error and improves the correlation coefficient with subjective evaluation compared with the existing itu-t p.563 and speech quality evaluation method based on fuzzy directed graph support vector machine(FDGSVM),but increases the evaluation time.Aiming at the problem of the long evaluation time of the above methods,this paper adopts the elite selection and adaptive step-and-step improved firefly algorithm to integrate into BP neural network to solve the problem.And this paper proposes a new speech quality assessment method based on the combination of SAE and improved backpropagation.The essential features of speech are extracted by the SAE and this feature is mapped to the subjective MOS score by improved backpropagation.The simulation results show that compared with the speech quality evaluation method combined with SAE and BP,the evaluation time is reduced by 67.30%.In the speech quality evaluation of the in-set language,the correlation coefficient with subjective evaluation is increased by 16.57%.The mean square error was reduced by 10.06%,but in the speech quality evaluation of the out-set languages,the correlation coefficient with subjective evaluation deteriorated by 31.23%,and the mean square error deteriorated by 22.85%.This method is suitable for the speech quality evaluation of the in-set language,but not for the speech quality evaluation of out-set languages.

Keywords/Search Tags:

speech quality assessment, deep learning, stacked autoencoder, backpropagation, glowworm swarm optimization algorithm

PDF Full Text Request

Related items

1	Classification Of Power Quality Disturbances Based On Deep Learning
2	Optimizing Deep Learning Algorithm Based On Noisy Autoencoder
3	Research On Stacked Capsule Autoencoder Optimization Algorithm Based On Manifold Regularization
4	Study On Speech Quality Assessment Based On Deep Learning
5	Reserch On Artificial Glowworm Swarm Optimization Algorithm Analysis Improve And Application
6	Improved And Applied Research Of The Glowworm Swarm Optimization Algorithm
7	Research On Swarm Intelligent Optimization Algorithm Of Glowworm And Its Application
8	K-center Algorithm Based On Improved Glowworm Swarm Optimization
9	An Improved Glowworm Swarm Optimization Algorithm And Its Application
10	Improved And Application Of Glowworm Swarm Optimization Algorithm