Speaker Recognition Based On Independent Vector Analysis And Deep Convolutional Neural Network

Posted on:2023-04-30

Degree:Master

Type:Thesis

Country:China

Candidate:B Ma

Full Text:PDF

GTID:2568306800952269

Subject:Electronic and communication engineering

Abstract/Summary:

PDF Full Text Request

Speaker recognition aims to identify the speaker’s identity with the features of the speakers’ speech signals.Speaker recognition is widely used in forensic identification,voice assistants,etc.,and it is a hot research area in speech signal processing.In this paper,the speakers’ speech feature fusion algorithms based on the independent vector analysis(IVA)and parallel convolutional neural networks are proposed for speaker recognition.The main work of this paper is as follows:1.A speech feature fusion algorithm based on IVA is proposed for speaker recognition.First,the time domain(TD)features and the frequency domain(FD)features are extracted from the speaker’s speech signal,respectively.A TD feature matrix and a FD feature matrix are formed with the TD features and the FD features of the speaker,respectively.A feature tensor can be obtained by paralleling the TD feature and the FD feature matrix.The independent feature component(IFC)matrix of the TD features and FD features are estimated by using the IVA,respectively.The fusion feature of the speaker’s speech is obtained by paralleling the IFC matrix of the TD and FD features.A speaker model can be obtained by using the IVA.Finally,the fusion feature of the speaker’s speech is used as the input of a deep convolutional neural network to extract the deep feature of the speaker’s speech.The deep feature of the speaker’s speech is utilized as the input of the fully connected(FC)layers,and the output of the FC layers is used as the input of the Softmax layer for speaker recognition.2.A speech feature fusion algorithm based on parallel convolutional neural network is proposed for speaker recognition.First,the IFC matrix of the TD and FD features of the speaker’s speech can be estimated from the speaker’s speech by using the IVA,respectively.Then,the IFC matrix of the TD features and the IFC matrix of the FD features are used as the input of the parallel convolutional neural network to extract the deep features of the TD and the FD features,respectively.The fusion feature of the speaker’s speech can be obtained by concatenating the deep features of the TD and the FD features.Finally,the fusion feature of the speaker’s speech is utilized as the input of the FC layers,and the output of the FC layers is used as the input of the Softmax layer for speaker recognition.

Keywords/Search Tags:

Speaker recognition, Independent vector analysis, Feature fusion, Deep convolutional neural network

PDF Full Text Request

Related items

1	Rearch On Text-independent Speaker Identification Technology Based On SVM
2	Speaker Recognition Based On Multi-information Fusion
3	The Research Of The Speaker Recognition System Using Low-Dimensional Vector Representations
4	Speaker Recognition Based On Fusion Features And Deep Neural Networks
5	Research On Speaker Recognition Method Based On Deep Learning
6	Research Of Speaker Recognition Technology Based On Fusion Features
7	Research On Speaker Adaptation Of Neural Network Acoustic Models For Speech Recognition
8	Research Of Speaker Identification Technology Based On Deep Features
9	Research On Speaker Recognition Algorithm Based On Deep Convolutional Neural Network
10	Speech Emotion Recognition Based On Deep Feature And Multi-kernel PCA Feature Fusion