Speech Lie Detection Research Based On Signal Decomposition Reconstruction And Deep Learning

Posted on:2024-01-23

Degree:Master

Type:Thesis

Country:China

Candidate:Y J Jiang

Full Text:PDF

GTID:2568306917997529

Subject:Information and Communication Engineering

Abstract/Summary:

PDF Full Text Request

Some lying behavior can be harmful to human society,how to detect lie is always an important question,especially for fields such as law,military and forensic science.Speech contains a great deal of information that reflects the psychophysiological changes in a person,so using speech for lie detection has scientific basis.What’s more,speech lie detection is lowcost,easy to operate,less likely to cause rejection and fear in the person being tested,and the detection result is more objective.With the advancement of technology,researchers have used machine learning algorithms for speech lie detection research,and the rise of deep learning in recent years has led to a new level of lie detection research.Therefore,in this thesis,lie detection techniques based on speech signals are studied,and the specific work is as follows:Firstly,the theoretical basis of lie detection and speech processing is introduced.For the lack of corpus in speech lie detection field,the thesis draws on foreign corpus collection methods,collects resources from Chinese video websites,and got a Chinese lie corpus which is named CWG-LD(Chinese Wolf Game Lie Dataset),enriching the Chinese lie corpus.Secondly,since speech lie detection studies focus less on the speech itself,the CC-SDR(Correlation Coefficient-based Signal Decomposition and Reconstruction)model is proposed using signal decomposition algorithms in signal processing techniques.The signal is decomposed by a signal decomposition algorithm,and a threshold value is calculated based on the correlation coefficient between the sub-signal and the original signal,so as to filter the useful components of the speech to reconstruct the speech and improve the performance of speech lie detection.This thesis verifies the effect of CC-SDR with four different signal decomposition algorithms,EMD,LMD,VMD and EWT.The results show that except for VMD,all three algorithms,EMD,LMD and EWT,can make CC-SDR work for speech lie detection.Among them EMD performs the best,on average,the accuracy and F1 scores in Real-life Trail are improved by 1.08%and 1.30%respectively,and in CWG-LD they are improved by 1.70%and 1.92%.Finally,since the temporal characteristics of speech are more neglected in speech lie detection studies,this thesis proposes a CAMT-TCN-LSTM(Channel attention and Muti-taskbased TCN-LSTM)network based on channel attention and multi-task learning for speech lie detection.Both TCN and LSTM can handle temporal data,attention is used to focus on the more important dimensions of the features,while multi-task learning is used to obtain better performance.The experiments show the best results with a 4-layer TCN,a 1-layer bidirectional LSTM and the use of the ECANet attention module,achieving 89.8%and 68.6%accuracy and 89.3%and 68.0%F1 scores under the two datasets,respectively.Ablation experiments verify the contribution of each part.The organic combination of TCN and LSTM is the core,which complements each other’s shortcomings,promotes their advantages and greatly enhances the results.The attention mechanism and multi-task learning strategy playing a role in optimizing the performance of the TCN-LSTM model.Finally,the results are compared with others,and the superiority of proposed method is analyzed.

Keywords/Search Tags:

speech lie detection, signal decomposition, temporal convolutional networks, channel attention, multi-task learning

PDF Full Text Request

Related items

1	Research On Speech Enhancement Technology Based On Deep Learning
2	Research On Long Time Sequence Speech Enhancement Based On Multi-task Learning
3	Research On Characteristics Of Speech Signal For Single Channel Speech Enhancement
4	Research On Detection And Recognization Of Network Offensive Speech Based On Multi-task Learning
5	Research On CTC-based And Attention-based End-to-end Speech Recognition
6	Research On End-to-End Speech Recognition Method Based On Self-Attention Mechanism
7	Research On Speech Emotion Recognition Based On Multi-Attention Mechanism And Multi-Task Learning
8	Research On Multi-dimensional Speech Recognition Technology Based On Multi-task Neural Network
9	Research On Key Issues Of Temporal Relation Between Events
10	Research On Single Channel Speech Enhancement Based On Multi-head Attention Mechanism