Multi-subspace Attention Neural Machine Translation

Posted on:2021-01-26

Degree:Master

Type:Thesis

Country:China

Candidate:W X Wang

Full Text:PDF

GTID:2428330611951424

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

With the fierce development of Internet technology and the widespread use of computers,machine translation has gradually been applied to multiple fields from the field of natural language processing,such as industrial fields,education fields,and so on.Driven by artificial intelligence,machine translation and neural networks are combined with each other,making traditional machine translation methods gradually expand to neural machine translation methods.Although most of the existing neural machine translation models are used in combination with the attention mechanism,the current attention mechanism only uses an attention calculation function.A single attention calculation function causes the model to always ignore important information.Therefore,this paper first proposes a new attention mechanism.The research focus is on how to combine multiple attention calculation functions and maximize the advantages of each attention calculation function.Therefore,a neural machine translation model with multiple attention calculation functions is proposed.Bidirectional long-short term memory network is a kind of recurrent neural network widely used in the field of natural language processing.This paper proposes a fast-converging long-short term memory network,which proves that the fusion of future information and historical information can extract more sufficient contextual semantic information.Therefore,this paper proposes a multi-subspace attention mechanism,and integrates multiple attention calculation functions in the multi-subspace attention mechanism.The multi-attention mechanism first maps the hidden layer states of the bidirectional long-term and short-term memory network to multiple subspaces,and then uses multiple attention calculation functions in the multi-attention mechanism to calculate the attention score,which is finally applied to the neural machine translation model.This paper compares the proposed multi-attention neural machine translation model with several neural machine translation models on the WMT 14 dataset.The experimental results prove that the multi-subspace attention neural machine translation model proposed in this paper can effectively improve the translation quality of text.

Keywords/Search Tags:

Neural Machine Translation, Attention Mechanism, Sequence to Sequence Model, Long-short Term Memory Network

PDF Full Text Request

Related items

1	Research On Deep Learning Algorithm For Sequence Data
2	Research On Sequence Recommendation Method Based On Hybrid Neural Network
3	Research On Long And Short-term Neural Network Recommendation Model Based On Self-attention Mechanism
4	Study Of Aurora Image And Sequence Classification Based On Deep Learning
5	Key Information Extraction Of Sequence Data Based On Deep Neural Network
6	Research On Chinese Event Extraction Via Incorporating Attention Mechanism And Long Short-Term Memory Networks
7	Research And Implementation Of Chinese Resume Parsing System Based On Deep Neural Network
8	Research On Relation Classification Via Bidirectional Long Short-Term Memory Networks With Attention Mechanism
9	Question And Answer System Based On Deep Attention Model
10	Research On Network Intrusion Detection Method Based On Bi-LSTM