Minimal Gated Unit For Recurrent Neural Networks

Posted on:2017-03-20

Degree:Master

Type:Thesis

Country:China

Candidate:G B Zhou

Full Text:PDF

GTID:2308330485966385

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

Countless learning tasks require dealing with sequential data. For some problem, it requires that a model produces outputs that are sequences. In other domains, a model must learn from inputs that are sequences. Interactive tasks often demand both capabilities. Compared to the traditional models, recurrent neural networks can be better qualified for the three categories of issues mentioned above. In practice, recurrent neural network have been successfully applied to many areas, such as speech recognition, video motion analysis, handwriting recognition and image captioning, achieved good results. After years of development, the recurrent neural network spawned a lot of variants, LSTM and GRU are the most widely used structures and both of them are gated units. Benefitting from evaluation results on LSTM and GRU in the literature, we propose a gated unit for RNN, named as the Minimal Gated Unit (MGU), since it only contains one gate, which is a minimal design among all gated hidden units. Compared to the previous variants, MGU’s contribution are:First, MGU minimize the number of gates in the structure, so it has much less parameters than LSTM and GRU. Its training complexity and training speed also benefit from this property. In some experiments, MGU is much more faster than GRU. We can get a good result within acceptable time cost, but GRU can’t.Second, its simple architecture also means that it is easier to evaluate and tune, and in principle it is easier to study MGU’s properties theoretically and empirically.Third, we have evaluated the effectiveness of MGU in four problems (the adding problem, sentiment analysis, image identification and language model), and MGU has achieved comparable accuracy with GRU when the input sequence length ranges short (35,50-55), moderate (128), and long (784).

Keywords/Search Tags:

RNN, Machine Learning, LSTM, GRU

PDF Full Text Request

Related items

1	Prediction Of Baijiu Stock Based On Machine Learning Algorithm Model
2	Conv-BiLSTM: A New Intelligent WebShell Detection Network Based On Bi-LSTM
3	Deep Learning-based Machine Translation Research In China And Malaysia
4	Research And Application Of Ouantitative Investment Decision Based On Machine Learning
5	Applications Of Machine Learning In Finance
6	Research And Application Of Game Artificial Intelligence System Based On Machine Learning Methods
7	Machine Learning Based Cache Prefetcher Design
8	Research On Stock Price Trend Based On Machine Learning Algorithm
9	The Key Technology On Chinese Word Segmentation Based On Bi-LSTM-CRF Model
10	Research On Key Techniques For Chinese Word Segmentation With The Combination Of Deep Learning Features And Shallow Machine Learning Features