An Improved Automatic Math Problem Solver Based On Temporal Convolutional Networks And Multi-head Attention

Posted on:2022-05-22

Degree:Master

Type:Thesis

Country:China

Candidate:B Yang

Full Text:PDF

GTID:2518306497979299

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

Math word problems are any numerical problems written in natural languages,based on any subject domain(mathematics,physics,chemistry,biology etc.)Due to the variety of types and complexity of math word problems,it is necessary to make appropriate reasoning for the expressions and answers to solve these problems described by natural language.Therefore,the design of automatic math word problem solvers has been a hot topic in applying artificial intelligence to education.In recent years,most top-level math word problem solvers have adopted deep learning methods.However,they generally ignore the importance of data preprocessing and lack the consideration of the time sequential problem of the text of the math word problems.Therefore,this paper aims to explore different ways to improve the performance of the current popular automatic math word problem solver.The core of designing automatic math word problem solver is to apply the advanced natural language processing technology and computer application technology to the whole model of automatic math word problem solver.It mainly includes two modules: data preprocessing and model training.First of all,this paper comprehensively analyzes the characteristics of the datasets of math word problems.Its application value in the automatic math word problem solver model is explored.Secondly,this paper analyzes and summarizes the related algorithms based on deep learning model.The performance advantages of different deep learning algorithms in building automatic math word problem solvers are explored.Finally,this paper proposes an automatic math word problem solver model TMASeq2 seq,which integrates the Seq2 seq model,Multi-Head Attention Mechanism and Temporal Convolution Network(TCN).Three improvement schemes are proposed for the establishment of automatic math word problem solver.Experiments are conducted on two common public datasets.All of them improve the accuracy of the automatic mathematics application problem solver.The main work of this paper is as follows:(1)TMASeq2seq adopts the better performance of word segmentation tool(Fool NLTK)and word embedding method(ELMO)to form the data preprocessing module.More attention is paid to the effect of data preprocessing on the performance of math word problem solver.(2)TMASeq2seq has added a Multi-Head Attention Mechanism.When calculating the context attention,it adopts the method of partial analysis first,and then whole reasoning.The original context attention is divided into several sub domains for calculation,and then combined reasoning.This method can not only achieve good parallelism,but also capture longer distance dependencies.(3)TMASeq2seq combines the traditional Seq2 seq model with the time convolutional network TCN for the first time to solve math word problems automatically.It enlarges the receptive field of the decoder and enhances the processing ability of sequential data to a certain extent.(4)On large-scale datasets,TMASeq2 seq achieves better performance than the current best integrated seq2 seq model(Ensemble model),with accuracy increasing from 68.4% to 70.5%.On a small dataset,TMASeq2 seq also achieved performance improvements,with an accuracy rate of 91.3%.

Keywords/Search Tags:

Math Word Problem Solver, Deep learning, Seq2seq model, Temporal Convolution Network, Multi-Head Attention Mechanism

PDF Full Text Request

Related items

1	Mathematical Word Problem Solver Based On Deep Adaptive Graph Convolutional Networ
2	Short Text Classification Algorithm Based On Temporal Convolution And Attention Mechanism
3	History-based Attention In Seq2seq Model For Multi-label Text Classification
4	Futures Trend Forecasting Based On Dilated Causal Convolution And Multi Head Self Attention
5	Research On Chinese Word Segmentation Methods Based On Deep Learning
6	Research On Text Sentiment Analysis Method Based On Deep Learning
7	Research On English-Chinese Translation Based On Improved Seq2seq Model
8	Research On Intrusion Detection Method Based On Semantic Re-Encoding And Multi-Head Attention Mechanism
9	Research On Aspect-based Sentiment Analysis Based On Multi-head Attention Mechanism
10	Research On Advertising Click Rate Prediction Based On Deep Network Model Compression