A Research Of Classifying The Compatibility Between Foods

Posted on:2018-08-25

Degree:Master

Type:Thesis

Country:China

Candidate:J Hu

Full Text:PDF

GTID:2348330533966803

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

With the development of computer technology and artificial intelligence technology,more and more fields begin to apply the technology of machine learning.Among them,traditional Chinese medicine proposed the theory of that medicine equals food and there is compatibility between foods.In this paper,the machine learning algorithm will be applied to this field,we collect some information on the corpus on internet in order to develop a model for the classification of the compatibility between food and food.In principle,the classification of compatibility between food is also a kind of knowledge representation method.The rise in recent years,a lot of knowledge representation learning method,one of the most famous new and widely used is the transE algorithm and its derivative algorithm,transE algorithm and its derivative algorithm but the principle is not suitable for processing a kind of relationship is reflexive,while compatibility between food is just a reflexive relationship.A and B are compatible means B compatible A.In order to solve this problem,this paper presents a new feature relation representation method,and puts forward two methods of data augmentation for this feature,we can make these data augmentation methods to produce more relation representation on the same pair of food.In training stage,we can through the extended data set to get more samples to improve the classification accuracy;in the prediction stage,we can use the same food to generate multiple feature representation then obtain multiple answers and voting among them,so as to improve the accuracy of the classifier.Because the studies are very rare,so there is no public dataset on internet now.The word embedding model used in the paper and the dataset of compatibility between foods used by the algorithm was built by our own,generating the dataset of compatibility between foods is by fusing the data from crawling multiple sites through the network.The corpus we using to build word embedding is from food in wikipedia and health blog,etc.We devise experiments and do it on the dataset in order to find out the influence of two data augmentation methods and which method is better.And study the relationship of accuracy with dim of word embedding,different classifier model,etc.After all,we get many classifiers and try to aggregate them together,we devise three method of aggregation and make things better.

Keywords/Search Tags:

Data augmentation, word embedding, aggregation

PDF Full Text Request

Related items

1	Research On Word Spotting Technology In Handwritten Historical Document Images
2	Research On Mongolian-Chinese Cross-Lingual Word Embedding Learning Based On BERT
3	Image Semantic Understanding Introducing Word Embedding And Attention Augmentation Mechanisms
4	Research On Chinese Word Segmentation Method Based On Word Embedding
5	Research On Word Sense Disambiguation Method Based On Word Embedding
6	Research On The Representation Of Word Embedding Based On Knowledge Fusion
7	Research And Application On Word Embedding Of Low Frequency Words
8	Research On Personalized Recommendation System Based On Word Embedding
9	Word Embedding Revision Based On Morphological Information And Semantic Lexicons
10	Research On Ontology Alignment Based On Word Embedding