Detection And Study Of Spam APP Based On Metadata

Posted on:2018-09-16

Degree:Master

Type:Thesis

Country:China

Candidate:S S Si

Full Text:PDF

GTID:2428330596989219

Subject:Electronic and communication engineering

Abstract/Summary:

PDF Full Text Request

In recent years,due to the openness and ease of use of the Android platform,based on the Android smart phones become popular for more and more consumers,therefore,increasing developers turned to Android application development.Some enterprises,groups or individuals in order to promote their own apps,they will upload a number of function similar apps or add some non-related descriptions and keywords about their apps to improve the probability of being found,there are some malicious attackers who add some violating user privacy codes to some popular applications,and then re-uploaded to the application market.As the application market not only lack of effective detection,and lower upload threshold for uploading the app,so criminals use the related vulnerability to upload some spam app.These spam apps not only reduces the user experience,but also increase the burden of application market staff.Spam apps have many obvious features,for example,no specific function,not related app description information,not related keywords,similar functional applications and so on.Application market staff through the manual method in accordance with the spam app identification rules detection app,and remove the apps which are identified as spam,this method is inefficient and cannot found and delete spam app timely.In this paper,a new spam app detection method is designed.At the same time,the results of different classifier models are compared.The precision,accuracy and recall of spam app are all very good.This is the work of this paper and their predecessors' different places.This paper first uses some obvious features as the checkpoints,and collects the data sets used in previous experiments,and then uses the decision tree,support vector machine,random forest and Boosting classifier model to experiment.The decision tree classifier model is widely used for its good readability,human analysis,high efficiency,and repeated construction.SVM has many advantages in solving small sample,non-linear and high-dimensional pattern recognition.The random forest is a classifier that uses multiple trees to train and forecast the samples.The output of the random forest is determined by the number of individual output results.The Boosting algorithm is a method of integrating several classifiers into a classifier.Finally,the experimental results are compared,and the difference of experimental results of different classifiers model is compared,and the reason of the experimental results is analyzed.On the one hand,this paper proves the validity and correctness of the selected features in the paper,and on the other hand,we can use the classifier model to design the spam app.

Keywords/Search Tags:

Spam app, decision tree, random forests, support vector machine, Boosting

PDF Full Text Request

Related items

1	Recommendation Algorithm Based On Random Forests And Boosting Thought Research
2	Research On Classification Method Of Random Support Vector Machine And Its Application
3	The Detection Method Of SPAM Based On Data Driven
4	Research On Structure-Activity Relationship Of Semiconductor Materials Based On Machine Learning
5	Studies On Question Classification Technology In Chinese Question Answering System
6	Random Forests Expression Recognition Algorithm Based On Sequence Features
7	Research On Content Based Spam Short Messages Identifying
8	Research On Code Plagiarism Detection Model Based On Random Forest And Gradient Boosting Decision Tree
9	Analysis And Application Of Telecommunications Data Based On Support Vector Machine And Decision Tree
10	Design And Implementation Of Support Vector Machine Monitoring System Based On Spam SMS