Research On Information Sensitivity Based On Dependency Parsing

Posted on:2014-07-23

Degree:Master

Type:Thesis

Country:China

Candidate:C Wang

Full Text:PDF

GTID:2268330401986729

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

With the fast development of informatization, the sensitive information related to individual privacy, trade secret and states secrete have more and more existence forms and is increasing in quantity and have different secret classifications. Currently, thereâ€™s little research on automatic classification while traditional manual marking or classification is often inefficient and the result is not good. Therefore using computer for automatic analysis of sensitive information and automatic classification has become an important and practical research topic.A lot of information sensitivity studies need multi-granularity sensitive information analysis. First, this paper presents an algorithm SSAD, sentence sensitivity analysis based on dependency parsing. After dependency parsing analysis in sentences, extract core structure of sentence that contains sensitive information. Then analyze the semantic distance and the position of sentence. And considering sensitive words the sentence contained calculate the sensitive values of whole sentence. Finally, store the sentence frame for further processing.In order to analyze of the sensitivity of the document, based on effective analysis of sentence-level information sensitive, this paper presents an algorithm DSAD, document sensitivity analysis based on dependency parsing. After segment and dependency parsing, according to the similarity with storied sentence frame, calculate the sensitivity value of the document. If the document has been classified, calculate document sensitive values considering the sentencesâ€™ sensitivity and document secret classification. If the document has not been classified, calculate security classification of the document by the distribution of sensitive words and sentence frame, and then combine with the sensitive sentences information the document contains to calculate the document sensitive values.To solve the problem that the same sensitive information has different sensitivities at different times, this paper proposes a strategy of dynamic updating the sensitivity of sensitive words.Experimental results show that the algorithms above can calculate the sensitivity of the sentence-level and document-level information effectively, and can classify and sort the document without secret classification rightly.

Keywords/Search Tags:

Sensitive Information, Sentence Sensitivity, DocumentSensitivity, Dependency Parsing, Auto Classification, Dynamic Updating

PDF Full Text Request

Related items

1	Study And Application Of Chinese Sentence Structure Clustering
2	Improving Dependency Parsing Using Sentence Splitting
3	Research On Chinese Dependency Parsing Based On Statistical Methods
4	Reaserch On Dependency Parsing Of Chinese Simple Sentence
5	Research On Dependency Parsing Based Semantic Computing Method For Chinese Sentence
6	Research And Implement On Chinese Dependency Parsing
7	Exploiting Dependency Parsing As An Auxiliary Task To Enhance AMR Parsing
8	Tourism Scenic Spot Evaluation System Based On Dependency Parsing
9	Study On Aspect-level Sentiment Classification Algorithm Based On Dependency Parsing
10	Cost-sensitive Rough Set Theory: Model And Application