Font Size: a A A

Research And Implementation Of Text Comprehensive Processing Platform

Posted on:2015-11-15Degree:MasterType:Thesis
Country:ChinaCandidate:M M WangFull Text:PDF
GTID:2298330431978655Subject:Computer technology
Abstract/Summary:PDF Full Text Request
With the development of Internet, the information explosion era comes, people enjoy therich information on the Internet and also endure the "information explosion" confusion,and asurge of information makes people facing a severe test to obtain the information they need fromthe inexhaustible information in the sea. At present, facing with such large information, wehandle with it mainly from two aspects of technology and management o address. From themanagement,countries have issued the corresponding information management rules andmeasures, but in view of the differences in customs, morality and system,therefore theformation of aunified global standard is very difficult. From the aspects of management,method of forming a unified control "information rubbish" is difficult to achieve. Because ofthe difficult management of the "information rubbish", people start to prepare technicallyseeking method. From the beginning of the middle of ninety’s,information workers started theresearch of data mining, the standardization of information technology, to achieve theresearch upsurge. However, most of the studies of information staff around the world stay in thetheory stage, data mining, knowledge discovery, application in practice still relativelybackward compared to theoretical researchis. Researchers in various fields and companiesurgently need system according to their needs with text analysis, text retrieval and canabsorb their requirement text..According to market researchers and domain requirements,this platform can create aseparate database for different users, and the user can maintain the personal database platform,using eigenvalue analysis of text to make the text analysis comparing to relative single featureof previous values greatly improve the accuracy of analysis.The platform adopts B/S (Browser/server) mode, using SSH open framework for thedevelopment of technology to achieve the creation of personal database for differentusers,(and) the user can maintain personal database and timely update database.(At the sametime, platform provides domain thesaurus generating, similarity calculating, textclassification and other functions. Users have their own databases, and you can text andassociated with their only pick in the amount of text data, so that the work can omit dataselection. In addition, text information through comprehensive analysis of multiple eigenvalues is more comprehensive.This platform has the basic function of text analysis andcan be extended, not only saves the time text selected, further improves the accuracy of textanalysis,(but also) promoted the development of the text analysis system.
Keywords/Search Tags:Feature Extraction, Multiple Feature, Text Analysis, SimilarityCalculation, Text Classification
PDF Full Text Request
Related items