Font Size: a A A

Platform Service And Application Research For Text Extraction Based On Geographic Information

Posted on:2015-08-06Degree:MasterType:Thesis
Country:ChinaCandidate:R ZhouFull Text:PDF
GTID:2298330467466117Subject:Computer technology
Abstract/Summary:PDF Full Text Request
No doubt, we have now entered the information age, data age. People need toaccess and retrieve information quantity also grow with each passing day, then howmultitude data world to find the required information has become an increasinglyimportant research topic. Nowadays, to grow with each passing day information, onlyby artificial and simple database method is time-consuming and not too realistic. Weneed a more intelligent and reliable way, more intelligent help people find information,processing data, to solve the information rich and poor knowledge of contradiction.Indeed, there have been many intelligent tools such as automatic abstract, automaticdocument retrieval language processing technology, a key technology in these is thetheme words, helps to simplify the work for extracting topic words, and how to findthe topic word to word technique.This is also the current search engine, the coretechnology of intelligent translation tools.Word segmentation technology, as the name suggests, by means of automatic textclassification subject to the computer, so that it can correctly express the meaning ofthe expression. At the same time, notably, Chinese is different from the western,thereis no space the delimiter, at the same time in the Chinese filled with synonyms, a lotof similar words, so how to Chinese participle is a very complicated problem. At thesame time a involves linguistics, logic, computer science, Natural LanguageProcessing, cognitive science, psychology and many other fields of technology.The technology of data mining, is the analysis of data from different angles, andsummarized into useful information, is a potential new technology, can help theenterprise to collect the customer they want or potential customer information.Thevast majority of network applications are based on the database, the user data isincreasingly tired and technological update, finally let us into the age of big data, ifthe secret hidden links there seemed to be no relationship between data and data byexposing, measuring what could happen in the future by focusing on the past data, animportant mission to mine the value of is data mining is given. Spatial data mining, also known as spatial data mining and knowledgediscovery,is a new data in order to solve the mass characteristics of spatial datamining research and extension of the branch, is refers to the process of pattern andcommon feature extraction implied from spatial database, the user interested spatial ornon spatial. The object of spatial data mining is the main spatial database, spatialdatabase not only store geometry, spatial objects or object shape data, attribute data,but also the spatial topological relations between spatial objects or objects;Geographic visualization technology, the use of specific visualexpression(performance medium is paper, computer or other medium) visualization tospace environment and the problems, so as to maximize the use of the ability ofinformation processing "associated with the human visual ability, through acombination of scientific visualization, research direction of cartography and thedevelopment of the GIS, is through a series of visualization technology allows theuser to a better understanding of the spatial data, is conducive to the furtherexploration and analysis of spatial data. So far, the computer recognition ability is stillworse than visual observation ability of human beings, human beings can quickly andaccurately from image found in specific data distribution mode.Especially in thegeographical environment, people are accustomed to in the related problem analysisand space in a visualization environment. Due to the combination of the ability toobserve people keen and user knowledge possible,interactive visualization of SDMcan make the data mining process into an interactive, visual, easy to understand torepeat the process, rather than fully automatic black box operation. Analysis of thispoint is particularly important for the spatial data exploration. In general, thehuman-computer interaction is one of the most important visual technology, real-timeinteraction makes spatial data analysis and knowledge discovery has become morehumane and professional.Therefore, geographic visualization to help us analyze thedata and problems,strategy method to solve the problem of thinking, expression andinterpretation of spatial analysis results have a very special significance.The text extraction of geographic information, is the organic combination of wordsegmentation technology and spatial information derived, reflect a specific applicationis the spatial data mining technology in the field of geographic information. Firstly,the word segmentation technology, data mining and spatial data mining concept,characteristics of geographic visualization technology introduced. Then based onthese technologies, derived from the text extraction of geographic informationtechnology and the detailed technical route and the realization process analysis. The main research work as follows:(1) the segmentation technology research, with the help of the open sourcealgorithm, integrated development environment, lightweight Chinese participle API,geographic information system to establish the simple data processing model, furtheroptimization for the application of geographic information,geographic informationextraction from text.(2) the research of data mining technology, especially for spatial data miningtechnology, the spatial database, spatial topology association to study the relationshipbetween spatial objects or objects. To explore the internal relations among objects, asimple spatial data model.(3) the research of geographical information visualization technology, theresearch results of visualization of the data, constructing a simple interaction model.(4) the text of geographic information construction of content service and thefeasibility of the application of inquiry.The points of innovation as follows:This text information extraction. Based on the segmentationtechnology,especially Chinese segmentation technology characteristic of Chineselanguage,the text of geographic information specific to the optimization ofsegmentation technology. To extract the text of geographic information, make the textgeographic segmentation accuracy and processing speed to achieve optimal.Implementation of electronic map marker clustering algorithm based ondistance.The current mainstream algorithm is based on the clustering of gridmarks,algorithm is fast, simple, but the accuracy is not high enough, the errordistribution.Construction of specific to spatial geographic information data miningmodel.Data mining is a new technology in the present study more is to study sometheoretical properties, especially the achievements in the few geographic informationdomain, this from a practical point of view to explore and explain its significance.
Keywords/Search Tags:Chinese Segmentation, Spatial Data Mining, Visualization ofGeographic Information, Geographic Information Extraction, Electronic Map MarkerClustering Based on Distance
PDF Full Text Request
Related items