Font Size: a A A

Design And Optimization Of Chinese Address Matching System

Posted on:2019-06-06Degree:MasterType:Thesis
Country:ChinaCandidate:W H ZhangFull Text:PDF
GTID:2348330542455549Subject:Information and communications systems
Abstract/Summary:PDF Full Text Request
The purpose of Chinese address matching research is to aggregate non-standard and non-normalized addresses through the technology of Chinese address matching.Linking information with the Chinese address is an important way for big data analysis.Address is a natural language character string that describes the space coordinates,and is also the spatial coordinate for identifying human beings' living,working and so on,and is very to people's life.There are a lot of records with address in the logistics,telecommunications registration,household registration,taxation,real estate,business and other fields.Analyzing these data can have a positive impact on national economic and social security.At this stage,the use of Chinese address is still in its junior stage.Chinese addresses are not structured data and have a lot of forms.There are difficult for computer to understand and can not be directly used for aggregation.Chinese Addresses are not conducive to analyze the data.The study of Chinese address matching can solve the problems of standardization and exact matching of Chinese addresses and provide effective support for the intercommunication of address data which from different sources.Although foreign countries have very mature research on address matching,there are many problems in the prior art for Chinese address matching due to the complexity of Chinese and the late progress of our country in overall planning of addresses and standards.Based on the above points,this article will chose the Chinese address matching as a topic of the research,which is a study of Chinese address standardization,matching and other issues.The main content of the thesis includes:1.The research of Chinese address standardization.The Chinese address string consists of Chinese characters,English characters,numeric characters and other special characters.Firstly,this paper analyzes the complexity of Chinese addresses and the difficulty of standardization.And then it analyzes the composition of address elements required by standard Chinese addresses and how to obtain standardized Chinese Address.It puts forward a Chinese word segmentation method,and a word address element recognition method.2.Research on Chinese Address Efficient Matching Algorithm.Based on the standardization of Chinese addresses,this paper studies how to efficiently match Chinese addresses.3.The design of Chinese address matching system.In response to the needs of Chinese address matching,this paper designs and implements a practical Chinese address matching system.The innovations of this article are:1.A word segmentation algorithm based on LSTM network is proposed.2.After word segmentation an address resolution algorithm is proposed based on rules and understanding.
Keywords/Search Tags:Chinese address matching, Chinese address segmentation, Address elements, Chinese address standardization
PDF Full Text Request
Related items