Font Size: a A A

Study Of The Evaluation System Of Web TCM Information Resource Based On Hadoop

Posted on:2017-04-20Degree:DoctorType:Dissertation
Country:ChinaCandidate:X B LiFull Text:PDF
GTID:1318330491460695Subject:TCM History and Literature
Abstract/Summary:PDF Full Text Request
With the development of computer and communication technology, the Internet has gradually penetrated into people's production and life in even all areas. The Internet has become an important source of knowledge and people obtain information for internet to guide their work and lives. Modern social life cannot make progress without the internet.The information of TCM(Tradition Chinese Medicine) on Web is growing every day and existing information resources are constantly changing and updating. The rapid development of information technology makes the TCM information on the Internet is growing explosively, but TCM information quality does not match its growing, and in the current situation it is very difficult to have the objective evaluation of the quality to guide people to find correct and useful information from the Web.We need to find a way to evaluate the TCM information resources of the Web. From the characteristics of information resources of Web, we use Hadoop distributed computing technology, put forward the mass data aided AHP method to establish evaluation index system of Web TCM information resources, and makes an empirical research on Chinese medicine health service website. The research results include the following aspects:(1) Design of TCM theme crawler.(Chapter No.3)In this Chapter, we discussed the characteristics of Web TCM information resources such as fast growth, wide distribution, and easy to change. If you want to analyze and evaluate the TCM information resource on the Web, the premise is that we can obtain information fast and with high quality. So we should use automated Web information retrieval method, namely web crawler. The crawler and general search engine crawlers should be different. The way we use crawler of TCM theme to get the web information avoid wasting time to improve the accuracy rate of the crawler. In view of the above requirements, we determined to use distributed the TCM theme crawler, and it can get information from the Web with high quality, and to use the Java programming language for the development of crawler.(2) Construction of TCM resources based on Hadoop.(Chapter No.3 and No.6)The TCM information resources get by the crawler updating, and page analysis and data mining bring high demands on the machine performance, so the storage mode of traditional relational database, can not meet the high performance computing requirements. After collecting information by the crawler, we use the Hadoop HDFS to store it. At the same time, the text mining and the text statistical analysis, can ensure the high performance and low overload.(3) Construction of the evaluation index system of Web TCM resources.(Chapter No.4 and No.5)In this chapter we started from the characteristics analysis of Web information resources of TCM. We discussed that the evaluation of information resources should insist some principles. The evaluation index system is divided into four parts, namely, evaluation of the content, evaluation of the website, evaluation of the usage and other evaluation aspects. Each part is divided the specific evaluation index, total is 24 items, and made a detailed description of the meaning and function of them. The evaluation of Chinese medicine information resource analysis method based on AHP levels were analyzed, establish judgment matrix and determine the weight specific index. According to the weight of the comparison, determined the degree of importance of each index.(4)Implementation of the Web evaluation of TCM(Chapter No.6)Based on the specific TCM evaluation practice, we made the expatiation of constructing development environment, the software and hardware configuration requirements, system architecture, Hadoop cluster building and so on. We explained the realization of MapReduce algorithm, described the specific implementation process of the website classification and evaluation, and pointed out what the website should do based on this evaluation result.
Keywords/Search Tags:TCM, Web, Information, Evaluation
PDF Full Text Request
Related items