Font Size: a A A

Research And Realization Of The Access And Management Technology For The Web Resources Quality Metadata

Posted on:2011-07-19Degree:MasterType:Thesis
Country:ChinaCandidate:L LiuFull Text:PDF
GTID:2218330338466763Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
With the rapid development of Internet, information resources in the Internet have become richer and richer. Web data is in various forms and has no specific models are used to describe them, and web data is self-descriptive and dynamic variability, how to obtain high-quality information from complicated resource is a great challenge. In recent years, domestic and foreign scholars have made great progress in study of web resource quality assessment technologies. However, in the classic methods, according to quality indicators qualitative evaluation the quality of web resources has a high subjective, easily lead to bias, and reduce the accuracy of quality evaluation results.In order to analyze the quality of Web resources objectively and accurately, the method based on the perspective of metadata of quality web resources was studied. In this paper, web resources/information quality evaluation model WebQM was understood as quality metadata model, Dublin Core Metadata Set (Dublin Core Element Set, referred to as DC) metadata was used for the design standards of Web resources metadata in this paper, and DC elements were expanded. At the same time, quantitative methods of quality web resources metadata was studied, data extraction technologies are analyzed. Data extraction technology based on regular expressions was used to obtain quality metadata. Then, by analyzing the relationship between web resources quality and sub-dimensions, the fact tables and dimension tables were design, a star model was established, an elementary web resource quality metadata warehouse was constructed.The ultimate goal of this article is to build web resources quality metadata warehouse, and manage and analyze web resources quality metadata based on web resources quality data warehouse. Therefore, in the paper the analysis method of the factless fact tables was studied, data analysis operations such as statistical calculation was finished based on web resources quality data warehouse. Experimental results showed that the management and analysis web resources quality metadata could easily and quickly analyze the quality of Web resources, estimated the trend of the quality of Web resources, the situation has reached the core objective of this paper.
Keywords/Search Tags:DC specifications, WebQM, quality metadata access, quality metadata management and analysis
PDF Full Text Request
Related items