Font Size: a A A

Study On Application Of Web Hyperlink Analysis

Posted on:2006-12-27Degree:MasterType:Thesis
Country:ChinaCandidate:Y J HuFull Text:PDF
GTID:2178360182467929Subject:Information Science
Abstract/Summary:PDF Full Text Request
In the coming information age, the web has been becoming an important platform, on which people publish and obtain their information. However, the web is also a set of heterogeneous data, which increasing rapidly and uncontrolled. So. in fields of information utilization and web topology research, the classic information mining techniques and data modeling methods are difficult to be used effectively. But the introduction and application of hyperlink analysis provide a wholly new approach to solute those problems which are difficult for classic methods. Based on analysis of the hyperlink analysis thinking, this dissertation is in depth study on the application of the hyperlink analysis in 3 aspects: web information retrieval, web resources discovery and web topology research.Since the hyperlink is in similitude of co-citation on structure and function, many methods and thinking of co-citation are applied in hyperlink analysis. But hyperlink analysis is faced to more complicated environments, which is dynamic, heterogeneous and uncontrolled. So hyperlink analysis is not only the indolence of arithmetic in simple, but combined with Social Network Analysis, Complex System Theory and Topology Modeling, in order to research into the relationship between the structure of the web hyperlink and the behaviors of the web information entity.With reference to a certain information retrieval system, the main influential factors are recall and precision. But in web, recall is not belonging to main factors, and how to filter the noises and obtain the required information are becoming the main factors. The hyperlink analysis provides a nature method to quantification the "Similarity" among web pages. The chapter 2 analyzes the application of hyperlink analysis in the fields of the web information retrieval, such as web pages crawling, finding related page, optimization of page rank, and clustering retrieval results. And also discuss the development of hyperlink analysis through the combination of the content analysis and hyperlink analysis, of the user behavior and hyperlink analysis, and the uniform between the hyperlink arithmetic. The research shows that hyperlink analysis not only enriches the theory of modern information retrieval, but can improvethe efficiency of the web retrieval and the quality of the web search engine.The process of the creation of the hyperlink between the coupled web pages, which is not stochastic and uncontrolled, is a platform on which the author of the pages can expands the space of the information communications. Based on similar social or science background, those pages are linked together and constructed to many different topics, and in the end, thousands of millions web virtual communities are built. Hyperlink analysis provides a nature mechanism to quantify the "Authority" of web resources. The chapter 3 discusses the application of hyperlink analysis in the fields of web resources discovery through 3 discoveries: common topic discovery, authority topic discovery, and web community discovery. The research shows that hyperlink analysis not only can improve the quality of the web retrieval, but also can improve the quality of the web resources evaluation and expand the space of the discovery of the web topic and web community.Web pages and hyperlinks are two basic elements of the web. Pages represent the heterogeneous information entities, and hyperlinks show the relationship among those entities. Web topology research does not focus on the attribution of pages individual and the real form of the particular hyperlink. It focuses on how the structure of hyperlink influences the pages behaviors on earth. The chapter 4 discusses the application of the hyperlink analysis in the field of web topology. The research shows that the hyperlink analysis can further the research of the web topology, and can help us to discovery and understand the laws embedded in the web.In a word, although the theory of hyperlink analysis is not perfect and many applications are empirical, the hyperlink analysis plays an important role in extending the similarity retrieval scope, expanding the mining space of web knowledge, and discovering the valuable structure information.
Keywords/Search Tags:hyperlink analysis, web retrieval, topic discovery, web community, web topology
PDF Full Text Request
Related items