Font Size: a A A

Search Engine, Based On Java's Zhejiang Textile & Fashion College Campus Network

Posted on:2011-05-20Degree:MasterType:Thesis
Country:ChinaCandidate:S L ZhangFull Text:PDF
GTID:2208330332986909Subject:Software engineering
Abstract/Summary:PDF Full Text Request
Internet is the huge treasure of information resources. Almost all the Internet users want to get richer resources to make it better. The resources of the Internet are increasing in the speed of geometric progression. However, these information resources are connected to their own host machines. It's impossible for people to read all the information of the Internet. If you want to find the information that you want quickly and efficiently from hundreds and thousands of Web sites, the search engine of the Internet is required.The search engine of the Campus Network is different from that of the Internet with the characters of delicate structure, convenient building and superior efficiency. The IP addresses of the Campus Network are limited. The information search is the important apply of the technology of the full-text search. The arithmetic of search engine of the Campus Network built by us has its value in use, which can be used in most Campus Networks and obtain concise and correct feedbacks.This text describes the search engine system based by the inverted index, and highlights its procedure of web spider and the Chinese Word Segmentation. We learn the Java language, design and maintenance of the database and knowledge of Chinese linguistics, and understand the distributed concept when doing the research of the design of the search engine, and finally we complete it.This text is divided into six chapters. The first chapter is about the research background, research aim, main research contents and structure of the thesis. The second chapter is discussing the related platform of the system research. The third chapter is about the research of the structure of the web spider. The fourth chapter is about the realization of the Chinese Word Segmentation. The fifth chapter is discussing the realization of the index and search. In the end, the thesis sums up the process of the design of the system, looks into the distance to the development trend of the system, and points out the direction of the research in the future.
Keywords/Search Tags:search engine, java, Lucene, reverse maximum matching
PDF Full Text Request
Related items