Font Size: a A A

Design And Implementation Of Specific Electronic Products Crawler Based On Amazon Web Site

Posted on:2013-07-25Degree:MasterType:Thesis
Country:ChinaCandidate:H W WangFull Text:PDF
GTID:2248330371985207Subject:Software engineering
Abstract/Summary:PDF Full Text Request
The development of communication technology and computer greatly promotethe popularization and development of the network. In recent years, various types ofe-commerce sites have been developing rapidly. With a large number of B2C or C2Cmode shopping sites appeared, the consumers have a wide choice. Online shoppingbreak the record of sale again and again. With the development of online shopping,product category and the number increase gradually, a large number of productinformation often makes consumers unable to obtain the best goods. Preferentialnetwork system is proposed under this context. Preferential network system isdesigned to provide users with timely and effectively merchandise information andshopping reference. In Europe and America, the site’s similar function has beenrunning very successfully, but such large push shopping site in our country has noprecedent. In factual perspective, such site meets most consumers’ needs and has verygood prospects.At first there was no crawler in search engine, with the development of theInternet, more and more pages and information generated. The Web Crawler canautomatically obtain information on the website so it displays advantages. The WebCrawler is the program that one can use search-engines search information thoughkeyword on the Internet. In a catalog or index database, search engines searchspecified fields (author, title, subject headings, etc.) of each record in the database.The search engines can be broadly divided into two parts: gather information,organize information. The inquiry and the main role of the Web crawler is the firstpart. Begin with a few initial pages the crawler crawl information until the URL queueis empty or satisfies the closing conditions. Web crawler can also be used as a websitelink checker tool. And it has unique advantages in linking activity check. The crawler in this paper is designed for searching favorable goods. Favorablegoods search release system is used to bring convenience to users who shoppingonline. Users can use product information that the browse system recommended andbe easy to get preferential commodity information, save search time, the system canalso increase the shopping website’s sales. The main crawler design’s purpose is toprovide information for classification system. This paper takes electronic commodityclassification for example. Through analyzing AMAZON API, we can get XMLdocuments, obtain the tree structure after analyze XML file, and use crawlerknowledge to get commodity classification seeds and obtain corresponding list ofcommodity information list.In general this paper includes: access commodity classification seeds list basedon crawler, favorable goods search release system framework’s design andimplementation. This paper mainly introduces the process that call for AMAZON APIto get XML documents, extract the key word then get commodity information and theoverall design of system architecture and the function of the system.At last, this paper designs based on Amazon web specific electronic productscrawler and applied it to the favorable goods recommend system which based on thissystem to realize the function of each module, but considering the system’s security,stability and operational aspects remains to be strengthened, and the method extractkey word can also be further study, the interface art design also has the deficiency, inthe future work will be improved step by step.
Keywords/Search Tags:E-Commerce, Amazon, Web Crawler, S2SH Framework
PDF Full Text Request
Related items