Research And Implementation Of Federated Retrieval Platform Based On Service Oriented Architecture

Posted on:2015-01-19

Degree:Master

Type:Thesis

Country:China

Candidate:H C Li

Full Text:PDF

GTID:2298330422490887

Subject:Computer Science and Technology

Abstract/Summary:

In recent years, the rapid development and popularization of computertechnology has changed the enterprise information management model,Multi-sectoral and trans-regional enterprises are constantly emerging. Thetraditional centralized data management model and peer-to-peer data interactionmodel not only canâ€™t meet the needs of enterprise information management andsharing, but also its business architecture canâ€™t meet the needs of the dynamicbusiness expansion. In addition, the isolation between departments often make usershardly have access to information of all departments efficiently and timely.Getting resources and information of multiple distributed heterogeneousdatabase at one-time has become the purpose of this study. Based on this demand,this article builds a federated information retrieval platform oriented on servicearchitecture in the VMware enterprise private cloud environment. In the realizationof the original information management and sharing, using VMware Iaas cloudcomputing service also improve the utilization rate of software and hardware, datesecurity and the quality of service.First of all, this paper introduces the concept of SOA and uses the distributedWeb service technology to implements the flexible and loosely coupled SOAarchitecture, meeting the needs of the dynamic expansion of enterprise business.And by introducing the concept of metadata, we design the uniform metadatastandards for distributed heterogeneous and unstructured data to facilitate thecentralized management of data resources in a single resource center. Thestandardized design of metadata makes the distributed Web search services have thesame interface specification. For the federated search results of multiple data centers,after researching of various sorting algorithms and testing its recall ratio and sortingefficiency with the TREC testing set, we design the adaptive synthesis sortingalgorithm which is suitable for the federated information retrieval platform. Inaddition, combined with that the resource utilization ratio of server and the virtualmachines on it can be monitored in real time, we design the feature of loadbalancing based on VMware cloud platform in order to improve running stability ofthe platform under high load. Finally, we design and implement the feature ofsemantic conflict mediation to improve the recall rate in information retrieval, at thesame time by using the abstract data management model of dataspaces and buildingthe relationship between objects, we make the users have a more comprehensiveunderstanding of the relevant information through correlation search. At last, inorder to identify bottlenecks and deficiencies of the platform, this paper constructs the LoadRunner cluster to test the performance of the platform from a plurality ofangles and analyses the testing results.

Keywords/Search Tags:

Federated Search, Cloud Platform, Service Oriented Architecture, Merging Results

Related items

1	The Research And Implementation Of The Key Technologies On Federated Search Systems
2	Research And Implementation Of Distributed Search Diversity Based On Vertical
3	Research Of Microblog Retrieval Based On The Thought Of Federated Search
4	Research On Personalized Meta Search Results Merging In Information Retrieval
5	Research And Application Of Microservice Failure Avoidance Technology In Federated Cloud Platform
6	Merging multiple search results approach for meta-search engines
7	Research And Implementation Of Result Merging In Distributed Search
8	Research On Service Composition Technology Based On CPN And SOA And Its Application In Supercomputing Simulation Cloud Platform
9	Service Oriented Architecture for Mobile Cloud Computing
10	Design And Implementation Of Aisino Information Service Cloud Platform