The Research On Scalable Metadata Service Of Parallel File System For High Performance Computing Systems

Posted on:2014-03-27

Degree:Master

Type:Thesis

Country:China

Candidate:Q Chen

Full Text:PDF

GTID:2268330422463463

Subject:Computer system architecture

Abstract/Summary:

PDF Full Text Request

With the increasing of the computational abilities of supercomputers, problem sizeand complexity targeted by applications has scaled, which required higher performance ofI/O subsystems. While the throughput of single metadata server limited the performanceof the parallel file system in high concurrent access and high-frequency filecreating/deleting scenarios. A high scalable metadata service based on metadata delegation,applied to I/O forwarding architecture, is proposed. Depending on job scheduling system,it distributes the requests for metadata from the file system to multiple metadatadelegations to accelerate metadata operations.Parallel I/O is a typical scenario in high performance computing. It can becharacterized to two categories file per process mode and shared file mode. The formerrenders performance on metadata, and we mainly focus on this I/O scenario. The keycontribution of this dissertation is implementing Meta-Data Delegation Service(MDDS) inLustre file system and proposing a scalable distributed metadata management scheme.MDDS is based on Lustre Cluster MetaData(CMD) design. We use loose coupling to keepthe high availability of the cluster, organize the MDDS namespace by directory subtree toavoid the complex and inefficiency of distributed atomic operations introduced bycross-node operations, and developed a metadata migration mechanism to avoid objectsdata moving between data servers. Also, the system is allowed to add and remove aMDDS server dynamically. Two job-scheduling strategies have been proposed---one jobscheduled on single MDDS and jobs sharing multiple MDDSes. The former is suitable fortraditional job’s I/O access patterns, and can avoid competition of metadata operationsamong jobs; while the latter is able to distribute the metadata’s operations to multipleMDDSes to achieve load balancing of the requests for metadata inside a job.We analyzed the performance of MDDS on116storage servers, and simulated theapplications’ metadata access load in I/O forward architecture to evaluate the performanceof the two schedulers. The initial experimental results show that quasi-linear scalable metadata performance is achieved by MDDS, and even show better scalability than LustreCMD in large-scale cluster. The two job-scheduling strategies distribute the applications’metadata access load effectively, and overcome performance bottlenecks in accessing filemetadata in HPC.

Keywords/Search Tags:

Parallel file system, scalable metadata service, Metadata delegation, loadbalance, high performance computing

PDF Full Text Request

Related items

1	Metadata Management For Parallel File Systems
2	Research On Key Issues Of Scalable Distributed File System Metadata Service In Large-scale Networked Storage Systems
3	Parallel Computation For File Metadata Cube In Cloud Computing Environments
4	The Design And Implementation Of High Performance Metadata Service In Distributed File System
5	Client-oriented Highly Available And Scalable Metadata Service
6	Research On Metadata Management Of Parallel File System
7	Research On Metadata Management Of Parallel File System Build On Shared Object-based Storage Device
8	Metadata Management Optimization In Distributed File Systems
9	Research On Performance Improvementmethod Of File System Metadata Service Based On Shared Log
10	Research On Memory Safe Embedded Processor Architecture Based On Metadata Parallel Processing