earticle

논문검색

An Efficient Replicated System for the Metadata of HDFS

초록

영어

Hadoop HDFS is an open source project from Apache Software Foundation for scalable, distributed computing and data storage. HDFS has become a critical component in today's cloud computing environment and a wide range of applications built on top of it. However, the initial design of HDFS has introduced a single-point-of-failure, since HDFS contains only one active namenode, if this namenode experiences software or hardware failures, the whole HDFS cluster is unusable, this is a reason why people are reluctant to deploy HDFS for an application whose requirement is high availability. In this paper, we present a solution to enable the high availability for HDFS's namenode through efficient metadata replication. Our solution has 3 major advantages than existing ones: We utilize multiple active namenodes, instead of one, to build a cluster to serve requests of metadata simultaneously; We implement a pub/sub system to handle the metadata replication process across these active namonodes efficiently; We also propose a novel replication algorithm to deal with the network delay when the namonodes are deployed in different areas. Based on the solution we build a prototype called NCluster and integrate it with HDFS. We evaluate NCluster to exhibit its feasibility and effectiveness. The experimental results show that our solution performs well with low replication cost, good throughput and scalability.

목차

Abstract
 1. Introduction
 2. Background and Related Work
  2.1. Metadata of HDFS
  2.2. Metadata Availability
  2.3. Replication
 3. System Design
  3.1 Architecture
  3.2 What to Replicate
  3.3. How to Replicate
  3.4 Consistency
  3.5 Replication Optimization
  3.6 Failover
 4. Implemention
 5. Evaluations
  5.1. Experimental Setup
  5.2. Replication Cost
  5.3. NCluster vs Zookeeper
  5.4 Throughput
  5.5 Scalability
  5.6 Geo-Replication
 6. Conclusions and Future Work
 References

저자정보

  • Zhanye Wang Department of Computer Science and Technology Tsinghua University Beijing, 100084 P.R. China
  • Tao Xu Department of Computer Science and Technology Tsinghua University Beijing, 100084 P.R. China
  • Dongsheng Wang Research Institute of Information Technology Tsinghua University Beijing, 100084 P.R. China

참고문헌

자료제공 : 네이버학술정보

    ※ 원문제공기관과의 협약기간이 종료되어 열람이 제한될 수 있습니다.

    0개의 논문이 장바구니에 담겼습니다.