Design and Implementation of Metadata De-Duplication Controller Systems Based on Server Storage

서버 이중화는 일반적으로 대형 서버의 신뢰성에 의존하는 IT 환경에서 볼 수 있다. 서버의 경우, 애플리케이션이 병렬 하드웨어의 양을 정당화하기 위해 사용할 수 있는 병렬화가 고려되어야 한다. 4차 산업혁명의 도래와 함께, 디지털 데이터는 사물인터넷의 확산으로 인해 포화 상태에 이르렀다. 데이터양이 증가함에 따라 서버를 기반으로 확장되는 플랫폼에서 정형 및 비정형 데이터를 관리하는 퍼블릭 클라우드 플랫폼이 성장하고 있다. 객체 스토리지를 기반으로 하는 플랫폼은 디렉터리 트리 구조가 아닌 개별 객체를 기반으로 한 메타데이터 확장으로 인해 입출력 오버헤드가 발생한다. 본 논문에서는 업데이트 빈도에 따라 삭제된 서버 간의 메타데이터 중복 제거 컨트롤러 시스템을 제안한다. 제안된 시스템은 리눅스 시스템 파일의 메타데이터를 확장하여 사용자 파일의 중복 파일을 고정 길이의 데이터 청크로 변환한다. 업데이트 빈도가 낮은 청크만 삭제해 성능을 유지하는 향상된 용량 효율성을 실현한다. 또한 시스템의 파일 메타데이터는 서버 간 확장 및 복제를 수행할 수 있도록 유연해야 한다. 제안된 시스템은 커널 드라이버에서 사용자 데이터를 처리하고 액세스하는 프로세스와 중복 제거 메타데이터를 관리하는 프로세스로 구성하였다. 실험에 따르면 제안된 방법은 기존 객체 스토리지 기반 시스템보다 성능 및 중복제거율이 12.3% 더 높아짐을 확인하였다.

Server redundancy is typically seen in IT environments that rely on large-scale server reliability. For servers, parallelization that applications can use to justify the amount of parallel hardware should be considered. Independent computers, standard local area networks, and computer clusters formed by switches may be considered. With the advent of the Fourth Industrial Revolution, digital data has reached saturation due to the spread of the Internet of Things. As the amount of data increases, public cloud platforms that manage structured and unstructured data are growing on platforms that expand based on servers. Platforms based on object storage incur I/O overhead due to metadata extensions based on individual objects rather than directory tree structures. In this paper, we propose a metadata deduplication controller system between servers that are deleted according to the metadata update frequency. The proposed system converts duplicate files of user files into fixed-length data chunks by extending the metadata of Linux system files.It realizes improved capacity efficiency that maintains performance by deleting only chunks with low update frequency. Additionally, the system's file metadata is flexible for scaling and replication between servers. The proposed system consists of the process of processing and accessing user data in the kernel driver and the process of managing deduplication metadata. Experiments have shown that the proposed method has 12.3% higher performance and deduplication rates than traditional object storage-based systems.

인용현황

서버 스토리지 기반의 메타데이터 중복 제거 컨트롤러 시스템 설계 및 구현 Design and Implementation of Metadata De-Duplication Controller Systems Based on Server Storage

초록 열기/닫기 버튼

키워드열기/닫기 버튼

피인용 횟수

인용현황

KCI에서 이 논문을 인용한 논문의 수는 1건입니다. 열기/닫기 버튼

참고문헌(15) 열기/닫기 버튼 * 2023년 이후 발행 논문의 참고문헌은 현재 구축 중입니다.