Search CORE

497 research outputs found

Optimal Locally Repairable Codes and Connections to Matroid Theory

Author: Dimakis Alexandros G.
Papailiopoulos Dimitris S.
Tamo Itzhak
Publication venue
Publication date: 06/11/2013
Field of study

Petabyte-scale distributed storage systems are currently transitioning to erasure codes to achieve higher storage efficiency. Classical codes like Reed-Solomon are highly sub-optimal for distributed environments due to their high overhead in single-failure events. Locally Repairable Codes (LRCs) form a new family of codes that are repair efficient. In particular, LRCs minimize the number of nodes participating in single node repairs during which they generate small network traffic. Two large-scale distributed storage systems have already implemented different types of LRCs: Windows Azure Storage and the Hadoop Distributed File System RAID used by Facebook. The fundamental bounds for LRCs, namely the best possible distance for a given code locality, were recently discovered, but few explicit constructions exist. In this work, we present an explicit and optimal LRCs that are simple to construct. Our construction is based on grouping Reed-Solomon (RS) coded symbols to obtain RS coded symbols over a larger finite field. We then partition these RS symbols in small groups, and re-encode them using a simple local code that offers low repair locality. For the analysis of the optimality of the code, we derive a new result on the matroid represented by the code generator matrix.Comment: Submitted for publication, a shorter version was presented at ISIT 201

arXiv.org e-Print Archive

CiteSeerX

On Error Decoding of Locally Repairable and Partial MDS Codes

Author: Holzbaur Lukas
Puchinger Sven
Wachter-Zeh Antonia
Publication venue
Publication date: 01/01/2019
Field of study

We consider error decoding of locally repairable codes (LRC) and partial MDS (PMDS) codes through interleaved decoding. For a specific class of LRCs we investigate the success probability of interleaved decoding. For PMDS codes we show that there is a wide range of parameters for which interleaved decoding can increase their decoding radius beyond the minimum distance with the probability of successful decoding approaching

1

, when the code length goes to infinity

arXiv.org e-Print Archive

Crossref

Online Research Database In Technology

A family of optimal locally recoverable codes

Author: Barg Alexander
Tamo Itzhak
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 11/07/2014
Field of study

A code over a finite alphabet is called locally recoverable (LRC) if every symbol in the encoding is a function of a small number (at most

r

) other symbols. We present a family of LRC codes that attain the maximum possible value of the distance for a given locality parameter and code cardinality. The codewords are obtained as evaluations of specially constructed polynomials over a finite field, and reduce to a Reed-Solomon code if the locality parameter

r

is set to be equal to the code dimension. The size of the code alphabet for most parameters is only slightly greater than the code length. The recovery procedure is performed by polynomial interpolation over

r

points. We also construct codes with several disjoint recovering sets for every symbol. This construction enables the system to conduct several independent and simultaneous recovery processes of a specific symbol by accessing different parts of the codeword. This property enables high availability of frequently accessed data ("hot data").Comment: Minor changes. This is the final published version of the pape

arXiv.org e-Print Archive

Crossref

A Class of MSR Codes for Clustered Distributed Storage

Author: Choi Beongjun
Moon Jaekyun
Sohn Jy-yong
Publication venue
Publication date: 06/01/2018
Field of study

Clustered distributed storage models real data centers where intra- and cross-cluster repair bandwidths are different. In this paper, exact-repair minimum-storage-regenerating (MSR) codes achieving capacity of clustered distributed storage are designed. Focus is given on two cases:

\epsilon=0

and

\epsilon=1/(n-k)

, where

\epsilon

is the ratio of the available cross- and intra-cluster repair bandwidths,

n

is the total number of distributed nodes and

k

is the number of contact nodes in data retrieval. The former represents the scenario where cross-cluster communication is not allowed, while the latter corresponds to the case of minimum cross-cluster bandwidth that is possible under the minimum storage overhead constraint. For the

\epsilon=0

case, two types of locally repairable codes are proven to achieve the MSR point. As for

\epsilon=1/(n-k)

, an explicit MSR coding scheme is suggested for the two-cluster situation under the specific condition of

n = 2k

.Comment: 9 pages, a part of this paper is submitted to IEEE ISIT201

arXiv.org e-Print Archive

Crossref