Search CORE

3,514 research outputs found

Capacity Upper Bounds for Deletion-Type Channels

Author: Belazzougui D.
Brakensiek J.
Capacity
Coding
Diggavi S.
Dobrushin R. L.
Kalai A.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 11/06/2018
Field of study

We develop a systematic approach, based on convex programming and real analysis, for obtaining upper bounds on the capacity of the binary deletion channel and, more generally, channels with i.i.d. insertions and deletions. Other than the classical deletion channel, we give a special attention to the Poisson-repeat channel introduced by Mitzenmacher and Drinea (IEEE Transactions on Information Theory, 2006). Our framework can be applied to obtain capacity upper bounds for any repetition distribution (the deletion and Poisson-repeat channels corresponding to the special cases of Bernoulli and Poisson distributions). Our techniques essentially reduce the task of proving capacity upper bounds to maximizing a univariate, real-valued, and often concave function over a bounded interval. We show the following: 1. The capacity of the binary deletion channel with deletion probability

d

is at most

(1-d)\log\varphi

for

d\geq 1/2

, and, assuming the capacity function is convex, is at most

1-d\log(4/\varphi)

for

d<1/2

, where

\varphi=(1+\sqrt{5})/2

is the golden ratio. This is the first nontrivial capacity upper bound for any value of

d

outside the limiting case

d\to 0

that is fully explicit and proved without computer assistance. 2. We derive the first set of capacity upper bounds for the Poisson-repeat channel. 3. We derive several novel upper bounds on the capacity of the deletion channel. All upper bounds are maximums of efficiently computable, and concave, univariate real functions over a bounded domain. In turn, we upper bound these functions in terms of explicit elementary and standard special functions, whose maximums can be found even more efficiently (and sometimes, analytically, for example for

d=1/2

). Along the way, we develop several new techniques of potentially independent interest in information theory, probability, and mathematical analysis.Comment: Minor edits, In Proceedings of 50th Annual ACM SIGACT Symposium on the Theory of Computing (STOC), 201

arXiv.org e-Print Archive

Crossref

An Upper Bound on the Capacity of non-Binary Deletion Channels

Author: Duman Tolga M.
Rahmati Mojtaba
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2013
Field of study

We derive an upper bound on the capacity of non-binary deletion channels. Although binary deletion channels have received significant attention over the years, and many upper and lower bounds on their capacity have been derived, such studies for the non-binary case are largely missing. The state of the art is the following: as a trivial upper bound, capacity of an erasure channel with the same input alphabet as the deletion channel can be used, and as a lower bound the results by Diggavi and Grossglauser are available. In this paper, we derive the first non-trivial non-binary deletion channel capacity upper bound and reduce the gap with the existing achievable rates. To derive the results we first prove an inequality between the capacity of a 2K-ary deletion channel with deletion probability

d

, denoted by

C_{2K}(d)

, and the capacity of the binary deletion channel with the same deletion probability,

C_2(d)

, that is,

C_{2K}(d)\leq C_2(d)+(1-d)\log(K)

. Then by employing some existing upper bounds on the capacity of the binary deletion channel, we obtain upper bounds on the capacity of the 2K-ary deletion channel. We illustrate via examples the use of the new bounds and discuss their asymptotic behavior as

d \rightarrow 0

.Comment: accepted for presentation in ISIT 201

arXiv.org e-Print Archive

Bilkent University Institutional Repository

A Note on the Deletion Channel Capacity

Author: Duman Tolga M.
Rahmati Mojtaba
Publication venue
Publication date: 11/11/2012
Field of study

Memoryless channels with deletion errors as defined by a stochastic channel matrix allowing for bit drop outs are considered in which transmitted bits are either independently deleted with probability

d

or unchanged with probability

1-d

. Such channels are information stable, hence their Shannon capacity exists. However, computation of the channel capacity is formidable, and only some upper and lower bounds on the capacity exist. In this paper, we first show a simple result that the parallel concatenation of two different independent deletion channels with deletion probabilities

d_1

and

d_2

, in which every input bit is either transmitted over the first channel with probability of

\lambda

or over the second one with probability of

1-\lambda

, is nothing but another deletion channel with deletion probability of

d=\lambda d_1+(1-\lambda)d_2

. We then provide an upper bound on the concatenated deletion channel capacity

C(d)

in terms of the weighted average of

C(d_1)

C(d_2)

and the parameters of the three channels. An interesting consequence of this bound is that

C(\lambda d_1+(1-\lambda))\leq \lambda C(d_1)

which enables us to provide an improved upper bound on the capacity of the i.i.d. deletion channels, i.e.,

C(d)\leq 0.4143(1-d)

for

d\geq 0.65

. This generalizes the asymptotic result by Dalai as it remains valid for all

d\geq 0.65

. Using the same approach we are also able to improve upon existing upper bounds on the capacity of the deletion/substitution channel.Comment: Submitted to the IEEE Transactions on Information Theor

arXiv.org e-Print Archive

CiteSeerX

Write Channel Model for Bit-Patterned Media Recording

Author: Iyengar Aravind R.
Siegel Paul H.
Wolf Jack K.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 21/10/2010
Field of study

We propose a new write channel model for bit-patterned media recording that reflects the data dependence of write synchronization errors. It is shown that this model accommodates both substitution-like errors and insertion-deletion errors whose statistics are determined by an underlying channel state process. We study information theoretic properties of the write channel model, including the capacity, symmetric information rate, Markov-1 rate and the zero-error capacity.Comment: 11 pages, 12 figures, journa

arXiv.org e-Print Archive

Crossref

Models and information-theoretic bounds for nanopore sequencing

Author: Diggavi Suhas
Kannan Sreeram
Mao Wei
Publication venue
Publication date: 17/02/2018
Field of study

Nanopore sequencing is an emerging new technology for sequencing DNA, which can read long fragments of DNA (~50,000 bases) in contrast to most current short-read sequencing technologies which can only read hundreds of bases. While nanopore sequencers can acquire long reads, the high error rates (20%-30%) pose a technical challenge. In a nanopore sequencer, a DNA is migrated through a nanopore and current variations are measured. The DNA sequence is inferred from this observed current pattern using an algorithm called a base-caller. In this paper, we propose a mathematical model for the "channel" from the input DNA sequence to the observed current, and calculate bounds on the information extraction capacity of the nanopore sequencer. This model incorporates impairments like (non-linear) inter-symbol interference, deletions, as well as random response. These information bounds have two-fold application: (1) The decoding rate with a uniform input distribution can be used to calculate the average size of the plausible list of DNA sequences given an observed current trace. This bound can be used to benchmark existing base-calling algorithms, as well as serving a performance objective to design better nanopores. (2) When the nanopore sequencer is used as a reader in a DNA storage system, the storage capacity is quantified by our bounds

arXiv.org e-Print Archive

Crossref

Fundamental Bounds and Approaches to Sequence Reconstruction from Nanopore Sequencers

Author: Duda Jarek
Grama Ananth
Szpankowski Wojciech
Publication venue
Publication date: 11/01/2016
Field of study

Nanopore sequencers are emerging as promising new platforms for high-throughput sequencing. As with other technologies, sequencer errors pose a major challenge for their effective use. In this paper, we present a novel information theoretic analysis of the impact of insertion-deletion (indel) errors in nanopore sequencers. In particular, we consider the following problems: (i) for given indel error characteristics and rate, what is the probability of accurate reconstruction as a function of sequence length; (ii) what is the number of `typical' sequences within the distortion bound induced by indel errors; (iii) using replicated extrusion (the process of passing a DNA strand through the nanopore), what is the number of replicas needed to reduce the distortion bound so that only one typical sequence exists within the distortion bound. Our results provide a number of important insights: (i) the maximum length of a sequence that can be accurately reconstructed in the presence of indel and substitution errors is relatively small; (ii) the number of typical sequences within the distortion bound is large; and (iii) replicated extrusion is an effective technique for unique reconstruction. In particular, we show that the number of replicas is a slow function (logarithmic) of sequence length -- implying that through replicated extrusion, we can sequence large reads using nanopore sequencers. Our model considers indel and substitution errors separately. In this sense, it can be viewed as providing (tight) bounds on reconstruction lengths and repetitions for accurate reconstruction when the two error modes are considered in a single model.Comment: 12 pages, 5 figure

arXiv.org e-Print Archive

CiteSeerX