2,038 research outputs found
Improved Lower Bounds for Constant GC-Content DNA Codes
The design of large libraries of oligonucleotides having constant GC-content
and satisfying Hamming distance constraints between oligonucleotides and their
Watson-Crick complements is important in reducing hybridization errors in DNA
computing, DNA microarray technologies, and molecular bar coding. Various
techniques have been studied for the construction of such oligonucleotide
libraries, ranging from algorithmic constructions via stochastic local search
to theoretical constructions via coding theory. We introduce a new stochastic
local search method which yields improvements up to more than one third of the
benchmark lower bounds of Gaborit and King (2005) for n-mer oligonucleotide
libraries when n <= 14. We also found several optimal libraries by computing
maximum cliques on certain graphs.Comment: 4 page
Bounds for DNA codes with constant GC-content
We derive theoretical upper and lower bounds on the maximum size of DNA codes
of length n with constant GC-content w and minimum Hamming distance d, both
with and without the additional constraint that the minimum Hamming distance
between any codeword and the reverse-complement of any codeword be at least d.
We also explicitly construct codes that are larger than the best
previously-published codes for many choices of the parameters n, d and w.Comment: 13 pages, no figures; a few references added and typos correcte
Linear constructions for DNA codes
AbstractIn this paper we translate in terms of coding theory constraints that are used in designing DNA codes for use in DNA computing or as bar-codes in chemical libraries. We propose new constructions for DNA codes satisfying either a reverse-complement constraint, a GC-content constraint, or both, that are derived from additive and linear codes over four-letter alphabets. We focus in particular on codes over GF(4), and we construct new DNA codes that are in many cases better (sometimes far better) than previously known codes. We provide updated tables up to length 20 that include these codes as well as new codes constructed using a combination of lexicographic techniques and stochastic search
On Conflict Free DNA Codes
DNA storage has emerged as an important area of research. The reliability of
DNA storage system depends on designing the DNA strings (called DNA codes) that
are sufficiently dissimilar. In this work, we introduce DNA codes that satisfy
a special constraint. Each codeword of the DNA code has a specific property
that any two consecutive sub-strings of the DNA codeword will not be the same
(a generalization of homo-polymers constraint). This is in addition to the
usual constraints such as Hamming, reverse, reverse-complement and
-content. We believe that the new constraint will help further in reducing
the errors during reading and writing data into the synthetic DNA strings. We
also present a construction (based on a variant of stochastic local search
algorithm) to calculate the size of the DNA codes with all the above
constraints, which improves the lower bounds from the existing literature, for
some specific cases. Moreover, a recursive isometric map between binary vectors
and DNA strings is proposed. Using the map and the well known binary codes we
obtain few classes of DNA codes with all the constraints including the property
that the constructed DNA codewords are free from the hairpin-like secondary
structures.Comment: 12 pages, Draft (Table VI and Table VII are updated
- …