52,666 research outputs found
Mutually Uncorrelated Primers for DNA-Based Data Storage
We introduce the notion of weakly mutually uncorrelated (WMU) sequences,
motivated by applications in DNA-based data storage systems and for
synchronization of communication devices. WMU sequences are characterized by
the property that no sufficiently long suffix of one sequence is the prefix of
the same or another sequence. WMU sequences used for primer design in DNA-based
data storage systems are also required to be at large mutual Hamming distance
from each other, have balanced compositions of symbols, and avoid primer-dimer
byproducts. We derive bounds on the size of WMU and various constrained WMU
codes and present a number of constructions for balanced, error-correcting,
primer-dimer free WMU codes using Dyck paths, prefix-synchronized and cyclic
codes.Comment: 14 pages, 3 figures, 1 Table. arXiv admin note: text overlap with
arXiv:1601.0817
Correlation between nucleotide composition and folding energy of coding sequences with special attention to wobble bases
Background: The secondary structure and complexity of mRNA influences its
accessibility to regulatory molecules (proteins, micro-RNAs), its stability and
its level of expression. The mobile elements of the RNA sequence, the wobble
bases, are expected to regulate the formation of structures encompassing coding
sequences.
Results: The sequence/folding energy (FE) relationship was studied by
statistical, bioinformatic methods in 90 CDS containing 26,370 codons. I found
that the FE (dG) associated with coding sequences is significant and negative
(407 kcal/1000 bases, mean +/- S.E.M.) indicating that these sequences are able
to form structures. However, the FE has only a small free component, less than
10% of the total. The contribution of the 1st and 3rd codon bases to the FE is
larger than the contribution of the 2nd (central) bases. It is possible to
achieve a ~ 4-fold change in FE by altering the wobble bases in synonymous
codons. The sequence/FE relationship can be described with a simple algorithm,
and the total FE can be predicted solely from the sequence composition of the
nucleic acid. The contributions of different synonymous codons to the FE are
additive and one codon cannot replace another. The accumulated contributions of
synonymous codons of an amino acid to the total folding energy of an mRNA is
strongly correlated to the relative amount of that amino acid in the translated
protein.
Conclusion: Synonymous codons are not interchangable with regard to their
role in determining the mRNA FE and the relative amounts of amino acids in the
translated protein, even if they are indistinguishable in respect of amino acid
coding.Comment: 14 pages including 6 figures and 1 tabl
Temperature dependence of normal mode reconstructions of protein dynamics
Normal mode analysis is a widely used technique for reconstructing
conformational changes of proteins from the knowledge of native structures. In
this Letter, we investigate to what extent normal modes capture the salient
features of the dynamics over a range of temperatures from close to T = 0 to
above unfolding. We show that on the one hand, the use of normal modes at
physiological temperatures is justified provided proteins are cooperative. On
the other hand, it is imperative to consider several modes in order to
eliminate the unpredictable temperature dependence of single- mode
contributions to the protein fluctuations
Molecular access to multi-dimensionally encoded information
Polymer scientist have only recently realized that information storage on the molecular level is not only restricted to DNA-based systems. Similar encoding and decoding of data have been demonstrated on synthetic polymers that could overcome some of the drawbacks associated with DNA, such as the ability to make use of a larger monomer alphabet. This feature article describes some of the recent data storage strategies that were investigated, ranging from writing information on linear sequence-defined macromolecules up to layer-by-layer casted surfaces and QR codes. In addition, some strategies to increase storage density are elaborated and some trends regarding future perspectives on molecular data storage from the literature are critically evaluated. This work ends with highlighting the demand for new strategies setting up reliable solutions for future data management technologies
- …