Search CORE

52,666 research outputs found

Mutually Uncorrelated Primers for DNA-Based Data Storage

Author: Gabrys Ryan
Kiah Han Mao
Milenkovic Olgica
Yazdi S. M. Hossein Tabatabaei
Publication venue
Publication date: 13/09/2017
Field of study

We introduce the notion of weakly mutually uncorrelated (WMU) sequences, motivated by applications in DNA-based data storage systems and for synchronization of communication devices. WMU sequences are characterized by the property that no sufficiently long suffix of one sequence is the prefix of the same or another sequence. WMU sequences used for primer design in DNA-based data storage systems are also required to be at large mutual Hamming distance from each other, have balanced compositions of symbols, and avoid primer-dimer byproducts. We derive bounds on the size of WMU and various constrained WMU codes and present a number of constructions for balanced, error-correcting, primer-dimer free WMU codes using Dyck paths, prefix-synchronized and cyclic codes.Comment: 14 pages, 3 figures, 1 Table. arXiv admin note: text overlap with arXiv:1601.0817

arXiv.org e-Print Archive

DR-NTU (Digital Repository of NTU)

Correlation between nucleotide composition and folding energy of coding sequences with special attention to wobble bases

Author: AA Komar
AA Komar
C Kimchi-Sarfaty
C Soares
C Workman
DE Draper
DJ Patterson
EG Shaper
EG Shpaer
F Pagani
GD Stormo
IM Meyer
J Duan
J Konecny
Jan C Biro
JC Biro
JC Biro
JC Biro
JC Biro
JC Biro
JD Lieb
JR Powell
JV Chamary
K Mita
K Robison
KB Nielsen
L Cartegni
L Katz
M Jia
M Kellis
M Oresic
M Oresic
M Zama
M Zuker
M Zuker
MD Ermolaeva
NR Markham
P Cortazzo
P Mukhopadhyay
S Itzkovitz
SA Shabalina
W Gu
W Gu
W Seffens
W Seffens
WC Winkler
ZE Sauna
Publication venue
Publication date: 01/07/2008
Field of study

Background: The secondary structure and complexity of mRNA influences its accessibility to regulatory molecules (proteins, micro-RNAs), its stability and its level of expression. The mobile elements of the RNA sequence, the wobble bases, are expected to regulate the formation of structures encompassing coding sequences. Results: The sequence/folding energy (FE) relationship was studied by statistical, bioinformatic methods in 90 CDS containing 26,370 codons. I found that the FE (dG) associated with coding sequences is significant and negative (407 kcal/1000 bases, mean +/- S.E.M.) indicating that these sequences are able to form structures. However, the FE has only a small free component, less than 10% of the total. The contribution of the 1st and 3rd codon bases to the FE is larger than the contribution of the 2nd (central) bases. It is possible to achieve a ~ 4-fold change in FE by altering the wobble bases in synonymous codons. The sequence/FE relationship can be described with a simple algorithm, and the total FE can be predicted solely from the sequence composition of the nucleic acid. The contributions of different synonymous codons to the FE are additive and one codon cannot replace another. The accumulated contributions of synonymous codons of an amino acid to the total folding energy of an mRNA is strongly correlated to the relative amount of that amino acid in the translated protein. Conclusion: Synonymous codons are not interchangable with regard to their role in determining the mRNA FE and the relative amounts of amino acids in the translated protein, even if they are indistinguishable in respect of amino acid coding.Comment: 14 pages including 6 figures and 1 tabl

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

PubMed Central

Temperature dependence of normal mode reconstructions of protein dynamics

Author: Cecconi Fabio
Piazza Francesco
Rios Paolo De Los
Publication venue: 'American Physical Society (APS)'
Publication date: 27/10/2008
Field of study

Normal mode analysis is a widely used technique for reconstructing conformational changes of proteins from the knowledge of native structures. In this Letter, we investigate to what extent normal modes capture the salient features of the dynamics over a range of temperatures from close to T = 0 to above unfolding. We show that on the one hand, the use of normal modes at physiological temperatures is justified provided proteins are cooperative. On the other hand, it is imperative to consider several modes in order to eliminate the unpredictable temperature dependence of single- mode contributions to the protein fluctuations

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne

Crossref

Molecular access to multi-dimensionally encoded information

Author: Aksakal Resat
Du Prez Filip
Steinkönig Jan
Publication venue: 'Elsevier BV'
Publication date: 01/01/2019
Field of study

Polymer scientist have only recently realized that information storage on the molecular level is not only restricted to DNA-based systems. Similar encoding and decoding of data have been demonstrated on synthetic polymers that could overcome some of the drawbacks associated with DNA, such as the ability to make use of a larger monomer alphabet. This feature article describes some of the recent data storage strategies that were investigated, ranging from writing information on linear sequence-defined macromolecules up to layer-by-layer casted surfaces and QR codes. In addition, some strategies to increase storage density are elaborated and some trends regarding future perspectives on molecular data storage from the literature are critically evaluated. This work ends with highlighting the demand for new strategies setting up reliable solutions for future data management technologies

Ghent University Academic Bibliography