261 research outputs found
Word Blending and Other Formal Models of Bio-operations
As part of ongoing efforts to view biological processes as computations, several formal models of DNA-based processes have been proposed and studied in the formal language literature. In this thesis, we survey some classical formal language word and language operations, as well as several bio-operations, and we propose a new operation inspired by a DNA recombination lab protocol known as Cross-pairing Polymerase Chain Reaction, or XPCR. More precisely, we define and study a word operation called word blending which models a special case of XPCR, where two words x w p and q w y sharing a non-empty overlap part w generate the word x w y. Properties of word blending that we study include closure properties of the Chomsky families of languages under this operation and its iterated version, existence of solution to equations involving this operation, and its state complexity
Computation and programmability at the nano-bio interface
PhD ThesisThe manipulation of physical reality on the molecular level and construction of devices
operating on the nanoscale has been the focal point of nanotechnology. In particular,
nanotechnology based on DNA and RNA has a potential to nd applications in the
eld of Synthetic Biology thanks to the inherent compatibility of nucleic acids with
biological systems. Sca olded DNA origami, proposed by P. Rothemund, is one of
the leading and most successful methods in which nanostructures are realised through
rational programming of short 'staple' oligomers which fold a long single-stranded
DNA called the 'sca old' strand into a variety of desired shapes. DNA origami already
has many applications; including intelligent drug delivery, miniaturisation of logic
circuits and computation in vivo. However, one of the factors that are limiting the
complexity, applicability and scalability of this approach is the source of the sca old
which commonly originates from viruses or phages. Furthermore, developing a robust
and orthogonal interface between DNA nanotechnology and biological parts remains
a signi cant challenge.
The rst part of this thesis tackles these issues by challenging the fundamental as-
sumption in the eld, namely that a viral sequence is to be used as the DNA origami
sca old. A method is introduced for de novo generation of long synthetic sequences
based on De Bruijn sequence, which has been previously proposed in combinatorics.
The thesis presents a collection of algorithms which allow the construction of custom-
made sequences that are uniquely addressable and biologically orthogonal (i.e. they
do not code for any known biological function). Synthetic sca olds generated by these
algorithms are computationally analysed and compared with their natural counter-
parts with respect to: repetition in sequence, secondary structure and thermodynamic
addressability. This also aids the design of wet lab experiments pursuing justi cation
and veri cation of this novel approach by empirical evidence.
The second part of this thesis discusses the possibility of applying evolutionary op-
timisation to synthetic DNA sequences under constraints dictated by the biological
interface. A multi-strand system is introduced based on an alternative approach to
DNA self-assembly, which relies on strand-displacement cascades, for molecular data
storage. The thesis demonstrates how a genetic algorithm can be used to generate
viable solutions to this sequence optimisation problem which favours the target self-
assembly con guration. Additionally, the kinetics of strand-displacement reactions
are analysed with existing coarse-grained DNA models (oxDNA).
This thesis is motivated by the application of scienti c computing to problems which
lie on the boundary of Computer Science and the elds of DNA Nanotechnology, DNA
Computing and Synthetic Biology, and thus I endeavour to the best of my ability to
establish this work within the context of these disciplines
A primer on molecular biology
Modern molecular biology provides a rich source of challenging machine learning problems. This tutorial chapter aims to provide the necessary biological background knowledge required to communicate with biologists and to understand and properly formalize a number of most interesting problems in this application domain. The largest part of the chapter (its first section) is devoted to the cell as the basic unit of life. Four aspects of cells are reviewed in sequence: (1) the molecules that cells make use of (above all, proteins, RNA, and DNA); (2) the spatial organization of cells (``compartmentalization''); (3) the way cells produce proteins (``protein expression''); and (4) cellular communication and evolution (of cells and organisms). In the second section, an overview is provided of the most frequent measurement technologies, data types, and data sources. Finally, important open problems in the analysis of these data (bioinformatics challenges) are briefly outlined
Toward precision medicine with nanopore technology
Currently, when patients are diagnosed with cancer, they often receive a treatment based on the type and stage of the tumor. However, different patients may respond to the same treatment differently, due to the variation in their genomic alteration profile. Thus, it is essential to understand the effect of genomic alterations on cancer drug efficiency and engineer devices to monitor these changes for therapeutic response prediction. Nanopore-based detection technology features devices containing a nanometer-scale pore embedded in a thin membrane that can be utilized for DNA sequencing, biosensing, and detection of biological or chemical modifications on single molecules. Overall, this project aims to evaluate the capability of the biological nanopore, alpha-hemolysin, as a biosensor for genetic and epigenetic biomarkers of cancer. Specifically, we utilized the nanopore to (1) study the effect of point mutations on C-kit1 G-quadruplex formation and its response to CX-5461 cancer drug; (2) evaluate the nanopore\u27s ability to detect cytosine methylation in label-dependent and label-independent manners; and (3) detect circulating-tumor DNA collected from lung cancer patients\u27 plasma for disease detection and treatment response monitoring. Compared to conventional techniques, nanopore assays offer increased flexibility and much shorter processing time
A systematic and extensible approach to DNA primer design for whole gene synthesis
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2005.Includes bibliographical references (leaves 93-96).The future of synthetic biology research hinges upon the development of accurate and inexpensive whole gene synthesis technologies. Recent advances in the purification of solid-phase manufactured oligonucleotides make it possible to manufacture whole genes by polymerase chain reaction methods. Yet, despite the improvement in laboratory methods, whole gene synthesis is not rapidly progressing because most gene design software takes an excessively naive approach to the complex problem of designing component oligonucleotides for whole gene synthesis. The synthetic biology community needs a flexible, robust and optimal primer design tool. We present the software design for a tool which designs oligonucleotides that are compatible with a wide variety of oligo purification and whole gene assembly protocols. Our design strategy uses physical sequence feature identification, optimal artificial intelligence search techniques, and sequence optimisation via intelligent codon substitution to produce near-optimal oligonucleotide arrays. We address all aspects of the oligonucleotide design problem, from physical constraints to the computational overhead involved in searching for an optimal solution, and provide an extensive set of data structures and algorithms.by Amanda Victrix Allen Wozniak.M.Eng
- …