261 research outputs found

    Word Blending and Other Formal Models of Bio-operations

    Get PDF
    As part of ongoing efforts to view biological processes as computations, several formal models of DNA-based processes have been proposed and studied in the formal language literature. In this thesis, we survey some classical formal language word and language operations, as well as several bio-operations, and we propose a new operation inspired by a DNA recombination lab protocol known as Cross-pairing Polymerase Chain Reaction, or XPCR. More precisely, we define and study a word operation called word blending which models a special case of XPCR, where two words x w p and q w y sharing a non-empty overlap part w generate the word x w y. Properties of word blending that we study include closure properties of the Chomsky families of languages under this operation and its iterated version, existence of solution to equations involving this operation, and its state complexity

    Computation and programmability at the nano-bio interface

    Get PDF
    PhD ThesisThe manipulation of physical reality on the molecular level and construction of devices operating on the nanoscale has been the focal point of nanotechnology. In particular, nanotechnology based on DNA and RNA has a potential to nd applications in the eld of Synthetic Biology thanks to the inherent compatibility of nucleic acids with biological systems. Sca olded DNA origami, proposed by P. Rothemund, is one of the leading and most successful methods in which nanostructures are realised through rational programming of short 'staple' oligomers which fold a long single-stranded DNA called the 'sca old' strand into a variety of desired shapes. DNA origami already has many applications; including intelligent drug delivery, miniaturisation of logic circuits and computation in vivo. However, one of the factors that are limiting the complexity, applicability and scalability of this approach is the source of the sca old which commonly originates from viruses or phages. Furthermore, developing a robust and orthogonal interface between DNA nanotechnology and biological parts remains a signi cant challenge. The rst part of this thesis tackles these issues by challenging the fundamental as- sumption in the eld, namely that a viral sequence is to be used as the DNA origami sca old. A method is introduced for de novo generation of long synthetic sequences based on De Bruijn sequence, which has been previously proposed in combinatorics. The thesis presents a collection of algorithms which allow the construction of custom- made sequences that are uniquely addressable and biologically orthogonal (i.e. they do not code for any known biological function). Synthetic sca olds generated by these algorithms are computationally analysed and compared with their natural counter- parts with respect to: repetition in sequence, secondary structure and thermodynamic addressability. This also aids the design of wet lab experiments pursuing justi cation and veri cation of this novel approach by empirical evidence. The second part of this thesis discusses the possibility of applying evolutionary op- timisation to synthetic DNA sequences under constraints dictated by the biological interface. A multi-strand system is introduced based on an alternative approach to DNA self-assembly, which relies on strand-displacement cascades, for molecular data storage. The thesis demonstrates how a genetic algorithm can be used to generate viable solutions to this sequence optimisation problem which favours the target self- assembly con guration. Additionally, the kinetics of strand-displacement reactions are analysed with existing coarse-grained DNA models (oxDNA). This thesis is motivated by the application of scienti c computing to problems which lie on the boundary of Computer Science and the elds of DNA Nanotechnology, DNA Computing and Synthetic Biology, and thus I endeavour to the best of my ability to establish this work within the context of these disciplines

    A primer on molecular biology

    No full text
    Modern molecular biology provides a rich source of challenging machine learning problems. This tutorial chapter aims to provide the necessary biological background knowledge required to communicate with biologists and to understand and properly formalize a number of most interesting problems in this application domain. The largest part of the chapter (its first section) is devoted to the cell as the basic unit of life. Four aspects of cells are reviewed in sequence: (1) the molecules that cells make use of (above all, proteins, RNA, and DNA); (2) the spatial organization of cells (``compartmentalization''); (3) the way cells produce proteins (``protein expression''); and (4) cellular communication and evolution (of cells and organisms). In the second section, an overview is provided of the most frequent measurement technologies, data types, and data sources. Finally, important open problems in the analysis of these data (bioinformatics challenges) are briefly outlined

    Toward precision medicine with nanopore technology

    Get PDF
    Currently, when patients are diagnosed with cancer, they often receive a treatment based on the type and stage of the tumor. However, different patients may respond to the same treatment differently, due to the variation in their genomic alteration profile. Thus, it is essential to understand the effect of genomic alterations on cancer drug efficiency and engineer devices to monitor these changes for therapeutic response prediction. Nanopore-based detection technology features devices containing a nanometer-scale pore embedded in a thin membrane that can be utilized for DNA sequencing, biosensing, and detection of biological or chemical modifications on single molecules. Overall, this project aims to evaluate the capability of the biological nanopore, alpha-hemolysin, as a biosensor for genetic and epigenetic biomarkers of cancer. Specifically, we utilized the nanopore to (1) study the effect of point mutations on C-kit1 G-quadruplex formation and its response to CX-5461 cancer drug; (2) evaluate the nanopore\u27s ability to detect cytosine methylation in label-dependent and label-independent manners; and (3) detect circulating-tumor DNA collected from lung cancer patients\u27 plasma for disease detection and treatment response monitoring. Compared to conventional techniques, nanopore assays offer increased flexibility and much shorter processing time

    A systematic and extensible approach to DNA primer design for whole gene synthesis

    Get PDF
    Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2005.Includes bibliographical references (leaves 93-96).The future of synthetic biology research hinges upon the development of accurate and inexpensive whole gene synthesis technologies. Recent advances in the purification of solid-phase manufactured oligonucleotides make it possible to manufacture whole genes by polymerase chain reaction methods. Yet, despite the improvement in laboratory methods, whole gene synthesis is not rapidly progressing because most gene design software takes an excessively naive approach to the complex problem of designing component oligonucleotides for whole gene synthesis. The synthetic biology community needs a flexible, robust and optimal primer design tool. We present the software design for a tool which designs oligonucleotides that are compatible with a wide variety of oligo purification and whole gene assembly protocols. Our design strategy uses physical sequence feature identification, optimal artificial intelligence search techniques, and sequence optimisation via intelligent codon substitution to produce near-optimal oligonucleotide arrays. We address all aspects of the oligonucleotide design problem, from physical constraints to the computational overhead involved in searching for an optimal solution, and provide an extensive set of data structures and algorithms.by Amanda Victrix Allen Wozniak.M.Eng
    • …
    corecore