78 research outputs found

    Active Learning of Points-To Specifications

    Full text link
    When analyzing programs, large libraries pose significant challenges to static points-to analysis. A popular solution is to have a human analyst provide points-to specifications that summarize relevant behaviors of library code, which can substantially improve precision and handle missing code such as native code. We propose ATLAS, a tool that automatically infers points-to specifications. ATLAS synthesizes unit tests that exercise the library code, and then infers points-to specifications based on observations from these executions. ATLAS automatically infers specifications for the Java standard library, and produces better results for a client static information flow analysis on a benchmark of 46 Android apps compared to using existing handwritten specifications

    Algorithms for automated DNA assembly

    Get PDF
    Generating a defined set of genetic constructs within a large combinatorial space provides a powerful method for engineering novel biological functions. However, the process of assembling more than a few specific DNA sequences can be costly, time consuming and error prone. Even if a correct theoretical construction scheme is developed manually, it is likely to be suboptimal by any number of cost metrics. Modular, robust and formal approaches are needed for exploring these vast design spaces. By automating the design of DNA fabrication schemes using computational algorithms, we can eliminate human error while reducing redundant operations, thus minimizing the time and cost required for conducting biological engineering experiments. Here, we provide algorithms that optimize the simultaneous assembly of a collection of related DNA sequences. We compare our algorithms to an exhaustive search on a small synthetic dataset and our results show that our algorithms can quickly find an optimal solution. Comparison with random search approaches on two real-world datasets show that our algorithms can also quickly find lower-cost solutions for large datasets

    Rapid hierarchical assembly of medium-size DNA cassettes

    Get PDF
    Synthetic biology applications call for efficient methods to generate large gene cassettes that encode complex gene circuits in order to avoid simultaneous delivery of multiple plasmids encoding individual genes. Multiple methods have been proposed to achieve this goal. Here, we describe a novel protocol that allows one-step cloning of up to four gene-size DNA fragments, followed by a second assembly of these concatenated sequences into large circular DNA. The protocols described here comprise a simple, cheap and fast solution for routine construction of cassettes with up to 10 gene-size components

    ICRPfinder: a fast pattern design algorithm for coding sequences and its application in finding potential restriction enzyme recognition sites

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Restriction enzymes can produce easily definable segments from DNA sequences by using a variety of cut patterns. There are, however, no software tools that can aid in gene building -- that is, modifying wild-type DNA sequences to express the same wild-type amino acid sequences but with enhanced codons, specific cut sites, unique post-translational modifications, and other engineered-in components for recombinant applications. A fast DNA pattern design algorithm, ICRPfinder, is provided in this paper and applied to find or create potential recognition sites in target coding sequences.</p> <p>Results</p> <p>ICRPfinder is applied to find or create restriction enzyme recognition sites by introducing silent mutations. The algorithm is shown capable of mapping existing cut-sites but importantly it also can generate specified new unique cut-sites within a specified region that are guaranteed not to be present elsewhere in the DNA sequence.</p> <p>Conclusion</p> <p>ICRPfinder is a powerful tool for finding or creating specific DNA patterns in a given target coding sequence. ICRPfinder finds or creates patterns, which can include restriction enzyme recognition sites, without changing the translated protein sequence. ICRPfinder is a browser-based JavaScript application and it can run on any platform, in on-line or off-line mode.</p

    TmPrime: fast, flexible oligonucleotide design software for gene synthesis

    Get PDF
    Herein we present TmPrime, a computer program to design oligonucleotide sets for gene assembly by both ligase chain reaction (LCR) and polymerase chain reaction (PCR). TmPrime offers much flexibility with no constraints on the gene and oligonucleotide lengths. The program divides the long input DNA sequence based on the input desired melting temperature, and dynamically optimizes the length of oligonucleotides to achieve homologous melting temperatures. The output reports the melting temperatures, oligonucleotide sequences and potential formation of secondary structures. Our program also provides functions on sequence pooling to separate long genes into smaller pieces for multi-pool assembly and codon optimization for expression. The software has been successfully used in the design and synthesis of green fluorescent protein fragment (GFPuv) (760 bp), human protein kinase B-2 (PKB2) (1446 bp) and the promoter of human calcium-binding protein A4 (S100A4) (752 bp) using real-time PCR assembly with LCGreen I, which offers a novel approach to compare the efficiency of gene synthesis. The purity of assembled products is successfully estimated with the use of melting curve analysis, which would potentially eliminate the necessity for agarose gel electrophoresis. This program is freely available at http://prime.ibn.a-star.edu.sg

    Pairwise selection assembly for sequence-independent construction of long-length DNA

    Get PDF
    The engineering of biological components has been facilitated by de novo synthesis of gene-length DNA. Biological engineering at the level of pathways and genomes, however, requires a scalable and cost-effective assembly of DNA molecules that are longer than ∼10 kb, and this remains a challenge. Here we present the development of pairwise selection assembly (PSA), a process that involves hierarchical construction of long-length DNA through the use of a standard set of components and operations. In PSA, activation tags at the termini of assembly sub-fragments are reused throughout the assembly process to activate vector-encoded selectable markers. Marker activation enables stringent selection for a correctly assembled product in vivo, often obviating the need for clonal isolation. Importantly, construction via PSA is sequence-independent, and does not require primary sequence modification (e.g. the addition or removal of restriction sites). The utility of PSA is demonstrated in the construction of a completely synthetic 91-kb chromosome arm from Saccharomyces cerevisiae

    Design and Synthesis of a Quintessential Self-Transmissible IncX1 Plasmid, pX1.0

    Get PDF
    DNA exchange in bacteria via conjugative plasmids is believed to be among the most important contributing factors to the rapid evolution- and diversification rates observed in bacterial species. The IncX1 plasmids are particularly interesting in relation to enteric bacteria, and typically carry genetic loads like antibiotic resistance genes and virulence factors. So far, however, a “pure” version of these molecular parasites, without genetic loads, has yet to be isolated from the environment. Here we report the construction of pX1.0, a fully synthesized IncX1 plasmid capable of horizontal transfer between different enteric bacteria. The designed pX1.0 sequence was derived from the consensus gene content of five IncX1 plasmids and three other, more divergent, members of the same phylogenetic group. The pX1.0 plasmid was shown to replicate stably in E. coli with a plasmid DNA per total DNA ratio corresponding to approximately 3–9 plasmids per chromosome depending on the growth phase of the host. Through conjugation, pX1.0 was able to self-transfer horizontally into an isogenic strain of E. coli as well as into two additional species belonging to the family Enterobacteriaceae. Our results demonstrate the immediate applicability of recent advances made within the field of synthetic biology for designing and constructing DNA systems, previously existing only in silica

    High-quality gene assembly directly from unpurified mixtures of microarray-synthesized oligonucleotides

    Get PDF
    To meet the growing demand for synthetic genes more robust, scalable and inexpensive gene assembly technologies must be developed. Here, we present a protocol for high-quality gene assembly directly from low-cost marginal-quality microarray-synthesized oligonucleotides. Significantly, we eliminated the time- and money-consuming oligonucleotide purification steps through the use of hybridization-based selection embedded in the assembly process. The protocol was tested on mixtures of up to 2000 oligonucleotides eluted directly from microarrays obtained from three different chip manufacturers. These mixtures containing <5% perfect oligos, and were used directly for assembly of 27 test genes of different sizes. Gene quality was assessed by sequencing, and their activity was tested in coupled in vitro transcription/translation reactions. Genes assembled from the microarray-eluted material using the new protocol matched the quality of the genes assembled from >95% pure column-synthesized oligonucleotides by the standard protocol. Both averaged only 2.7 errors/kb, and genes assembled from microarray-eluted material without clonal selection produced only 30% less protein than sequence-confirmed clones. This report represents the first demonstration of cost-efficient gene assembly from microarray-synthesized oligonucleotides. The overall cost of assembly by this method approaches 5¢ per base, making gene synthesis more affordable than traditional cloning

    A theoretical model for template-free synthesis of long DNA sequence

    Get PDF
    This theoretical scheme is intended to formulate a potential method for high fidelity synthesis of Nucleic Acid molecules towards a few thousand bases using an enzyme system. Terminal Deoxyribonucleotidyl Transferase, which adds a nucleotide to the 3′OH end of a Nucleic Acid molecule, may be used in combination with a controlled method for nucleotide addition and degradation, to synthesize a predefined Nucleic Acid sequence. A pH control system is suggested to regulate the sequential activity switching of different enzymes in the synthetic scheme. Current practice of synthetic biology is cumbersome, expensive and often error prone owing to the dependence on the ligation of short oligonucleotides to fabricate functional genetic parts. The projected scheme is likely to render synthetic genomics appreciably convenient and economic by providing longer DNA molecules to start with
    corecore