Skip to main content
Article thumbnail
Location of Repository

Sequence verification of synthetic DNA by assembly of sequencing reads



This is the publisher’s final pdf. The published article is copyrighted by Oxford University Press and can be found at: synthesis attempts to assemble user-defined DNA sequences with base-level precision. Verifying the sequences of construction intermediates and the final product of a gene synthesis project is a critical part of the workflow, yet one that has received the least attention. Sequence validation is equally important for other kinds of curated clone collections. Ensuring that the physical sequence of a clone matches its published sequence is a common quality control step performed at least once over the course of a research project. GenoREAD is a web-based application that breaks the sequence verification process into two steps: the assembly of sequencing reads and the alignment of the resulting contig with a reference sequence. GenoREAD can determine if a clone matches its reference sequence. Its sophisticated reporting features help identify and troubleshoot problems that arise during the sequence verification process. GenoREAD has been experimentally validated on thousands of gene-sized constructs from an ORFeome project, and on longer sequences including whole plasmids and synthetic chromosomes. Comparing GenoREAD results with those from manual analysis of the sequencing data demonstrates that GenoREAD tends to be conservative in its diagnostic. GenoREAD is available at

Topics: Gene synthesis, Genome, Biology, Bioinformatics, Retrieval, Alignment
Publisher: Oxford University Press
Year: 2012
DOI identifier: 10.1093/nar
OAI identifier:
Provided by: ScholarsArchive@OSU

Suggested articles


  1. (2012). Accurate gene synthesis with tag-directed retrieval of sequence-verified DNA molecules.
  2. (2012). Addgene provides an open forum for plasmid sharing.
  3. (2004). and Int Human Genome Sequencing,
  4. (1998). Base-calling of automated sequencer traces using phred. I. Accuracy assessment.
  5. (1998). Base-calling of automated sequencer traces using phred. II. Error probabilities.
  6. (1990). Basic local alignment search tool.
  7. (2011). Call for a quality standard for sequence-based assays in clinical microbiology: necessity for quality assessment of sequences used in microbial identification and typing.
  8. (2010). CLONEQC: lightweight sequence verification for synthetic biology.
  9. (2004). Coleman,B.I. et al.; NISC Comparative Sequencing Program
  10. (2010). Creation of a bacterial cell controlled by a chemically synthesized genome.
  11. (2009). CSA: an efficient algorithm to improve circular DNA multiple alignment.
  12. (2012). Design, implementation and practice of JBEI-ICE: an open source biological part registry platform and tools.
  13. (2000). EMBOSS:
  14. (2011). Essential information for synthetic DNA sequences.
  15. (2007). External quality assessment of a DNA sequence-based scheme for epidemiological typing of Legionella pneumophila by an international network of laboratories.
  16. (2009). Gene synthesis demystified.
  17. (2012). Gene Synthesis: Methods and Protocols.
  18. (2009). Genomics. Genome project standards in a new era of sequencing.
  19. (2010). High-fidelity gene synthesis by retrieval of sequence-verified DNA identified using high-throughput pyrosequencing.
  20. (2012). Mechanisms and evolution of virulence in oomycetes.
  21. (2002). Multiple sequence alignment using ClustalW and ClustalX. In: Baxevanis,A.D. et al. (eds),
  22. (2009). Population genomics of domestic and wild yeasts.
  23. (2010). Protein Structure Initiative Material Repository: an open shared public resource of structural genomics plasmids for the biological community.
  24. (2011). Synthetic chromosome arms function in yeast and generate phenotypic diversity by design.
  25. (2008). Targeted development of registries of biological parts.
  26. (2009). Teaching synthetic biology, bioinformatics and engineering to undergraduates: the interdisciplinary Build-a-Genome Course.
  27. (2011). The new date, new format, new goals and new sponsor of the Archon Genomics X PRIZE competition.
  28. (2012). Unipro UGENE: a unified bioinformatics toolkit.
  29. (2004). Using the miraEST assembler for reliable and automated mRNA transcript assembly and SNP detection in sequenced ESTs.

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.