91 research outputs found

    Whole genome sequencing of enriched chloroplast DNA using the Illumina GAII platform

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Complete chloroplast genome sequences provide a valuable source of molecular markers for studies in molecular ecology and evolution of plants. To obtain complete genome sequences, recent studies have made use of the polymerase chain reaction to amplify overlapping fragments from conserved gene loci. However, this approach is time consuming and can be more difficult to implement where gene organisation differs among plants. An alternative approach is to first isolate chloroplasts and then use the capacity of high-throughput sequencing to obtain complete genome sequences. We report our findings from studies of the latter approach, which used a simple chloroplast isolation procedure, multiply-primed rolling circle amplification of chloroplast DNA, Illumina Genome Analyzer II sequencing, and de novo assembly of paired-end sequence reads.</p> <p>Results</p> <p>A modified rapid chloroplast isolation protocol was used to obtain plant DNA that was enriched for chloroplast DNA, but nevertheless contained nuclear and mitochondrial DNA. Multiply-primed rolling circle amplification of this mixed template produced sufficient quantities of chloroplast DNA, even when the amount of starting material was small, and improved the template quality for Illumina Genome Analyzer II (hereafter Illumina GAII) sequencing. We demonstrate, using independent samples of karaka (<it>Corynocarpus laevigatus</it>), that there is high fidelity in the sequence obtained from this template. Although less than 20% of our sequenced reads could be mapped to chloroplast genome, it was relatively easy to assemble complete chloroplast genome sequences from the mixture of nuclear, mitochondrial and chloroplast reads.</p> <p>Conclusions</p> <p>We report successful whole genome sequencing of chloroplast DNA from karaka, obtained efficiently and with high fidelity.</p

    Protocol for the COG-UK hospital onset COVID-19 infection (HOCI) multicentre interventional clinical study: evaluating the efficacy of rapid genome sequencing of SARS-CoV-2 in limiting the spread of COVID-19 in United Kingdom NHS hospitals

    Get PDF
    Introduction: Nosocomial transmission of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has been a significant cause of mortality in National Health Service (NHS) hospitals during the coronavirus disease 2019 (COVID-19) pandemic. The aim of this study is to evaluate the impact of rapid whole genome sequencing of SARS-CoV-2, supported by a novel probabilistic reporting methodology, to inform infection prevention and control (IPC) practice within NHS hospital settings. / Methods and analysis: COG-UK HOCI (COG-UK Consortium Hospital-Onset COVID-19 Infections study) is a multicentre, prospective, interventional, superiority study. Eligible patients must be admitted to hospital with first confirmed SARS-CoV-2 PCR positive test result >48h from time of admission, where COVID-19 diagnosis was not suspected upon admission. The projected sample size for 14 participating sites covering all study phases over winter-spring 2020/2021 in the United Kingdom is 2,380 patients. The intervention is the return of a sequence report, within 48 hours in one phase (rapid local lab) and within 5-10 days in a second phase (mimicking central lab use), comparing the viral genome from an eligible study participant with others within and outside the hospital site. The primary outcomes are the incidence of Public Health England (PHE)/IPC-defined SARS-CoV-2 hospital-acquired infection during the baseline and two interventional phases, and proportion of hospital-onset cases with genomic evidence of transmission linkage following implementation of the intervention where such linkage was not suspected by initial IPC investigation. Secondary outcomes include incidence of hospital outbreaks, with and without sequencing data; actual and desirable changes to IPC actions; periods of healthcare worker (HCW) absence. A process evaluation using qualitative interviews with HCWs will be conducted alongside the study and analysis, underpinned by iterative programme theory of the sequence report. Health economic analysis will be conducted to determine cost-benefit of the intervention, and whether this leads to economic advantages within the NHS setting. / Ethics and dissemination: The protocol has been approved by the National Research Ethics Service Committee (Cambridge South 20/EE/0118). This manuscript is based on version 5.0 of the protocol. The study findings will be disseminated through peer-reviewed publications

    The Association Between Vitamin D and Multiple Sclerosis Risk: 1,25(OH)2D3 Induces Super-Enhancers Bound by VDR

    Get PDF
    A super-enhancer (SE) is a cluster of enhancers with a relatively high density of particular chromatin features. SEs typically regulate key genes that can determine cell identity and differentiation. Identifying SEs and their effects may be critical in predicting key regulatory genes, such as master transcription factor genes or oncogenes. Signal inducible SEs are dense stretches of signal terminal transcription factor (TF) binding regions, and may modulate the interaction between environmental factors (e.g., Vitamin D) and genetic factors (i.e., risk variants) in complex diseases such as multiple sclerosis (MS). As a complex autoimmune disease, the etiology and progression of MS, including the interaction between Vitamin D and MS risk variants, is still unclear and can be explored from the aspect of signal SEs. Vitamin D [with its active form: 1,25(OH)2D3], is an environmental risk factor for MS. It binds the Vitamin D receptor (VDR) and regulates gene expression. This study explores the association between VDR super-enhancers (VSEs) and MS risk variants. Firstly, we reanalyse public ChIP-seq and RNA-seq data to classify VSEs into three categories according to their combinations of persistent and secondary VDR binding. Secondly, we indicate the genes with VSE regions that are near MS risk variants. Furthermore, we find that MS risk variants are enriched in VSE regions, and we indicate some genes with a VSE overlapping MS risk variant for further exploration. We also find two clusters of genes from the set of genes showing correlation of expression patterns with the MS risk gene ZMIZ1 that appear to be regulated by VSEs in THP-1 cells. It is the first time that VSEs have been analyzed, and we directly connect the genetic risk factors for MS risk with Vitamin D based on VSEs

    Index-Free De Novo Assembly and Deconvolution of Mixed Mitochondrial Genomes

    Get PDF
    Second-generation sequencing technology has allowed a very large increase in sequencing throughput. In order to make use of this high throughput, we have developed a pipeline for sequencing and de novo assembly of multiple mitochondrial genomes without the costs of indexing. Simulation studies on a mixture of diverse animal mitochondrial genomes showed that mitochondrial genomes could be reassembled from a high coverage of short (35 nt) reads, such as those generated by a second-generation Illumina Genome Analyzer. We then assessed this experimentally with long-range polymerase chain reaction products from mitochondria of a human, a rat, a bird, a frog, an insect, and a mollusc. Comparison with reference genomes was used for deconvolution of the assembled contigs rather than for mapping of sequence reads. As proof of concept, we report the complete mollusc mitochondrial genome of an olive shell (Amalda northlandica). It has a very unusual putative control region, which contains a structure that would probably only be detectable by next-generation sequencing. The general approach has considerable potential, especially when combined with indexed sequencing of different groups of genomes

    Association of Genetic Variation with Keratoconus

    Get PDF
    Importance: Keratoconus is a condition in which the cornea progressively thins and protrudes in a conical shape, severely affecting refraction and vision. It is a major indication for corneal transplant. To discover new genetic loci associated with keratoconus and better understand the causative mechanism of this disease, we performed a genome-wide association study on patients with keratoconus.Objective: To identify genetic susceptibility regions for keratoconus in the human genome.Design, Setting, and Participants: This study was conducted with data from eye clinics in Australia, the United States, and Northern Ireland. The discovery cohort of individuals with keratoconus and control participants from Australia was genotyped using the Illumina HumanCoreExome single-nucleotide polymorphism array. After quality control and data cleaning, genotypes were imputed against the 1000 Genomes Project reference panel (phase III; version 5), and association analyses were completed using PLINK version 1.90. Single-nucleotide polymorphisms with P -6 were assessed for replication in 3 additional cohorts. Control participants were drawn from the cohorts of the Blue Mountains Eye Study and a previous study of glaucoma. Replication cohorts were from a previous keratoconus genome-wide association study data set from the United States, a cohort of affected and control participants from Australia and Northern Ireland, and a case-control cohort from Victoria, Australia. Data were collected from January 2006 to March 2019.Main Outcomes and Measures: Associations between keratoconus and 6 252 612 genetic variants were estimated using logistic regression after adjusting for ancestry using the first 3 principal components.Results: The discovery cohort included 522 affected individuals and 655 control participants, while the replication cohorts included 818 affected individuals (222 from the United States, 331 from Australia and Northern Ireland, and 265 from Victoria, Australia) and 3858 control participants (2927 from the United States, 229 from Australia and Northern Ireland, and 702 from Victoria, Australia). Two novel loci reached genome-wide significance (defined as P -8), with a P value of 7.46 × 10-9 at rs61876744 in patatin-like phospholipase domain-containing 2 gene (PNPLA2) on chromosome 11 and a P value of 6.35 × 10-12 at rs138380, 2.2 kb upstream of casein kinase I isoform epsilon gene (CSNK1E) on chromosome 22. One additional locus was identified with a P value less than 1.00 × 10-6 in mastermind-like transcriptional coactivator 2 (MAML2) on chromosome 11 (P = 3.91 × 10-7). The novel locus in PNPLA2 reached genome-wide significance in an analysis of all 4 cohorts (P = 2.45 × 10-8).Conclusions and Relevance: In this relatively large keratoconus genome-wide association study, we identified a genome-wide significant locus for keratoconus in the region of PNPLA2 on chromosome 11

    Evolutionary relationships and divergence times among the native rats of Australia

    Get PDF
    Background The genus Rattus is highly speciose and has a complex taxonomy that is not fully resolved. As shown previously there are two major groups within the genus, an Asian and an Australo-Papuan group. This study focuses on the Australo-Papuan group and particularly on the Australian rats. There are uncertainties regarding the number of species within the group and the relationships among them. We analysed 16 mitochondrial genomes, including seven novel genomes from six species, to help elucidate the evolutionary history of the Australian rats. We also demonstrate, from a larger dataset, the usefulness of short regions of the mitochondrial genome in identifying these rats at the species level. Results Analyses of 16 mitochondrial genomes representing species sampled from Australo-Papuan and Asian clades of Rattus indicate divergence of these two groups ~2.7 million years ago (Mya). Subsequent diversification of at least 4 lineages within the Australo-Papuan clade was rapid and occurred over the period from ~ 0.9-1.7 Mya, a finding that explains the difficulty in resolving some relationships within this clade. Phylogenetic analyses of our 126 taxon, but shorter sequence (1952 nucleotides long), Rattus database generally give well supported species clades. Conclusions Our whole mitochondrial genome analyses are concordant with a taxonomic division that places the native Australian rats into the Rattus fuscipes species group. We suggest the following order of divergence of the Australian species. R. fuscipes is the oldest lineage among the Australian rats and is not part of a New Guinean radiation. R. lutreolus is also within this Australian clade and shallower than R. tunneyi while the R. sordidus group is the shallowest lineage in the clade. The divergences within the R. sordidus and R. leucopus lineages occurring about half a million years ago support the hypotheses of more recent interchanges of rats between Australia and New Guinea. While problematic for inference of deeper divergences, we report that the analysis of shorter mitochondrial sequences is very useful for species identification in rats

    Why barcode? High-throughput multiplex sequencing of mitochondrial genomes for molecular systematics

    Get PDF
    Mitochondrial genome sequences are important markers for phylogenetics but taxon sampling remains sporadic because of the great effort and cost required to acquire full-length sequences. Here, we demonstrate a simple, cost-effective way to sequence the full complement of protein coding mitochondrial genes from pooled samples using the 454/Roche platform. Multiplexing was achieved without the need for expensive indexing tags (‘barcodes’). The method was trialled with a set of long-range polymerase chain reaction (PCR) fragments from 30 species of Coleoptera (beetles) sequenced in a 1/16th sector of a sequencing plate. Long contigs were produced from the pooled sequences with sequencing depths ranging from ∼10 to 100× per contig. Species identity of individual contigs was established via three ‘bait’ sequences matching disparate parts of the mitochondrial genome obtained by conventional PCR and Sanger sequencing. This proved that assembly of contigs from the sequencing pool was correct. Our study produced sequences for 21 nearly complete and seven partial sets of protein coding mitochondrial genes. Combined with existing sequences for 25 taxa, an improved estimate of basal relationships in Coleoptera was obtained. The procedure could be employed routinely for mitochondrial genome sequencing at the species level, to provide improved species ‘barcodes’ that currently use the cox1 gene only

    A multi-ethnic genome-wide association study implicates collagen matrix integrity and cell differentiation pathways in keratoconus

    Get PDF
    Keratoconus is characterised by reduced rigidity of the cornea with distortion and focal thinning that causes blurred vision, however, the pathogenetic mechanisms are unknown. It can lead to severe visual morbidity in children and young adults and is a common indication for corneal transplantation worldwide. Here we report the first large scale genome-wide association study of keratoconus including 4,669 cases and 116,547 controls. We have identified significant association with 36 genomic loci that, for the first time, implicate both dysregulation of corneal collagen matrix integrity and cell differentiation pathways as primary disease-causing mechanisms. The results also suggest pleiotropy, with some disease mechanisms shared with other corneal diseases, such as Fuchs endothelial corneal dystrophy. The common variants associated with keratoconus explain 12.5% of the genetic variance, which shows potential for the future development of a diagnostic test to detect susceptibility to disease
    corecore