317 research outputs found

    The Association of Virulence Factors with Genomic Islands

    Get PDF
    Background: It has been noted that many bacterial virulence factor genes are located within genomic islands (GIs; clusters of genes in a prokaryotic genome of probable horizontal origin). However, such studies have been limited to single genera or isolated observations. We have performed the first large-scale analysis of multiple diverse pathogens to examine this association. We additionally identified genes found predominantly in pathogens, but not non-pathogens, across multiple genera using 631 complete bacterial genomes, and we identified common trends in virulence for genes in GIs. Furthermore, we examined the relationship between GIs and clustered regularly interspaced palindromic repeats (CRISPRs) proposed to confer resistance to phage. Methodology/Principal Findings: We show quantitatively that GIs disproportionately contain more virulence factors than the rest of a given genome (p,1E-40 using three GI datasets) and that CRISPRs are also over-represented in GIs. Virulence factors in GIs and pathogen-associated virulence factors are enriched for proteins having more ‘‘offensive’ ’ functions, e.g. active invasion of the host, and are disproportionately components of type III/IV secretion systems or toxins. Numerous hypothetical pathogen-associated genes were identified, meriting further study. Conclusions/Significance: This is the first systematic analysis across diverse genera indicating that virulence factors are disproportionately associated with GIs. ‘‘Offensive’ ’ virulence factors, as opposed to host-interaction factors, may more ofte

    Pseudomonas aeruginosa Genome Database and PseudoCAP: facilitating community-based, continually updated, genome annotation

    Get PDF
    Using the Pseudomonas aeruginosa Genome Project as a test case, we have developed a database and submission system to facilitate a community-based approach to continually updated genome annotation (http://www.pseudomonas.com). Researchers submit proposed annotation updates through one of three web-based form options which are then subjected to review, and if accepted, entered into both the database and log file of updates with author acknowledgement. In addition, a coordinator continually reviews literature for suitable updates, as we have found such reviews to be the most efficient. Both the annotations database and updates-log database have Boolean search capability with the ability to sort results and download all data or search results as tab-delimited files. To complement this peer-reviewed genome annotation, we also provide a linked GBrowse view which displays alternate annotations. Additional tools and analyses are also integrated, including PseudoCyc, and knockout mutant information. We propose that this database system, with its focus on facilitating flexible queries of the data and providing access to both peer-reviewed annotations as well as alternate annotation information, may be a suitable model for other genome projects wishing to use a continually updated, community-based annotation approach. The source code is freely available under GNU General Public Licence

    A Systematic Analysis of Fe II Emission in Quasars: Evidence for Inflow to the Central Black Hole

    Full text link
    Broad Fe II emission is a prominent feature of the optical and ultraviolet spectra of quasars. We report on a systematical investigation of optical Fe II emission in a large sample of 4037 z < 0.8 quasars selected from the Sloan Digital Sky Survey. We have developed and tested a detailed line-fitting technique, taking into account the complex continuum and narrow and broad emission-line spectrum. Our primary goal is to quantify the velocity broadening and velocity shift of the Fe II spectrum in order to constrain the location of the Fe II-emitting region and its relation to the broad-line region. We find that the majority of quasars show Fe II emission that is redshifted, typically by ~ 400 km/s but up to 2000 km/s, with respect to the systemic velocity of the narrow-line region or of the conventional broad-line region as traced by the Hbeta line. Moreover, the line width of Fe II is significantly narrower than that of the broad component of Hbeta. We show that the magnitude of the Fe II redshift correlates inversely with the Eddington ratio, and that there is a tendency for sources with redshifted Fe II emission to show red asymmetry in the Hbeta line. These characteristics strongly suggest that Fe II originates from a location different from, and most likely exterior to, the region that produces most of Hbeta. The Fe II-emitting zone traces a portion of the broad-line region of intermediate velocities whose dynamics may be dominated by infall.Comment: 20 pages, 14 figures, accepted for publication in Ap

    MicrobeDB: a locally maintainable database of microbial genomic sequences

    Get PDF
    Summary: Analysis of microbial genomes often requires the general organization and comparison of tens to thousands of genomes both from public repositories and unpublished sources. MicrobeDB provides a foundation for such projects by the automation of downloading published, completed bacterial and archaeal genomes from key sources, parsing annotations of all genomes (both public and private) into a local database, and allowing interaction with the database through an easy to use programming interface. MicrobeDB creates a simple to use, easy to maintain, centralized local resource for various large-scale comparative genomic analyses and a back-end for future microbial application design

    Integrated Genomic Analysis of Diverse Induced Pluripotent Stem Cells from the Progenitor Cell Biology Consor tium

    Get PDF
    The rigorous characterization of distinct induced pluripotent stem cells (iPSC) derived from multiple reprogramming technologies, somatic sources, and donors is required to understand potential sources of variability and downstream potential. To achieve this goal, the Progenitor Cell Biology Consortium performed comprehensive experimental and genomic analyses of 58 iPSC from ten laboratories generated using a variety of reprogramming genes, vectors, and cells. Associated global molecular characterization studies identified functionally informative correlations in gene expression, DNA methylation, and/or copy-number variation among key developmental and oncogenic regulators as a result of donor, sex, line stability, reprogramming technology, and cell of origin. Furthermore, X-chromosome inactivation in PSC produced highly correlated differences in teratoma-lineage staining and regulator expression upon differentiation. All experimental results, and raw, processed, and metadata from these analyses, including powerful tools, are interactively accessible from a new online portal at https://www.synapse.org to serve as a reusable resource for the stem cell community

    Molecular, phenotypic, and sample-associated data to describe pluripotent stem cell lines and derivatives

    Get PDF
    The use of induced pluripotent stem cells (iPSC) derived from independent patients and sources holds considerable promise to improve the understanding of development and disease. However, optimized use of iPSC depends on our ability to develop methods to efficiently qualify cell lines and protocols, monitor genetic stability, and evaluate self-renewal and differentiation potential. To accomplish these goals, 57 stem cell lines from 10 laboratories were differentiated to 7 different states, resulting in 248 analyzed samples. Cell lines were differentiated and characterized at a central laboratory using standardized cell culture methodologies, protocols, and metadata descriptors. Stem cell and derived differentiated lines were characterized using RNA-seq, miRNA-seq, copy number arrays, DNA methylation arrays, flow cytometry, and molecular histology. All materials, including raw data, metadata, analysis and processing code, and methodological and provenance documentation are publicly available for re-use and interactive exploration at https://www.synapse.org/pcbc. The goal is to provide data that can improve our ability to robustly and reproducibly use human pluripotent stem cells to understand development and disease

    Unique reporter-based sensor platforms to monitor signalling in cells

    Get PDF
    Introduction: In recent years much progress has been made in the development of tools for systems biology to study the levels of mRNA and protein, and their interactions within cells. However, few multiplexed methodologies are available to study cell signalling directly at the transcription factor level. &lt;p/&gt;Methods: Here we describe a sensitive, plasmid-based RNA reporter methodology to study transcription factor activation in mammalian cells, and apply this technology to profiling 60 transcription factors in parallel. The methodology uses two robust and easily accessible detection platforms; quantitative real-time PCR for quantitative analysis and DNA microarrays for parallel, higher throughput analysis. &lt;p/&gt;Findings: We test the specificity of the detection platforms with ten inducers and independently validate the transcription factor activation. &lt;p/&gt;Conclusions: We report a methodology for the multiplexed study of transcription factor activation in mammalian cells that is direct and not theoretically limited by the number of available reporters

    Wide-Scale Analysis of Human Functional Transcription Factor Binding Reveals a Strong Bias towards the Transcription Start Site

    Get PDF
    We introduce a novel method to screen the promoters of a set of genes with shared biological function, against a precompiled library of motifs, and find those motifs which are statistically over-represented in the gene set. The gene sets were obtained from the functional Gene Ontology (GO) classification; for each set and motif we optimized the sequence similarity score threshold, independently for every location window (measured with respect to the TSS), taking into account the location dependent nucleotide heterogeneity along the promoters of the target genes. We performed a high throughput analysis, searching the promoters (from 200bp downstream to 1000bp upstream the TSS), of more than 8000 human and 23,000 mouse genes, for 134 functional Gene Ontology classes and for 412 known DNA motifs. When combined with binding site and location conservation between human and mouse, the method identifies with high probability functional binding sites that regulate groups of biologically related genes. We found many location-sensitive functional binding events and showed that they clustered close to the TSS. Our method and findings were put to several experimental tests. By allowing a "flexible" threshold and combining our functional class and location specific search method with conservation between human and mouse, we are able to identify reliably functional TF binding sites. This is an essential step towards constructing regulatory networks and elucidating the design principles that govern transcriptional regulation of expression. The promoter region proximal to the TSS appears to be of central importance for regulation of transcription in human and mouse, just as it is in bacteria and yeast.Comment: 31 pages, including Supplementary Information and figure

    RSAT: regulatory sequence analysis tools

    Get PDF
    The regulatory sequence analysis tools (RSAT, http://rsat.ulb.ac.be/rsat/) is a software suite that integrates a wide collection of modular tools for the detection of cis-regulatory elements in genome sequences. The suite includes programs for sequence retrieval, pattern discovery, phylogenetic footprint detection, pattern matching, genome scanning and feature map drawing. Random controls can be performed with random gene selections or by generating random sequences according to a variety of background models (Bernoulli, Markov). Beyond the original word-based pattern-discovery tools (oligo-analysis and dyad-analysis), we recently added a battery of tools for matrix-based detection of cis-acting elements, with some original features (adaptive background models, Markov-chain estimation of P-values) that do not exist in other matrix-based scanning tools. The web server offers an intuitive interface, where each program can be accessed either separately or connected to the other tools. In addition, the tools are now available as web services, enabling their integration in programmatic workflows. Genomes are regularly updated from various genome repositories (NCBI and EnsEMBL) and 682 organisms are currently supported. Since 1998, the tools have been used by several hundreds of researchers from all over the world. Several predictions made with RSAT were validated experimentally and published

    Metastable Atrial State Underlies the Primary Genetic Substrate for MYL4 Mutation-Associated Atrial Fibrillation

    Get PDF
    Background:Atrial fibrillation (AF) is the most common clinical arrhythmia and is associated with heart failure, stroke, and increased mortality. The myocardial substrate for AF is poorly understood because of limited access to primary human tissue and mechanistic questions around existing in vitro or in vivo models.Methods:Using an MYH6:mCherry knock-in reporter line, we developed a protocol to generate and highly purify human pluripotent stem cell–derived cardiomyocytes displaying physiological and molecular characteristics of atrial cells. We modeled human MYL4 mutants, one of the few definitive genetic causes of AF. To explore non–cell-autonomous components of AF substrate, we also created a zebrafish Myl4 knockout model, which exhibited molecular, cellular, and physiologic abnormalities that parallel those in humans bearing the cognate mutations.Results:There was evidence of increased retinoic acid signaling in both human embryonic stem cells and zebrafish mutant models, as well as abnormal expression and localization of cytoskeletal proteins, and loss of intracellular nicotinamide adenine dinucleotide and nicotinamide adenine dinucleotide + hydrogen. To identify potentially druggable proximate mechanisms, we performed a chemical suppressor screen integrating multiple human cellular and zebrafish in vivo endpoints. This screen identified Cx43 (connexin 43) hemichannel blockade as a robust suppressor of the abnormal phenotypes in both models of MYL4 (myosin light chain 4)–related atrial cardiomyopathy. Immunofluorescence and coimmunoprecipitation studies revealed an interaction between MYL4 and Cx43 with altered localization of Cx43 hemichannels to the lateral membrane in MYL4 mutants, as well as in atrial biopsies from unselected forms of human AF. The membrane fraction from MYL4-/- human embryonic stem cell derived atrial cells demonstrated increased phospho-Cx43, which was further accentuated by retinoic acid treatment and by the presence of risk alleles at the Pitx2 locus. PKC (protein kinase C) was induced by retinoic acid, and PKC inhibition also rescued the abnormal phenotypes in the atrial cardiomyopathy models.Conclusions:These data establish a mechanistic link between the transcriptional, metabolic and electrical pathways previously implicated in AF substrate and suggest novel avenues for the prevention or therapy of this common arrhythmia.</p
    • 

    corecore