3,839 research outputs found
Reconstruction of the regulatory network for Bacillus subtilis and reconciliation with gene expression data
The Supplementary Material for this article can be found online at: http://journal.frontiersin.org/article/10.3389/fmicb. 2016.00275We introduce a manually constructed and curated regulatory network model that describes the current state of knowledge of transcriptional regulation of B. subtilis. The model corresponds to an updated and enlarged version of the regulatory model of central metabolism originally proposed in 2008. We extended the original network to the whole genome by integration of information from DBTBS, a compendium of regulatory data that includes promoters, transcription factors (TFs), binding sites, motifs and regulated operons. Additionally, we consolidated our network with all the information on regulation included in the SporeWeb and Subtiwiki community-curated resources on B. subtilis. Finally, we reconciled our network with data from RegPrecise, which recently released their own less comprehensive reconstruction of the regulatory network for B. subtilis. Our model describes 275 regulators and their target genes, representing 30 different mechanisms of regulation such as TFs, RNA switches, Riboswitches and small regulatory RNAs. Overall, regulatory information is included in the model for approximately 2500 of the ~4200 genes in B. subtilis 168. In an effort to further expand our knowledge of B. subtilis regulation, we reconciled our model with expression data. For this process, we reconstructed the Atomic Regulons (ARs) for B. subtilis, which are the sets of genes that share the same ON and OFF gene expression profiles across multiple samples of experimental data. We show how atomic regulons for B. subtilis are able to capture many sets of genes corresponding to regulated operons in our manually curated network. Additionally, we demonstrate how atomic regulons can be used to help expand or validate the knowledge of the regulatory networks by looking at highly correlated genes in the ARs for which regulatory information is lacking. During this process, we were also able to infer novel stimuli for hypothetical genes by exploring the genome expression metadata relating to experimental conditions, gaining insights into novel biology.AG and VF acknowledge funding from the European Union Basynthecproject(BaSynthecFP7-244093)
Recommended from our members
Knowledge of cervical cancer risk factors and symptoms among women in a refugee settlement: a cross-sectional study in northern Uganda.
BACKGROUND: There are limited data on awareness of cervical cancer risk factors and symptoms among refugee populations living in Uganda. In this study, we sought to determine the awareness and knowledge of cervical cancer risk factors and symptoms among women in Palabek refugee settlement, northern Uganda. METHODS: We conducted a cross-sectional study. 815 women (aged 18-60 years) were randomly selected using multistage sampling in Palabek refugee settlement. Data were collected using pre-tested, structured questionnaires. Logistic regression models were used to determine magnitudes of association between socio-demographic and health system factors, and knowledge on cervical cancer risk factors and symptoms. RESULTS: The majority of participants (53%, n = 433) were young (18-29 years), married (68%, n = 553), and did not have formal employment (93%, n = 759). Less than half (40%, n = 325) had heard of cervical cancer. Of those who had heard, most recognized multiple male sexual partners, early onset of sexual intercourse and HPV infections as risk factors for cervical cancer (93%, n = 295; 89%, n = 283; and 86%, n = 271 respectively). Median knowledge score for risk factor recognition = 7 (IQR: 3-9). Median knowledge score for symptoms recognition = 7 (IQR: 1-10). Half of women (50%, n = 409) correctly recognized 7 to 11 symptoms of cervical cancer, with vaginal bleeding between menstrual periods, pelvic pain, and vaginal bleeding during/after sexual intercourse recognized by 58, 52 and 54% respectively. Single women (OR = 0.59 (95%CI: 0.38-0.94), and women that lived farther than 1 kilo meter from nearest health facility in South Sudan (OR = 0.36-0.49 (95%CI: 0.26-0.84) were less likely to be knowledgeable of symptoms of cervical cancer. CONCLUSION: A significant proportion of women in Palabek refugee settlement had not heard about cervical cancer. Refugee health services providers could increase awareness of cervical cancer risk factors and symptoms through health education in order to promote risk reduction behaviours and guide women during symptoms appraisal. Single women and those who lived more than one kilo metre from nearest health facility in home country could be a priority group for awareness intervention in the settlement
Computing and applying atomic regulons to understand gene expression and regulation
The Supplementary Material for this article can be found online at: http://journal.frontiersin.org/article/10.3389/fmicb.2016.01819/full#supplementary-materialUnderstanding gene function and regulation is essential for the interpretation prediction and ultimate design of cell responses to changes in the environment. An important step toward meeting the challenge of understanding gene function and regulation is the identification of sets of genes that are always co-expressed. These gene sets Atomic Regulons ARs represent fundamental units of function within a cell and could be used to associate genes of unknown function with cellular processes and to enable rational genetic engineering of cellular systems. Here we describe an approach for inferring ARs that leverages large-scale expression data sets gene context and functional relationships among genes. We computed ARs for Escherichia coli based on 907 gene expression experiments and compared our results with gene clusters produced by two prevalent data-driven methods: hierarchical clustering and k-means clustering. We compared ARs and purely data-driven gene clusters to the curated set of regulatory interactions for E. coli found in RegulonDB showing that ARs are more consistent with gold standard regulons than are data-driven gene clusters. We further examined the consistency of ARs and data-driven gene clusters in the context of gene interactions predicted by Context Likelihood of Relatedness CLR analysis finding that the ARs show better agreement with CLR predicted interactions. We determined the impact of increasing amounts of expression data on AR construction and find that while more data improve ARs it is not necessary to use the full set of gene expression experiments available for E. coli to produce high quality ARs. In order to explore the conservation of co-regulated gene sets across different organisms we computed ARs for Shewanella oneidensis Pseudomonas aeruginosa Thermus thermophilus and Staphylococcus aureus each of which represents increasing degrees of phylogenetic distance from E. coli. Comparison of the organism-specific ARs showed that the consistency of AR gene membership correlates with phylogenetic distance but there is clear variability in the regulatory networks of closely related organisms. As large scale expression data sets become increasingly common for model and non-model organisms comparative analyses of atomic regulons will provide valuable insights into fundamental regulatory modules used across the bacterial domain.JF acknowledges funding from [SFRH/BD/70824/2010] of the FCT (Portuguese Foundation for Science and Technology) PhD program. CH and PW were supported by the National Science Foundation under grant number EFRI-MIKS-1137089.
RT was supported by the Genomic Science Program (GSP), Office of Biological and Environmental Research (OBER), U.S. Department of Energy(DOE),and his work is a contribution of the Pacific North west National Laboratory (PNNL) Foundational Scientific Focus Area. This work was partially supported by an award from the National Science Foundation to MD, AB, NT, and RO (NSFABI-0850546).
This work was also supported by the United States National Institute of Allergy and Infectious Diseases, National Institutes of Health, Department of Health and Human Service [Contract No. HHSN272201400027C]
Over half of the far-infrared background light comes from galaxies at z >= 1.2
Submillimetre surveys during the past decade have discovered a population of
luminous, high-redshift, dusty starburst galaxies. In the redshift range 1 <= z
<= 4, these massive submillimetre galaxies go through a phase characterized by
optically obscured star formation at rates several hundred times that in the
local Universe. Half of the starlight from this highly energetic process is
absorbed and thermally re-radiated by clouds of dust at temperatures near 30 K
with spectral energy distributions peaking at 100 microns in the rest frame. At
1 <= z <= 4, the peak is redshifted to wavelengths between 200 and 500 microns.
The cumulative effect of these galaxies is to yield extragalactic optical and
far-infrared backgrounds with approximately equal energy densities. Since the
initial detection of the far-infrared background (FIRB), higher-resolution
experiments have sought to decompose this integrated radiation into the
contributions from individual galaxies. Here we report the results of an
extragalactic survey at 250, 350 and 500 microns. Combining our results at 500
microns with those at 24 microns, we determine that all of the FIRB comes from
individual galaxies, with galaxies at z >= 1.2 accounting for 70 per cent of
it. As expected, at the longest wavelengths the signal is dominated by
ultraluminous galaxies at z > 1.Comment: Accepted to Nature. Maps available at http://blastexperiment.info
Tuberculosis Treatment in HIV Infected Ugandans with CD4 Counts >350 Cells/mm3 Reduces Immune Activation with No Effect on HIV Load or CD4 Count
Both HIV and TB cause a state of heightened immune activation. Immune activation in HIV is associated with progression to AIDS. Prior studies, focusing on persons with advanced HIV, have shown no decline in markers of cellular activation in response to TB therapy alone.) and pulmonary TB. HIV load, CD4 count, and markers of immune activation (CD38 and HLA-DR on CD4 and CD8 T cells) were measured prior to starting, during, and for 6 months after completion of standard 6 month anti-tuberculosis (TB) therapy in 38 HIV infected Ugandans with smear and culture confirmed pulmonary TB.Expression of CD38, and co-expression of CD38 and HLA-DR, on CD8 cells declined significantly within 3 months of starting standard TB therapy in the absence of anti-retroviral therapy, and remained suppressed for 6 months after completion of therapy. In contrast, HIV load and CD4 count remained unchanged throughout the study period.
Structure of a putative NTP pyrophosphohydrolase: YP_001813558.1 from Exiguobacterium sibiricum 255-15.
The crystal structure of a putative NTPase, YP_001813558.1 from Exiguobacterium sibiricum 255-15 (PF09934, DUF2166) was determined to 1.78 Å resolution. YP_001813558.1 and its homologs (dimeric dUTPases, MazG proteins and HisE-encoded phosphoribosyl ATP pyrophosphohydrolases) form a superfamily of all-α-helical NTP pyrophosphatases. In dimeric dUTPase-like proteins, a central four-helix bundle forms the active site. However, in YP_001813558.1, an unexpected intertwined swapping of two of the helices that compose the conserved helix bundle results in a `linked dimer' that has not previously been observed for this family. Interestingly, despite this novel mode of dimerization, the metal-binding site for divalent cations, such as magnesium, that are essential for NTPase activity is still conserved. Furthermore, the active-site residues that are involved in sugar binding of the NTPs are also conserved when compared with other α-helical NTPases, but those that recognize the nucleotide bases are not conserved, suggesting a different substrate specificity
An ALMA survey of submillimetre galaxies in the COSMOS field: The extent of the radio-emitting region revealed by 3 GHz imaging with the Very Large Array
We determine the radio size distribution of a large sample of 152 SMGs in
COSMOS that were detected with ALMA at 1.3 mm. For this purpose, we used the
observations taken by the VLA-COSMOS 3 GHz Large Project. One hundred and
fifteen of the 152 target SMGs were found to have a 3 GHz counterpart. The
median value of the major axis FWHM at 3 GHz is derived to be kpc.
The radio sizes show no evolutionary trend with redshift, or difference between
different galaxy morphologies. We also derived the spectral indices between 1.4
and 3 GHz, and 3 GHz brightness temperatures for the sources, and the median
values were found to be and K. Three of the
target SMGs, which are also detected with the VLBA, show clearly higher
brightness temperatures than the typical values. Although the observed radio
emission appears to be predominantly powered by star formation and supernova
activity, our results provide a strong indication of the presence of an AGN in
the VLBA and X-ray-detected SMG AzTEC/C61. The median radio-emitting size we
have derived is 1.5-3 times larger than the typical FIR dust-emitting sizes of
SMGs, but similar to that of the SMGs' molecular gas component traced through
mid- line emission of CO. The physical conditions of SMGs probably render
the diffusion of cosmic-ray electrons inefficient, and hence an unlikely
process to lead to the observed extended radio sizes. Instead, our results
point towards a scenario where SMGs are driven by galaxy interactions and
mergers. Besides triggering vigorous starbursts, galaxy collisions can also
pull out the magnetised fluids from the interacting disks, and give rise to a
taffy-like synchrotron-emitting bridge. This provides an explanation for the
spatially extended radio emission of SMGs, and can also cause a deviation from
the well-known IR-radio correlation.Comment: 32 pages (incl. 5 appendices), 17 figures, 7 tables; accepted for
publication in A&A; abstract abridged for arXi
Structure of the first representative of Pfam family PF04016 (DUF364) reveals enolase and Rossmann-like folds that combine to form a unique active site with a possible role in heavy-metal chelation.
The crystal structure of Dhaf4260 from Desulfitobacterium hafniense DCB-2 was determined by single-wavelength anomalous diffraction (SAD) to a resolution of 2.01 Å using the semi-automated high-throughput pipeline of the Joint Center for Structural Genomics (JCSG) as part of the NIGMS Protein Structure Initiative (PSI). This protein structure is the first representative of the PF04016 (DUF364) Pfam family and reveals a novel combination of two well known domains (an enolase N-terminal-like fold followed by a Rossmann-like domain). Structural and bioinformatic analyses reveal partial similarities to Rossmann-like methyltransferases, with residues from the enolase-like fold combining to form a unique active site that is likely to be involved in the condensation or hydrolysis of molecules implicated in the synthesis of flavins, pterins or other siderophores. The genome context of Dhaf4260 and homologs additionally supports a role in heavy-metal chelation
Network Discovery Pipeline Elucidates Conserved Time-of-Day–Specific cis-Regulatory Modules
Correct daily phasing of transcription confers an adaptive advantage to almost all organisms, including higher plants. In this study, we describe a hypothesis-driven network discovery pipeline that identifies biologically relevant patterns in genome-scale data. To demonstrate its utility, we analyzed a comprehensive matrix of time courses interrogating the nuclear transcriptome of Arabidopsis thaliana plants grown under different thermocycles, photocycles, and circadian conditions. We show that 89% of Arabidopsis transcripts cycle in at least one condition and that most genes have peak expression at a particular time of day, which shifts depending on the environment. Thermocycles alone can drive at least half of all transcripts critical for synchronizing internal processes such as cell cycle and protein synthesis. We identified at least three distinct transcription modules controlling phase-specific expression, including a new midnight specific module, PBX/TBX/SBX. We validated the network discovery pipeline, as well as the midnight specific module, by demonstrating that the PBX element was sufficient to drive diurnal and circadian condition-dependent expression. Moreover, we show that the three transcription modules are conserved across Arabidopsis, poplar, and rice. These results confirm the complex interplay between thermocycles, photocycles, and the circadian clock on the daily transcription program, and provide a comprehensive view of the conserved genomic targets for a transcriptional network key to successful adaptation
- …