69 research outputs found

    Wide-Scale Analysis of Human Functional Transcription Factor Binding Reveals a Strong Bias towards the Transcription Start Site

    Get PDF
    We introduce a novel method to screen the promoters of a set of genes with shared biological function, against a precompiled library of motifs, and find those motifs which are statistically over-represented in the gene set. The gene sets were obtained from the functional Gene Ontology (GO) classification; for each set and motif we optimized the sequence similarity score threshold, independently for every location window (measured with respect to the TSS), taking into account the location dependent nucleotide heterogeneity along the promoters of the target genes. We performed a high throughput analysis, searching the promoters (from 200bp downstream to 1000bp upstream the TSS), of more than 8000 human and 23,000 mouse genes, for 134 functional Gene Ontology classes and for 412 known DNA motifs. When combined with binding site and location conservation between human and mouse, the method identifies with high probability functional binding sites that regulate groups of biologically related genes. We found many location-sensitive functional binding events and showed that they clustered close to the TSS. Our method and findings were put to several experimental tests. By allowing a "flexible" threshold and combining our functional class and location specific search method with conservation between human and mouse, we are able to identify reliably functional TF binding sites. This is an essential step towards constructing regulatory networks and elucidating the design principles that govern transcriptional regulation of expression. The promoter region proximal to the TSS appears to be of central importance for regulation of transcription in human and mouse, just as it is in bacteria and yeast.Comment: 31 pages, including Supplementary Information and figure

    Identification of a Transcription Factor Controlling pH-Dependent Organic Acid Response in Aspergillus niger.

    Get PDF
    Acid formation in Aspergillus niger is known to be subjected to tight regulation, and the acid production profiles are fine-tuned to respond to the ambient pH. Based on transcriptome data, putative trans-acting pH responding transcription factors were listed and through knock out studies, mutants exhibiting an oxalate overproducing phenotype were identified. The yield of oxalate was increased up to 158% compared to the wild type and the corresponding transcription factor was therefore entitled Oxalic Acid repression Factor, OafA. Detailed physiological characterization of one of the ΔoafA mutants, compared to the wild type, showed that both strains produced substantial amounts of gluconic acid, but the mutant strain was more efficient in re-uptake of gluconic acid and converting it to oxalic acid, particularly at high pH (pH 5.0). Transcriptional profiles showed that 241 genes were differentially expressed due to the deletion of oafA and this supported the argument of OafA being a trans-acting transcription factor. Furthermore, expression of two phosphoketolases was down-regulated in the ΔoafA mutant, one of which has not previously been described in fungi. It was argued that the observed oxalate overproducing phenotype was a consequence of the efficient re-uptake of gluconic acid and thereby a higher flux through glycolysis. This results in a lower flux through the pentose phosphate pathway, demonstrated by the down-regulation of the phosphoketolases. Finally, the physiological data, in terms of the specific oxygen consumption, indicated a connection between the oxidative phosphorylation and oxalate production and this was further substantiated through transcription analysis

    Internet-based Self-Assessment after the Tsunami: lessons learned

    Get PDF
    BACKGROUND: In the aftermath of the Tsunami disaster in 2004, an online psychological self-assessment (ONSET) was developed and made available by the University of Zurich in order to provide an online screening instrument for Tsunami victims to test if they were traumatized and in need of mental health care. The objective of the study was to report the lessons learnt that were made using an Internet-based, self-screening instrument after a large-scale disaster and to discuss its outreach and usefulness. METHODS: Users of the online self-assessment decided after finishing the procedure whether their dataset could be used for quality control and scientific evaluation Their answers were stored anonymously only if they consented (which was the case in 88% of the sample), stratified analyses according to level of exposure were conducted. RESULTS: A total of 2,914 adult users gave their consent for analysis of the screenings. Almost three quarter of the sample filled out the ONSET questionnaire within the first four weeks. Forty-one percent of the users reported direct exposure to the Tsunami disaster. Users who were injured by the Tsunami and users who reported dead or injured family members showed the highest degree of PTSD symptoms. CONCLUSION: ONSET was used by a large number of subjects who thought to be affected by the catastrophe in order to help them decide if they needed to see a mental health professional. Furthermore, men more frequently accessed the instrument than women, indicating that Internet-based testing facilitates reaching out to a different group of people than "ordinary" public mental health strategies

    How a Diverse Research Ecosystem Has Generated New Rehabilitation Technologies: Review of NIDILRR’s Rehabilitation Engineering Research Centers

    Get PDF
    Over 50 million United States citizens (1 in 6 people in the US) have a developmental, acquired, or degenerative disability. The average US citizen can expect to live 20% of his or her life with a disability. Rehabilitation technologies play a major role in improving the quality of life for people with a disability, yet widespread and highly challenging needs remain. Within the US, a major effort aimed at the creation and evaluation of rehabilitation technology has been the Rehabilitation Engineering Research Centers (RERCs) sponsored by the National Institute on Disability, Independent Living, and Rehabilitation Research. As envisioned at their conception by a panel of the National Academy of Science in 1970, these centers were intended to take a “total approach to rehabilitation”, combining medicine, engineering, and related science, to improve the quality of life of individuals with a disability. Here, we review the scope, achievements, and ongoing projects of an unbiased sample of 19 currently active or recently terminated RERCs. Specifically, for each center, we briefly explain the needs it targets, summarize key historical advances, identify emerging innovations, and consider future directions. Our assessment from this review is that the RERC program indeed involves a multidisciplinary approach, with 36 professional fields involved, although 70% of research and development staff are in engineering fields, 23% in clinical fields, and only 7% in basic science fields; significantly, 11% of the professional staff have a disability related to their research. We observe that the RERC program has substantially diversified the scope of its work since the 1970’s, addressing more types of disabilities using more technologies, and, in particular, often now focusing on information technologies. RERC work also now often views users as integrated into an interdependent society through technologies that both people with and without disabilities co-use (such as the internet, wireless communication, and architecture). In addition, RERC research has evolved to view users as able at improving outcomes through learning, exercise, and plasticity (rather than being static), which can be optimally timed. We provide examples of rehabilitation technology innovation produced by the RERCs that illustrate this increasingly diversifying scope and evolving perspective. We conclude by discussing growth opportunities and possible future directions of the RERC program

    Measuring Global Credibility with Application to Local Sequence Alignment

    Get PDF
    Computational biology is replete with high-dimensional (high-D) discrete prediction and inference problems, including sequence alignment, RNA structure prediction, phylogenetic inference, motif finding, prediction of pathways, and model selection problems in statistical genetics. Even though prediction and inference in these settings are uncertain, little attention has been focused on the development of global measures of uncertainty. Regardless of the procedure employed to produce a prediction, when a procedure delivers a single answer, that answer is a point estimate selected from the solution ensemble, the set of all possible solutions. For high-D discrete space, these ensembles are immense, and thus there is considerable uncertainty. We recommend the use of Bayesian credibility limits to describe this uncertainty, where a (1−α)%, 0≤α≤1, credibility limit is the minimum Hamming distance radius of a hyper-sphere containing (1−α)% of the posterior distribution. Because sequence alignment is arguably the most extensively used procedure in computational biology, we employ it here to make these general concepts more concrete. The maximum similarity estimator (i.e., the alignment that maximizes the likelihood) and the centroid estimator (i.e., the alignment that minimizes the mean Hamming distance from the posterior weighted ensemble of alignments) are used to demonstrate the application of Bayesian credibility limits to alignment estimators. Application of Bayesian credibility limits to the alignment of 20 human/rodent orthologous sequence pairs and 125 orthologous sequence pairs from six Shewanella species shows that credibility limits of the alignments of promoter sequences of these species vary widely, and that centroid alignments dependably have tighter credibility limits than traditional maximum similarity alignments

    A combinatorial optimization approach for diverse motif finding applications

    Get PDF
    BACKGROUND: Discovering approximately repeated patterns, or motifs, in biological sequences is an important and widely-studied problem in computational molecular biology. Most frequently, motif finding applications arise when identifying shared regulatory signals within DNA sequences or shared functional and structural elements within protein sequences. Due to the diversity of contexts in which motif finding is applied, several variations of the problem are commonly studied. RESULTS: We introduce a versatile combinatorial optimization framework for motif finding that couples graph pruning techniques with a novel integer linear programming formulation. Our approach is flexible and robust enough to model several variants of the motif finding problem, including those incorporating substitution matrices and phylogenetic distances. Additionally, we give an approach for determining statistical significance of uncovered motifs. In testing on numerous DNA and protein datasets, we demonstrate that our approach typically identifies statistically significant motifs corresponding to either known motifs or other motifs of high conservation. Moreover, in most cases, our approach finds provably optimal solutions to the underlying optimization problem. CONCLUSION: Our results demonstrate that a combined graph theoretic and mathematical programming approach can be the basis for effective and powerful techniques for diverse motif finding applications

    WeederH: an algorithm for finding conserved regulatory motifs and regions in homologous sequences

    Get PDF
    BACKGROUND: This work addresses the problem of detecting conserved transcription factor binding sites and in general regulatory regions through the analysis of sequences from homologous genes, an approach that is becoming more and more widely used given the ever increasing amount of genomic data available. RESULTS: We present an algorithm that identifies conserved transcription factor binding sites in a given sequence by comparing it to one or more homologs, adapting a framework we previously introduced for the discovery of sites in sequences from co-regulated genes. Differently from the most commonly used methods, the approach we present does not need or compute an alignment of the sequences investigated, nor resorts to descriptors of the binding specificity of known transcription factors. The main novel idea we introduce is a relative measure of conservation, assuming that true functional elements should present a higher level of conservation with respect to the rest of the sequence surrounding them. We present tests where we applied the algorithm to the identification of conserved annotated sites in homologous promoters, as well as in distal regions like enhancers. CONCLUSION: Results of the tests show how the algorithm can provide fast and reliable predictions of conserved transcription factor binding sites regulating the transcription of a gene, with better performances than other available methods for the same task. We also show examples on how the algorithm can be successfully employed when promoter annotations of the genes investigated are missing, or when regulatory sites and regions are located far away from the genes

    Transcriptomic epidemiology of smoking: the effect of smoking on gene expression in lymphocytes

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>This investigation offers insights into system-wide pathological processes induced in response to cigarette smoke exposure by determining its influences at the gene expression level.</p> <p>Methods</p> <p>We obtained genome-wide quantitative transcriptional profiles from 1,240 individuals from the San Antonio Family Heart Study, including 297 current smokers. Using lymphocyte samples, we identified 20,413 transcripts with significantly detectable expression levels, including both known and predicted genes. Correlation between smoking and gene expression levels was determined using a regression model that allows for residual genetic effects.</p> <p>Results</p> <p>With a conservative false-discovery rate of 5% we identified 323 unique genes (342 transcripts) whose expression levels were significantly correlated with smoking behavior. These genes showed significant over-representation within a range of functional categories that correspond well with known smoking-related pathologies, including immune response, cell death, cancer, natural killer cell signaling and xenobiotic metabolism.</p> <p>Conclusions</p> <p>Our results indicate that not only individual genes but entire networks of gene interaction are influenced by cigarette smoking. This is the largest <it>in vivo </it>transcriptomic epidemiological study of smoking to date and reveals the significant and comprehensive influence of cigarette smoke, as an environmental variable, on the expression of genes. The central importance of this manuscript is to provide a summary of the relationships between gene expression and smoking in this exceptionally large cross-sectional data set.</p

    Team climate, intention to leave and turnover among hospital employees: Prospective cohort study

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>In hospitals, the costs of employee turnover are substantial and intentions to leave among staff may manifest as lowered performance. We examined whether team climate, as indicated by clear and shared goals, participation, task orientation and support for innovation, predicts intention to leave the job and actual turnover among hospital employees.</p> <p>Methods</p> <p>Prospective study with baseline and follow-up surveys (2–4 years apart). The participants were 6,441 (785 men, 5,656 women) hospital employees under the age of 55 at the time of follow-up survey. Logistic regression with generalized estimating equations was used as an analysis method to include both individual and work unit level predictors in the models.</p> <p>Results</p> <p>Among stayers with no intention to leave at baseline, lower self-reported team climate predicted higher likelihood of having intentions to leave at follow-up (odds ratio per 1 standard deviation decrease in team climate was 1.6, 95% confidence interval 1.4–1.8). Lower co-worker assessed team climate at follow-up was also association with such intentions (odds ratio 1.8, 95% confidence interval 1.4–2.4). Among all participants, the likelihood of actually quitting the job was higher for those with poor self-reported team climate at baseline. This association disappeared after adjustment for intention to leave at baseline suggesting that such intentions may explain the greater turnover rate among employees with low team climate.</p> <p>Conclusion</p> <p>Improving team climate may reduce intentions to leave and turnover among hospital employees.</p

    Using C. elegans to discover therapeutic compounds for ageing-associated neurodegenerative diseases

    Get PDF
    Age-associated neurodegenerative disorders such as Alzheimer’s disease are a major public health challenge, due to the demographic increase in the proportion of older individuals in society. However, the relatively few currently approved drugs for these conditions provide only symptomatic relief. A major goal of neurodegeneration research is therefore to identify potential new therapeutic compounds that can slow or even reverse disease progression, either by impacting directly on the neurodegenerative process or by activating endogenous physiological neuroprotective mechanisms that decline with ageing. This requires model systems that can recapitulate key features of human neurodegenerative diseases that are also amenable to compound screening approaches. Mammalian models are very powerful, but are prohibitively expensive for high-throughput drug screens. Given the highly conserved neurological pathways between mammals and invertebrates, Caenorhabditis elegans has emerged as a powerful tool for neuroprotective compound screening. Here we describe how C. elegans has been used to model various human ageing-associated neurodegenerative diseases and provide an extensive list of compounds that have therapeutic activity in these worm models and so may have translational potential
    corecore