194 research outputs found

    Wide-Scale Analysis of Human Functional Transcription Factor Binding Reveals a Strong Bias towards the Transcription Start Site

    Get PDF
    We introduce a novel method to screen the promoters of a set of genes with shared biological function, against a precompiled library of motifs, and find those motifs which are statistically over-represented in the gene set. The gene sets were obtained from the functional Gene Ontology (GO) classification; for each set and motif we optimized the sequence similarity score threshold, independently for every location window (measured with respect to the TSS), taking into account the location dependent nucleotide heterogeneity along the promoters of the target genes. We performed a high throughput analysis, searching the promoters (from 200bp downstream to 1000bp upstream the TSS), of more than 8000 human and 23,000 mouse genes, for 134 functional Gene Ontology classes and for 412 known DNA motifs. When combined with binding site and location conservation between human and mouse, the method identifies with high probability functional binding sites that regulate groups of biologically related genes. We found many location-sensitive functional binding events and showed that they clustered close to the TSS. Our method and findings were put to several experimental tests. By allowing a "flexible" threshold and combining our functional class and location specific search method with conservation between human and mouse, we are able to identify reliably functional TF binding sites. This is an essential step towards constructing regulatory networks and elucidating the design principles that govern transcriptional regulation of expression. The promoter region proximal to the TSS appears to be of central importance for regulation of transcription in human and mouse, just as it is in bacteria and yeast.Comment: 31 pages, including Supplementary Information and figure

    Using quantile regression to investigate racial disparities in medication non-adherence

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Many studies have investigated racial/ethnic disparities in medication non-adherence in patients with type 2 diabetes using common measures such as medication possession ratio (MPR) or gaps between refills. All these measures including MPR are quasi-continuous and bounded and their distribution is usually skewed. Analysis of such measures using traditional regression methods that model mean changes in the dependent variable may fail to provide a full picture about differential patterns in non-adherence between groups.</p> <p>Methods</p> <p>A retrospective cohort of 11,272 veterans with type 2 diabetes was assembled from Veterans Administration datasets from April 1996 to May 2006. The main outcome measure was MPR with quantile cutoffs Q1-Q4 taking values of 0.4, 0.6, 0.8 and 0.9. Quantile-regression (QReg) was used to model the association between MPR and race/ethnicity after adjusting for covariates. Comparison was made with commonly used ordinary-least-squares (OLS) and generalized linear mixed models (GLMM).</p> <p>Results</p> <p>Quantile-regression showed that Non-Hispanic-Black (NHB) had statistically significantly lower MPR compared to Non-Hispanic-White (NHW) holding all other variables constant across all quantiles with estimates and p-values given as -3.4% (p = 0.11), -5.4% (p = 0.01), -3.1% (p = 0.001), and -2.00% (p = 0.001) for Q1 to Q4, respectively. Other racial/ethnic groups had lower adherence than NHW only in the lowest quantile (Q1) of about -6.3% (p = 0.003). In contrast, OLS and GLMM only showed differences in mean MPR between NHB and NHW while the mean MPR difference between other racial groups and NHW was not significant.</p> <p>Conclusion</p> <p>Quantile regression is recommended for analysis of data that are heterogeneous such that the tails and the central location of the conditional distributions vary differently with the covariates. QReg provides a comprehensive view of the relationships between independent and dependent variables (i.e. not just centrally but also in the tails of the conditional distribution of the dependent variable). Indeed, without performing QReg at different quantiles, an investigator would have no way of assessing whether a difference in these relationships might exist.</p

    Identification of Direct Target Genes Using Joint Sequence and Expression Likelihood with Application to DAF-16

    Get PDF
    A major challenge in the post-genome era is to reconstruct regulatory networks from the biological knowledge accumulated up to date. The development of tools for identifying direct target genes of transcription factors (TFs) is critical to this endeavor. Given a set of microarray experiments, a probabilistic model called TRANSMODIS has been developed which can infer the direct targets of a TF by integrating sequence motif, gene expression and ChIP-chip data. The performance of TRANSMODIS was first validated on a set of transcription factor perturbation experiments (TFPEs) involving Pho4p, a well studied TF in Saccharomyces cerevisiae. TRANSMODIS removed elements of arbitrariness in manual target gene selection process and produced results that concur with one's intuition. TRANSMODIS was further validated on a genome-wide scale by comparing it with two other methods in Saccharomyces cerevisiae. The usefulness of TRANSMODIS was then demonstrated by applying it to the identification of direct targets of DAF-16, a critical TF regulating ageing in Caenorhabditis elegans. We found that 189 genes were tightly regulated by DAF-16. In addition, DAF-16 has differential preference for motifs when acting as an activator or repressor, which awaits experimental verification. TRANSMODIS is computationally efficient and robust, making it a useful probabilistic framework for finding immediate targets

    The <i>N</i>-myristoylome of <i>Trypanosoma cruzi</i>

    Get PDF
    Protein N-myristoylation is catalysed by N-myristoyltransferase (NMT), an essential and druggable target in Trypanosoma cruzi, the causative agent of Chagas’ disease. Here we have employed whole cell labelling with azidomyristic acid and click chemistry to identify N-myristoylated proteins in different life cycle stages of the parasite. Only minor differences in fluorescent-labelling were observed between the dividing forms (the insect epimastigote and mammalian amastigote stages) and the non-dividing trypomastigote stage. Using a combination of label-free and stable isotope labelling of cells in culture (SILAC) based proteomic strategies in the presence and absence of the NMT inhibitor DDD85646, we identified 56 proteins enriched in at least two out of the three experimental approaches. Of these, 6 were likely to be false positives, with the remaining 50 commencing with amino acids MG at the N-terminus in one or more of the T. cruzi genomes. Most of these are proteins of unknown function (32), with the remainder (18) implicated in a diverse range of critical cellular and metabolic functions such as intracellular transport, cell signalling and protein turnover. In summary, we have established that 0.43–0.46% of the proteome is N-myristoylated in T. cruzi approaching that of other eukaryotic organisms (0.5–1.7%)

    A genetic algorithm for the one-dimensional cutting stock problem with setups

    Get PDF
    This paper investigates the one-dimensional cutting stock problem considering two conflicting objective functions: minimization of both the number of objects and the number of different cutting patterns used. A new heuristic method based on the concepts of genetic algorithms is proposed to solve the problem. This heuristic is empirically analyzed by solving randomly generated instances and also practical instances from a chemical-fiber company. The computational results show that the method is efficient and obtains positive results when compared to other methods from the literature. © 2014 Brazilian Operations Research Society

    Genome-Wide Association Study of Plasma Polyunsaturated Fatty Acids in the InCHIANTI Study

    Get PDF
    Polyunsaturated fatty acids (PUFA) have a role in many physiological processes, including energy production, modulation of inflammation, and maintenance of cell membrane integrity. High plasma PUFA concentrations have been shown to have beneficial effects on cardiovascular disease and mortality. To identify genetic contributors of plasma PUFA concentrations, we conducted a genome-wide association study of plasma levels of six omega-3 and omega-6 fatty acids in 1,075 participants in the InCHIANTI study on aging. The strongest evidence for association was observed in a region of chromosome 11 that encodes three fatty acid desaturases (FADS1, FADS2, FADS3). The SNP with the most significant association was rs174537 near FADS1 in the analysis of arachidonic acid (AA; p = 5.95×10−46). Minor allele homozygotes had lower AA compared to the major allele homozygotes and rs174537 accounted for 18.6% of the additive variance in AA concentrations. This SNP was also associated with levels of eicosadienoic acid (EDA; p = 6.78×10−9) and eicosapentanoic acid (EPA; p = 1.07×10−14). Participants carrying the allele associated with higher AA, EDA, and EPA also had higher low-density lipoprotein (LDL-C) and total cholesterol levels. Outside the FADS gene cluster, the strongest region of association mapped to chromosome 6 in the region encoding an elongase of very long fatty acids 2 (ELOVL2). In this region, association was observed with EPA (rs953413; p = 1.1×10−6). The effects of rs174537 were confirmed in an independent sample of 1,076 subjects participating in the GOLDN study. The ELOVL2 SNP was associated with docosapentanoic and DHA but not with EPA in GOLDN. These findings show that polymorphisms of genes encoding enzymes in the metabolism of PUFA contribute to plasma concentrations of fatty acids

    The Role of Host Genetics in Susceptibility to Influenza: A Systematic Review

    Get PDF
    Background: The World Health Organization has identified studies of the role of host genetics on susceptibility to severe influenza as a priority. A systematic review was conducted to summarize the current state of evidence on the role of host genetics in susceptibility to influenza (PROSPERO registration number: CRD42011001380). Methods and Findings: PubMed, Web of Science, the Cochrane Library, and OpenSIGLE were searched using a pre-defined strategy for all entries up to the date of the search. Two reviewers independently screened the title and abstract of 1,371 unique articles, and 72 full text publications were selected for inclusion. Mouse models clearly demonstrate that host genetics plays a critical role in susceptibility to a range of human and avian influenza viruses. The Mx genes encoding interferon inducible proteins are the best studied but their relevance to susceptibility in humans is unknown. Although the MxA gene should be considered a candidate gene for further study in humans, over 100 other candidate genes have been proposed. There are however no data associating any of these candidate genes to susceptibility in humans, with the only published study in humans being under-powered. One genealogy study presents moderate evidence of a heritable component to the risk of influenza-associated death, and while the marked familial aggregation of H5N1 cases is suggestive of host genetic factors, this remains unproven. Conclusion: The fundamental question ‘‘Is susceptibility to severe influenza in humans heritable?’ ’ remains unanswered. No
    corecore