1,017 research outputs found

    RepSeq-A database of amino acid repeats present in lower eukaryotic pathogens

    Get PDF
    BACKGROUND Amino acid repeat-containing proteins have a broad range of functions and their identification is of relevance to many experimental biologists. In human-infective protozoan parasites (such as the Kinetoplastid and Plasmodium species), they are implicated in immune evasion and have been shown to influence virulence and pathogenicity. RepSeq http://repseq.gugbe.com is a new database of amino acid repeat-containing proteins found in lower eukaryotic pathogens. The RepSeq database is accessed via a web-based application which also provides links to related online tools and databases for further analyses. RESULTS The RepSeq algorithm typically identifies more than 98% of repeat-containing proteins and is capable of identifying both perfect and mismatch repeats. The proportion of proteins that contain repeat elements varies greatly between different families and even species (3 - 35% of the total protein content). The most common motif type is the Sequence Repeat Region (SRR) - a repeated motif containing multiple different amino acid types. Proteins containing Single Amino Acid Repeats (SAARs) and Di-Peptide Repeats (DPRs) typically account for 0.5 - 1.0% of the total protein number. Notable exceptions are P. falciparum and D. discoideum, in which 33.67% and 34.28% respectively of the predicted proteomes consist of repeat-containing proteins. These numbers are due to large insertions of low complexity single and multi-codon repeat regions. CONCLUSION The RepSeq database provides a repository for repeat-containing proteins found in parasitic protozoa. The database allows for both individual and cross-species proteome analyses and also allows users to upload sequences of interest for analysis by the RepSeq algorithm. Identification of repeat-containing proteins provides researchers with a defined subset of proteins which can be analysed by expression profiling and functional characterisation, thereby facilitating study of pathogenicity and virulence factors in the parasitic protozoa. While primarily designed for kinetoplastid work, the RepSeq algorithm and database retain full functionality when used to analyse other species

    Wide-Scale Analysis of Human Functional Transcription Factor Binding Reveals a Strong Bias towards the Transcription Start Site

    Get PDF
    We introduce a novel method to screen the promoters of a set of genes with shared biological function, against a precompiled library of motifs, and find those motifs which are statistically over-represented in the gene set. The gene sets were obtained from the functional Gene Ontology (GO) classification; for each set and motif we optimized the sequence similarity score threshold, independently for every location window (measured with respect to the TSS), taking into account the location dependent nucleotide heterogeneity along the promoters of the target genes. We performed a high throughput analysis, searching the promoters (from 200bp downstream to 1000bp upstream the TSS), of more than 8000 human and 23,000 mouse genes, for 134 functional Gene Ontology classes and for 412 known DNA motifs. When combined with binding site and location conservation between human and mouse, the method identifies with high probability functional binding sites that regulate groups of biologically related genes. We found many location-sensitive functional binding events and showed that they clustered close to the TSS. Our method and findings were put to several experimental tests. By allowing a "flexible" threshold and combining our functional class and location specific search method with conservation between human and mouse, we are able to identify reliably functional TF binding sites. This is an essential step towards constructing regulatory networks and elucidating the design principles that govern transcriptional regulation of expression. The promoter region proximal to the TSS appears to be of central importance for regulation of transcription in human and mouse, just as it is in bacteria and yeast.Comment: 31 pages, including Supplementary Information and figure

    Genetic Diversity Enhances Restoration Success by Augmenting Ecosystem Services

    Get PDF
    Disturbance and habitat destruction due to human activities is a pervasive problem in near-shore marine ecosystems, and restoration is often used to mitigate losses. A common metric used to evaluate the success of restoration is the return of ecosystem services. Previous research has shown that biodiversity, including genetic diversity, is positively associated with the provision of ecosystem services. We conducted a restoration experiment using sources, techniques, and sites similar to actual large-scale seagrass restoration projects and demonstrated that a small increase in genetic diversity enhanced ecosystem services (invertebrate habitat, increased primary productivity, and nutrient retention). In our experiment, plots with elevated genetic diversity had plants that survived longer, increased in density more quickly, and provided more ecosystem services (invertebrate habitat, increased primary productivity, and nutrient retention). We used the number of alleles per locus as a measure of genetic diversity, which, unlike clonal diversity used in earlier research, can be applied to any organism. Additionally, unlike previous studies where positive impacts of diversity occurred only after a large disturbance, this study assessed the importance of diversity in response to potential environmental stresses (high temperature, low light) along a water–depth gradient. We found a positive impact of diversity along the entire depth gradient. Taken together, these results suggest that ecosystem restoration will significantly benefit from obtaining sources (transplants or seeds) with high genetic diversity and from restoration techniques that can maintain that genetic diversity

    Advances in the treatment of chronic myeloid leukemia

    Get PDF
    Although imatinib is firmly established as an effective therapy for newly diagnosed patients with chronic myeloid leukemia (CML), the field continues to advance on several fronts. In this minireview we cover recent results of second generation tyrosine kinase inhibitors in newly diagnosed patients, investigate the state of strategies to discontinue therapy and report on new small molecule inhibitors to tackle resistant disease, focusing on agents that target the T315I mutant of BCR-ABL. As a result of these advances, standard of care in frontline therapy has started to gravitate toward dasatinib and nilotinib, although more observation is needed to fully support this. Stopping therapy altogether remains a matter of clinical trials, and more must be learned about the mechanisms underlying the persistence of leukemic cells with treatment. However, there is good news for patients with the T315I mutation, as effective drugs such as ponatinib are on their way to regulatory approval. Despite these promising data, accelerated or blastic phase disease remains a challenge, possibly due to BCR-ABL-independent resistance

    The vertebrate phylotypic stage and an early bilaterian-related stage in mouse embryogenesis defined by genomic information

    Get PDF
    BACKGROUND: Embryos of taxonomically different vertebrates are thought to pass through a stage in which they resemble one another morphologically. This "vertebrate phylotypic stage" may represent the basic vertebrate body plan that was established in the common ancestor of vertebrates. However, much controversy remains about when the phylotypic stage appears, and whether it even exists. To overcome the limitations of studies based on morphological comparison, we explored a comprehensive quantitative method for defining the constrained stage using expressed sequence tag (EST) data, gene ontologies (GO), and available genomes of various animals. If strong developmental constraints occur during the phylotypic stage of vertebrate embryos, then genes conserved among vertebrates would be highly expressed at this stage. RESULTS: We established a novel method for evaluating the ancestral nature of mouse embryonic stages that does not depend on comparative morphology. The numerical "ancestor index" revealed that the mouse indeed has a highly conserved embryonic period at embryonic day 8.0–8.5, the time of appearance of the pharyngeal arch and somites. During this period, the mouse prominently expresses GO-determined developmental genes shared among vertebrates. Similar analyses revealed the existence of a bilaterian-related period, during which GO-determined developmental genes shared among bilaterians are markedly expressed at the cleavage-to-gastrulation period. The genes associated with the phylotypic stage identified by our method are essential in embryogenesis. CONCLUSION: Our results demonstrate that the mid-embryonic stage of the mouse is indeed highly constrained, supporting the existence of the phylotypic stage. Furthermore, this candidate stage is preceded by a putative bilaterian ancestor-related period. These results not only support the developmental hourglass model, but also highlight the hierarchical aspect of embryogenesis proposed by von Baer. Identification of conserved stages and tissues by this method in various animals would be a powerful tool to examine the phylotypic stage hypothesis, and to understand which kinds of developmental events and gene sets are evolutionarily constrained and how they limit the possible variations of animal basic body plans

    Systematic identification of conserved motif modules in the human genome

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The identification of motif modules, groups of multiple motifs frequently occurring in DNA sequences, is one of the most important tasks necessary for annotating the human genome. Current approaches to identifying motif modules are often restricted to searches within promoter regions or rely on multiple genome alignments. However, the promoter regions only account for a limited number of locations where transcription factor binding sites can occur, and multiple genome alignments often cannot align binding sites with their true counterparts because of the short and degenerative nature of these transcription factor binding sites.</p> <p>Results</p> <p>To identify motif modules systematically, we developed a computational method for the entire non-coding regions around human genes that does not rely upon the use of multiple genome alignments. First, we selected orthologous DNA blocks approximately 1-kilobase in length based on discontiguous sequence similarity. Next, we scanned the conserved segments in these blocks using known motifs in the TRANSFAC database. Finally, a frequent pattern mining technique was applied to identify motif modules within these blocks. In total, with a false discovery rate cutoff of 0.05, we predicted 3,161,839 motif modules, 90.8% of which are supported by various forms of functional evidence. Compared with experimental data from 14 ChIP-seq experiments, on average, our methods predicted 69.6% of the ChIP-seq peaks with TFBSs of multiple TFs. Our findings also show that many motif modules have distance preference and order preference among the motifs, which further supports the functionality of these predictions.</p> <p>Conclusions</p> <p>Our work provides a large-scale prediction of motif modules in mammals, which will facilitate the understanding of gene regulation in a systematic way.</p

    Radio emission from Supernova Remnants

    Get PDF
    The explosion of a supernova releases almost instantaneously about 10^51 ergs of mechanic energy, changing irreversibly the physical and chemical properties of large regions in the galaxies. The stellar ejecta, the nebula resulting from the powerful shock waves, and sometimes a compact stellar remnant, constitute a supernova remnant (SNR). They can radiate their energy across the whole electromagnetic spectrum, but the great majority are radio sources. Almost 70 years after the first detection of radio emission coming from a SNR, great progress has been achieved in the comprehension of their physical characteristics and evolution. We review the present knowledge of different aspects of radio remnants, focusing on sources of the Milky Way and the Magellanic Clouds, where the SNRs can be spatially resolved. We present a brief overview of theoretical background, analyze morphology and polarization properties, and review and critical discuss different methods applied to determine the radio spectrum and distances. The consequences of the interaction between the SNR shocks and the surrounding medium are examined, including the question of whether SNRs can trigger the formation of new stars. Cases of multispectral comparison are presented. A section is devoted to reviewing recent results of radio SNRs in the Magellanic Clouds, with particular emphasis on the radio properties of SN 1987A, an ideal laboratory to investigate dynamical evolution of an SNR in near real time. The review concludes with a summary of issues on radio SNRs that deserve further study, and analyzing the prospects for future research with the latest generation radio telescopes.Comment: Revised version. 48 pages, 15 figure

    c-REDUCE: Incorporating sequence conservation to detect motifs that correlate with expression

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Computational methods for characterizing novel transcription factor binding sites search for sequence patterns or "motifs" that appear repeatedly in genomic regions of interest. Correlation-based motif finding strategies are used to identify motifs that correlate with expression data and do not rely on promoter sequences from a pre-determined set of genes.</p> <p>Results</p> <p>In this work, we describe a method for predicting motifs that combines the correlation-based strategy with phylogenetic footprinting, where motifs are identified by evaluating orthologous sequence regions from multiple species. Our method, c-REDUCE, can account for variability at a motif position inferred from evolutionary information. c-REDUCE has been tested on ChIP-chip data for yeast transcription factors and on gene expression data in <it>Drosophila</it>.</p> <p>Conclusion</p> <p>Our results indicate that utilizing sequence conservation information in addition to correlation-based methods improves the identification of known motifs.</p

    Azimuthal anisotropy and correlations at large transverse momenta in p+pp+p and Au+Au collisions at sNN\sqrt{s_{_{NN}}}= 200 GeV

    Get PDF
    Results on high transverse momentum charged particle emission with respect to the reaction plane are presented for Au+Au collisions at sNN\sqrt{s_{_{NN}}}= 200 GeV. Two- and four-particle correlations results are presented as well as a comparison of azimuthal correlations in Au+Au collisions to those in p+pp+p at the same energy. Elliptic anisotropy, v2v_2, is found to reach its maximum at pt3p_t \sim 3 GeV/c, then decrease slowly and remain significant up to pt7p_t\approx 7 -- 10 GeV/c. Stronger suppression is found in the back-to-back high-ptp_t particle correlations for particles emitted out-of-plane compared to those emitted in-plane. The centrality dependence of v2v_2 at intermediate ptp_t is compared to simple models based on jet quenching.Comment: 4 figures. Published version as PRL 93, 252301 (2004
    corecore