138 research outputs found

    Discovering Biological Progression Underlying Microarray Samples

    Get PDF
    In biological systems that undergo processes such as differentiation, a clear concept of progression exists. We present a novel computational approach, called Sample Progression Discovery (SPD), to discover patterns of biological progression underlying microarray gene expression data. SPD assumes that individual samples of a microarray dataset are related by an unknown biological process (i.e., differentiation, development, cell cycle, disease progression), and that each sample represents one unknown point along the progression of that process. SPD aims to organize the samples in a manner that reveals the underlying progression and to simultaneously identify subsets of genes that are responsible for that progression. We demonstrate the performance of SPD on a variety of microarray datasets that were generated by sampling a biological process at different points along its progression, without providing SPD any information of the underlying process. When applied to a cell cycle time series microarray dataset, SPD was not provided any prior knowledge of samples' time order or of which genes are cell-cycle regulated, yet SPD recovered the correct time order and identified many genes that have been associated with the cell cycle. When applied to B-cell differentiation data, SPD recovered the correct order of stages of normal B-cell differentiation and the linkage between preB-ALL tumor cells with their cell origin preB. When applied to mouse embryonic stem cell differentiation data, SPD uncovered a landscape of ESC differentiation into various lineages and genes that represent both generic and lineage specific processes. When applied to a prostate cancer microarray dataset, SPD identified gene modules that reflect a progression consistent with disease stages. SPD may be best viewed as a novel tool for synthesizing biological hypotheses because it provides a likely biological progression underlying a microarray dataset and, perhaps more importantly, the candidate genes that regulate that progression

    Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor

    Get PDF
    BACKGROUND: Repbase is a reference database of eukaryotic repetitive DNA, which includes prototypic sequences of repeats and basic information described in annotations. Updating and maintenance of the database requires specialized tools, which we have created and made available for use with Repbase, and which may be useful as a template for other curated databases. RESULTS: We describe the software tools RepbaseSubmitter and Censor, which are designed to facilitate updating and screening the content of Repbase. RepbaseSubmitter is a java-based interface for formatting and annotating Repbase entries. It eliminates many common formatting errors, and automates actions such as calculation of sequence lengths and composition, thus facilitating curation of Repbase sequences. In addition, it has several features for predicting protein coding regions in sequences; searching and including Pubmed references in Repbase entries; and searching the NCBI taxonomy database for correct inclusion of species information and taxonomic position. Censor is a tool to rapidly identify repetitive elements by comparison to known repeats. It uses WU-BLAST for speed and sensitivity, and can conduct DNA-DNA, DNA-protein, or translated DNA-translated DNA searches of genomic sequence. Defragmented output includes a map of repeats present in the query sequence, with the options to report masked query sequence(s), repeat sequences found in the query, and alignments. CONCLUSION: Censor and RepbaseSubmitter are available as both web-based services and downloadable versions. They can be found at (RepbaseSubmitter) and (Censor)

    Boolean implication networks derived from large scale, whole genome microarray datasets

    Get PDF
    A method for analysis of microarray data is presented that extracts statistically significant Boolean implication relationships between pairs of genes

    SINEs, evolution and genome structure in the opossum

    Get PDF
    Short INterspersed Elements (SINEs) are non-autonomous retrotransposons, usually between 100 and 500 base pairs (bp) in length, which are ubiquitous components of eukaryotic genomes. Their activity, distribution, and evolution can be highly informative on genomic structure and evolutionary processes. To determine recent activity, we amplified more than one hundred SINE1 loci in a panel of 43 M. domestica individuals derived from five diverse geographic locations. The SINE1 family has expanded recently enough that many loci were polymorphic, and the SINE1 insertion-based genetic distances among populations reflected geographic distance. Genome-wide comparisons of SINE1 densities and GC content revealed that high SINE1 density is associated with high GC content in a few long and many short spans. Young SINE1s, whether fixed or polymorphic, showed an unbiased GC content preference for insertion, indicating that the GC preference accumulates over long time periods, possibly in periodic bursts. SINE1 evolution is thus broadly similar to human Alu evolution, although it has an independent origin. High GC content adjacent to SINE1s is strongly correlated with bias towards higher AT to GC substitutions and lower GC to AT substitutions. This is consistent with biased gene conversion, and also indicates that like chickens, but unlike eutherian mammals, GC content heterogeneity (isochore structure) is reinforced by substitution processes in the M. domestica genome. Nevertheless, both high and low GC content regions are apparently headed towards lower GC content equilibria, possibly due to a relative shift to lower recombination rates in the recent Monodelphis ancestral lineage. Like eutherians, metatherian (marsupial) mammals have evolved high CpG substitution rates, but this is apparently a convergence in process rather than a shared ancestral state. © 2007 Elsevier B.V. All rights reserved

    Maternal Anti-Dengue IgG Fucosylation Predicts Susceptibility to Dengue Disease in Infants

    Get PDF
    Infant mortality from dengue disease is a devastating global health burden that could be minimized with the ability to identify susceptibility for severe disease prior to infection. Although most primary infant dengue infections are asymptomatic, maternally derived anti-dengue immunoglobulin G (IgGs) present during infection can trigger progression to severe disease through antibody-dependent enhancement mechanisms. Importantly, specific characteristics of maternal IgGs that herald progression to severe infant dengue are unknown. Here, we define \u3e /=10% afucosylation of maternal anti-dengue IgGs as a risk factor for susceptibility of infants to symptomatic dengue infections. Mechanistic experiments show that afucosylation of anti-dengue IgGs promotes FcgammaRIIIa signaling during infection, in turn enhancing dengue virus replication in FcgammaRIIIa(+) monocytes. These studies identify a post-translational modification of anti-dengue IgGs that correlates with risk for symptomatic infant dengue infections and define a mechanism by which afucosylated antibodies and FcgammaRIIIa enhance dengue infections

    Conditional expression of HGAL leads to the development of diffuse large B-cell lymphoma in mice

    Get PDF
    Diffuse large B-cell lymphomas (DLBCLs) are clinically and genetically heterogeneous tumors. Deregulation of diverse biological processes specific to B cells, such as B-cell receptor (BCR) signaling and motility regulation, contribute to lymphomagenesis. Human germinal center associated lymphoma (HGAL) is a B-cell–specific adaptor protein controlling BCR signaling and B lymphocyte motility. In normal B cells, it is expressed in germinal center (GC) B lymphocytes and promptly downregulated upon further differentiation. The majority of DLBCL tumors, primarily GC B-cell types, but also activated types, express HGAL. To investigate the consequences of constitutive expression of HGAL in vivo, we generated mice that conditionally express human HGAL at different stages of hematopoietic development using 3 restricted Cre-mediated approaches to initiate expression of HGAL in hematopoietic stem cells, pro-B cells, or GC B cells. Following immune stimulation, we observed larger GCs in mice in which HGAL expression was initiated in GC B cells. All 3 mouse strains developed DLBCL at a frequency of 12% to 30% starting at age 13 months, leading to shorter survival. Immunohistochemical studies showed that all analyzed tumors were of the GC B-cell type. Exon sequencing revealed mutations reported in human DLBCL. Our data demonstrate that constitutive enforced expression of HGAL leads to DLBCL development

    Genome of the marsupial Monodelphis domestica reveals innovation in non-coding sequences

    Get PDF
    We report a high-quality draft of the genome sequence of the grey, short-tailed opossum (Monodelphis domestica). As the first metatherian (\u27marsupial\u27) species to be sequenced, the opossum provides a unique perspective on the organization and evolution of mammalian genomes. Distinctive features of the opossum chromosomes provide support for recent theories about genome evolution and function, including a strong influence of biased gene conversion on nucleotide sequence composition, and a relationship between chromosomal characteristics and X chromosome inactivation. Comparison of opossum and eutherian genomes also reveals a sharp difference in evolutionary innovation between protein-coding and non-coding functional elements. True innovation in protein-coding genes seems to be relatively rare, with lineage-specific differences being largely due to diversification and rapid turnover in gene families involved in environmental interactions. In contrast, about 20% of eutherian conserved non-coding elements (CNEs) are recent inventions that postdate the divergence of Eutheria and Metatheria. A substantial proportion of these eutherian-specific CNEs arose from sequence inserted by transposable elements, pointing to transposons as a major creative force in the evolution of mammalian gene regulation. ©2007 Nature Publishing Group

    Multidisciplinary investigations of the diets of two post-medieval populations from London using stable isotopes and microdebris analysis

    Get PDF
    This paper presents the first multi-tissue study of diet in post-medieval London using both the stable light isotope analysis of carbon and nitrogen and analysis of microdebris in dental calculus. Dietary intake was explored over short and long timescales. Bulk bone collagen was analysed from humans from the Queen’s Chapel of the Savoy (QCS) (n = 66) and the St Barnabas/St Mary Abbots (SB) (n = 25). Incremental dentine analysis was performed on the second molar of individual QCS1123 to explore childhood dietary intake. Bulk hair samples (n = 4) were sampled from adults from QCS, and dental calculus was analysed from four other individuals using microscopy. In addition, bone collagen from a total of 46 animals from QCS (n = 11) and the additional site of Prescot Street (n = 35) was analysed, providing the first animal dietary baseline for post-medieval London. Overall, isotopic results suggest a largely C3-based terrestrial diet for both populations, with the exception of QCS1123 who exhibited values consistent with the consumption of C4 food sources throughout childhood and adulthood. The differences exhibited in δ15Ncoll across both populations likely reflect variations in diet due to social class and occupation, with individuals from SB likely representing wealthier individuals consuming larger quantities of animal and marine fish protein. Microdebris analysis results were limited but indicate the consumption of domestic cereals. This paper demonstrates the utility of a multidisciplinary approach to investigate diet across long and short timescales to further our understanding of variations in social status and mobility
    corecore