49 research outputs found

    FlyBase: genomes by the dozen

    Get PDF
    FlyBase () is the primary database of genetic and genomic data for the insect family Drosophilidae. Historically, Drosophila melanogaster has been the most extensively studied species in this family, but recent determination of the genomic sequences of an additional 11 Drosophila species opens up new avenues of research for other Drosophila species. This extensive sequence resource, encompassing species with well-defined phylogenetic relationships, provides a model system for comparative genomic analyses. FlyBase has developed tools to facilitate access to and navigation through this invaluable new data collection

    REC, Drosophila MCM8, Drives Formation of Meiotic Crossovers

    Get PDF
    Crossovers ensure the accurate segregation of homologous chromosomes from one another during meiosis. Here, we describe the identity and function of the Drosophila melanogaster gene recombination defective (rec), which is required for most meiotic crossing over. We show that rec encodes a member of the mini-chromosome maintenance (MCM) protein family. Six MCM proteins (MCM2–7) are essential for DNA replication and are found in all eukaryotes. REC is the Drosophila ortholog of the recently identified seventh member of this family, MCM8. Our phylogenetic analysis reveals the existence of yet another family member, MCM9, and shows that MCM8 and MCM9 arose early in eukaryotic evolution, though one or both have been lost in multiple eukaryotic lineages. Drosophila has lost MCM9 but retained MCM8, represented by REC. We used genetic and molecular methods to study the function of REC in meiotic recombination. Epistasis experiments suggest that REC acts after the Rad51 ortholog SPN-A but before the endonuclease MEI-9. Although crossovers are reduced by 95% in rec mutants, the frequency of noncrossover gene conversion is significantly increased. Interestingly, gene conversion tracts in rec mutants are about half the length of tracts in wild-type flies. To account for these phenotypes, we propose that REC facilitates repair synthesis during meiotic recombination. In the absence of REC, synthesis does not proceed far enough to allow formation of an intermediate that can give rise to crossovers, and recombination proceeds via synthesis-dependent strand annealing to generate only noncrossover products

    Annotating genes and genomes with DNA sequences extracted from biomedical articles

    Get PDF
    Motivation: Increasing rates of publication and DNA sequencing make the problem of finding relevant articles for a particular gene or genomic region more challenging than ever. Existing text-mining approaches focus on finding gene names or identifiers in English text. These are often not unique and do not identify the exact genomic location of a study

    FlyBase at 25: looking to the future.

    Get PDF
    Since 1992, FlyBase (flybase.org) has been an essential online resource for the Drosophila research community. Concentrating on the most extensively studied species, Drosophila melanogaster, FlyBase includes information on genes (molecular and genetic), transgenic constructs, phenotypes, genetic and physical interactions, and reagents such as stocks and cDNAs. Access to data is provided through a number of tools, reports, and bulk-data downloads. Looking to the future, FlyBase is expanding its focus to serve a broader scientific community. In this update, we describe new features, datasets, reagent collections, and data presentations that address this goal, including enhanced orthology data, Human Disease Model Reports, protein domain search and visualization, concise gene summaries, a portal for external resources, video tutorials and the FlyBase Community Advisory Group

    Combining modularity, conservation, and interactions of proteins significantly increases precision and coverage of protein function prediction

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>While the number of newly sequenced genomes and genes is constantly increasing, elucidation of their function still is a laborious and time-consuming task. This has led to the development of a wide range of methods for predicting protein functions in silico. We report on a new method that predicts function based on a combination of information about protein interactions, orthology, and the conservation of protein networks in different species.</p> <p>Results</p> <p>We show that aggregation of these independent sources of evidence leads to a drastic increase in number and quality of predictions when compared to baselines and other methods reported in the literature. For instance, our method generates more than 12,000 novel protein functions for human with an estimated precision of ~76%, among which are 7,500 new functional annotations for 1,973 human proteins that previously had zero or only one function annotated. We also verified our predictions on a set of genes that play an important role in colorectal cancer (<it>MLH1</it>, <it>PMS2</it>, <it>EPHB4 </it>) and could confirm more than 73% of them based on evidence in the literature.</p> <p>Conclusions</p> <p>The combination of different methods into a single, comprehensive prediction method infers thousands of protein functions for every species included in the analysis at varying, yet always high levels of precision and very good coverage.</p

    The FlyBase database of the Drosophila genome projects and community literature

    No full text
    corecore