967,505 research outputs found

    Polytraits : a database on biological traits of marine polychaetes

    Get PDF
    The study of ecosystem functioning – the role which organisms play in an ecosystem – is becoming increasingly important in marine ecological research. The functional structure of a community can be represented by a set of functional traits assigned to behavioural, reproductive and morphological characteristics. The collection of these traits from the literature is however a laborious and time-consuming process, and gaps of knowledge and restricted availability of literature are a common problem. Trait data are not yet readily being shared by research communities, and even if they are, a lack of trait data repositories and standards for data formats leads to the publication of trait information in forms which cannot be processed by computers. This paper describes Polytraits (http://polytraits.lifewatchgreece.eu), a database on biological traits of marine polychaetes (bristle worms, Polychaeta: Annelida). At present, the database contains almost 20,000 records on morphological, behavioural and reproductive characteristics of more than 1,000 marine polychaete species, all referenced by literature sources. All data can be freely accessed through the project website in different ways and formats, both human-readable and machine-readable, and have been submitted to the Encyclopedia of Life for archival and integration with trait information from other sources

    Unbiased protein association study on the public human proteome reveals biological connections between co-occurring protein pairs

    Get PDF
    Mass-spectrometry-based, high-throughput proteomics experiments produce large amounts of data. While typically acquired to answer specific biological questions, these data can also be reused in orthogonal wayS to reveal new biological knowledge. We here present a novel method for such orthogonal data reuse of public proteomics data. Our method elucidates biological relationships between proteins based on the co-occurrence of these proteins across human experiments in the PRIDE database. The majority of the significantly co-occurring protein pairs that were detected by our method have been successfully mapped to existing biological knowledge. The validity of our novel method is substantiated by the extremely few pairs that can be mapped to existing knowledge based on random associations between the same set of proteins. Moreover, using literature searches and the STRING database, we were able to derive meaningful biological associations for unannotated protein pairs that were detected using our method, further illustrating that as-yet unknown associations present highly interesting targets for follow-up analysis

    bdbms -- A Database Management System for Biological Data

    Full text link
    Biologists are increasingly using databases for storing and managing their data. Biological databases typically consist of a mixture of raw data, metadata, sequences, annotations, and related data obtained from various sources. Current database technology lacks several functionalities that are needed by biological databases. In this paper, we introduce bdbms, an extensible prototype database management system for supporting biological data. bdbms extends the functionalities of current DBMSs to include: (1) Annotation and provenance management including storage, indexing, manipulation, and querying of annotation and provenance as first class objects in bdbms, (2) Local dependency tracking to track the dependencies and derivations among data items, (3) Update authorization to support data curation via content-based authorization, in contrast to identity-based authorization, and (4) New access methods and their supporting operators that support pattern matching on various types of compressed biological data types. This paper presents the design of bdbms along with the techniques proposed to support these functionalities including an extension to SQL. We also outline some open issues in building bdbms.Comment: This article is published under a Creative Commons License Agreement (http://creativecommons.org/licenses/by/2.5/.) You may copy, distribute, display, and perform the work, make derivative works and make commercial use of the work, but, you must attribute the work to the author and CIDR 2007. 3rd Biennial Conference on Innovative Data Systems Research (CIDR) January 710, 2007, Asilomar, California, US

    Apellicon: a web-based tool for constructing and curating Textpresso databases.

    Get PDF
    As more research literature in the biological sciences is made available in electronic format, text mining systems are increasingly being used to improve the ability of investigators to retrieve relevant information. Through the use of advanced indexing techniques that utilize biological ontologies, semantic databases, and other formal representations of biological concepts text mining systems have been able to effectively parse biological literature. While text mining systems are increasingly effective at creating the linkages required to provide context-specific search results, the systems themselves are difficult to set up and use by novice computer users due to the highly technical nature of the applications. Because most researchers in the biological sciences do not have a strong computer science background we have focused on improving the quality of existing, proven text mining systems by implementing a web-based GUI that greatly improves the workflow of these systems. Textpresso in particular has an excellent web-based interface for searching literature but does not have an easy to use administrative interface. We developed the Apellicon interface to enable a wide range of users to build and manage a Textpresso database. An important feature of Apellicon is that it can enable groups to collaborate in building a Textpresso database

    HTRIdb: an open-access database for experimentally verified human transcriptional regulation interactions

    Get PDF
    Background: The modeling of interactions among transcription factors (TFs) and their respective target genes (TGs) into transcriptional regulatory networks is important for the complete understanding of regulation of biological processes. In the case of human TF-TG interactions, there is no database at present that explicitly provides such information even though many databases containing human TF-TG interaction data have been available. In an effort to provide researchers with a repository of TF-TG interactions from which such interactions can be directly extracted, we present here the Human Transcriptional Regulation Interactions database (HTRIdb).
Description: The HTRIdb is an open-access database of experimentally validated interactions among human TFs and their TGs. HTRIdb can be searched via a user-friendly web interface and the retrieved TF-TG interactions data and the associated protein-protein interactions can be downloaded or interactively visualized as a network using the Cytoscape Web software. Moreover, users can improve the database quality by uploading their own interactions and indicating inconsistencies in the data. So far, HTRIdb has been populated with 283 TFs that regulate 11886 genes, totaling 18160 TF-TG interactions. HTRIdb is freely available at http://www.lbbc.ibb.unesp.br/htri.
Conclusions: HTRIdb is a powerful user-friendly tool from which human experimentally validated TF-TG interactions can be easily extracted and used to construct transcriptional regulation interaction networks enabling researchers to decipher the regulation of biological processes

    Leaf venation networks of Bornean trees: images and hand-traced segmentations.

    Get PDF
    The data set contains images of leaf venation networks obtained from tree species in Malaysian Borneo. The data set contains 726 leaves from 295 species comprising 50 families, sampled from eight forest plots in Sabah. Image extents are approximately 1 × 1 cm, or 50 megapixels. All images contain a region of interest in which all veins have been hand traced. The complete data set includes over 30 billion pixels, of which more than 600 million have been validated by hand tracing. These images are suitable for morphological characterization of these species, as well as for training of machine-learning algorithms that segment biological networks from images. Data are made available under the Open Data Commons Attribution License. You are free to copy, distribute, and use the database; to produce works from the database; and to modify, transform, and build upon the database. You must attribute any public use of the database, or works produced from the database, in the manner specified in the license. For any use or redistribution of the database, or works produced from it, you must make clear to others the license of the database and keep intact any notices on the original database

    Automatic landmarking for building biological shape models

    Get PDF
    We present a new method for automatic landmark extraction from the contours of biological specimens. Our ultimate goal is to enable automatic identification of biological specimens in photographs and drawings held in a database. We propose to use active appearance models for visual indexing of both photographs and drawings. Automatic landmark extraction will assist us in building the models. We describe the results of using our method on drawings and photographs of examples of diatoms, and present an active shape model built using automatically extracted data
    corecore