411 research outputs found
SpBase: the sea urchin genome database and web site
SpBase is a system of databases focused on the genomic information from sea urchins and related echinoderms. It is exposed to the public through a web site served with open source software (http://spbase.org/). The enterprise was undertaken to provide an easily used collection of information to directly support experimental work on these useful research models in cell and developmental biology. The information served from the databases emerges from the draft genomic sequence of the purple sea urchin, Strongylocentrotus purpuratus and includes sequence data and genomic resource descriptions for other members of the echinoderm clade which in total span 540 million years of evolutionary time. This version of the system contains two assemblies of the purple sea urchin genome, associated expressed sequences, gene annotations and accessory resources. Search mechanisms for the sequences and the gene annotations are provided. Because the system is maintained along with the Sea Urchin Genome resource, a database of sequenced clones is also provided
An approach to describing and analysing bulk biological annotation quality: a case study using UniProtKB
Motivation: Annotations are a key feature of many biological databases, used
to convey our knowledge of a sequence to the reader. Ideally, annotations are
curated manually, however manual curation is costly, time consuming and
requires expert knowledge and training. Given these issues and the exponential
increase of data, many databases implement automated annotation pipelines in an
attempt to avoid un-annotated entries. Both manual and automated annotations
vary in quality between databases and annotators, making assessment of
annotation reliability problematic for users. The community lacks a generic
measure for determining annotation quality and correctness, which we look at
addressing within this article. Specifically we investigate word reuse within
bulk textual annotations and relate this to Zipf's Principle of Least Effort.
We use UniProt Knowledge Base (UniProtKB) as a case study to demonstrate this
approach since it allows us to compare annotation change, both over time and
between automated and manually curated annotations.
Results: By applying power-law distributions to word reuse in annotation, we
show clear trends in UniProtKB over time, which are consistent with existing
studies of quality on free text English. Further, we show a clear distinction
between manual and automated analysis and investigate cohorts of protein
records as they mature. These results suggest that this approach holds distinct
promise as a mechanism for judging annotation quality.
Availability: Source code is available at the authors website:
http://homepages.cs.ncl.ac.uk/m.j.bell1/annotation.
Contact: [email protected]: Paper accepted at The European Conference on Computational Biology
2012 (ECCB'12). Subsequently will be published in a special issue of the
journal Bioinformatics. Paper consists of 8 pages, made up of 5 figure
Alpha-particle-induced complex chromosome exchanges transmitted through extra-thymic lymphopoiesis in vitro show evidence of emerging genomic instability
Human exposure to high-linear energy transfer α-particles includes environmental (e.g. radon gas and its decay progeny), medical (e.g. radiopharmaceuticals) and occupational (nuclear industry) sources. The associated health risks of α-particle exposure for lung cancer are well documented however the risk estimates for leukaemia remain uncertain. To further our understanding of α-particle effects in target cells for leukaemogenesis and also to seek general markers of individual exposure to α-particles, this study assessed the transmission of chromosomal damage initially-induced in human haemopoietic stem and progenitor cells after exposure to high-LET α-particles. Cells surviving exposure were differentiated into mature T-cells by extra-thymic T-cell differentiation in vitro. Multiplex fluorescence in situ hybridisation (M-FISH) analysis of naïve T-cell populations showed the occurrence of stable (clonal) complex chromosome aberrations consistent with those that are characteristically induced in spherical cells by the traversal of a single α-particle track. Additionally, complex chromosome exchanges were observed in the progeny of irradiated mature T-cell populations. In addition to this, newly arising de novo chromosome aberrations were detected in cells which possessed clonal markers of α-particle exposure and also in cells which did not show any evidence of previous exposure, suggesting ongoing genomic instability in these populations. Our findings support the usefulness and reliability of employing complex chromosome exchanges as indicators of past or ongoing exposure to high-LET radiation and demonstrate the potential applicability to evaluate health risks associated with α-particle exposure.This work was supported by the Department of Health, UK. Contract RRX95 (RMA NSDTG)
The Orthologue of Sjögren's Syndrome Nuclear Autoantigen 1 (SSNA1) in Trypanosoma brucei Is an Immunogenic Self-Assembling Molecule
Primary Sjögren's Syndrome (PSS) is a highly prevalent autoimmune disease, typically manifesting as lymphocytic infiltration of the exocrine glands leading to chronically impaired lacrimal and salivary secretion. Sjögren's Syndrome nuclear autoantigen 1 (SSNA1 or NA14) is a major specific target for autoantibodies in PSS but the precise function and clinical relevance of this protein are largely unknown. Orthologues of the gene are absent from many of the commonly used model organisms but are present in Chlamyodomonas reinhardtii (in which it has been termed DIP13) and most protozoa. We report the functional characterisation of the orthologue of SSNA1 in the kinetoplastid parasite, Trypanosoma brucei. Both TbDIP13 and human SSNA1 are small coiled-coil proteins which are predicted to be remote homologues of the actin-binding protein tropomyosin. We use comparative proteomic methods to identify potential interacting partners of TbDIP13. We also show evidence that TbDIP13 is able to self-assemble into fibril-like structures both in vitro and in vivo, a property which may contribute to its immunogenicity. Endogenous TbDIP13 partially co-localises with acetylated α-tubulin in the insect procyclic stage of the parasite. However, deletion of the DIP13 gene in cultured bloodstream and procyclic stages of T. brucei has little effect on parasite growth or morphology, indicating either a degree of functional redundancy or a function in an alternative stage of the parasite life cycle
Improved annotation of 3' untranslated regions and complex loci by combination of strand-specific direct RNA sequencing, RNA-seq and ESTs
The reference annotations made for a genome sequence provide the framework
for all subsequent analyses of the genome. Correct annotation is particularly
important when interpreting the results of RNA-seq experiments where short
sequence reads are mapped against the genome and assigned to genes according to
the annotation. Inconsistencies in annotations between the reference and the
experimental system can lead to incorrect interpretation of the effect on RNA
expression of an experimental treatment or mutation in the system under study.
Until recently, the genome-wide annotation of 3-prime untranslated regions
received less attention than coding regions and the delineation of intron/exon
boundaries. In this paper, data produced for samples in Human, Chicken and A.
thaliana by the novel single-molecule, strand-specific, Direct RNA Sequencing
technology from Helicos Biosciences which locates 3-prime polyadenylation sites
to within +/- 2 nt, were combined with archival EST and RNA-Seq data. Nine
examples are illustrated where this combination of data allowed: (1) gene and
3-prime UTR re-annotation (including extension of one 3-prime UTR by 5.9 kb);
(2) disentangling of gene expression in complex regions; (3) clearer
interpretation of small RNA expression and (4) identification of novel genes.
While the specific examples displayed here may become obsolete as genome
sequences and their annotations are refined, the principles laid out in this
paper will be of general use both to those annotating genomes and those seeking
to interpret existing publically available annotations in the context of their
own experimental dataComment: 44 pages, 9 figure
Enzymatic Shaving of the Tegument Surface of Live Schistosomes for Proteomic Analysis: A Rational Approach to Select Vaccine Candidates
Adult schistosome parasites can reside in the host bloodstream for decades surrounded by components of the immune system. It was originally proposed that their survival depended on the secretion of an inert bilayer, the membranocalyx, to protect the underlying plasma membrane from attack. We have investigated whether any proteins were exposed on the surface of live worms using incubation with selected hydrolases, in combination with mass spectrometry to identify released proteins. We show that a small number of parasite proteins are accessible to the enzymes and so could represent constituents of the membranocalyx. We also identified several proteins acquired by the parasite on contact with host cells. In addition, components of the cytolytic complement pathway were detected, but these appeared not to harm the worm, indicating that some of its own surface proteins could inhibit the lytic pathway. We suggest that, collectively, the ‘superficial’ parasite proteins may provide good candidates for a schistosome vaccine
- …