149 research outputs found
Germin-like proteins (GLPs) in cereal genomes: gene clustering and dynamic roles in plant defence
The recent release of the genome sequences of a number of crop and model plant species has made it possible to define the genome organisation and functional characteristics of specific genes and gene families of agronomic importance. For instance, Sorghum bicolor, maize (Zea mays) and Brachypodium distachyon genome sequences along with the model grass species rice (Oryza sativa) enable the comparative analysis of genes involved in plant defence. Germin-like proteins (GLPs) are a small, functionally and taxonomically diverse class of cupin-domain containing proteins that have recently been shown to cluster in an area of rice chromosome 8. The genomic location of this gene cluster overlaps with a disease resistance QTL that provides defence against two rice fungal pathogens (Magnaporthe oryzae and Rhizoctonia solani). Studies showing the involvement of GLPs in basal host resistance against powdery mildew (Blumeria graminis ssp.) have also been reported in barley and wheat. In this mini-review, we compare the close proximity of GLPs in publicly available cereal crop genomes and discuss the contribution that these proteins, and their genome sequence organisation, play in plant defenc
An Open Framework for Extensible Multi-Stage Bioinformatics Software
In research labs, there is often a need to customise software at every step
in a given bioinformatics workflow, but traditionally it has been difficult to
obtain both a high degree of customisability and good performance.
Performance-sensitive tools are often highly monolithic, which can make
research difficult. We present a novel set of software development principles
and a bioinformatics framework, Friedrich, which is currently in early
development. Friedrich applications support both early stage experimentation
and late stage batch processing, since they simultaneously allow for good
performance and a high degree of flexibility and customisability. These
benefits are obtained in large part by basing Friedrich on the multiparadigm
programming language Scala. We present a case study in the form of a basic
genome assembler and its extension with new functionality. Our architecture has
the potential to greatly increase the overall productivity of software
developers and researchers in bioinformatics.Comment: 12 pages, 1 figure, to appear in proceedings of PRIB 201
Yabi: An online research environment for grid, high performance and cloud computing
Background
There is a significant demand for creating pipelines or workflows in the life science discipline that chain a number of discrete compute and data intensive analysis tasks into sophisticated analysis procedures. This need has led to the development of general as well as domain-specific workflow environments that are either complex desktop applications or Internet-based applications. Complexities can arise when configuring these applications in heterogeneous compute and storage environments if the execution and data access models are not designed appropriately. These complexities manifest themselves through limited access to available HPC resources, significant overhead required to configure tools and inability for users to simply manage files across heterogenous HPC storage infrastructure.
Results
In this paper, we describe the architecture of a software system that is adaptable to a range of both pluggable execution and data backends in an open source implementation called Yabi. Enabling seamless and transparent access to heterogenous HPC environments at its core, Yabi then provides an analysis workflow environment that can create and reuse workflows as well as manage large amounts of both raw and processed data in a secure and flexible way across geographically distributed compute resources. Yabi can be used via a web-based environment to drag-and-drop tools to create sophisticated workflows. Yabi can also be accessed through the Yabi command line which is designed for users that are more comfortable with writing scripts or for enabling external workflow environments to leverage the features in Yabi. Configuring tools can be a significant overhead in workflow environments. Yabi greatly simplifies this task by enabling system administrators to configure as well as manage running tools via a web-based environment and without the need to write or edit software programs or scripts. In this paper, we highlight Yabi's capabilities through a range of bioinformatics use cases that arise from large-scale biomedical data analysis.
Conclusion
The Yabi system encapsulates considered design of both execution and data models, while abstracting technical details away from users who are not skilled in HPC and providing an intuitive drag-and-drop scalable web-based workflow environment where the same tools can also be accessed via a command line. Yabi is currently in use and deployed at multiple institutions and is available at http://ccg.murdoch.edu.au/yabi
A highly conserved gene island of three genes on chromosome 3B of hexaploid wheat: diverse gene function and genomic structure maintained in a tightly linked block
The complexity of the wheat genome has resulted from waves of retrotransposable element insertions. Gene deletions and disruptions generated by the fast replacement of repetitive elements in wheat have resulted in disruption of colinearity at a micro (sub-megabase) level among the cereals. In view of genomic changes that are possible within a given time span, conservation of genes between species tends to imply an important functional or regional constraint that does not permit a change in genomic structure. The ctg1034 contig completed in this paper was initially studied because it was assigned to the Sr2 resistance locus region, but detailed mapping studies subsequently assigned it to the long arm of 3B and revealed its unusual features
Reassociation kinetics-based approach for partial genome sequencing of the cattle tick, Rhipicephalus (Boophilus) microplus
Background: The size and repetitive nature of the Rhipicephalus microplus genome makes obtaining a full genome sequence fiscally and technically problematic. To selectively obtain gene-enriched regions of this tick's genome, Cot filtration was performed, and Cot-filtered DNA was sequenced via 454 FLX pyrosequencing.Results: The sequenced Cot-filtered genomic DNA was assembled with an EST-based gene index of 14,586 unique entries where each EST served as a potential "seed" for scaffold formation. The new sequence assembly extended the lengths of 3,913 of the 14,586 gene index entries. Over half of the extensions corresponded to extensions of over 30 amino acids. To survey the repetitive elements in the tick genome, the complete sequences of five BAC clones were determined. Both Class I and II transposable elements were found. Comparison of the BAC and Cot filtration data indicates that Cot filtration was highly successful in filtering repetitive DNA out of the genomic DNA used in 454 sequencing.Conclusion: Cot filtration is a very useful strategy to incorporate into genome sequencing projects on organisms with large genome sizes and which contain high percentages of repetitive, difficult to assemble, genomic DNA. Combining the Cot selection approach with 454 sequencing and assembly with a pre-existing EST database as seeds resulted in extensions of 27% of the members of the EST database
Design of a framework for the deployment of collaborative independent rare disease-centric registries: Gaucher disease registry model
Orphan drug clinical trials often are adversely affected by a lack of high quality treatment efficacy data that can be reliably compared across large patient cohorts derived from multiple governmental and country jurisdictions. It is critical that these patient data be captured with limited corporate involvement. For some time, there have been calls to develop collaborative, non-proprietary, patient-centric registries for post-market surveillance of aspects related to orphan drug efficacy. There is an urgent need for the development and sustainable deployment of these ‘independent’ registries that can capture comprehensive clinical, genetic and therapeutic information on patients with rare diseases. We therefore extended an open-source registry platform, the Rare Disease Registry Framework (RDRF) to establish an Independent Rare Disease Registry (IRDR). We engaged with an established rare disease community for Gaucher disease to determine system requirements, methods of data capture, consent, and reporting. A non-proprietary IRDR model is presented that can serve as autonomous data repository, but more importantly ensures that the relevant data can be made available to appropriate stakeholders in a secure, timely and efficient manner to improve clinical decision-making and the lives of those with a rare diseas
Design of a framework for the deployment of collaborative independent rare disease-centric registries: Gaucher disease registry model
Orphan drug clinical trials often are adversely affected by a lack of high quality treatment efficacy data that can be reliably compared across large patient cohorts derived from multiple governmental and country jurisdictions. It is critical that these patient data be captured with limited corporate involvement. For some time, there have been calls to develop collaborative, non-proprietary, patient-centric registries for post-market surveillance of aspects related to orphan drug efficacy. There is an urgent need for the development and sustainable deployment of these ‘independent’ registries that can capture comprehensive clinical, genetic and therapeutic information on patients with rare diseases. We therefore extended an open-source registry platform, the Rare Disease Registry Framework (RDRF) to establish an Independent Rare Disease Registry (IRDR). We engaged with an established rare disease community for Gaucher disease to determine system requirements, methods of data capture, consent, and reporting. A non-proprietary IRDR model is presented that can serve as autonomous data repository, but more importantly ensures that the relevant data can be made available to appropriate stakeholders in a secure, timely and efficient manner to improve clinical decision-making and the lives of those with a rare disease
Comparative microarray analysis of Rhipicephalus (Boophilus) microplus expression profiles of larvae pre-attachment and feeding adult female stages on Bos indicus and Bos taurus cattle
Background: Rhipicephalus (Boophilus) microplus is an obligate blood feeder which is host specific to cattle. Existing knowledge pertaining to the host or host breed effects on tick transcript expression profiles during the tick - host interaction is poor.
Results: Global analysis of gene expression changes in whole R. microplus ticks during larval, pre-attachment and early adult stages feeding on Bos indicus and Bos taurus cattle were compared using gene expression microarray analysis. Among the 13,601 R. microplus transcripts from BmiGI Version 2 we identified 297 high and 17 low expressed transcripts that were significantly differentially expressed between R. microplus feeding on tick resistant cattle [Bos indicus (Brahman)] compared to R. microplus feeding on tick susceptible cattle [Bos taurus (Holstein-Friesian)] (p <= 0.001). These include genes encoding enzymes involved in primary metabolism, and genes related to stress, defence, cell wall modification, cellular signaling, receptor, and cuticle formation. Microarrays were validated by qRT-PCR analysis of selected transcripts using three housekeeping genes as normalization controls.
Conclusion: The analysis of all tick stages under survey suggested a coordinated regulation of defence proteins, proteases and protease inhibitors to achieve successful attachment and survival of R. microplus on different host breeds, particularly Bos indicus cattle. R. microplus ticks demonstrate different transcript expression patterns when they encounter tick resistant and susceptible breeds of cattle. In this study we provide the first transcriptome evidence demonstrating the influence of tick resistant and susceptible cattle breeds on transcript expression patterns and the molecular physiology of ticks during host attachment and feeding
- …