150 research outputs found

    Development of a cloud-based Bioinformatics Training Platform

    Get PDF
    First published online: April 15, 2016The Bioinformatics Training Platform (BTP) has been developed to provide access to the computational infrastructure required to deliver sophisticated hands-on bioinformatics training courses. The BTP is a cloud-based solution that is in active use for delivering next-generation sequencing training to Australian researchers at geographically dispersed locations. The BTP was built to provide an easy, accessible, consistent and cost-effective approach to delivering workshops at host universities and organizations with a high demand for bioinformatics training but lacking the dedicated bioinformatics training suites required. To support broad uptake of the BTP, the platform has been made compatible with multiple cloud infrastructures. The BTP is an open-source and open-access resource. To date, 20 training workshops have been delivered to over 700 trainees at over 10 venues across Australia using the BTP.Jerico Revote, Nathan S. Watson-Haigh, Steve Quenette, Blair Bethwaite, Annette McGrath and Catherine A. Shan

    Towards an open, collaborative, reusable framework for sharing hands-on bioinformatics training workshops

    Get PDF
    There is a clear demand for hands-on bioinformatics training. The development of bioinformatics workshop content is both time-consuming and expensive. Therefore, enabling trainers to develop bioinformatics workshops in a way that facilitates reuse is becoming increasingly important. The most widespread practice for sharing workshop content is through making PDF, PowerPoint and Word documents available online. While this effort is to be commended, such content is usually not so easy to reuse or repurpose and does not capture all the information required for a third party to rerun a workshop. We present an open, collaborative framework for developing and maintaining, reusable and shareable hands-on training workshop content.Nathan S. Watson-Haigh, Jerico Revote, Radosław Suchecki, Sonika Tyagi, Susan M. Corley, Catherine A. Shang, Annette McGrat

    Identifying the genetic control of salinity tolerance in the bread wheat landrace Mocho de Espiga Branca

    Get PDF
    Salinity tolerance in bread wheat is frequently reported to be associated with low leaf sodium (Na+) concentrations. However, the Portuguese landrace, Mocho de Espiga Branca, accumulates significantly higher leaf Na+ but has comparable salinity tolerance to commercial bread wheat cultivars. To determine the genetic loci associated with the salinity tolerance of this landrace, an F2 mapping population was developed by crossing Mocho de Espiga Branca with the Australian cultivar Gladius. The population was phenotyped for 19 salinity tolerance subtraits using both non-destructive and destructive techniques. Genotyping was performed using genotyping-by-sequencing (GBS). Genomic regions associated with salinity tolerance were detected on chromosomes 1A, 1D, 4B and 5A for the subtraits of relative and absolute growth rate (RGR, AGR respectively), and on chromosome 2A, 2B, 4D and 5D for Na+, potassium (K+) and chloride (Cl−) accumulation. Candidate genes that encode proteins associated with salinity tolerance were identified within the loci including Na+/H+ antiporters, K+ channels, H+-ATPase, calcineurin B-like proteins (CBLs), CBL-interacting protein kinases (CIPKs), calcium dependent protein kinases (CDPKs) and calcium-transporting ATPase. This study provides a new insight into the genetic control of salinity tolerance in a Na+ accumulating bread wheat to assist with the future development of salt tolerant cultivars.Chana Borjigin, Rhiannon K. Schilling, Nathaniel Jewell, Chris Brien, Juan Carlos Sanchez-Ferrero, Paul J. Eckermann ... et al

    Next-generation sequencing: A challenge to meet the increasing demand for training workshops in Australia

    Get PDF
    The widespread adoption of high-throughput next-generation sequencing (NGS) technology among the Australian life science research community is highlighting an urgent need to up-skill biologists in tools required for handling and analysing their NGS data. There is currently a shortage of cutting-edge bioinformatics training courses in Australia as a consequence of a scarcity of skilled trainers with time and funding to develop and deliver training courses. To address this, a consortium of Australian research organizations, including Bioplatforms Australia, the Commonwealth Scientific and Industrial Research Organisation and the Australian Bioinformatics Network, have been collaborating with EMBL-EBI training team. A group of Australian bioinformaticians attended the train-the-trainer workshop to improve training skills in developing and delivering bioinformatics workshop curriculum. A 2-day NGS workshop was jointly developed to provide hands-on knowledge and understanding of typical NGS data analysis workflows. The road show–style workshop was successfully delivered at five geographically distant venues in Australia using the newly established Australian NeCTAR Research Cloud. We highlight the challenges we had to overcome at different stages from design to delivery, including the establishment of an Australian bioinformatics training network and the computing infrastructure and resource development. A virtual machine image, workshop materials and scripts for configuring a machine with workshop contents have all been made available under a Creative Commons Attribution 3.0 Unported License. This means participants continue to have convenient access to an environment they had become familiar and bioinformatics trainers are able to access and reuse these resources.Nathan S.Watson-Haigh, Catherine A. Shang, Matthias Haimel, Myrto Kostadima, Remco Loos, Nandan Deshpande, Konsta Duesing, Xi Li, Annette McGrath, Sean McWilliam, Simon Michnowicz, Paula Moolhuijzen, Steve Quenette, Jerico Nico De Leon Revote, SonikaTyagi and Maria V. Schneide

    Transcriptome profiling of grapevine seedless segregants during berry development reveals candidate genes associated with berry weight

    Get PDF
    Indexación: Web of Science; PubMedBackground Berry size is considered as one of the main selection criteria in table grape breeding programs. However, this is a quantitative and polygenic trait, and its genetic determination is still poorly understood. Considering its economic importance, it is relevant to determine its genetic architecture and elucidate the mechanisms involved in its expression. To approach this issue, an RNA-Seq experiment based on Illumina platform was performed (14 libraries), including seedless segregants with contrasting phenotypes for berry weight at fruit setting (FST) and 6–8 mm berries (B68) phenological stages. Results A group of 526 differentially expressed (DE) genes were identified, by comparing seedless segregants with contrasting phenotypes for berry weight: 101 genes from the FST stage and 463 from the B68 stage. Also, we integrated differential expression, principal components analysis (PCA), correlations and network co-expression analyses to characterize the transcriptome profiling observed in segregants with contrasting phenotypes for berry weight. After this, 68 DE genes were selected as candidate genes, and seven candidate genes were validated by real time-PCR, confirming their expression profiles. Conclusions We have carried out the first transcriptome analysis focused on table grape seedless segregants with contrasting phenotypes for berry weight. Our findings contributed to the understanding of the mechanisms involved in berry weight determination. Also, this comparative transcriptome profiling revealed candidate genes for berry weight which could be evaluated as selection tools in table grape breeding programs.http://bmcplantbiol.biomedcentral.com/articles/10.1186/s12870-016-0789-

    Long-Baseline Neutrino Facility (LBNF) and Deep Underground Neutrino Experiment (DUNE) Conceptual Design Report Volume 2: The Physics Program for DUNE at LBNF

    Full text link
    The Physics Program for the Deep Underground Neutrino Experiment (DUNE) at the Fermilab Long-Baseline Neutrino Facility (LBNF) is described

    Evaluation and improvement of the regulatory inference for large co-expression networks with limited sample size

    Get PDF
    Abstract Background Co-expression has been widely used to identify novel regulatory relationships using high throughput measurements, such as microarray and RNA-seq data. Evaluation studies on co-expression network analysis methods mostly focus on networks of small or medium size of up to a few hundred nodes. For large networks, simulated expression data usually consist of hundreds or thousands of profiles with different perturbations or knock-outs, which is uncommon in real experiments due to their cost and the amount of work required. Thus, the performances of co-expression network analysis methods on large co-expression networks consisting of a few thousand nodes, with only a small number of profiles with a single perturbation, which more accurately reflect normal experimental conditions, are generally uncharacterized and unknown. Methods We proposed a novel network inference methods based on Relevance Low order Partial Correlation (RLowPC). RLowPC method uses a two-step approach to select on the high-confidence edges first by reducing the search space by only picking the top ranked genes from an intial partial correlation analysis and, then computes the partial correlations in the confined search space by only removing the linear dependencies from the shared neighbours, largely ignoring the genes showing lower association. Results We selected six co-expression-based methods with good performance in evaluation studies from the literature: Partial correlation, PCIT, ARACNE, MRNET, MRNETB and CLR. The evaluation of these methods was carried out on simulated time-series data with various network sizes ranging from 100 to 3000 nodes. Simulation results show low precision and recall for all of the above methods for large networks with a small number of expression profiles. We improved the inference significantly by refinement of the top weighted edges in the pre-inferred partial correlation networks using RLowPC. We found improved performance by partitioning large networks into smaller co-expressed modules when assessing the method performance within these modules. Conclusions The evaluation results show that current methods suffer from low precision and recall for large co-expression networks where only a small number of profiles are available. The proposed RLowPC method effectively reduces the indirect edges predicted as regulatory relationships and increases the precision of top ranked predictions. Partitioning large networks into smaller highly co-expressed modules also helps to improve the performance of network inference methods. The RLowPC R package for network construction, refinement and evaluation is available at GitHub: https://github.com/wyguo/RLowPC

    Four simple recommendations to encourage best practices in research software [version 1; referees: awaiting peer review]

    Get PDF
    Scientific research relies on computer software, yet software is not always developed following practices that ensure its quality and sustainability. This manuscript does not aim to propose new software development best practices, but rather to provide simple recommendations that encourage the adoption of existing best practices. Software development best practices promote better quality software, and better quality software improves the reproducibility and reusability of research. These recommendations are designed around Open Source values, and provide practical suggestions that contribute to making research software and its source code more discoverable, reusable and transparent. This manuscript is aimed at developers, but also at organisations, projects, journals and funders that can increase the quality and sustainability of research software by encouraging the adoption of these recommendations. Keyword

    Genetic architecture of gene expression in ovine skeletal muscle

    Get PDF
    In livestock populations the genetic contribution to muscling is intensively monitored in the progeny of industry sires and used as a tool in selective breeding programs. The genes and pathways conferring this genetic merit are largely undefined. Genetic variation within a population has potential, amongst other mechanisms, to alter gene expression via cis- or trans-acting mechanisms in a manner that impacts the functional activities of specific pathways that contribute to muscling traits. By integrating sire-based genetic merit information for a muscling trait with progeny-based gene expression data we directly tested the hypothesis that there is genetic structure in the gene expression program in ovine skeletal muscle. Results The genetic performance of six sires for a well defined muscling trait, longissimus lumborum muscle depth, was measured using extensive progeny testing and expressed as an Estimated Breeding Value by comparison with contemporary sires. Microarray gene expression data were obtained for longissimus lumborum samples taken from forty progeny of the six sires (4-8 progeny/sire). Initial unsupervised hierarchical clustering analysis revealed strong genetic architecture to the gene expression data, which also discriminated the sire-based Estimated Breeding Value for the trait. An integrated systems biology approach was then used to identify the major functional pathways contributing to the genetics of enhanced muscling by using both Estimated Breeding Value weighted gene co-expression network analysis and a differential gene co-expression network analysis. The modules of genes revealed by these analyses were enriched for a number of functional terms summarised as muscle sarcomere organisation and development, protein catabolism (proteosome), RNA processing, mitochondrial function and transcriptional regulation. Conclusions This study has revealed strong genetic structure in the gene expression program within ovine longissimus lumborum muscle. The balance between muscle protein synthesis, at the levels of both transcription and translation control, and protein catabolism mediated by regulated proteolysis is likely to be the primary determinant of the genetic merit for the muscling trait in this sheep population. There is also evidence that high genetic merit for muscling is associated with a fibre type shift toward fast glycolytic fibres. This study provides insight into mechanisms, presumably subject to strong artificial selection, that underpin enhanced muscling in sheep populations
    • …
    corecore