186 research outputs found

    Alone in the Universe

    Get PDF

    Using 2k + 2 bubble searches to find single nucleotide polymorphisms in k-mer graphs

    Get PDF
    Motivation: Single nucleotide polymorphism (SNP) discovery is an important preliminary for understanding genetic variation. With current sequencing methods, we can sample genomes comprehensively. SNPs are found by aligning sequence reads against longer assembled references. De Bruijn graphs are efficient data structures that can deal with the vast amount of data from modern technologies. Recent work has shown that the topology of these graphs captures enough information to allow the detection and characterization of genetic variants, offering an alternative to alignment-based methods. Such methods rely on depth-first walks of the graph to identify closing bifurcations. These methods are conservative or generate many false-positive results, particularly when traversing highly inter-connected (complex) regions of the graph or in regions of very high coverage. Results: We devised an algorithm that calls SNPs in converted De Bruijn graphs by enumerating 2k + 2 cycles. We evaluated the accuracy of predicted SNPs by comparison with SNP lists from alignment-based methods. We tested accuracy of the SNP calling using sequence data from 16 ecotypes of Arabidopsis thaliana and found that accuracy was high. We found that SNP calling was even across the genome and genomic feature types. Using sequence-based attributes of the graph to train a decision tree allowed us to increase accuracy of SNP calls further. Together these results indicate that our algorithm is capable of finding SNPs accurately in complex sub-graphs and potentially comprehensively from whole genome graphs

    Improved K-mer Based Prediction of Protein-Protein Interactions With Chaos Game Representation, Deep Learning and Reduced Representation Bias

    Full text link
    Protein-protein interactions drive many biological processes, including the detection of phytopathogens by plants' R-Proteins and cell surface receptors. Many machine learning studies have attempted to predict protein-protein interactions but performance is highly dependent on training data; models have been shown to accurately predict interactions when the proteins involved are included in the training data, but achieve consistently poorer results when applied to previously unseen proteins. In addition, models that are trained using proteins that take part in multiple interactions can suffer from representation bias, where predictions are driven not by learned biological features but by learning of the structure of the interaction dataset. We present a method for extracting unique pairs from an interaction dataset, generating non-redundant paired data for unbiased machine learning. After applying the method to datasets containing _Arabidopsis thaliana_ and pathogen effector interations, we developed a convolutional neural network model capable of learning and predicting interactions from Chaos Game Representations of proteins' coding genes

    Andragogy in the 21st century: Applying the Assumptions of Adult Learning Online

    Get PDF
    Regardless of whether their motivation is intrinsic or extrinsic, adults undertakea course of learning with much more sophisticated needs and expectations thanyounger learners, and this will strongly influence their persistence. The sixassumptions of Knowles’ Andragogical Model provide insight into this psychomotivationalcocktail that we will use to make practical recommendations forinstructors about how to fully activate adults’ imperative to articulate andaccomplish their online educational goals—an essential variable toward theirsuccess. Given an attrition rate of up to 80% for some online learning contexts,it is vital that the educational approach of instructional design for online learningaligns with the learning objectives that correspond to learners’ real-world needs.If educational technology is to live up to the promise of enhancing onlinelearning outcomes, a different paradigm for instructional design and delivery ofcontent is needed. This paper will provide guidelines and techniques forincorporating adult learning principles into the structure, delivery, andmentoring/administration of online courses of study

    Using false discovery rates to benchmark SNP-callers in next-generation sequencing projects

    Get PDF
    Funding: R.A.F. was funded by the Natural Environment Research Council (NERC). D.A.H. and M.C.F. were supported by the Wellcome Trust. No additional external funding received for this study.Peer reviewedPublisher PD

    Protein engineering expands the effector recognition profile of a rice NLR immune receptor

    Get PDF
    Plant nucleotide binding, leucine-rich repeat (NLR) receptors detect pathogen effectors and initiate an immune response. Since their discovery, NLRs have been the focus of protein engineering to improve disease resistance. However, this approach has proven challenging, in part due to their narrow response specificity. Previously, we revealed the structural basis of pathogen recognition by the integrated heavy metal associated (HMA) domain of the rice NLR Pikp (Maqbool et al., 2015). Here, we used structure-guided engineering to expand the response profile of Pikp to variants of the rice blast pathogen effector AVR-Pik. A mutation located within an effector-binding interface of the integrated Pikp-HMA domain increased the binding affinity for AVR-Pik variants in vitro and in vivo. This translates to an expanded cell-death response to AVR-Pik variants previously unrecognized by Pikp in planta. The structures of the engineered Pikp-HMA in complex with AVR-Pik variants revealed the mechanism of expanded recognition. These results provide a proof-of-concept that protein engineering can improve the utility of plant NLR receptors where direct interaction between effectors and NLRs is established, particularly where this interaction occurs via integrated domains

    Morphological and Physiological Changes of Brassica oleracea Acephala Group Seedlings as Affected by Ion and Salt Stress

    Get PDF
    The aim of this study was to determine the effect of salt stress on morphological and physiological changes of Brassica oleracea acephala group seedlings. Seedlings of kale cultivar Red Russian (RR) and collard Croatian population Konavle 2 (K2) were grown in a floating hydroponic system in Tifton, Georgia, USA. Seedlings were treated with seven different nutrient solutions (NS). The control NS (EC 2 dS m-1) was concentrated to achieve EC 4, 6 or 8 dS m-1. Three additional salt treatments included addition of NaCl solution to the control NS to get: EC 4 NaCl (2 NS + 2 NaCl), EC 6 NaCl (2 NS + 4 NaCl) and EC 8 NaCl (2 NS + 6 NaCl) dSm-1. Leaf gas exchange parameters decreased with increased EC. Seedlings treated with EC 6 NaCl and 8 NaCl dS m-1 had the lowest leaf relative water content (less than 59%). Seedlings treated with 2 dS m-1 had the greatest (187 cm2) leaf area (LA). Cultivar RR had greater LA (131 cm2) than population K2 (84 cm2). Increased percentage of shoot (14.1%) and root (10.4%) dry weight (DW) was recorded in seedlings treated with EC 8 dS m-1, c. Population K2 had higher shoot (10.9%) and root (10.4%) DW percentage compared with cv. RR. In conclusion, the nutrient solution of EC 4 NaCl had negative effect on morphological characteristics, compared to the same solution without NaCl. Increased concentrations of NS significantly affected the leaf thickness (SLA) of B. oleracea acephala group seedlings. This can be used as production tool for seedlings hardening

    High-resolution expression profiling of selected gene sets during plant immune activation

    Get PDF
    The plant immune system involves detection of pathogens via both cell-surface and intracellular receptors. Both receptor classes can induce transcriptional reprogramming that elevates disease resistance. To assess differential gene expression during plant immunity, we developed and deployed quantitative sequence capture (CAP-I). We designed and synthesized biotinylated single-strand RNA bait libraries targeted to a subset of defense genes, and generated sequence capture data from 99 RNA-seq libraries. We built a data processing pipeline to quantify the RNA-CAP-I-seq data, and visualize differential gene expression. Sequence capture in combination with quantitative RNA-seq enabled cost-effective assessment of the expression profile of a specified subset of genes. Quantitative sequence capture is not limited to RNA-seq or any specific organism and can potentially be incorporated into automated platforms for high-throughput sequencing

    Crowdsourcing genomic analyses of ash and ash dieback – power to the people

    Get PDF
    Ash dieback is a devastating fungal disease of ash trees that has swept across Europe and recently reached the UK. This emergent pathogen has received little study in the past and its effect threatens to overwhelm the ash population. In response to this we have produced some initial genomics datasets and taken the unusual step of releasing them to the scientific community for analysis without first performing our own. In this manner we hope to ‘crowdsource’ analyses and bring the expertise of the community to bear on this problem as quickly as possible. Our data has been released through our website at oadb.tsl.ac.uk and a public GitHub repository

    Transgenic goats producing an improved version of cetuximab in milk [preprint]

    Get PDF
    Therapeutic monoclonal antibodies (mAbs) represent one of the most important classes of pharmaceutical proteins to treat human diseases. Most are produced in cultured mammalian cells which is expensive, limiting their availability. Goats, striking a good balance between a relatively short generation time and copious milk yield, present an alternative platform for the cost-effective, flexible, large-scale production of therapeutic mAbs. Here, we focused on cetuximab, a mAb against epidermal growth factor receptor, that is commercially produced under the brand name Erbitux and approved for anti-cancer treatments. We generated several transgenic goat lines that produce cetuximab in their milk. Two lines were selected for detailed characterization. Both showed stable genotypes and cetuximab production levels of up to 10g/L. The mAb could be readily purified and showed improved characteristics compared to Erbitux. The goat-produced cetuximab (gCetuximab) lacked a highly immunogenic epitope that is part of Erbitux. Moreover, it showed enhanced binding to CD16 and increased antibody-dependent cell-dependent cytotoxicity compared to Erbitux. This indicates that these goats produce an improved cetuximab version with the potential for enhanced effectiveness and better safety profile compared to treatments with Erbitux. In addition, our study validates transgenic goats as an excellent platform for large-scale production of therapeutic mAbs
    • 

    corecore