23 research outputs found

    Reconstructing (super)trees from data sets with missing distances: Not all is lost

    Get PDF
    The wealth of phylogenetic information accumulated over many decades of biological research, coupled with recent technological advances in molecular sequence generation, present significant opportunities for researchers to investigate relationships across and within the kingdoms of life. However, to make best use of this data wealth, several problems must first be overcome. One key problem is finding effective strategies to deal with missing data. Here, we introduce Lasso, a novel heuristic approach for reconstructing rooted phylogenetic trees from distance matrices with missing values, for datasets where a molecular clock may be assumed. Contrary to other phylogenetic methods on partial datasets, Lasso possesses desirable properties such as its reconstructed trees being both unique and edge-weighted. These properties are achieved by Lasso restricting its leaf set to a large subset of all possible taxa, which in many practical situations is the entire taxa set. Furthermore, the Lasso approach is distance-based, rendering it very fast to run and suitable for datasets of all sizes, including large datasets such as those generated by modern Next Generation Sequencing technologies. To better understand the performance of Lasso, we assessed it by means of artificial and real biological datasets, showing its effectiveness in the presence of missing data. Furthermore, by formulating the supermatrix problem as a particular case of the missing data problem, we assessed Lasso's ability to reconstruct supertrees. We demonstrate that, although not specifically designed for such a purpose, Lasso performs better than or comparably with five leading supertree algorithms on a challenging biological data set. Finally, we make freely available a software implementation of Lasso so that researchers may, for the first time, perform both rooted tree and supertree reconstruction with branch lengths on their own partial datasets

    Improving Student Engagement in Veterinary Business Studies

    Get PDF
    In a densely packed veterinary curriculum, students may find it particularly challenging to engage in the less overtly clinical subjects, yet pressure from industry and an increasingly competitive employment market necessitate improved veterinary student education in business and management skills. We describe a curriculum intervention (formative reflective assignment) that optimizes workplace learning opportunities and aims to provide better student scaffolding for their in-context business learning. Students were asked to analyze a business practice they experienced during a period of extra-mural studies (external work placement). Following return to the college, they were then instructed to discuss their findings in their study group, and produce a group reflection on their learning. To better understand student engagement in this area, we analyzed individual and group components of the assignment. Thematic analysis revealed evidence of various depths of student engagement, and provided indications of the behaviors they used when engaging at different levels. Interactive and social practices (discussing business strategies with veterinary employees and student peers) appeared to facilitate student engagement, assist the perception of relevance of these skills, and encourage integration with other curriculum elements such as communication skills and clinical problem solving

    Identification of six new susceptibility loci for invasive epithelial ovarian cancer.

    Get PDF
    Genome-wide association studies (GWAS) have identified 12 epithelial ovarian cancer (EOC) susceptibility alleles. The pattern of association at these loci is consistent in BRCA1 and BRCA2 mutation carriers who are at high risk of EOC. After imputation to 1000 Genomes Project data, we assessed associations of 11 million genetic variants with EOC risk from 15,437 cases unselected for family history and 30,845 controls and from 15,252 BRCA1 mutation carriers and 8,211 BRCA2 mutation carriers (3,096 with ovarian cancer), and we combined the results in a meta-analysis. This new study design yielded increased statistical power, leading to the discovery of six new EOC susceptibility loci. Variants at 1p36 (nearest gene, WNT4), 4q26 (SYNPO2), 9q34.2 (ABO) and 17q11.2 (ATAD5) were associated with EOC risk, and at 1p34.3 (RSPO1) and 6p22.1 (GPX6) variants were specifically associated with the serous EOC subtype, all with P < 5 × 10(-8). Incorporating these variants into risk assessment tools will improve clinical risk predictions for BRCA1 and BRCA2 mutation carriers.COGS project is funded through a European Commission's Seventh Framework Programme grant (agreement number 223175 ] HEALTH ]F2 ]2009 ]223175). The CIMBA data management and data analysis were supported by Cancer Research.UK grants 12292/A11174 and C1287/A10118. The Ovarian Cancer Association Consortium is supported by a grant from the Ovarian Cancer Research Fund thanks to donations by the family and friends of Kathryn Sladek Smith (PPD/RPCI.07). The scientific development and funding for this project were in part supported by the US National Cancer Institute GAME ]ON Post ]GWAS Initiative (U19 ]CA148112). This study made use of data generated by the Wellcome Trust Case Control consortium. Funding for the project was provided by the Wellcome Trust under award 076113. The results published here are in part based upon data generated by The Cancer Genome Atlas Pilot Project established by the National Cancer Institute and National Human Genome Research Institute (dbGap accession number phs000178.v8.p7). The cBio portal is developed and maintained by the Computational Biology Center at Memorial Sloan ] Kettering Cancer Center. SH is supported by an NHMRC Program Grant to GCT. Details of the funding of individual investigators and studies are provided in the Supplementary Note. This study made use of data generated by the Wellcome Trust Case Control consortium, funding for which was provided by the Wellcome Trust under award 076113. The results published here are, in part, based upon data generated by The Cancer Genome Atlas Pilot Project established by the National Cancerhttp://dx.doi.org/10.1038/ng.3185This is the Author Accepted Manuscript of 'Identification of six new susceptibility loci for invasive epithelial ovarian cancer' which was published in Nature Genetics 47, 164–171 (2015) © Nature Publishing Group - content may only be used for academic research

    Cardiovascular magnetic resonance phase contrast imaging

    Get PDF

    Algorithms for Identification Key Generation and Optimization with Application to Yeast Identification

    No full text
    Algorithms for the automated creation of low cost identification keys are described and theoretical and empirical justifications are provided. The algorithms are shown to handle differing test costs, prior probabilities for each potential diagnosis and tests that produce uncertain results. The approach is then extended to cover situations where more than one measure of cost is of importance, by allowing tests to be performed in batches. Experiments are performed on a real-world case study involving the identification of yeasts

    GERMINATE. a generic database for integrating genotypic and phenotypic information for plant genetic resource collections

    No full text
    The extensive germplasm resource collections that are now available for major crop plants and their wild relatives will increasingly provide valuable biological and bioinformatics resources for plant physiologists and geneticists to dissect the molecular basis of key traits and to develop highly adapted plant material to sustain future breeding programs. A key to the efficient deployment of these resources is the development of information systems that will enable the collection and storage of biological information for these plant lines to be integrated with the molecular information that is now becoming available through the use of high-throughput genomics and post-genomics technologies. The GERMINATE database has been designed to hold a diverse variety of data types, ranging from molecular to phenotypic, and to allow querying between such data for any plant species. Data are stored in GERMINATE in a technology-independent manner, such that new technologies can be accommodated in the database as they emerge, without modification of the underlying schema. Users can access data in GERMINATE databases either via a lightweight Perl-CGI Web interface or by the more complex Genomic Diversity and Phenotype Connection software. GERMINATE is released under the GNU General Public License and is available at http://germinate.scri.sari.ac.uk/germinate/

    No clinical utility of KRAS variant rs61764370 for ovarian or breast cancer

    Get PDF
    Objective. Clinical genetic testing is commercially available for rs61764370, an inherited variant residing in a KRAS 3' UTR microRNA binding site, based on suggested associations with increased ovarian and breast cancer risk as well as with survival time. However, prior studies, emphasizing particular subgroups, were relatively small. Therefore, we comprehensively evaluated ovarian and breast cancer risks as well as clinical outcome associated with rs61764370. Methods. Centralized genotyping and analysis were performed for 140,012 women enrolled in the Ovarian Cancer Association Consortium (15,357 ovarian cancer patients; 30,816 controls), the Breast Cancer Association Consortium (33,530 breast cancer patients; 37,640 controls), and the Consortium of Modifiers of BRCA1 and BRCA2 (14,765 BRCA1 and 7904 BRCA2 mutation carriers). Results. We found no association with risk of ovarian cancer (OR = 0.99, 95% CI 0.94-1.04, p = 0.74) or breast cancer (OR = 0.98, 95% CI 0.94-1.01, p = 0.19) and results were consistent among mutation carriers (BRCA1, ovarian cancer HR = 1.09, 95% CI 0.97-1.23, p = 0.14, breast cancer HR = 1.04, 95% CI 0.97-1.12, p = 0.27; BRCA2, ovarian cancer HR = 0.89, 95% CI 0.71-1.13, p = 034, breast cancer HR = 1.06, 95% CI 0.94-1.19, p = 0.35). Null results were also obtained for associations with overall survival following ovarian cancer (HR = 0.94, 95% CI 0.83-1.07, p = 0.38), breast cancer (HR = 0.96, 95% CI 0.87-1.06, p = 0.38), and all other previously-reported associations. Conclusions. rs61764370 is not associated with risk of ovarian or breast cancer nor with clinical outcome for patients with these cancers. Therefore, genotyping this variant has no clinical utility related to the prediction or management of these cancers. (C) 2015 Elsevier Inc. All rights reserved.Peer reviewe

    Ovarian and Breast Cancer Risks Associated With Pathogenic Variants in RAD51C and RAD51D

    Get PDF
    BACKGROUND: The purpose of this study was to estimate precise age-specific tubo-ovarian carcinoma (TOC) and breast cancer (BC) risks for carriers of pathogenic variants in RAD51C and RAD51D. METHODS: We analyzed data from 6178 families, 125 with pathogenic variants in RAD51C, and 6690 families, 60 with pathogenic variants in RAD51D. TOC and BC relative and cumulative risks were estimated using complex segregation analysis to model the cancer inheritance patterns in families while adjusting for the mode of ascertainment of each family. All statistical tests were two-sided. RESULTS: Pathogenic variants in both RAD51C and RAD51D were associated with TOC (RAD51C: relative risk [RR] = 7.55, 95% confidence interval [CI] = 5.60 to 10.19; P = 5 × 10-40; RAD51D: RR = 7.60, 95% CI = 5.61 to 10.30; P = 5 × 10-39) and BC (RAD51C: RR = 1.99, 95% CI = 1.39 to 2.85; P = 1.55 × 10-4; RAD51D: RR = 1.83, 95% CI = 1.24 to 2.72; P = .002). For both RAD51C and RAD51D, there was a suggestion that the TOC relative risks increased with age until around age 60 years and decreased thereafter. The estimated cumulative risks of developing TOC to age 80 years were 11% (95% CI = 6% to 21%) for RAD51C and 13% (95% CI = 7% to 23%) for RAD51D pathogenic variant carriers. The estimated cumulative risks of developing BC to 80 years were 21% (95% CI = 15% to 29%) for RAD51C and 20% (95% CI = 14% to 28%) for RAD51D pathogenic variant carriers. Both TOC and BC risks for RAD51C and RAD51D pathogenic variant carriers varied by cancer family history and could be as high as 32-36% for TOC, for carriers with two first-degree relatives diagnosed with TOC, or 44-46% for BC, for carriers with two first-degree relatives diagnosed with BC. CONCLUSIONS: These estimates will facilitate the genetic counseling of RAD51C and RAD51D pathogenic variant carriers and justify the incorporation of RAD51C and RAD51D into cancer risk prediction models
    corecore