203 research outputs found

    Measuring homoplasy I: comprehensive measures of maximum and minimum cost under parsimony across discrete cost matrix character types.

    Get PDF
    Here, we propose, prove mathematically and discuss maximum and minimum measures of maximum parsimony evolution across 12 discrete phylogenetic character types, classified across 4467 morphological and molecular datasets. Covered character types are: constant, binary symmetric, multistate unordered (non-additive) symmetric, multistate linear ordered symmetric, multistate non-linear ordered symmetric, binary irreversible, multistate irreversible, binary Dollo, multistate Dollo, multistate custom symmetric, binary custom asymmetric and multistate custom asymmetric characters. We summarize published solutions and provide and prove a range of new formulae for the algebraic calculation of minimum (m), maximum (g) and maximum possible (gmax) character cost for applicable character types. Algorithms for exhaustive calculation of m, g and gmax applicable to all classified character types (within computational limits on the numbers of taxa and states) are also provided. The general algorithmic solution for minimum steps (m) is identical to a minimum spanning tree on the state graph or minimum weight spanning arborescence on the state digraph. Algorithmic solutions for character g and gmax are based on matrix mathematics equivalent to optimization on the star tree, respectively for given state frequencies and all possible state frequencies meeting specified numbers of taxa and states. We show that maximizing possible cost (gmax) with given transition costs can be equivalent to maximizing, across all possible state frequency combinations, the lowest implied cost of state transitions if any one state is ancestral on the star tree, via the solution of systems of linear equations. The methods we present, implemented in the Claddis R package, extend to a comprehensive range, the fundamental character types for which homoplasy may be measured under parsimony using m, g and gmax, including extra cost (h), consistency index (ci), retention index (ri) or indices based thereon

    CAHM, a long non-coding RNA gene hypermethylated in colorectal neoplasia

    Get PDF
    Copyright © 2014 Landes Bioscience This is an open-access article licensed under a Creative Commons Attribution 3.0 Unported License. The article may be redistributed, reproduced, and reused for non-commercial purposes, provided the original source is properly cited. Permission is granted subject to the terms of the License under which the work was published. Please check the License conditions for the work which you wish to reuse. Full and appropriate attribution must be given. This permission does not cover any third party copyrighted material which may appear in the work requested.The CAHM gene (Colorectal Adenocarcinoma HyperMethylated), previously LOC 100526820, is located on chromosome 6, hg19 chr6:163 834 097–163 834 982. It lacks introns, encodes a long non-coding RNA (lncRNA) and is located adjacent to the gene QKI, which encodes an RNA binding protein. Deep bisulphite sequencing of ten colorectal cancer (CRC ) and matched normal tissues demonstrated frequent hypermethylation within the CAHM gene in cancer. A quantitative methylation-specific PCR (qMSP ) was used to characterize additional tissue samples. With a threshold of 5% methylation, the CAHM assay was positive in 2/26 normal colorectal tissues (8%), 17/21 adenomas (81%), and 56/79 CRC samples (71%). A reverse transcriptase-qPCR assay showed that CAHM RNA levels correlated negatively with CAHM % methylation, and therefore CAHM gene expression is typically decreased in CRC . The CAHM qMSP assay was applied to DNA isolated from plasma specimens from 220 colonoscopy-examined patients. Using a threshold of 3 pg methylated genomic DNA per mL plasma, methylated CAHM sequences were detected in the plasma DNA of 40/73 (55%) of CRC patients compared with 3/73 (4%) from subjects with adenomas and 5/74 (7%) from subjects without neoplasia. Both the frequency of detection and the amount of methylated CAHM DNA released into plasma increased with increasing cancer stage. Methylated CAHM DNA shows promise as a plasma biomarker for use in screening for CRC

    Disparities in the analysis of morphological disparity

    Get PDF
    Analyses of morphological disparity have been used to characterize and investigate the evolution of variation in the anatomy, function and ecology of organisms since the 1980s. While a diversity of methods have been employed, it is unclear whether they provide equivalent insights. Here, we review the most commonly used approaches for characterizing and analysing morphological disparity, all of which have associated limitations that, if ignored, can lead to misinterpretation. We propose best practice guidelines for disparity analyses, while noting that there can be no ‘one-size-fits-all’ approach. The available tools should always be used in the context of a specific biological question that will determine data and method selection at every stage of the analysis

    Recurrent Coding Sequence Variation Explains only A Small Fraction of the Genetic Architecture of Colorectal Cancer

    Get PDF
    Whilst common genetic variation in many non-coding genomic regulatory regions are known to impart risk of colorectal cancer (CRC), much of the heritability of CRC remains unexplained. To examine the role of recurrent coding sequence variation in CRC aetiology, we genotyped 12,638 CRCs cases and 29,045 controls from six European populations. Single-variant analysis identified a coding variant (rs3184504) in SH2B3 (12q24) associated with CRC risk (OR = 1.08, P = 3.9 × 10-7), and novel damaging coding variants in 3 genes previously tagged by GWAS efforts; rs16888728 (8q24) in UTP23 (OR = 1.15, P = 1.4 × 10-7); rs6580742 and rs12303082 (12q13) in FAM186A (OR = 1.11, P = 1.2 × 10-

    Common variation near CDKN1A, POLD3 and SHROOM2 influences colorectal cancer risk

    Get PDF
    We performed a meta-analysis of five genome-wide association studies to identify common variants influencing colorectal cancer (CRC) risk comprising 8,682 cases and 9,649 controls. Replication analysis was performed in case-control sets totaling 21,096 cases and 19,555 controls. We identified three new CRC risk loci at 6p21 (rs1321311, near CDKN1A; P = 1.14 × 10(-10)), 11q13.4 (rs3824999, intronic to POLD3; P = 3.65 × 10(-10)) and Xp22.2 (rs5934683, near SHROOM2; P = 7.30 × 10(-10)) This brings the number of independent loci associated with CRC risk to 20 and provides further insight into the genetic architecture of inherited susceptibility to CRC.Swedish Research Council et al.Manuscrip

    Well-Annotated microRNAomes Do Not Evidence Pervasive miRNA Loss

    Get PDF
    microRNAs are conserved noncoding regulatory factors implicated in diverse physiological and developmental processes in multicellular organisms, as causal macroevolutionary agents and for phylogeny inference. However, the conservation and phylogenetic utility of microRNAs has been questioned on evidence of pervasive loss. Here, we show that apparent widespread losses are, largely, an artefact of poorly sampled and annotated microRNAomes. Using a curated data set of animal microRNAomes, we reject the view that miRNA families are never lost, but they are rarely lost (92% are never lost). A small number of families account for a majority of losses (1.7% of families account for >45% losses), and losses are associated with lineages exhibiting phenotypic simplification. Phylogenetic analyses based on the presence/absence of microRNA families among animal lineages, and based on microRNA sequences among Osteichthyes, demonstrate the power of these small data sets in phylogenetic inference. Perceptions of widespread evolutionary loss of microRNA families are due to the uncritical use of public archives corrupted by spurious microRNA annotations, and failure to discriminate false absences that occur because of incomplete microRNAome annotation

    Cause of Death and Predictors of All-Cause Mortality in Anticoagulated Patients With Nonvalvular Atrial Fibrillation : Data From ROCKET AF

    Get PDF
    M. Kaste on työryhmän ROCKET AF Steering Comm jäsen.Background-Atrial fibrillation is associated with higher mortality. Identification of causes of death and contemporary risk factors for all-cause mortality may guide interventions. Methods and Results-In the Rivaroxaban Once Daily Oral Direct Factor Xa Inhibition Compared with Vitamin K Antagonism for Prevention of Stroke and Embolism Trial in Atrial Fibrillation (ROCKET AF) study, patients with nonvalvular atrial fibrillation were randomized to rivaroxaban or dose-adjusted warfarin. Cox proportional hazards regression with backward elimination identified factors at randomization that were independently associated with all-cause mortality in the 14 171 participants in the intention-to-treat population. The median age was 73 years, and the mean CHADS(2) score was 3.5. Over 1.9 years of median follow-up, 1214 (8.6%) patients died. Kaplan-Meier mortality rates were 4.2% at 1 year and 8.9% at 2 years. The majority of classified deaths (1081) were cardiovascular (72%), whereas only 6% were nonhemorrhagic stroke or systemic embolism. No significant difference in all-cause mortality was observed between the rivaroxaban and warfarin arms (P=0.15). Heart failure (hazard ratio 1.51, 95% CI 1.33-1.70, P= 75 years (hazard ratio 1.69, 95% CI 1.51-1.90, P Conclusions-In a large population of patients anticoagulated for nonvalvular atrial fibrillation, approximate to 7 in 10 deaths were cardiovascular, whereasPeer reviewe

    Global, regional, and national incidence and mortality for HIV, tuberculosis, and malaria during 1990–2013: a systematic analysis for the Global Burden of Disease Study 2013

    Get PDF
    BACKGROUND: The Millennium Declaration in 2000 brought special global attention to HIV, tuberculosis, and malaria through the formulation of Millennium Development Goal (MDG) 6. The Global Burden of Disease 2013 study provides a consistent and comprehensive approach to disease estimation for between 1990 and 2013, and an opportunity to assess whether accelerated progress has occured since the Millennium Declaration. METHODS: To estimate incidence and mortality for HIV, we used the UNAIDS Spectrum model appropriately modified based on a systematic review of available studies of mortality with and without antiretroviral therapy (ART). For concentrated epidemics, we calibrated Spectrum models to fit vital registration data corrected for misclassification of HIV deaths. In generalised epidemics, we minimised a loss function to select epidemic curves most consistent with prevalence data and demographic data for all-cause mortality. We analysed counterfactual scenarios for HIV to assess years of life saved through prevention of mother-to-child transmission (PMTCT) and ART. For tuberculosis, we analysed vital registration and verbal autopsy data to estimate mortality using cause of death ensemble modelling. We analysed data for corrected case-notifications, expert opinions on the case-detection rate, prevalence surveys, and estimated cause-specific mortality using Bayesian meta-regression to generate consistent trends in all parameters. We analysed malaria mortality and incidence using an updated cause of death database, a systematic analysis of verbal autopsy validation studies for malaria, and recent studies (2010-13) of incidence, drug resistance, and coverage of insecticide-treated bednets. FINDINGS: Globally in 2013, there were 1·8 million new HIV infections (95% uncertainty interval 1·7 million to 2·1 million), 29·2 million prevalent HIV cases (28·1 to 31·7), and 1·3 million HIV deaths (1·3 to 1·5). At the peak of the epidemic in 2005, HIV caused 1·7 million deaths (1·6 million to 1·9 million). Concentrated epidemics in Latin America and eastern Europe are substantially smaller than previously estimated. Through interventions including PMTCT and ART, 19·1 million life-years (16·6 million to 21·5 million) have been saved, 70·3% (65·4 to 76·1) in developing countries. From 2000 to 2011, the ratio of development assistance for health for HIV to years of life saved through intervention was US$4498 in developing countries. Including in HIV-positive individuals, all-form tuberculosis incidence was 7·5 million (7·4 million to 7·7 million), prevalence was 11·9 million (11·6 million to 12·2 million), and number of deaths was 1·4 million (1·3 million to 1·5 million) in 2013. In the same year and in only individuals who were HIV-negative, all-form tuberculosis incidence was 7·1 million (6·9 million to 7·3 million), prevalence was 11·2 million (10·8 million to 11·6 million), and number of deaths was 1·3 million (1·2 million to 1·4 million). Annualised rates of change (ARC) for incidence, prevalence, and death became negative after 2000. Tuberculosis in HIV-negative individuals disproportionately occurs in men and boys (versus women and girls); 64·0% of cases (63·6 to 64·3) and 64·7% of deaths (60·8 to 70·3). Globally, malaria cases and deaths grew rapidly from 1990 reaching a peak of 232 million cases (143 million to 387 million) in 2003 and 1·2 million deaths (1·1 million to 1·4 million) in 2004. Since 2004, child deaths from malaria in sub-Saharan Africa have decreased by 31·5% (15·7 to 44·1). Outside of Africa, malaria mortality has been steadily decreasing since 1990. INTERPRETATION: Our estimates of the number of people living with HIV are 18·7% smaller than UNAIDS's estimates in 2012. The number of people living with malaria is larger than estimated by WHO. The number of people living with HIV, tuberculosis, or malaria have all decreased since 2000. At the global level, upward trends for malaria and HIV deaths have been reversed and declines in tuberculosis deaths have accelerated. 101 countries (74 of which are developing) still have increasing HIV incidence. Substantial progress since the Millennium Declaration is an encouraging sign of the effect of global action. FUNDING: Bill & Melinda Gates Foundation

    CMB-S4: Forecasting Constraints on Primordial Gravitational Waves

    Full text link
    CMB-S4---the next-generation ground-based cosmic microwave background (CMB) experiment---is set to significantly advance the sensitivity of CMB measurements and enhance our understanding of the origin and evolution of the Universe, from the highest energies at the dawn of time through the growth of structure to the present day. Among the science cases pursued with CMB-S4, the quest for detecting primordial gravitational waves is a central driver of the experimental design. This work details the development of a forecasting framework that includes a power-spectrum-based semi-analytic projection tool, targeted explicitly towards optimizing constraints on the tensor-to-scalar ratio, rr, in the presence of Galactic foregrounds and gravitational lensing of the CMB. This framework is unique in its direct use of information from the achieved performance of current Stage 2--3 CMB experiments to robustly forecast the science reach of upcoming CMB-polarization endeavors. The methodology allows for rapid iteration over experimental configurations and offers a flexible way to optimize the design of future experiments given a desired scientific goal. To form a closed-loop process, we couple this semi-analytic tool with map-based validation studies, which allow for the injection of additional complexity and verification of our forecasts with several independent analysis methods. We document multiple rounds of forecasts for CMB-S4 using this process and the resulting establishment of the current reference design of the primordial gravitational-wave component of the Stage-4 experiment, optimized to achieve our science goals of detecting primordial gravitational waves for r>0.003r > 0.003 at greater than 5σ5\sigma, or, in the absence of a detection, of reaching an upper limit of r<0.001r < 0.001 at 95%95\% CL.Comment: 24 pages, 8 figures, 9 tables, submitted to ApJ. arXiv admin note: text overlap with arXiv:1907.0447

    Multiple novel prostate cancer susceptibility signals identified by fine-mapping of known risk loci among Europeans

    Get PDF
    Genome-wide association studies (GWAS) have identified numerous common prostate cancer (PrCa) susceptibility loci. We have fine-mapped 64 GWAS regions known at the conclusion of the iCOGS study using large-scale genotyping and imputation in 25 723 PrCa cases and 26 274 controls of European ancestry. We detected evidence for multiple independent signals at 16 regions, 12 of which contained additional newly identified significant associations. A single signal comprising a spectrum of correlated variation was observed at 39 regions; 35 of which are now described by a novel more significantly associated lead SNP, while the originally reported variant remained as the lead SNP only in 4 regions. We also confirmed two association signals in Europeans that had been previously reported only in East-Asian GWAS. Based on statistical evidence and linkage disequilibrium (LD) structure, we have curated and narrowed down the list of the most likely candidate causal variants for each region. Functional annotation using data from ENCODE filtered for PrCa cell lines and eQTL analysis demonstrated significant enrichment for overlap with bio-features within this set. By incorporating the novel risk variants identified here alongside the refined data for existing association signals, we estimate that these loci now explain ∼38.9% of the familial relative risk of PrCa, an 8.9% improvement over the previously reported GWAS tag SNPs. This suggests that a significant fraction of the heritability of PrCa may have been hidden during the discovery phase of GWAS, in particular due to the presence of multiple independent signals within the same regio
    corecore