65 research outputs found

    Multiscale Dictionary Learning for Estimating Conditional Distributions

    Full text link
    Nonparametric estimation of the conditional distribution of a response given high-dimensional features is a challenging problem. It is important to allow not only the mean but also the variance and shape of the response density to change flexibly with features, which are massive-dimensional. We propose a multiscale dictionary learning model, which expresses the conditional response density as a convex combination of dictionary densities, with the densities used and their weights dependent on the path through a tree decomposition of the feature space. A fast graph partitioning algorithm is applied to obtain the tree decomposition, with Bayesian methods then used to adaptively prune and average over different sub-trees in a soft probabilistic manner. The algorithm scales efficiently to approximately one million features. State of the art predictive performance is demonstrated for toy examples and two neuroscience applications including up to a million features

    Structured Bayesian learning through mixture models

    Get PDF
    <p>In this thesis, we develop some Bayesian mixture density estimation for univariate and multivariate data. We start proposing a repulsive process favoring mixture components further apart. While conducting inferences on the cluster-specific parameters, current frequentist and Bayesian methods often encounter problems when clusters are placed too close together to be scientifically meaningful. Current Bayesian practice generates component-specific parameters independently from a common prior, which tends to favor similar components and often leads to substantial probability assigned to redundant components that are not needed to fit the data. As an alternative, we propose to generate components from a repulsive process, which leads to fewer, better separated and more interpretable clusters. </p><p>In the second part of the thesis, we face the problem of modeling the conditional distribution of a response variable given a high dimensional vector of predictors potentially concentrated near a lower dimensional subspace or manifold. In many settings it is important to allow not only the mean but also the variance and shape of the response density to change flexibly with features, which are massive-dimensional. We propose a multiresolution model that scales efficiently to massive numbers of features, and can be implemented efficiently with slice sampling.</p><p> In the third part of the thesis, we deal with the problem of characterizing the conditional density of a multivariate vector of response given a potentially high dimensional vector of predictors. The proposed model flexibly characterizes the density of the response variable by hierarchically coupling a collection of factor models, each one defined on a different scale of resolution. As it is illustrated in Chapter 4, our proposed method achieves good predictive performance compared to competitive models while efficiently scaling to high dimensional predictors.</p>Dissertatio

    Integrated proteogenomic characterization across major histological types of pediatric brain cancer

    Get PDF
    We report a comprehensive proteogenomics analysis, including whole-genome sequencing, RNA sequencing, and proteomics and phosphoproteomics profiling, of 218 tumors across 7 histological types of childhood brain cancer: low-grade glioma (n = 93), ependymoma (32), high-grade glioma (25), medulloblastoma (22), ganglioglioma (18), craniopharyngioma (16), and atypical teratoid rhabdoid tumor (12). Proteomics data identify common biological themes that span histological boundaries, suggesting that treatments used for one histological type may be applied effectively to other tumors sharing similar proteomics features. Immune landscape characterization reveals diverse tumor microenvironments across and within diagnoses. Proteomics data further reveal functional effects of somatic mutations and copy number variations (CNVs) not evident in transcriptomics data. Kinase-substrate association and co-expression network analysis identify important biological mechanisms of tumorigenesis. This is the first large-scale proteogenomics analysis across traditional histological boundaries to uncover foundational pediatric brain tumor biology and inform rational treatment selection

    DCE-MRI and DWI Integration for Breast Lesions Assessment and Heterogeneity Quantification

    Get PDF
    In order to better predict and follow treatment responses in cancer patients, there is growing interest in noninvasively characterizing tumor heterogeneity based on MR images possessing different contrast and quantitative information. This requires mechanisms for integrating such data and reducing the data dimensionality to levels amenable to interpretation by human readers. Here we propose a two-step pipeline for integrating diffusion and perfusion MRI that we demonstrate in the quantification of breast lesion heterogeneity. First, the images acquired with the two modalities are aligned using an intermodal registration. Dissimilarity-based clustering is then performed exploiting the information coming from both modalities. To this end an ad hoc distance metric is developed and tested for tuning the weighting for the two modalities. The distributions of the diffusion parameter values in subregions identified by the algorithm are extracted and compared through nonparametric testing for posterior evaluation of the tissue heterogeneity. Results show that the joint exploitation of the information brought by DCE and DWI leads to consistent results accounting for both perfusion and microstructural information yielding a greater refinement of the segmentation than the separate processing of the two modalities, consistent with that drawn manually by a radiologist with access to the same data

    Addressing the Role of Angiogenesis in Patients with Advanced Pancreatic Neuroendocrine Tumors Treated with Everolimus: A Biological Prospective Analysis of Soluble Biomarkers and Clinical Outcomes

    Get PDF
    Simple Summary Despite the approval of new targeted therapies for pancreatic neuroendocrine tumors (PanNETs) over the past decades, the early identification of resistant tumors remains the major challenge, mainly because clear signs of tumor shrinkage are rarely achieved by imaging assessment. Starting from the hypothesis that angiogenesis can be implicated in the resistance to mTOR inhibitors, we evaluated a specific angiogenesis panel (through the measurement of soluble biomarkers for angiogenesis turnover, circulating endothelial cells, and circulating progenitors) as possible predictors of resistance to everolimus or everolimus efficacy in PanNETs. Our study showed that none of the investigated categories of biomarkers had a predictive value for everolimus resistance or efficacy. However, we suggest that circulating endothelial progenitors might be surrogate biomarkers for angiogenesis activity in PanNETs during everolimus treatment, and their baseline levels might correlate with survival outcomes. These data have never been reported before for NETs. Background: The success of targeted therapies in the treatment of pancreatic neuroendocrine tumors has emphasized the strategy of targeting angiogenesis and the PI3K/AKT/mTOR pathway. However, the major challenge in the targeted era remains the early identification of resistant tumors especially when the efficacy is rarely associated to a clear tumor shrinkage at by imaging assessment. Methods: In this prospective study (NCT02305810) we investigated the predictive and prognostic role of soluble biomarkers of angiogenesis turnover (VEGF, bFGF, VEGFR2, TSP-1) circulating endothelial cells and progenitors, in 43 patients with metastatic panNET receiving everolimus. Results: Among all tested biomarkers, we found a specific subpopulation of circulating cells, CD31+CD140b-, with a significantly increased tumor progression hazard for values less or equal to the first quartile. Conclusion: Our study suggested the evidence that circulating cells might be surrogate biomarkers of angiogenesis activity in patients treated with everolimus and their baseline levels can be correlated with survival. However, further studies are now needed to validate the role of these cells as surrogate markers for the selection of patients to be candidates for antiangiogenic treatments

    First Results of the &#8220;Carbonaceous Aerosol in Rome and Environs (CARE)&#8221; Experiment: Beyond Current Standards for PM10

    Get PDF
    In February 2017 the “Carbonaceous Aerosol in Rome and Environs (CARE)” experiment was carried out in downtown Rome to address the following specific questions: what is the color, size, composition, and toxicity of the carbonaceous aerosol in the Mediterranean urban background area of Rome? The motivation of this experiment is the lack of understanding of what aerosol types are responsible for the severe risks to human health posed by particulate matter (PM) pollution, and how carbonaceous aerosols influence radiative balance. Physicochemical properties of the carbonaceous aerosol were characterised, and relevant toxicological variables assessed. The aerosol characterisation includes: (i) measurements with high time resolution (min to 1–2 h) at a fixed location of black carbon (eBC), elemental carbon (EC), organic carbon (OC), particle number size distribution (0.008–10 ÎŒ m), major non refractory PM1 components, elemental composition, wavelength-dependent optical properties, and atmospheric turbulence; (ii) 24-h measurements of PM10 and PM2.5 mass concentration, water soluble OC and brown carbon (BrC), and levoglucosan; (iii) mobile measurements of eBC and size distribution around the study area, with computational fluid dynamics modeling; (iv) characterisation of road dust emissions and their EC and OC content. The toxicological assessment includes: (i) preliminary evaluation of the potential impact of ultrafine particles on lung epithelia cells (cultured at the air liquid interface and directly exposed to particles); (ii) assessment of the oxidative stress induced by carbonaceous aerosols; (iii) assessment of particle size dependent number doses deposited in different regions of the human body; (iv) PAHs biomonitoring (from the participants into the mobile measurements). The first experimental results of the CARE experiment are presented in this paper. The objective here is to provide baseline levels of carbonaceous aerosols for Rome, and to address future research directions. First, we found that BC and EC mass concentration in Rome are larger than those measured in similar urban areas across Europe (the urban background mass concentration of eBC in Rome in winter being on average 2.6 ± 2.5 ÎŒ g · m − 3 , mean eBC at the peak level hour being 5.2 (95% CI = 5.0–5.5) ÎŒ g · m − 3 ). Then, we discussed significant variations of carbonaceous aerosol properties occurring with time scales of minutes, and questioned on the data averaging period used in current air quality standard for PM 10 (24-h). Third, we showed that the oxidative potential induced by aerosol depends on particle size and composition, the effects of toxicity being higher with lower mass concentrations and smaller particle size. Albeit this is a preliminary analysis, findings reinforce the need for an urgent update of existing air quality standards for PM 10 and PM 2.5 with regard to particle composition and size distribution, and data averaging period. Our results reinforce existing concerns about the toxicity of carbonaceous aerosols, support the existing evidence indicating that particle size distribution and composition may play a role in the generation of this toxicity, and remark the need to consider a shorter averaging period (&lt;1 h) in these new standards

    Proteogenomics connects somatic mutations to signalling in breast cancer

    Get PDF
    Somatic mutations have been extensively characterized in breast cancer, but the effects of these genetic alterations on the proteomic landscape remain poorly understood. We describe quantitative mass spectrometry-based proteomic and phosphoproteomic analyses of 105 genomically annotated breast cancers of which 77 provided high-quality data. Integrated analyses allowed insights into the somatic cancer genome including the consequences of chromosomal loss, such as the 5q deletion characteristic of basal-like breast cancer. The 5q trans effects were interrogated against the Library of Integrated Network-based Cellular Signatures, thereby connecting CETN3 and SKP1 loss to elevated expression of EGFR, and SKP1 loss also to increased SRC. Global proteomic data confirmed a stromal-enriched group in addition to basal and luminal clusters and pathway analysis of the phosphoproteome identified a G Protein-coupled receptor cluster that was not readily identified at the mRNA level. Besides ERBB2, other amplicon-associated, highly phosphorylated kinases were identified, including CDK12, PAK1, PTK2, RIPK2 and TLK2. We demonstrate that proteogenomic analysis of breast cancer elucidates functional consequences of somatic mutations, narrows candidate nominations for driver genes within large deletions and amplified regions, and identifies therapeutic targets

    A chemical survey of exoplanets with ARIEL

    Get PDF
    Thousands of exoplanets have now been discovered with a huge range of masses, sizes and orbits: from rocky Earth-like planets to large gas giants grazing the surface of their host star. However, the essential nature of these exoplanets remains largely mysterious: there is no known, discernible pattern linking the presence, size, or orbital parameters of a planet to the nature of its parent star. We have little idea whether the chemistry of a planet is linked to its formation environment, or whether the type of host star drives the physics and chemistry of the planet’s birth, and evolution. ARIEL was conceived to observe a large number (~1000) of transiting planets for statistical understanding, including gas giants, Neptunes, super-Earths and Earth-size planets around a range of host star types using transit spectroscopy in the 1.25–7.8 ÎŒm spectral range and multiple narrow-band photometry in the optical. ARIEL will focus on warm and hot planets to take advantage of their well-mixed atmospheres which should show minimal condensation and sequestration of high-Z materials compared to their colder Solar System siblings. Said warm and hot atmospheres are expected to be more representative of the planetary bulk composition. Observations of these warm/hot exoplanets, and in particular of their elemental composition (especially C, O, N, S, Si), will allow the understanding of the early stages of planetary and atmospheric formation during the nebular phase and the following few million years. ARIEL will thus provide a representative picture of the chemical nature of the exoplanets and relate this directly to the type and chemical environment of the host star. ARIEL is designed as a dedicated survey mission for combined-light spectroscopy, capable of observing a large and well-defined planet sample within its 4-year mission lifetime. Transit, eclipse and phase-curve spectroscopy methods, whereby the signal from the star and planet are differentiated using knowledge of the planetary ephemerides, allow us to measure atmospheric signals from the planet at levels of 10–100 part per million (ppm) relative to the star and, given the bright nature of targets, also allows more sophisticated techniques, such as eclipse mapping, to give a deeper insight into the nature of the atmosphere. These types of observations require a stable payload and satellite platform with broad, instantaneous wavelength coverage to detect many molecular species, probe the thermal structure, identify clouds and monitor the stellar activity. The wavelength range proposed covers all the expected major atmospheric gases from e.g. H2O, CO2, CH4 NH3, HCN, H2S through to the more exotic metallic compounds, such as TiO, VO, and condensed species. Simulations of ARIEL performance in conducting exoplanet surveys have been performed – using conservative estimates of mission performance and a full model of all significant noise sources in the measurement – using a list of potential ARIEL targets that incorporates the latest available exoplanet statistics. The conclusion at the end of the Phase A study, is that ARIEL – in line with the stated mission objectives – will be able to observe about 1000 exoplanets depending on the details of the adopted survey strategy, thus confirming the feasibility of the main science objectives.Peer reviewedFinal Published versio
    • 

    corecore