88 research outputs found

    Estimating parameters for probabilistic linkage of privacy-preserved datasets.

    Get PDF
    Background: Probabilistic record linkage is a process used to bring together person-based records from within the same dataset (de-duplication) or from disparate datasets using pairwise comparisons and matching probabilities. The linkage strategy and associated match probabilities are often estimated through investigations into data quality and manual inspection. However, as privacy-preserved datasets comprise encrypted data, such methods are not possible. In this paper, we present a method for estimating the probabilities and threshold values for probabilistic privacy-preserved record linkage using Bloom filters. Methods: Our method was tested through a simulation study using synthetic data, followed by an application using real-world administrative data. Synthetic datasets were generated with error rates from zero to 20% error. Our method was used to estimate parameters (probabilities and thresholds) for de-duplication linkages. Linkage quality was determined by F-measure. Each dataset was privacy-preserved using separate Bloom filters for each field. Match probabilities were estimated using the expectation-maximisation (EM) algorithm on the privacy-preserved data. Threshold cut-off values were determined by an extension to the EM algorithm allowing linkage quality to be estimated for each possible threshold. De-duplication linkages of each privacy-preserved dataset were performed using both estimated and calculated probabilities. Linkage quality using the F-measure at the estimated threshold values was also compared to the highest F-measure. Three large administrative datasets were used to demonstrate the applicability of the probability and threshold estimation technique on real-world data. Results: Linkage of the synthetic datasets using the estimated probabilities produced an F-measure that was comparable to the F-measure using calculated probabilities, even with up to 20% error. Linkage of the administrative datasets using estimated probabilities produced an F-measure that was higher than the F-measure using calculated probabilities. Further, the threshold estimation yielded results for F-measure that were only slightly below the highest possible for those probabilities. Conclusions: The method appears highly accurate across a spectrum of datasets with varying degrees of error. As there are few alternatives for parameter estimation, the approach is a major step towards providing a complete operational approach for probabilistic linkage of privacy-preserved datasets

    Accounting for density reduction and structural loss in standing dead trees: Implications for forest biomass and carbon stock estimates in the United States

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Standing dead trees are one component of forest ecosystem dead wood carbon (C) pools, whose national stock is estimated by the U.S. as required by the United Nations Framework Convention on Climate Change. Historically, standing dead tree C has been estimated as a function of live tree growing stock volume in the U.S.'s National Greenhouse Gas Inventory. Initiated in 1998, the USDA Forest Service's Forest Inventory and Analysis program (responsible for compiling the Nation's forest C estimates) began consistent nationwide sampling of standing dead trees, which may now supplant previous purely model-based approaches to standing dead biomass and C stock estimation. A substantial hurdle to estimating standing dead tree biomass and C attributes is that traditional estimation procedures are based on merchantability paradigms that may not reflect density reductions or structural loss due to decomposition common in standing dead trees. The goal of this study was to incorporate standing dead tree adjustments into the current estimation procedures and assess how biomass and C stocks change at multiple spatial scales.</p> <p>Results</p> <p>Accounting for decay and structural loss in standing dead trees significantly decreased tree- and plot-level C stock estimates (and subsequent C stocks) by decay class and tree component. At a regional scale, incorporating adjustment factors decreased standing dead quaking aspen biomass estimates by almost 50 percent in the Lake States and Douglas-fir estimates by more than 36 percent in the Pacific Northwest.</p> <p>Conclusions</p> <p>Substantial overestimates of standing dead tree biomass and C stocks occur when one does not account for density reductions or structural loss. Forest inventory estimation procedures that are descended from merchantability standards may need to be revised toward a more holistic approach to determining standing dead tree biomass and C attributes (i.e., attributes of tree biomass outside of sawlog portions). Incorporating density reductions and structural loss adjustments reduces uncertainty associated with standing dead tree biomass and C while improving consistency with field methods and documentation.</p

    Polyfunctional T-Cell Responses Are Disrupted by the Ovarian Cancer Ascites Environment and Only Partially Restored by Clinically Relevant Cytokines

    Get PDF
    Host T-cell responses are associated with favorable outcomes in epithelial ovarian cancer (EOC), but it remains unclear how best to promote these responses in patients. Toward this goal, we evaluated a panel of clinically relevant cytokines for the ability to enhance multiple T-cell effector functions (polyfunctionality) in the native tumor environment.Experiments were performed with resident CD8+ and CD4+ T cells in bulk ascites cell preparations from high-grade serous EOC patients. T cells were stimulated with α-CD3 in the presence of 100% autologous ascites fluid with or without exogenous IL-2, IL-12, IL-18 or IL-21, alone or in combination. T-cell proliferation (Ki-67) and function (IFN-γ, TNF-α, IL-2, CCL4, and CD107a expression) were assessed by multi-parameter flow cytometry. In parallel, 27 cytokines were measured in culture supernatants. While ascites fluid had variable effects on CD8+ and CD4+ T-cell proliferation, it inhibited T-cell function in most patient samples, with CD107a, IFN-γ, and CCL4 showing the greatest inhibition. This was accompanied by reduced levels of IL-1β, IL-1ra, IL-9, IL-17, G-CSF, GM-CSF, Mip-1α, PDGF-bb, and bFGF in culture supernatants. T-cell proliferation was enhanced by exogenous IL-2, but other T-cell functions were largely unaffected by single cytokines. The combination of IL-2 with cytokines engaging complementary signaling pathways, in particular IL-12 and IL-18, enhanced expression of IFN-γ, TNF-α, and CCL4 in all patient samples by promoting polyfunctional T-cell responses. Despite this, other functional parameters generally remained inhibited.The EOC ascites environment disrupts multiple T-cell functions, and exogenous cytokines engaging diverse signaling pathways only partially reverse these effects. Our results may explain the limited efficacy of cytokine therapies for EOC to date. Full restoration of T-cell function will require activation of signaling pathways beyond those engaged by IL-2, IL-12, IL-18, and IL-21

    Major Depletion of Plasmacytoid Dendritic Cells in HIV-2 Infection, an Attenuated Form of HIV Disease

    Get PDF
    Plasmacytoid dendritic cells (pDC) provide an important link between innate and acquired immunity, mediating their action mainly through IFN-α production. pDC suppress HIV-1 replication, but there is increasing evidence suggesting they may also contribute to the increased levels of cell apoptosis and pan-immune activation associated with disease progression. Although having the same clinical spectrum, HIV-2 infection is characterized by a strikingly lower viremia and a much slower rate of CD4 decline and AIDS progression than HIV-1, irrespective of disease stage. We report here a similar marked reduction in circulating pDC levels in untreated HIV-1 and HIV-2 infections in association with CD4 depletion and T cell activation, in spite of the undetectable viremia found in the majority of HIV-2 patients. Moreover, the same overexpression of CD86 and PD-L1 on circulating pDC was found in both infections irrespective of disease stage or viremia status. Our observation that pDC depletion occurs in HIV-2 infected patients with undetectable viremia indicates that mechanisms other than direct viral infection determine the pDC depletion during persistent infections. However, viremia was associated with an impairment of IFN-α production on a per pDC basis upon TLR9 stimulation. These data support the possibility that diminished function in vitro may relate to prior activation by HIV virions in vivo, in agreement with our finding of higher expression levels of the IFN-α inducible gene, MxA, in HIV-1 than in HIV-2 individuals. Importantly, serum IFN-α levels were not elevated in HIV-2 infected individuals. In conclusion, our data in this unique natural model of “attenuated” HIV immunodeficiency contribute to the understanding of pDC biology in HIV/AIDS pathogenesis, showing that in the absence of detectable viremia a major depletion of circulating pDC in association with a relatively preserved IFN-α production does occur

    Shared heritability of attention-deficit/hyperactivity disorder and autism spectrum disorder

    Get PDF
    Attention-deficit/hyperactivity disorder (ADHD) and autism spectrum disorder (ASD) are both highly heritable neurodevelopmental disorders. Evidence indicates both disorders co-occur with a high frequency, in 20–50% of children with ADHD meeting criteria for ASD and in 30-80% of ASD children meeting criteria for ADHD. This review will provide an overview on all available studies [family based, twin, candidate gene, linkage, and genome wide association (GWA) studies] shedding light on the role of shared genetic underpinnings of ADHD and ASD. It is concluded that family and twin studies do provide support for the hypothesis that ADHD and ASD originate from partly similar familial/genetic factors. Only a few candidate gene studies, linkage studies and GWA studies have specifically addressed this co-occurrence, pinpointing to some promising pleiotropic genes, loci and single nucleotide polymorphisms (SNPs), but the research field is in urgent need for better designed and powered studies to tackle this complex issue. We propose that future studies examining shared familial etiological factors for ADHD and ASD use a family-based design in which the same phenotypic (ADHD and ASD), candidate endophenotypic, and environmental measurements are obtained from all family members. Multivariate multi-level models are probably best suited for the statistical analysis

    A Functional Phylogenomic View of the Seed Plants

    Get PDF
    A novel result of the current research is the development and implementation of a unique functional phylogenomic approach that explores the genomic origins of seed plant diversification. We first use 22,833 sets of orthologs from the nuclear genomes of 101 genera across land plants to reconstruct their phylogenetic relationships. One of the more salient results is the resolution of some enigmatic relationships in seed plant phylogeny, such as the placement of Gnetales as sister to the rest of the gymnosperms. In using this novel phylogenomic approach, we were also able to identify overrepresented functional gene ontology categories in genes that provide positive branch support for major nodes prompting new hypotheses for genes associated with the diversification of angiosperms. For example, RNA interference (RNAi) has played a significant role in the divergence of monocots from other angiosperms, which has experimental support in Arabidopsis and rice. This analysis also implied that the second largest subunit of RNA polymerase IV and V (NRPD2) played a prominent role in the divergence of gymnosperms. This hypothesis is supported by the lack of 24nt siRNA in conifers, the maternal control of small RNA in the seeds of flowering plants, and the emergence of double fertilization in angiosperms. Our approach takes advantage of genomic data to define orthologs, reconstruct relationships, and narrow down candidate genes involved in plant evolution within a phylogenomic view of species' diversification

    International comparisons of laboratory values from the 4CE collaborative to predict COVID-19 mortality

    Get PDF
    Given the growing number of prediction algorithms developed to predict COVID-19 mortality, we evaluated the transportability of a mortality prediction algorithm using a multi-national network of healthcare systems. We predicted COVID-19 mortality using baseline commonly measured laboratory values and standard demographic and clinical covariates across healthcare systems, countries, and continents. Specifically, we trained a Cox regression model with nine measured laboratory test values, standard demographics at admission, and comorbidity burden pre-admission. These models were compared at site, country, and continent level. Of the 39,969 hospitalized patients with COVID-19 (68.6% male), 5717 (14.3%) died. In the Cox model, age, albumin, AST, creatine, CRP, and white blood cell count are most predictive of mortality. The baseline covariates are more predictive of mortality during the early days of COVID-19 hospitalization. Models trained at healthcare systems with larger cohort size largely retain good transportability performance when porting to different sites. The combination of routine laboratory test values at admission along with basic demographic features can predict mortality in patients hospitalized with COVID-19. Importantly, this potentially deployable model differs from prior work by demonstrating not only consistent performance but also reliable transportability across healthcare systems in the US and Europe, highlighting the generalizability of this model and the overall approach

    HIV interactions with monocytes and dendritic cells: viral latency and reservoirs

    Get PDF
    HIV is a devastating human pathogen that causes serious immunological diseases in humans around the world. The virus is able to remain latent in an infected host for many years, allowing for the long-term survival of the virus and inevitably prolonging the infection process. The location and mechanisms of HIV latency are under investigation and remain important topics in the study of viral pathogenesis. Given that HIV is a blood-borne pathogen, a number of cell types have been proposed to be the sites of latency, including resting memory CD4+ T cells, peripheral blood monocytes, dendritic cells and macrophages in the lymph nodes, and haematopoietic stem cells in the bone marrow. This review updates the latest advances in the study of HIV interactions with monocytes and dendritic cells, and highlights the potential role of these cells as viral reservoirs and the effects of the HIV-host-cell interactions on viral pathogenesis
    corecore