43 research outputs found
A slice classification neural network for automated classification of axial PET/CT slices from a multi-centric lymphoma dataset
Automated slice classification is clinically relevant since it can be
incorporated into medical image segmentation workflows as a preprocessing step
that would flag slices with a higher probability of containing tumors, thereby
directing physicians attention to the important slices. In this work, we train
a ResNet-18 network to classify axial slices of lymphoma PET/CT images
(collected from two institutions) depending on whether the slice intercepted a
tumor (positive slice) in the 3D image or if the slice did not (negative
slice). Various instances of the network were trained on 2D axial datasets
created in different ways: (i) slice-level split and (ii) patient-level split;
inputs of different types were used: (i) only PET slices and (ii) concatenated
PET and CT slices; and different training strategies were employed: (i)
center-aware (CAW) and (ii) center-agnostic (CAG). Model performances were
compared using the area under the receiver operating characteristic curve
(AUROC) and the area under the precision-recall curve (AUPRC), and various
binary classification metrics. We observe and describe a performance
overestimation in the case of slice-level split as compared to the
patient-level split training. The model trained using patient-level split data
with the network input containing only PET slices in the CAG training regime
was the best performing/generalizing model on a majority of metrics. Our models
were additionally more closely compared using the sensitivity metric on the
positive slices from their respective test sets.Comment: 10 pages, 6 figures, 2 table
Land use and Europe’s renewable energy transition: identifying low-conflict areas for wind and solar development
Continued dependence on imported fossil fuels is rapidly becoming unsustainable in the face of the twin challenges of global climate change and energy security demands in Europe. Here we present scenarios in line with REPowerEU package to identify Renewables Acceleration Areas that support rapid renewable expansion, while ensuring minimal harm to places important for biodiversity and rural communities. We calculated the area needed to meet renewable energy objectives under Business-as-Usual (BAU) and Low-conflict (LCON) development scenarios within each country, providing a broad overview of the potential for renewable energy generation to reduce impacts when development is steered toward lower conflict lands. Our analysis shows that meeting renewable energy objectives would require a network of land-based wind turbines and solar arrays encompassing upwards of 164,789 km2 by 2030 and 445,654 km2 by 2050, the latter roughly equivalent to the land area of Sweden. Our results highlight that BAU development patterns disproportionately target high-conflict land cover types. By 2030, depending on the development pathway, solar and wind development are projected to impact approximately 4,386–20,996 km2 and 65,735–138,454 km2 of natural and agricultural lands, respectively. As renewable energy objectives increase from 2030 to 2050 impacts to natural and agricultural lands also increase, with upwards of 33,911 km2 from future solar development and 399,879 km2 from wind development. Despite this large footprint, low-conflict lands can generate substantial renewable energy: 6.6 million GWh of solar and 3.5 million GWh of wind, 8–31 times 2030 solar objectives and 3–5 times 2030 wind objectives. Given these patterns, we emphasize the need for careful planning in areas with greater impact potential, either due to a larger demand for land area or limited land availability. Top-emitting countries with large renewable energy objectives (Germany, Italy, Poland, France, Spain) and those with limited flexibility in meeting objectives on low-conflict land (Albania, Slovenia, Montenegro, Hungary, Croatia, Serbia, Bosnia Herzegovina, Finland, Greece, Portugal, and Norway) should be priorities for country-level customizations to guide low-conflict siting and avoid disproportionate impacts on high-value areas
Challenges of COVID-19 Case Forecasting in the US, 2020–2021
During the COVID-19 pandemic, forecasting COVID-19 trends to support planning and response was a priority for scientists and decision makers alike. In the United States, COVID-19 forecasting was coordinated by a large group of universities, companies, and government entities led by the Centers for Disease Control and Prevention and the US COVID-19 Forecast Hub (https://covid19forecasthub.org). We evaluated approximately 9.7 million forecasts of weekly state-level COVID-19 cases for predictions 1-4 weeks into the future submitted by 24 teams from August 2020 to December 2021. We assessed coverage of central prediction intervals and weighted interval scores (WIS), adjusting for missing forecasts relative to a baseline forecast, and used a Gaussian generalized estimating equation (GEE) model to evaluate differences in skill across epidemic phases that were defined by the effective reproduction number. Overall, we found high variation in skill across individual models, with ensemble-based forecasts outperforming other approaches. Forecast skill relative to the baseline was generally higher for larger jurisdictions (e.g., states compared to counties). Over time, forecasts generally performed worst in periods of rapid changes in reported cases (either in increasing or decreasing epidemic phases) with 95% prediction interval coverage dropping below 50% during the growth phases of the winter 2020, Delta, and Omicron waves. Ideally, case forecasts could serve as a leading indicator of changes in transmission dynamics. However, while most COVID-19 case forecasts outperformed a naïve baseline model, even the most accurate case forecasts were unreliable in key phases. Further research could improve forecasts of leading indicators, like COVID-19 cases, by leveraging additional real-time data, addressing performance across phases, improving the characterization of forecast confidence, and ensuring that forecasts were coherent across spatial scales. In the meantime, it is critical for forecast users to appreciate current limitations and use a broad set of indicators to inform pandemic-related decision making
Evaluation of individual and ensemble probabilistic forecasts of COVID-19 mortality in the United States
Short-term probabilistic forecasts of the trajectory of the COVID-19 pandemic in the United States have served as a visible and important communication channel between the scientific modeling community and both the general public and decision-makers. Forecasting models provide specific, quantitative, and evaluable predictions that inform short-term decisions such as healthcare staffing needs, school closures, and allocation of medical supplies. Starting in April 2020, the US COVID-19 Forecast Hub (https://covid19forecasthub.org/) collected, disseminated, and synthesized tens of millions of specific predictions from more than 90 different academic, industry, and independent research groups. A multimodel ensemble forecast that combined predictions from dozens of groups every week provided the most consistently accurate probabilistic forecasts of incident deaths due to COVID-19 at the state and national level from April 2020 through October 2021. The performance of 27 individual models that submitted complete forecasts of COVID-19 deaths consistently throughout this year showed high variability in forecast skill across time, geospatial units, and forecast horizons. Two-thirds of the models evaluated showed better accuracy than a naïve baseline model. Forecast accuracy degraded as models made predictions further into the future, with probabilistic error at a 20-wk horizon three to five times larger than when predicting at a 1-wk horizon. This project underscores the role that collaboration and active coordination between governmental public-health agencies, academic modeling teams, and industry partners can play in developing modern modeling capabilities to support local, state, and federal response to outbreaks
The United States COVID-19 Forecast Hub dataset
Academic researchers, government agencies, industry groups, and individuals have produced forecasts at an unprecedented scale during the COVID-19 pandemic. To leverage these forecasts, the United States Centers for Disease Control and Prevention (CDC) partnered with an academic research lab at the University of Massachusetts Amherst to create the US COVID-19 Forecast Hub. Launched in April 2020, the Forecast Hub is a dataset with point and probabilistic forecasts of incident cases, incident hospitalizations, incident deaths, and cumulative deaths due to COVID-19 at county, state, and national, levels in the United States. Included forecasts represent a variety of modeling approaches, data sources, and assumptions regarding the spread of COVID-19. The goal of this dataset is to establish a standardized and comparable set of short-term forecasts from modeling teams. These data can be used to develop ensemble models, communicate forecasts to the public, create visualizations, compare models, and inform policies regarding COVID-19 mitigation. These open-source data are available via download from GitHub, through an online API, and through R packages
Autoantibodies against type I IFNs in patients with critical influenza pneumonia
In an international cohort of 279 patients with hypoxemic influenza pneumonia, we identified 13 patients (4.6%) with autoantibodies neutralizing IFN-alpha and/or -omega, which were previously reported to underlie 15% cases of life-threatening COVID-19 pneumonia and one third of severe adverse reactions to live-attenuated yellow fever vaccine. Autoantibodies neutralizing type I interferons (IFNs) can underlie critical COVID-19 pneumonia and yellow fever vaccine disease. We report here on 13 patients harboring autoantibodies neutralizing IFN-alpha 2 alone (five patients) or with IFN-omega (eight patients) from a cohort of 279 patients (4.7%) aged 6-73 yr with critical influenza pneumonia. Nine and four patients had antibodies neutralizing high and low concentrations, respectively, of IFN-alpha 2, and six and two patients had antibodies neutralizing high and low concentrations, respectively, of IFN-omega. The patients' autoantibodies increased influenza A virus replication in both A549 cells and reconstituted human airway epithelia. The prevalence of these antibodies was significantly higher than that in the general population for patients 70 yr of age (3.1 vs. 4.4%, P = 0.68). The risk of critical influenza was highest in patients with antibodies neutralizing high concentrations of both IFN-alpha 2 and IFN-omega (OR = 11.7, P = 1.3 x 10(-5)), especially those <70 yr old (OR = 139.9, P = 3.1 x 10(-10)). We also identified 10 patients in additional influenza patient cohorts. Autoantibodies neutralizing type I IFNs account for similar to 5% of cases of life-threatening influenza pneumonia in patients <70 yr old
privGAN: Protecting GANs from membership inference attacks at low cost to utility
Abstract
Generative Adversarial Networks (GANs) have made releasing of synthetic images a viable approach to share data without releasing the original dataset. It has been shown that such synthetic data can be used for a variety of downstream tasks such as training classifiers that would otherwise require the original dataset to be shared. However, recent work has shown that the GAN models and their synthetically generated data can be used to infer the training set membership by an adversary who has access to the entire dataset and some auxiliary information. Current approaches to mitigate this problem (such as DPGAN [1]) lead to dramatically poorer generated sample quality than the original non–private GANs. Here we develop a new GAN architecture (privGAN), where the generator is trained not only to cheat the discriminator but also to defend membership inference attacks. The new mechanism is shown to empirically provide protection against this mode of attack while leading to negligible loss in downstream performances. In addition, our algorithm has been shown to explicitly prevent memorization of the training set, which explains why our protection is so effective. The main contributions of this paper are: i) we propose a novel GAN architecture that can generate synthetic data in a privacy preserving manner with minimal hyperparameter tuning and architecture selection, ii) we provide a theoretical understanding of the optimal solution of the privGAN loss function, iii) we empirically demonstrate the effectiveness of our model against several white and black–box attacks on several benchmark datasets, iv) we empirically demonstrate on three common benchmark datasets that synthetic images generated by privGAN lead to negligible loss in downstream performance when compared against non– private GANs. While we have focused on benchmarking privGAN exclusively on image datasets, the architecture of privGAN is not exclusive to image datasets and can be easily extended to other types of datasets. Repository link: https://github.com/microsoft/privGAN.</jats:p
Capturing COVID-19 Symptoms At-Scale using Banner Ads: A Novel Survey Methodology Pilot using an Online News Platform (Preprint)
BACKGROUND
Identifying new cases of COVID-19 is challenging. Not every suspected case undergoes testing, because testing kits and other equipment are limited in many parts of the world. Yet populations increasingly use the internet to manage both home and work life during the pandemic, giving researchers mediated connections to millions of people sheltering in place.
OBJECTIVE
To assess the feasibility of using an online news platform to recruit volunteers willing to report COVID-19 symptoms and behaviors.
METHODS
An online epidemiologic survey captured COVID-19 related symptoms and behaviors from individuals recruited through banner ads offered through Microsoft News. Respondents indicated whether they were experiencing symptoms, whether they received COVID-19 testing, and whether they traveled outside of their local area.
RESULTS
A total of 87,322 respondents completed the survey across a span of three weeks at the end of April 2020, with 54.3% responses from the U.S. and 32.0% from Japan. Of the respondents, 19,631 (22.5%) reported at least one symptom of COVID-19. Nearly two-fifths of these respondents (39.1%) reported >1 COVID-19 symptom. Just 4.8% of respondents reporting at least one symptom disclosed having received testing for COVID-19. Respondents were geographically diverse with all states and most ZIP Codes represented in the dataset. More than half of respondents from both countries were >50 years of age.
CONCLUSIONS
News platforms can be used to quickly recruit study participants, enabling collection of infectious disease symptoms at-scale and with populations that are distinct from those found through social media platforms. Such platforms could enable epidemiologists and researchers to quickly assess trends in emerging infections potentially before individuals present to clinics and hospitals for testing and/or treatment.
</sec
Capturing COVID-like Symptoms At-Scale using Banner Ads: A Novel Survey Methodology Pilot using an Online News Platform (Preprint)
Capturing COVID-19–Like Symptoms at Scale Using Banner Ads on an Online News Platform: Pilot Survey Study
Background: Identifying new COVID-19 cases is challenging. Not every suspected case undergoes testing, because testing kits and other equipment are limited in many parts of the world. Yet populations increasingly use the internet to manage both home and work life during the pandemic, giving researchers mediated connections to millions of people sheltering in place.
Objective: The goal of this study was to assess the feasibility of using an online news platform to recruit volunteers willing to report COVID-19–like symptoms and behaviors.
Methods: An online epidemiologic survey captured COVID-19–related symptoms and behaviors from individuals recruited through banner ads offered through Microsoft News. Respondents indicated whether they were experiencing symptoms, whether they received COVID-19 testing, and whether they traveled outside of their local area.
Results: A total of 87,322 respondents completed the survey across a 3-week span at the end of April 2020, with 54.3% of the responses from the United States and 32.0% from Japan. Of the total respondents, 19,631 (22.3%) reported at least one symptom associated with COVID-19. Nearly two-fifths of these respondents (39.1%) reported more than one COVID-19–like symptom. Individuals who reported being tested for COVID-19 were significantly more likely to report symptoms (47.7% vs 21.5%; P<.001). Symptom reporting rates positively correlated with per capita COVID-19 testing rates (R2=0.26; P<.001). Respondents were geographically diverse, with all states and most ZIP Codes represented. More than half of the respondents from both countries were older than 50 years of age.
Conclusions: News platforms can be used to quickly recruit study participants, enabling collection of infectious disease symptoms at scale and with populations that are older than those found through social media platforms. Such platforms could enable epidemiologists and researchers to quickly assess trends in emerging infections potentially before at-risk populations present to clinics and hospitals for testing and/or treatment
