78 research outputs found

    Two-step hyperparameter optimization method: Accelerating hyperparameter search by using a fraction of a training dataset

    Full text link
    Hyperparameter optimization (HPO) is an important step in machine learning (ML) model development, but common practices are archaic -- primarily relying on manual or grid searches. This is partly because adopting advanced HPO algorithms introduces added complexity to the workflow, leading to longer computation times. This poses a notable challenge to ML applications, as suboptimal hyperparameter selections curtail the potential of ML model performance, ultimately obstructing the full exploitation of ML techniques. In this article, we present a two-step HPO method as a strategic solution to curbing computational demands and wait times, gleaned from practical experiences in applied ML parameterization work. The initial phase involves a preliminary evaluation of hyperparameters on a small subset of the training dataset, followed by a re-evaluation of the top-performing candidate models post-retraining with the entire training dataset. This two-step HPO method is universally applicable across HPO search algorithms, and we argue it has attractive efficiency gains. As a case study, we present our recent application of the two-step HPO method to the development of neural network emulators for aerosol activation. Although our primary use case is a data-rich limit with many millions of samples, we also find that using up to 0.0025% of the data (a few thousand samples) in the initial step is sufficient to find optimal hyperparameter configurations from much more extensive sampling, achieving up to 135-times speedup. The benefits of this method materialize through an assessment of hyperparameters and model performance, revealing the minimal model complexity required to achieve the best performance. The assortment of top-performing models harvested from the HPO process allows us to choose a high-performing model with a low inference cost for efficient use in global climate models (GCMs)

    Direct Radiative Effect of Mineral Dust on the Development of African Easterly Wave in Late Summer, 2003-2007

    Get PDF
    Episodic events of both Saharan dust outbreaks and African Easterly Waves (AEWs) are observed to move westward over the eastern tropical Atlantic Ocean. The relationship between the warm, dry, and dusty Saharan Air Layer (SAL) on the nearby storms has been the subject of considerable debate. In this study, the Weather Research and Forecasting (WRF) model is used to investigate the radiative effect of dust on the development of AEWs during August and September, the months of maximum tropical cyclone activity, in years 2003-2007. The simulations show that dust radiative forcing enhances the convective instability of the environment. As a result, most AEWs intensify in the presence of a dust layer. The Lorenz energy cycle analysis reveals that the dust radiative forcing enhances the condensational heating, which elevates the zonal and eddy available potential energy. In turn, available potential energy is effectively converted to eddy kinetic energy, in which local convective overturning plays the primary role. The magnitude of the intensification effect depends on the initial environmental conditions, including moisture, baroclinity, and the depth of the boundary layer. We conclude that dust radiative forcing, albeit small, serves as a catalyst to promote local convection that facilitates AEW development

    Understanding and Visualizing Droplet Distributions in Simulations of Shallow Clouds

    Full text link
    Thorough analysis of local droplet-level interactions is crucial to better understand the microphysical processes in clouds and their effect on the global climate. High-accuracy simulations of relevant droplet size distributions from Large Eddy Simulations (LES) of bin microphysics challenge current analysis techniques due to their high dimensionality involving three spatial dimensions, time, and a continuous range of droplet sizes. Utilizing the compact latent representations from Variational Autoencoders (VAEs), we produce novel and intuitive visualizations for the organization of droplet sizes and their evolution over time beyond what is possible with clustering techniques. This greatly improves interpretation and allows us to examine aerosol-cloud interactions by contrasting simulations with different aerosol concentrations. We find that the evolution of the droplet spectrum is similar across aerosol levels but occurs at different paces. This similarity suggests that precipitation initiation processes are alike despite variations in onset times.Comment: 4 pages, 3 figures, accepted at NeurIPS 2023 (Machine Learning and the Physical Sciences Workshop

    Constraining Aging Processes of Black Carbon in the Community Atmosphere Model Using Environmental Chamber Measurements

    Get PDF
    The direct radiative forcing of black carbon aerosol (BC) on the Earth system remains unsettled, largely due to the uncertainty with physical properties of BC throughout their lifecycle. Here we show that ambient chamber measurements of BC properties provide a novel constraint on the crude BC aging representation in climate models. Observational evidence for significant absorption enhancement of BC can be reproduced when the aging processes in the four‐mode version of the Modal Aerosol Module (MAM4) aerosol scheme in the Community Atmosphere Model version 5 are calibrated by the recent in situ chamber measurements. An observation‐based scaling method is developed in the aging timescale calculation to alleviate the influence of biases in the simulated model chemical composition. Model sensitivity simulations suggest that the different monolayer settings in the BC aging parameterization of MAM4 can cause as large as 26% and 24% differences in BC burden and radiative forcing, respectively. We also find that an increase in coating materials (e.g., sulfate and secondary organic aerosols) reduces BC lifetime by increasing the hygroscopicity of the mixture but enhances its absorption, resulting in a net increase in BC direct radiative forcing. Our results suggest that accurate simulations of BC aging processes as well as other aerosol species are equally important in reducing the uncertainty of BC forcing estimation

    Aerosols in the E3SM Version 1: New Developments and Their Impacts on Radiative Forcing

    Full text link
    The new Energy Exascale Earth System Model Version 1 (E3SMv1) developed for the U.S. Department of Energy has significant new treatments of aerosols and lightĂą absorbing snow impurities as well as their interactions with clouds and radiation. This study describes seven sets of new aerosolĂą related treatments (involving emissions, new particle formation, aerosol transport, wet scavenging and resuspension, and snow radiative transfer) and examines how they affect global aerosols and radiative forcing in E3SMv1. Altogether, they give a reduced total aerosol radiative forcing (Ăą 1.6 W/m2) and sensitivity in cloud liquid water to aerosols, but an increased sensitivity in cloud droplet size to aerosols. A new approach for H2SO4 production and loss largely reduces a low bias in small particles concentrations and leads to substantial increases in cloud condensation nuclei concentrations and cloud radiative cooling. Emitting secondary organic aerosol precursor gases from elevated sources increases the column burden of secondary organic aerosol, contributing substantially to global clearĂą sky aerosol radiative cooling (Ăą 0.15 out of Ăą 0.5 W/m2). A new treatment of aerosol resuspension from evaporating precipitation, developed to remedy two shortcomings of the original treatment, produces a modest reduction in aerosols and cloud droplets; its impact depends strongly on the model physics and is much stronger in E3SM Version 0. New treatments of the mixing state and optical properties of snow impurities and snow grains introduce a positive presentĂą day shortwave radiative forcing (0.26 W/m2), but changes in aerosol transport and wet removal processes also affect the concentration and radiative forcing of lightĂą absorbing impurities in snow/ice.Plain Language SummaryAerosol and aerosolĂą cloud interactions continue to be a major uncertainty in Earth system models, impeding their ability to reproduce the observed historical warming and to project changes in global climate and water cycle. The U.S. DOE Energy Exascale Earth System Model version 1 (E3SMv1), a stateĂą ofĂą theĂą science Earth system model, was developed to use exascale computing to address the grand challenge of actionable predictions of variability and change in the Earth system critical to the energy sector. It has been publicly released with new treatments in many aspects, including substantial modifications to the physical treatments of aerosols in the atmosphere and lightĂą absorbing impurities in snow/ice, aimed at reducing some known biases or correcting model deficiencies in representing aerosols, their life cycle, and their impacts in various components of the Earth system. Compared to its predecessors (without the new treatments) and observations, E3SMv1 shows improvements in characterizing global distributions of aerosols and their radiative effects. We conduct sensitivity experiments to understand the impact of individual changes and provide guidance for future development of E3SM and other Earth system models.Key PointsA description and assessment of new aerosol treatments in the Energy Exascale Earth System Model Version 1 (E3SMv1) is providedContributions to the total aerosolĂą related radiative forcing by individual new treatments and different processes are quantifiedSome of the new treatments are found to depend on model physics and require further improvement for E3SM or other Earth system modelsPeer Reviewedhttps://deepblue.lib.umich.edu/bitstream/2027.42/153241/1/jame21034-sup-0001-Figure_SI-S01.pdfhttps://deepblue.lib.umich.edu/bitstream/2027.42/153241/2/jame21034.pdfhttps://deepblue.lib.umich.edu/bitstream/2027.42/153241/3/jame21034_am.pd

    Constraining the Twomey effect from satellite observations: issues and perspectives

    Get PDF
    The Twomey effect describes the radiative forcing associated with a change in cloud albedo due to an increase in anthropogenic aerosol emissions. It is driven by the perturbation in cloud droplet number concentration (1Nd; ant) in liquid-water clouds and is currently understood to exert a cooling effect on climate. The Twomey effect is the key driver in the effective radiative forcing due to aerosol–cloud interactions, but rapid adjustments also contribute. These adjustments are essentially the responses of cloud fraction and liquid water path to 1Nd; ant and thus scale approximately with it. While the fundamental physics of the influence of added aerosol particles on the droplet concentration (Nd) is well described by established theory at the particle scale (micrometres), how this relationship is expressed at the large-scale (hundreds of kilometres) perturbation, 1Nd; ant, remains uncertain. The discrepancy between process understanding at particle scale and insufficient quantification at the climate-relevant large scale is caused by co-variability of aerosol particles and updraught velocity and by droplet sink processes. These operate at scales on the order of tens of metres at which only localised observations are available and at which no approach yet exists to quantify the anthropogenic perturbation. Different atmospheric models suggest diverse magnitudes of the Twomey effect even when applying the same anthropogenic aerosol emission perturbation. Thus, observational data are needed to quantify and constrain the Twomey effect. At the global scale, this means satellite data. There are four key uncertainties in determining 1Nd; ant, namely the quantification of (i) the cloud-active aerosol – the cloud condensation nuclei (CCN) concentrations at or above cloud base, (ii) Nd, (iii) the statistical approach for inferring the sensitivity of Nd to aerosol particles from the satellite data and (iv) uncertainty in the anthropogenic perturbation to CCN concentrations, which is not easily accessible from observational data. This review discusses deficiencies of current approaches for the different aspects of the problem and proposes several ways forward: in terms of CCN, retrievals of optical quantities such as aerosol optical depth suffer from a lack of vertical resolution, size and hygroscopicity information, non-direct relation to the concentration of aerosols, difficulty to quantify it within or below clouds, and the problem of insufficient sensitivity at low concentrations, in addition to retrieval errors. A future path forward can include utilising co-located polarimeter and lidar instruments, ideally including high-spectral-resolution lidar capability at two wavelengths to maximise vertically resolved size distribution information content. In terms of Nd, a key problem is the lack of operational retrievals of this quantity and the inaccuracy of the retrieval especially in broken-cloud regimes. As for the Nd-to-CCN sensitivity, key issues are the updraught distributions and the role of Nd sink processes, for which empirical assessments for specific cloud regimes are currently the best solutions. These considerations point to the conclusion that past studies using existing approaches have likely underestimated the true sensitivity and, thus, the radiative forcing due to the Twomey effect

    Opportunistic experiments to constrain aerosol effective radiative forcing

    Get PDF
    Aerosol–cloud interactions (ACIs) are considered to be the most uncertain driver of present-day radiative forcing due to human activities. The nonlinearity of cloud-state changes to aerosol perturbations make it challenging to attribute causality in observed relationships of aerosol radiative forcing. Using correlations to infer causality can be challenging when meteorological variability also drives both aerosol and cloud changes independently. Natural and anthropogenic aerosol perturbations from well-defined sources provide “opportunistic experiments” (also known as natural experiments) to investigate ACI in cases where causality may be more confidently inferred. These perturbations cover a wide range of locations and spatiotemporal scales, including point sources such as volcanic eruptions or industrial sources, plumes from biomass burning or forest fires, and tracks from individual ships or shipping corridors. We review the different experimental conditions and conduct a synthesis of the available satellite datasets and field campaigns to place these opportunistic experiments on a common footing, facilitating new insights and a clearer understanding of key uncertainties in aerosol radiative forcing. Cloud albedo perturbations are strongly sensitive to background meteorological conditions. Strong liquid water path increases due to aerosol perturbations are largely ruled out by averaging across experiments. Opportunistic experiments have significantly improved process-level understanding of ACI, but it remains unclear how reliably the relationships found can be scaled to the global level, thus demonstrating a need for deeper investigation in order to improve assessments of aerosol radiative forcing and climate change
    • 

    corecore