430 research outputs found

    Statistical/climatic models to predict and project extreme precipitation events dominated by large-scale atmospheric circulation over the central-eastern China

    Get PDF
    Global warming has posed non-negligible effects on regional extreme precipitation changes and increased the uncertainties when meteorologists predict such extremes. More importantly, floods, landslides, and waterlogging caused by extreme precipitation have had catastrophic societal impacts and led to steep economic damages across the world, in particular over central-eastern China (CEC), where heavy precipitation due to the Meiyu-front and typhoon activities often causes flood disaster. There is mounting evidence that the anomaly atmospheric circulation systems and water vapor transport have a dominant role in triggering and maintaining the processes of regional extreme precipitation. Both understanding and accurately predicting extreme precipitation events based on these anomalous signals are hot issues in the field of hydrological research. In this thesis, the self-organizing map (SOM) and event synchronization were used to cluster the large-scale atmospheric circulation reflected by geopotential height at 500 hPa and to quantify the level of synchronization between the identified circulation patterns with extreme precipitation over CEC. With the understanding of which patterns were associated with extreme precipitation events, and corresponding water vapor transport fields, a hybrid deep learning model of multilayer perceptron and convolutional neural networks (MLP-CNN) was proposed to achieve the binary predictions of extreme precipitation. The inputs to MLP-CNN were the anomalous fields of GP at 500 hPa and vertically integrated water vapor transport (IVT). Compared with the original MLP, CNN, and two other machine learning models (random forest and support vector machine), MLP-CNN showed the best performance. Additionally, since the coarse spatial resolution of global circulation models and its large biases in extremes precipitation estimations, a new precipitation downscaling framework that combination of ensemble-learning and nonhomogeneous hidden Markov model (Ensemble-NHMM) was developed, to improve the reliabilities of GCMs in historical simulations and future projection. The performances of downscaled precipitation from reanalysis and GCM datasets were validated against the gauge observations and also compared with the results of traditional NHMM. Finally, the Ensemble-NHMM downscaling model was applied to future scenario data of GCM. On the projections of change trends in precipitation over CEC in the early-, medium- and late- 21st centuries under different emission scenarios, the possible causes were discussed in term of both thermodynamic and dynamic factors. Main results are enumerated as follows. (1) The large-scale atmospheric circulation patterns and associated water vapor transport fields synchronized with extreme precipitation events over CEC were quantitatively identified, as well as the contribution of circulation pattern changes to extreme precipitation changes and their teleconnection with the interdecadal modes of the ocean. Firstly, based on the nonparametric Pettitt test, it was found that 23% of rain gauges had significant abrupt changes in the annual extreme precipitation from 1960 to 2015. The average change point in the annual extreme precipitation frequency and amount occurred near 1989. Complex network analysis showed that the rain gauges highly synchronized on extreme precipitation events can be clustered into four clusters based on modularity information. Secondly, the dominant circulation patterns over CEC were robustly identified based on the SOM. From the period 1960–1989 to 1990–2015, the categories of identified circulation patterns generally remain almost unchanged. Among these, the circulation patterns characterized by obvious positive anomalies of 500 hPa geopotential height over the Eastern Eurasia continent and negative values over the surrounding oceans are highly synchronized with extreme precipitation events. An obvious water vapor channel originating from the northern Indian Ocean driven by the southwesterly airflow was observed for the representative circulation patterns (synchronized with extreme precipitation). Finally, the circulation pattern changes produced an increase in extreme precipitation frequency from 1960–1989 to 1990–2015. Empirical mode decomposition of the annual frequency variation signals in the representative circulation pattern showed that the 2–4 yr oscillation in the annual frequency was closely related to the phase of El Niño and Southern Oscillation (ENSO); while the 20–25 yr and 42–50 yr periodic oscillations were responses to the Pacific Decadal Oscillation and the Atlantic Multidecadal Oscillation. (2) A regional extreme precipitation prediction model was constructed. Two deep learning models-MLP and CNN were linearly stacked and used two atmospheric variables associated with extreme precipitation, that is, geopotential height at 500 hPa and IVT. The hybrid model can learn both the local-scale information with MLP and large-scale circulation information with CNN. Validation results showed that the MLP-CNN model can predict extreme or non-extreme precipitation days with an overall accuracy of 86%. The MLP-CNN also showed excellent seasonal transferability with an 81% accuracy on the testing set from different seasons of the training set. MLP-CNN significantly outperformed over other machine learning models, including MLP, CNN, random forest, and support vector machine. Additionally, the MLP-CNN can be used to produce precursor signals by 1 to 2 days, though the accuracy drops quickly as the number of precursor days increases. (3) The GCM seriously underestimated extreme precipitation over CEC but showed convincing results for reproducing large-scale atmospheric circulation patterns. The accuracies of 10 GCMs in extreme precipitation and large-scale atmospheric circulation simulations were evaluated. First, five indices were selected to measure the characteristics of extreme precipitation and the performances of GCMs were compared to the gauge-based daily precipitation analysis dataset over the Chinese mainland. The results showed that except for FGOALS-g3, most GCMs can reproduce the spatial distribution characteristics of the average precipitation from 1960 to 2015. However, all GCMs failed to accurately estimate the extreme precipitation with large underestimation (relative bias exceeds 85%). In addition, using the circulation patterns identified by the fifth-generation reanalysis data (ERA5) as benchmarks, GCMs can reproduce most CP types for the periods 1960–1989 and 1990–2015. In terms of the spatial similarity of the identified CPs, MPI-ESM1-2-HR was superior. (4) To improve the reliabilities of precipitation simulations and future projections from GCMs, a new statistical downscaling framework was proposed. This framework comprises two models, ensemble learning and NHMM. First, the extreme gradient boosting (XGBoost) and random forest (RF) were selected as the basic- and meta- classifiers for constructing the ensemble learning model. Based on the top 50 principal components of GP at 500 hPa and IVT, this model was trained to predict the occurrence probabilities for the different levels of daily precipitation (no rain, very light, light, moderate, and heavy precipitation) aggregated by multi-sites. Confusion matrix results showed that the ensemble learning model had sufficient accuracy (>88%) in classifying no rain or rain days and (>83%) predicting moderate precipitation events. Subsequently, precipitation downscaling was done using the probability sequences of daily precipitation as large-scale predictors to NHMM. Statistical metrics showed that the Ensemble-NHMM downscaled results matched best to the gauge observations in precipitation variabilities and extreme precipitation simulations, compared with the result from the one that directly used circulation variables as predictors. Finally, the downscaling model also performed well in the historical simulations of MPI-ESM1-2-HR, which reproduced the change trends of annual precipitation and the means of total extreme precipitation index. (5) Three climate scenarios with different Shared Socioeconomic Pathways and Representative Concentration Pathways (SSPs) were selected to project the future precipitation change trends. The Ensemble-NHMM downscaling model was applied to the scenario data from MPI-ESM1-2-HR. Projection results showed that the CEC would receive more precipitation in the future by ~30% through the 2075–2100 period. Compared to the recent 26-year epoch (1990–2015), the frequency and magnitude of extreme precipitation would increase by 21.9–48.1% and 12.3–38.3% respectively under the worst emission scenario (SSP585). In particular, the south CEC region is projected to receive more extreme precipitation than the north. Investigations of thermodynamic and dynamic factors showed that climate warming would increase the probability of stronger water vapor convergence over CEC. More wet weather states due to the enhanced water vapor transport, as well as the increased favoring large-scale atmospheric circulation and the strengthen pressure gradient would be the factors for the increased precipitation

    Physics-based Machine Learning Approaches to Complex Systems and Climate Analysis

    Get PDF
    Komplexe Systeme wie das Klima der Erde bestehen aus vielen Komponenten, die durch eine komplizierte Kopplungsstruktur miteinander verbunden sind. Für die Analyse solcher Systeme erscheint es daher naheliegend, Methoden aus der Netzwerktheorie, der Theorie dynamischer Systeme und dem maschinellen Lernen zusammenzubringen. Durch die Kombination verschiedener Konzepte aus diesen Bereichen werden in dieser Arbeit drei neuartige Ansätze zur Untersuchung komplexer Systeme betrachtet. Im ersten Teil wird eine Methode zur Konstruktion komplexer Netzwerke vorgestellt, die in der Lage ist, Windpfade des südamerikanischen Monsunsystems zu identifizieren. Diese Analyse weist u.a. auf den Einfluss der Rossby-Wellenzüge auf das Monsunsystem hin. Dies wird weiter untersucht, indem gezeigt wird, dass der Niederschlag mit den Rossby-Wellen phasenkohärent ist. So zeigt der erste Teil dieser Arbeit, wie komplexe Netzwerke verwendet werden können, um räumlich-zeitliche Variabilitätsmuster zu identifizieren, die dann mit Methoden der nichtlinearen Dynamik weiter analysiert werden können. Die meisten komplexen Systeme weisen eine große Anzahl von möglichen asymptotischen Zuständen auf. Um solche Zustände zu beschreiben, wird im zweiten Teil die Monte Carlo Basin Bifurcation Analyse (MCBB), eine neuartige numerische Methode, vorgestellt. Angesiedelt zwischen der klassischen Analyse mit Ordnungsparametern und einer gründlicheren, detaillierteren Bifurkationsanalyse, kombiniert MCBB Zufallsstichproben mit Clustering, um die verschiedenen Zustände und ihre Einzugsgebiete zu identifizieren. Bei von Vorhersagen von komplexen Systemen ist es nicht immer einfach, wie Vorwissen in datengetriebenen Methoden integriert werden kann. Eine Möglichkeit hierzu ist die Verwendung von Neuronalen Partiellen Differentialgleichungen. Hier wird im letzten Teil der Arbeit gezeigt, wie hochdimensionale räumlich-zeitlich chaotische Systeme mit einem solchen Ansatz modelliert und vorhergesagt werden können.Complex systems such as the Earth's climate are comprised of many constituents that are interlinked through an intricate coupling structure. For the analysis of such systems it therefore seems natural to bring together methods from network theory, dynamical systems theory and machine learning. By combining different concepts from these fields three novel approaches for the study of complex systems are considered throughout this thesis. In the first part, a novel complex network construction method is introduced that is able to identify the most important wind paths of the South American Monsoon system. Aside from the importance of cross-equatorial flows, this analysis points to the impact Rossby Wave trains have both on the precipitation and low-level circulation. This connection is then further explored by showing that the precipitation is phase coherent to the Rossby Wave. As such, the first part of this thesis demonstrates how complex networks can be used to identify spatiotemporal variability patterns within large amounts of data, that are then further analysed with methods from nonlinear dynamics. Most complex systems exhibit a large number of possible asymptotic states. To investigate and track such states, Monte Carlo Basin Bifurcation analysis (MCBB), a novel numerical method is introduced in the second part. Situated between the classical analysis with macroscopic order parameters and a more thorough, detailed bifurcation analysis, MCBB combines random sampling with clustering methods to identify and characterise the different asymptotic states and their basins of attraction. Forecasts of complex system are the next logical step. When doing so, it is not always straightforward how prior knowledge in data-driven methods. One possibility to do is by using Neural Partial Differential Equations. Here, it is demonstrated how high-dimensional spatiotemporally chaotic systems can be modelled and predicted with such an approach in the last part of the thesis

    Integrating climatic information in water resources modelling and optimisation

    Get PDF

    Nonlinear Dimensionality Reduction Methods in Climate Data Analysis

    Full text link
    Linear dimensionality reduction techniques, notably principal component analysis, are widely used in climate data analysis as a means to aid in the interpretation of datasets of high dimensionality. These linear methods may not be appropriate for the analysis of data arising from nonlinear processes occurring in the climate system. Numerous techniques for nonlinear dimensionality reduction have been developed recently that may provide a potentially useful tool for the identification of low-dimensional manifolds in climate data sets arising from nonlinear dynamics. In this thesis I apply three such techniques to the study of El Nino/Southern Oscillation variability in tropical Pacific sea surface temperatures and thermocline depth, comparing observational data with simulations from coupled atmosphere-ocean general circulation models from the CMIP3 multi-model ensemble. The three methods used here are a nonlinear principal component analysis (NLPCA) approach based on neural networks, the Isomap isometric mapping algorithm, and Hessian locally linear embedding. I use these three methods to examine El Nino variability in the different data sets and assess the suitability of these nonlinear dimensionality reduction approaches for climate data analysis. I conclude that although, for the application presented here, analysis using NLPCA, Isomap and Hessian locally linear embedding does not provide additional information beyond that already provided by principal component analysis, these methods are effective tools for exploratory data analysis.Comment: 273 pages, 76 figures; University of Bristol Ph.D. thesis; version with high-resolution figures available from http://www.skybluetrades.net/thesis/ian-ross-thesis.pdf (52Mb download

    Spatial-Temporal Data Mining for Ocean Science: Data, Methodologies, and Opportunities

    Full text link
    With the increasing amount of spatial-temporal~(ST) ocean data, numerous spatial-temporal data mining (STDM) studies have been conducted to address various oceanic issues, e.g., climate forecasting and disaster warning. Compared with typical ST data (e.g., traffic data), ST ocean data is more complicated with some unique characteristics, e.g., diverse regionality and high sparsity. These characteristics make it difficult to design and train STDM models. Unfortunately, an overview of these studies is still missing, hindering computer scientists to identify the research issues in ocean while discouraging researchers in ocean science from applying advanced STDM techniques. To remedy this situation, we provide a comprehensive survey to summarize existing STDM studies in ocean. Concretely, we first summarize the widely-used ST ocean datasets and identify their unique characteristics. Then, typical ST ocean data quality enhancement techniques are discussed. Next, we classify existing STDM studies for ocean into four types of tasks, i.e., prediction, event detection, pattern mining, and anomaly detection, and elaborate the techniques for these tasks. Finally, promising research opportunities are highlighted. This survey will help scientists from the fields of both computer science and ocean science have a better understanding of the fundamental concepts, key techniques, and open challenges of STDM in ocean

    Quantitative Modelling of Climate Change Impact on Hydro-climatic Extremes

    Get PDF
    In recent decades, climate change has caused a more volatile climate leading to more extreme events such as severe rainstorms, heatwaves and floods which are likely to become more frequent. Aiming to reveal climate change impact on the hydroclimatic extremes in a quantitative sense, this thesis presents a comprehensive analysis from three main strands. The first strand focuses on developing a quantitative modelling framework to quantify the spatiotemporal variation of hydroclimatic extremes for the areas of concern. A spatial random sampling toolbox (SRS-GDA) is designed for randomizing the regions of interest (ROIs) with different geographic locations, sizes, shapes and orientations where the hydroclimatic extremes are parameterised by a nonstationary distribution model whose parameters are assumed to be time-varying. The parameters whose variation with respect to different spatial features of ROIs and climate change are finally quantified by various statistical models such as the generalised linear model. The framework is applied to quantify the spatiotemporal variation of rainfall extremes in Great Britain (GB) and Australia and is further used in a comparison study to quantify the bias between observed and climate projected extremes. Then the framework is extended to a multivariate framework to estimate the time-varying joint probability of more than one hydroclimatic variable in the perspective of non-stationarity. A case study for evaluating compound floods in Ho Chi Minh City, Vietnam is applied for demonstrating the application of the framework. The second strand aims to recognise, classify and track the development of hydroclimatic extremes (e.g., severe rainstorms) by developing a stable computer algorithm (i.e., the SPER toolbox). The SPER toolbox can detect the boundary of the event area, extract the spatial and physical features of the event, which can be used not only for pattern recognition but also to support AI-based training for labelling/cataloguing the pattern from the large-sized, grid-based, multi-scaled environmental datasets. Three illustrative cases are provided; and as the front-end of AI study, an example for training a convolution neural network is given for classifying the rainfall extremes in the last century of GB. The third strand turns to support decision making by building both theory-driven and data-driven decision-making models to simulate the decisions in the context of flood forecasting and early warning, using the data collected via laboratory-style experiments based on various information of probabilistic flood forecasts and consequences. The research work demonstrated in this thesis has been able to bridge the knowledge gaps in the related field and it also provides a precritical insight in managing future risks arising from hydroclimatic extremes, which makes perfect sense given the urgent situation of climate change and the related challenges our societies are facing

    The predictability of UK drought using European weather patterns

    Get PDF
    PhD thesisThis thesis explores the use of a 167-year daily weather pattern (WP) classification (MO-30) in UK meteorological drought prediction. As MO-30 was recently introduced, necessary analyses as a precursor to building a forecast model are conducted. First, an exploratory analysis of MO30’s fundamental characteristics and its relation to UK precipitation and drought climatology is carried out. Second, two novel methods to find weekly to seasonal persistence in MO-30 are used in order to assess if there is any inherent predictability within MO-30. Third, a statistical model based on historical analogues for predicting 30-day periods of WPs is constructed, from which precipitation forecasts are derived. Finally, a dynamical ensemble prediction system is applied to forecast WPs, with resultant precipitation estimated in the same way as for the statistical method. MO-30 is shown to be suitable for precipitation-based analyses in the UK. Furthermore, intraWP precipitation variability, defined by the interquartile range, is lower in MO-30 compared to another commonly used WP classification. Six WPs are associated with nationwide drought, with several other WPs linked to regional drought. Results from the persistence analysis show that there are multi-month periods when small sets of four to six WPs dominate, and some of these periods coincide with notable meteorological events, including droughts and storms. Some WPs also behave as ‘attractors’, showing increased probability of reoccurrence despite other WPs occurring in-between. The statistical method for WP and precipitation forecasts is no more skilful than climatology, suggesting that the model did not adequately exploit the persistence identified previously. However, WPs are shown to be potentially useful for drought forecasting, as an idealised, perfect prognostic model (with WP observations as inputs rather than predictions) substantially improves skill, with a skill score of almost 0.5 (out of one) for north-eastern regions. Using a dynamical model to predict WPs, while keeping the precipitation estimation procedure the same as for the purely statistical method, yields overall higher skill compared to a benchmark statistical method for predicting droughts. The model also outperforms direct (modelled) dynamical precipitation forecasts for lead-times greater than 16 days during winter and autumn, with the greatest skill advantage for western regions. This is despite the relatively modest skill scores of all forecast models (rarely above 0.4). Again, high skill scores, of almost 0.8 on occasions, are achieved by the perfect prognostic model, demonstrating the potential for incorporating WPs into precipitation and drought forecast systems

    Long-term Landsat-based monthly burned area dataset for the Brazilian biomes using Deep Learning

    Get PDF
    Fire is a significant agent of landscape transformation on Earth, and a dynamic and ephemeral process that is challenging to map. Difficulties include the seasonality of native vegetation in areas affected by fire, the high levels of spectral heterogeneity due to the spatial and temporal variability of the burned areas, distinct persistence of the fire signal, increase in cloud and smoke cover surrounding burned areas, and difficulty in detecting understory fire signals. To produce a large-scale time-series of burned area, a robust number of observations and a more efficient sampling strategy is needed. In order to overcome these challenges, we used a novel strategy based on a machine-learning algorithm to map monthly burned areas from 1985 to 2020 using Landsat-based annual quality mosaics retrieved from minimum NBR values. The annual mosaics integrated year-round observations of burned and unburned spectral data (i.e., RED, NIR, SWIR-1, and SWIR-2), and used them to train a Deep Neural Network model, which resulted in annual maps of areas burned by land use type for all six Brazilian biomes. The annual dataset was used to retrieve the frequency of the burned area, while the date on which the minimum NBR was captured in a year, was used to reconstruct 36 years of monthly burned area. Results of this effort indicated that 19.6% (1.6 million km2) of the Brazilian territory was burned from 1985 to 2020, with 61% of this area burned at least once. Most of the burning (83%) occurred between July and October. The Amazon and Cerrado, together, accounted for 85% of the area burned at least once in Brazil. Native vegetation was the land cover most affected by fire, representing 65% of the burned area, while the remaining 35% burned in areas dominated by anthropogenic land uses, mainly pasture. This novel dataset is crucial for understanding the spatial and long-term temporal dynamics of fire regimes that are fundamental for designing appropriate public policies for reducing and controlling fires in Brazil

    Climate variability and extended range flood forecasting for the Amazon basin

    Get PDF
    The aim of this research is to investigate how large-scale climate variability affects flooding in the Amazon basin, using this assessment to demonstrate the potential predictability that these modes can provide to enable earlier warning of impactful floods. To address this a multi-stage approach is adopted; first to understand the gaps and confidence in the state of current knowledge on how climate variability affects both rainfall and river discharge in the Amazon basin, secondly, to understand the skill of global hydrological models for undertaking further assessment, and thirdly to undertake a robust assessment of the impact of climate variability on different flood characteristics while considering different methodological approaches in more detail. An assessment of the robustness in the results of previous studies suggests the need to explore in detail the physical mechanisms leading to flood events on an individual basis. While composite analysis of several floods identified a particular response associated with La Niña conditions, investigation into individual events show it is unknown if the same response would be identified for all events individually. The performance of eight large-scale hydrological models are evaluated for their ability to capture previous peak river flows. The choice of precipitation input is found to be the dominant component of the hydrometeorological modelling chain, with improvement found when ERA5 is the chosen meteorological forcing. Calibration of the Lisflood routing model is identified to have no impact on the ability to capture flood peaks, stressing the need to use an objective function that fits the purpose of the model. Examination of how climate variability impacts flood characteristics in the Amazon basin identified significant changes for both flood magnitude and duration during the negative ENSO phase, particularly in the north-eastern Amazon. This response was not identified for eastern Pacific ENSO events, highlighting how results can differ between ENSO types, while no notable impact or pattern is observed for flood timing. This thesis has provided important information on how climate variability impacts less studied flood characteristics (flood timing and duration) which are associated with important flood types (e.g. early or long floods). Future work should focus on the improvement of climate reanalysis to produce a longer-term dataset consistent with observations to extend climate analysis. This would allow the examination on the impact of climate phases at a more granular scale (e.g. analysing the strength or combination of climate phases
    corecore