10 research outputs found

    Proceedings Of The 18th Annual Meeting Of The Asia Oceania Geosciences Society (Aogs 2021)

    Get PDF
    The 18th Annual Meeting of the Asia Oceania Geosciences Society (AOGS 2021) was held from 1st to 6th August 2021. This proceedings volume includes selected extended abstracts from a challenging array of presentations at this conference. The AOGS Annual Meeting is a leading venue for professional interaction among researchers and practitioners, covering diverse disciplines of geosciences

    A comprehensive review of crop yield prediction using machine learning approaches with special emphasis on palm oil yield prediction

    Get PDF
    An early and reliable estimation of crop yield is essential in quantitative and financial evaluation at the field level for determining strategic plans in agricultural commodities for import-export policies and doubling farmer’s incomes. Crop yield predictions are carried out to estimate higher crop yield through the use of machine learning algorithms which are one of the challenging issues in the agricultural sector. Due to this developing significance of crop yield prediction, this article provides an exhaustive review on the use of machine learning algorithms to predict crop yield with special emphasis on palm oil yield prediction. Initially, the current status of palm oil yield around the world is presented, along with a brief discussion on the overview of widely used features and prediction algorithms. Then, the critical evaluation of the state-of-the-art machine learning-based crop yield prediction, machine learning application in the palm oil industry and comparative analysis of related studies are presented. Consequently, a detailed study of the advantages and difficulties related to machine learning-based crop yield prediction and proper identification of current and future challenges to the agricultural industry is presented. The potential solutions are additionally prescribed in order to alleviate existing problems in crop yield prediction. Since one of the major objectives of this study is to explore the future perspectives of machine learning-based palm oil yield prediction, the areas including application of remote sensing, plant’s growth and disease recognition, mapping and tree counting, optimum features and algorithms have been broadly discussed. Finally, a prospective architecture of machine learning-based palm oil yield prediction has been proposed based on the critical evaluation of existing related studies. This technology will fulfill its promise by performing new research challenges in the analysis of crop yield prediction and the development

    Besoin en eau et rendements des céréales en Méditerranée du Sud : observation, prévision saisonnière et impact du changement climatique

    Get PDF
    Le secteur agricole est l'un des piliers de l'économie marocaine. En plus de contribuer à 15% au Produit Intérieur Brut (PIB) et de fournir 35% des opportunités d'emploi, il a un impact sur les taux de croissance. Ces dernières sont affectées négativement ou positivement par les conditions climatiques et la pluviométrie en particulier. Lors des années de sécheresse, caractérisées par une baisse de la production agricole, en particulier celle des céréales, la croissance de l'économie marocaine a été sévèrement affectée et les importations alimentaires du royaume ont augmenté de manière significative. Dans ce contexte, il est important d'évaluer l'impact de la sécheresse agricole sur les rendements céréaliers et de développer des modèles de prévision précoce des rendements, ainsi que de déterminer l'impact futur du changement climatique sur le rendement du blé et leurs besoins en eau. Le but de ce travail est, premièrement, d'approfondir la compréhension de la relation entre le rendement des céréales et la sécheresse agricole au Maroc. Afin de détecter la sécheresse, nous avons utilisé des indices de sécheresse agricole provenant de différentes données satellitaires. En outre, nous avons utilisé les sorties du système d'assimilation des données terrestres (LDAS). Deuxièmement, nous avons développé des modèles empiriques de la prévision précoce des rendements des céréales à l'échelle provinciale. Pour atteindre cet objectif, nous avons construit des modèles de prévision en utilisant des données multi-sources comme prédicteurs, y compris des indices basés sur la télédétection, des données météorologiques et des indices climatiques régionaux. Pour construire ces modèles, nous nous sommes appuyés sur des algorithmes de machine learning tels que : Multiple Linear Regression (MLR), Support Vector Machine (SVM), Random Forest (RF) et eXtreme Gradient Boost (XGBoost). Enfin, nous avons évalué l'impact du changement climatique sur le rendement du blé et ses besoins en eau. Pour ce faire, nous nous sommes appuyés sur cinq modèles climatiques régionaux disponibles dans la base de données Med-CORDEX sous deux scénarios RCP4.5 et RCP8.5, ainsi que sur le modèle AquaCrop et nous nous sommes basés sur trois périodes, la période de référence 1991-2010, la deuxième période 2041-2060 et la troisième période 2081-2100. Les résultats ont montré qu'il y a une corrélation étroite entre le rendement des céréales et les indices de sécheresse liés à l'état de végétation pendant le stade d'épiaison (mars et avril) et qui sont liés à la température de surface pendant le stade de développement en janvier-février, et qui sont liés à l'humidité du sol pendant le stade d'émergence en novembre-décembre. Les résultats ont également montré que les sorties du LDAS sont capables de suivre avec précision la sécheresse agricole. En ce qui concerne la prévision du rendement, les résultats ont montré que la combinaison des données provenant de sources multiples a donné des meilleurs résultats que les modèles basés sur une seule source. Dans ce contexte, le modèle XGBoost a été capable de prévoir le rendement des céréales dès le mois de janvier (environ quatre mois avant la récolte) avec des métriques statistiques satisfaisants (R² = 0.88 et RMSE = 0.22 t. ha^-1). En ce qui concerne l'impact du changement climatique sur le rendement et les besoins en eau du blé, les résultats ont montré que l'augmentation de la température de l'air entraînera un raccourcissement du cycle de croissance du blé d'environ 50 jours. Les résultats ont également montré une diminution du rendement du blé jusqu'à 30% si l'augmentation du CO2 n'est pas prise en compte. Cependant, l'effet de la fertilisation au CO2 peut compenser les pertes du rendement, et ce dernier peut augmenter jusqu'à 27%. Finalement, les besoins en eau devraient diminuer de 13 à 42%, et cette diminution est associée à une modification de calendrier d'irrigation, le pic des besoins arrivant deux mois plus tôt que dans les conditions actuelles.The agricultural sector is one of the pillars of the Moroccan economy. In addition to contributing 15% in GDP and providing 35% of employment opportunities, it has an impact on growth rates that are negatively or positively affected by climatic conditions and rainfall in particular. During drought years characterized by a decline in agricultural production and in particular cereal production, the growth of the Moroccan economy was severely affected and the kingdom's food imports increased significantly. In this context, it's important to assess the impact of agricultural drought on cereal yields and to develop early yield prediction models, as well as to determine the future impact of climate change on wheat yield and water requirements. The aim of this work is, firstly to further understand the linkage between cereal yield and agricultural drought in Morocco. In order to identify this drought, we used agricultural drought indices from remotely sensed satellite data. In addition, we used the outputs of Land Data Assimilation System (LDAS). Secondly, to develop empirical models for early prediction of cereal yields at provincial scale. To achieve this goal, we built forecasting models using multi-source data as predictors, including remote sensing-based indices, weather data and regional climate indices. And to build these models, we relied on machine learning algorithms such as Multiple Linear Regression (MLR), Support Vector Machine (SVM), Random Forest (RF) and eXtreme Gradient Boost (XGBoost). Finally, to evaluate the impact of climate change on the wheat yield its water requirements. To do this, we relied on five regional climate models available in the Med-CORDEX database under two scenarios RCP4.5 and RCP8.5, as well as the AquaCrop model and we based on three periods, the reference period 1991-2010, the second period 2041-2060 and the third period 2081-2100. The results showed that there is a close correlation between cereals yield and drought indices related to canopy condition during the heading stage (March and April) and which are related to surface temperature during the development stage in January -February, and which are related to soil moisture during the emergence stage in November -December. The results also showed that the outputs of LDAS are able to accurately monitor agricultural drought. Concerning, cereal yield forecasting, the results showed that combining data from multiple sources outperformed models based on one data set only. In this context, the XGBoost was able to predict cereal yield as early as January (about four months before harvest) with satisfactory statistical metrics (R² = 0.88 and RMSE = 0.22 t. ha^-1). Regarding the impact of climate change on wheat yield and water requirements, the results showed that the increase in air temperature will result in a shortening of the wheat growth cycle by about 50 days. The results also showed a decrease in wheat yield up to 30% if the rising in CO2 was not taken into account. The effect of fertilizing of CO2 can offset the yield losses, and yield can increase up to 27 %. Finally, water requirements are expected to decrease by 13 to 42%, and this decrease is associated with a change in temporal patterns, with the requirement peak coming two months earlier than under current conditions

    Achieving sustainable development goals coupling earth observation data with machine learning

    Get PDF
    Tese de Doutoramento em Engenharia e Gestão Industrial, Universidade Lusíada, Vila Nova de Famalicão, 2021Exame público realizado em 09 de Junho de 2022The main purpose of this work is to assess and understand the achievement of Sustainable Development Goals by means of Earth Observation (EO) data and Machine Learning (ML) technologies. Thus, this study intends to promote and suggest the use of EO and ML in benefits to the Sustainable Development Goals (SDGs) to support and optimize the actual industry and field processes and moreover provide new insights (techniques) on EO approaches and applicability as well as ML techniques. A review on the Sustainable Development concept and its goals is presented followed by EO data and methods and its approaches relevant to this field, giving special attention to the contribution of ML methods and algorithms as well as their potential and capabilities to support the achievement of SDGs. Additionally, different ML approaches and techniques are reviewed (i.e., Classification and Regression techniques, Non-Binary Decision Tree (NBDT), and two novel methods are proposed, designated as: Random Forest built based on the Non-Binary Decision Tree (NBRF) and Fusion of techniques). Both developed methods are applied, optimized and validated to two case studies also aligned with specific SGDs: Case study I – Identification and mapping of healthy or infected crops, tackling SDGs 2, 8, 9 and 12; and Case study II - Deep-sea mining exploitation SDGs 8, 9, 12 and 14). Such is achieved by using data provided by European satellites or programs that allows to also contribute to the goals for Europe’s Space strategy. For Case study I, the parameters analysed to achieve the respective SDGs correspond to: several vegetation indices as well as the values of the spectral bands. Such parameters have been extracted by means of EO data (from Sentinel-2) and validated with different ML approaches. The results obtained from the different ML approaches suggest that for Case study I, the best classification technique (overall accuracy of 92.87%) as well as the best regression (Root mean square error of 0.148) corresponds to the Fusion of techniques All the applied techniques, however, show their applicability on this case study with good results, disregarding the NBDT which is the “weakest” one (best result on all tests: accuracy of 57.07%). For Case study II, the parameters analysed to achieve the respective SDGs correspond to the topography of the seafloor and, physical and biogeochemical ocean’s parameters. Such parameters have been extracted by means of EO data (from CMEMS and GEBCO) and validated with different ML approaches. The results of these approaches suggests that the best technique corresponds to the Fusion of techniques with a root mean square error of 0.196. However, not all the techniques proved to be appropriated, where the NBDT present the worst results (best result on all tests: accuracy 60.62%). Overall, it is observed that EO plays a key role in the monitoring and achievement of the SDGs given its cost-effectiveness pertaining to data acquisition on all scales and information richness, and the success of ML upon EO data analysis. The applicability of ML techniques allied to EO data has proven, by both case studies, that can contribute to the SDGs and can be extrapolated to other applications and fields. Keywords: Sustainable Development Goals; Earth Observation; Europe Space Strategy; Machine Learning; Deep-sea Mining; Agriculture

    Hydraulic-hydrologic model for the Zambezi River using satellite data and artificial intelligence techniques

    Get PDF
    The development of hydraulic-hydrologic models is a challenge in the case of large catchment areas with scarce or erroneous measurement data and observations. With his study Mr. Dr. José Pedro Matos made several original contributions in order to overcome this challenge. The scientific developments were applied at Zambezi River basin in Africa in the framework of the interdisciplinary African Dams research project (ADAPT). First of all, procedures and selection criteria for satellite data regarding topography, rainfall, land use, soil types and cover had to be developed. With the goal to extend the time scope of the analysis, Dr. Matos introduced a novel Pattern-Oriented Memory (POM) historical rainfall interpolation methodology. When POM interpolated rainfall is applied to hydrologic models it effectively opens up new possibilities related to extended calibration and the simulation of historical events, which would otherwise be difficult to exploit. A new scheme for rainfall aggregation was proposed, based on hydraulic considerations and easily implemented resorting to remote sensing data, which was able to enhance forecasting results. Dr. Matos used machine-learning models in an innovative way for discharge forecast. He compared the alternative models (e.g. Autoregressive Moving-Average (ARMA), Artificial Neural Networks (ANN) and Support-Vector Regression (SVR)). Dr. Matos made then significant contributions to the enhancement of rainfall aggregation techniques and the study of limitations inherent to SVR forecasting model. He proposed also a novel method for developing empirical forecast probability distributions. Finally Dr. Matos could successfully calibrate, probably for the first time, a daily hydrological model covering the whole Zambezi River basin (ZRB)

    Error Propagation Analysis for Remotely Sensed Aboveground Biomass

    Get PDF
    Edited version available. Full version will remain embargoed due to copyright. AS DCAbstract Above-Ground Biomass (AGB) assessment using remote sensing has been an active area of research since the 1970s. However, improvements in the reported accuracy of wide scale studies remain relatively small. Therefore, there is a need to improve error analysis to answer the question: Why is AGB assessment accuracy still under doubt? This project aimed to develop and implement a systematic quantitative methodology to analyse the uncertainty of remotely sensed AGB, including all perceptible error types and reducing the associated costs and computational effort required in comparison to conventional methods. An accuracy prediction tool was designed based on previous study inputs and their outcome accuracy. The methodology used included training a neural network tool to emulate human decision making for the optimal trade-off between cost and accuracy for forest biomass surveys. The training samples were based on outputs from a number of previous biomass surveys, including 64 optical data based studies, 62 Lidar data based studies, 100 Radar data based studies, and 50 combined data studies. The tool showed promising convergent results of medium production ability. However, it might take many years until enough studies will be published to provide sufficient samples for accurate predictions. To provide field data for the next steps, 38 plots within six sites were scanned with a Leica ScanStation P20 terrestrial laser scanner. The Terrestrial Laser Scanning (TLS) data analysis used existing techniques such as 3D voxels and applied allometric equations, alongside exploring new features such as non-plane voxel layers, parent-child relationships between layers and skeletonising tree branches to speed up the overall processing time. The results were two maps for each plot, a tree trunk map and branch map. An error analysis tool was designed to work on three stages. Stage 1 uses a Taylor method to propagate errors from remote sensing data for the products that were used as direct inputs to the biomass assessment process. Stage 2 applies a Monte Carlo method to propagate errors from the direct remote sensing and field inputs to the mathematical model. Stage 3 includes generating an error estimation model that is trained based on the error behaviour of the training samples. The tool was applied to four biomass assessment scenarios, and the results show that the relative error of AGB represented by the RMSE of the model fitting was high (20-35% of the AGB) in spite of the relatively high correlation coefficients. About 65% of the RMSE is due to the remote sensing and field data errors, with the remaining 35% due to the ill-defined relationship between the remote sensing data and AGB. The error component that has the largest influence was the remote sensing error (50-60% of the propagated error), with both the spatial and spectral error components having a clear influence on the total error. The influence of field data errors was close to the remote sensing data errors (40-50% of the propagated error) and its spatial and non-spatial Overall, the study successfully traced the errors and applied certainty-scenarios using the software tool designed for this purpose. The applied novel approach allowed for a relatively fast solution when mapping errors outside the fieldwork areas.HCED iraq, Middle Technical Universit

    Advances in Evaporation and Evaporative Demand

    Get PDF
    The importance of evapotranspiration is well-established in different disciplines such as hydrology, agronomy, climatology, and other geosciences. Reliable estimates of evapotranspiration are also vital to develop criteria for in-season irrigation management, water resource allocation, long-term estimates of water supply, demand and use, design and management of water resources infrastructure, and evaluation of the effect of land use and management changes on the water balance. The objective of this Special Issue is to define and discuss several ET terms, including potential, reference, and actual (crop) ET, and present a wide spectrum of innovative research papers and case studies

    Streamflow and soil moisture forecasting with hybrid data intelligent machine learning approaches: case studies in the Australian Murray-Darling basin

    Get PDF
    For a drought-prone agricultural nation such as Australia, hydro-meteorological imbalances and increasing demand for water resources are immensely constraining terrestrial water reservoirs and regional-scale agricultural productivity. Two important components of the terrestrial water reservoir i.e., streamflow water level (SWL) and soil moisture (SM), are imperative both for agricultural and hydrological applications. Forecasted SWL and SM can enable prudent and sustainable decisionmaking for agriculture and water resources management. To feasibly emulate SWL and SM, machine learning data-intelligent models are a promising tool in today’s rapidly advancing data science era. Yet, the naturally chaotic characteristics of hydro-meteorological variables that can exhibit non-linearity and non-stationarity behaviors within the model dataset, is a key challenge for non-tuned machine learning models. Another important issue that could confound model accuracy or applicability is the selection of relevant features to emulate SWL and SM since the use of too fewer inputs can lead to insufficient information to construct an accurate model while the use of an excessive number and redundant model inputs could obscure the performance of the simulation algorithm. This research thesis focusses on the development of hybridized dataintelligent models in forecasting SWL and SM in the upper layer (surface to 0.2 m) and the lower layer (0.2–1.5 m depth) within the agricultural region of the Murray-Darling Basin, Australia. The SWL quantifies the availability of surface water resources, while, the upper layer SM (or the surface SM) is important for surface runoff, evaporation, and energy exchange at the Earth-Atmospheric interface. The lower layer (or the root zone) SM is essential for groundwater recharge purposes, plant uptake and transpiration. This research study is constructed upon four primary objectives designed for the forecasting of SWL and SM with subsequent robust evaluations by means of statistical metrics, in tandem with the diagnostic plots of observed and modeled datasets. The first objective establishes the importance of feature selection (or optimization) in the forecasting of monthly SWL at three study sites within the Murray-Darling Basin. Artificial neural network (ANN) model optimized with iterative input selection (IIS) algorithm named IIS-ANN is developed whereby the IIS algorithm achieves feature optimization. The IIS-ANN model outperforms the standalone models and a further hybridization is performed by integrating a nondecimated and advanced maximum overlap discrete wavelet transformation (MODWT) technique. The IIS selected inputs are transformed into wavelet subseries via MODWT to unveil the embedded features leading to IIS-W-ANN model. The IIS-W-ANN outperforms the comparative IIS-W-M5 Model Tree, IIS-based and standalone models. In the second objective, improved self-adaptive multi-resolution analysis (MRA) techniques, ensemble empirical mode decomposition (EEMD) and complete ensemble empirical mode decomposition with adaptive noise (CEEMDAN) are utilized to address the non-stationarity issues in forecasting monthly upper and lower layer soil moisture at seven sites. The SM time-series are decomposed using EEMD/CEEMDAN into respective intrinsic mode functions (IMFs) and residual components. Then the partial-auto correlation function based significant lags are utilized as inputs to the extreme learning machine (ELM) and random forest (RF) models. The hybrid EEMD-ELM yielded better results in comparison to the CEEMDAN-ELM, EEMD-RF, CEEMDAN-RF and the classical ELM and RF models. Since SM is contingent upon many influential meteorological, hydrological and atmospheric parameters, for the third objective sixty predictor inputs are collated in forecasting upper and lower layer soil moisture at four sites. An ANN-based ensemble committee of models (ANN-CoM) is developed integrating a two-phase feature optimization via Neighborhood Component Analysis based feature selection algorithm for regression (fsrnca) and a basic ELM. The ANN-CoM shows better predictive performance in comparison to the standalone second order Volterra, M5 Model Tree, RF, and ELM models. In the fourth objective, a new multivariate sequential EEMD based modelling is developed. The establishment of multivariate sequential EEMD is an advancement of the classical single input EEMD approach, achieving a further methodological improvement. This multivariate approach is developed to allow for the utilization of multiple inputs in forecasting SM. The multivariate sequential EEMD optimized with cross-correlation function and Boruta feature selection algorithm is integrated with the ELM model in emulating weekly SM at four sites. The resulting hybrid multivariate sequential EEMD-Boruta-ELM attained a better performance in comparison with the multivariate adaptive regression splines (MARS) counterpart (EEMD-Boruta-MARS) and standalone ELM and MARS models. The research study ascertains the applicability of feature selection algorithms integrated with appropriate MRA for improved hydrological forecasting. Forecasting at shorter and near-real-time horizons (i.e., weekly) would help reinforce scientific tenets in designing knowledge-based systems for precision agriculture and climate change adaptation policy formulations