10 research outputs found

    Appendix 1 for IUAVT

    No full text

    Assessing the Impact of Expert Labelling of Training Data on the Quality of Automatic Classification of Lithological Groups Using Artificial Neural Networks

    No full text
    Machine learning (ML) methods are nowadays widely used to automate geophysical study. Some of ML algorithms are used to solve lithological classification problems during uranium mining process. One of the key aspects of using classical ML methods is causing data features and estimating their influence on the classification. This paper presents a quantitative assessment of the impact of expert opinions on the classification process. In other words, we have prepared the data, identified the experts and performed a series of experiments with and without taking into account the fact that the expert identifier is supplied to the input of the automatic classifier during training and testing. Feedforward artificial neural network (ANN) has been used as a classifier. The results of the experiments show that the “knowledge” of the ANN of which expert interpreted the data improves the quality of the automatic classification in terms of accuracy (by 5 %) and recall (by 20 %). However, due to the fact that the input parameters of the model may depend on each other, the SHapley Additive exPlanations (SHAP) method has been used to further assess the impact of expert identifier. SHAP has allowed assessing the degree of parameter influence. It has revealed that the expert ID is at least two times more influential than any of the other input parameters of the neural network. This circumstance imposes significant restrictions on the application of ANNs to solve the task of lithological classification at the uranium deposits

    CLOUD SERVICES FOR NATURAL LANGUAGE PROCESSING

    No full text
    The paper presents the results of experiments conducted with the aim of a comparative analysis of the performance of the existing cloud services for natural language processing in Russian. The article provides an overview of 10 cloud services: TextRazor, RosetteTextAnalytics, EurekaEngine, CloudNaturalLanguage, Texterra, Pullenti, NER-ru, UDPipe, AOT, DeepPavlov. Quantitative studies of their performance were made for 6 of them. In the process of evaluating services, the execution of such functions as the part of speech tagging, sentiment analysis, named entity recognition and the categorization of texts were analyzed. For a comparative assessment of the quality of the services, the following competition materials were used: factRuEval-2016 (named entities), AlemResearch (sentiment) and the corpora, Taiga and OpenCorpora (part of speech). The named entities recognition quality was evaluated by calculating Accuracy, Precision, Recall, and F1 parameters. As a result of the study, it was shown that when solving natural language text processing tasks in Russian, the best result is shown by the EurekaEngine service for recognizing named entities and sentiment analysis of the text, RosetteTextAnalytics service proved best in part of speech tagging the and TextRazor service in text categorization

    Mass Media as a Mirror of the COVID-19 Pandemic

    No full text
    The media plays an important role in disseminating facts and knowledge to the public at critical times, and the COVID-19 pandemic is a good example of such a period. This research is devoted to performing a comparative analysis of the representation of topics connected with the pandemic in the internet media of Kazakhstan and the Russian Federation. The main goal of the research is to propose a method that would make it possible to analyze the correlation between mass media dynamic indicators and the World Health Organization COVID-19 data. In order to solve the task, three approaches related to the representation of mass media dynamics in numerical form—automatically obtained topics, average sentiment, and dynamic indicators—were proposed and applied according to a manually selected list of search queries. The results of the analysis indicate similarities and differences in the ways in which the epidemiological situation is reflected in publications in Russia and in Kazakhstan. In particular, the publication activity in both countries correlates with the absolute indicators, such as the daily number of new infections, and the daily number of deaths. However, mass media tend to ignore the positive rate of confirmed cases and the virus reproduction rate. If we consider strictness of quarantine measures, mass media in Russia show a rather high correlation, while in Kazakhstan, the correlation is much lower. Analysis of search queries revealed that in Kazakhstan the problem of fake news and disinformation is more acute during periods of deterioration of the epidemiological situation, when the level of crime and poverty increase. The novelty of this work is the proposal and implementation of a method that allows the performing of a comparative analysis of objective COVID-19 statistics and several mass media indicators. In addition, it is the first time that such a comparative analysis, between different countries, has been performed on a corpus in a language other than English

    Coverage Path Planning Optimization of Heterogeneous UAVs Group for Precision Agriculture

    No full text
    Precision farming is one of the ways of transition to the intensive methods of agricultural production. The case of application of unmanned aerial vehicles (UAVs) for solving problems of agriculture and animal husbandry is among the actively studied issues. The UAV is capable of solving the tasks of monitoring, fertilizing, herbicides, etc. However, the effective use of UAV requires to solve the tasks of flight planning, taking into account the heterogeneity of the available attachments and the problem solved in the process of the overflight. This research investigates the problem of flight planning of a group of heterogeneous UAVs applied to solving the issues of coverage, which may arise both in the course of monitoring and in the process of the implementation of agrotechnical measures. The method of coverage path planning of heterogenic UAVs group based on a genetic algorithm is proposed; this method provides planning of flight by a group of UAVs using a moving ground platform on which UAVs are recharged and refueled (multi heterogenic UAVs coverage path planning with moving ground platform (mhCPPmp)). This method allows calculating a fly by to solve the task of covering fields of different shapes and permits selecting the optimal subset of UAVs from the available set of devices; it also provides a 10% reduction in the cost of a flyby compared to an algorithm that does not use heterogeneous UAVs or a moving platform

    Mass Media as a Mirror of the COVID-19 Pandemic

    No full text
    The media plays an important role in disseminating facts and knowledge to the public at critical times, and the COVID-19 pandemic is a good example of such a period. This research is devoted to performing a comparative analysis of the representation of topics connected with the pandemic in the internet media of Kazakhstan and the Russian Federation. The main goal of the research is to propose a method that would make it possible to analyze the correlation between mass media dynamic indicators and the World Health Organization COVID-19 data. In order to solve the task, three approaches related to the representation of mass media dynamics in numerical form—automatically obtained topics, average sentiment, and dynamic indicators—were proposed and applied according to a manually selected list of search queries. The results of the analysis indicate similarities and differences in the ways in which the epidemiological situation is reflected in publications in Russia and in Kazakhstan. In particular, the publication activity in both countries correlates with the absolute indicators, such as the daily number of new infections, and the daily number of deaths. However, mass media tend to ignore the positive rate of confirmed cases and the virus reproduction rate. If we consider strictness of quarantine measures, mass media in Russia show a rather high correlation, while in Kazakhstan, the correlation is much lower. Analysis of search queries revealed that in Kazakhstan the problem of fake news and disinformation is more acute during periods of deterioration of the epidemiological situation, when the level of crime and poverty increase. The novelty of this work is the proposal and implementation of a method that allows the performing of a comparative analysis of objective COVID-19 statistics and several mass media indicators. In addition, it is the first time that such a comparative analysis, between different countries, has been performed on a corpus in a language other than English

    Operational Mapping of Salinization Areas in Agricultural Fields Using Machine Learning Models Based on Low-Altitude Multispectral Images

    No full text
    Salinization of cultivated soil is an important negative factor that reduces crop yields. Obtaining accurate and timely data on the salinity of soil horizons allows for planning the agrotechnical measures to reduce this negative impact. The method of soil salinity mapping of the 0–30 cm layer on irrigated arable land with the help of multispectral data received from the UAV is described in this article. The research was carried out in the south of the Almaty region of Kazakhstan. In May 2022, 80 soil samples were taken from the ground survey, and overflight of two adjacent fields was performed. The flight was carried out using a UAV equipped with a multispectral camera. The data preprocessing method is proposed herein, and several machine learning algorithms are compared (XGBoost, LightGBM, random forest, support vector machines, ridge regression, elastic net, etc.). Machine learning methods provided regression reconstruction to predict the electrical conductivity of the 0–30 cm soil layer based on an optimized list of spectral indices. The XGB regressor model showed the best quality results: the coefficient of determination was 0.701, the mean-squared error was 0.508, and the mean absolute error was 0.514. A comparison with the results obtained based on Landsat 8 data using a similar model was performed. Soil salinity mapping using UAVs provides much better spatial detailing than satellite data and has the possibility of an arbitrary selection of the survey time, less dependence on the conditions of cloud cover, and a comparable degree of accuracy of estimates

    Review of Some Applications of Unmanned Aerial Vehicles Technology in the Resource-Rich Country

    No full text
    The use of unmanned aerial vehicles (UAVs) in various spheres of human activity is a promising direction for countries with very different types of economies. This statement refers to resource-rich economies as well. The peculiarities of such countries are associated with the dependence on resource prices since their economies present low diversification. Therefore, the employment of new technologies is one of the ways of increasing the sustainability of such economy development. In this context, the use of UAVs is a prospect direction, since they are relatively cheap, reliable, and their use does not require a high-tech background. The most common use of UAVs is associated with various types of monitoring tasks. In addition, UAVs can be used for organizing communication, search, cargo delivery, field processing, etc. Using additional elements of artificial intelligence (AI) together with UAVs helps to solve the problems in automatic or semi-automatic mode. Such UAV is named intelligent unmanned aerial vehicle technology (IUAVT), and its employment allows increasing the UAV-based technology efficiency. However, in order to adapt IUAVT in the sectors of economy, it is necessary to overcome a range of limitations. The research is devoted to the analysis of opportunities and obstacles to the adaptation of IUAVT in the economy. The possible economic effect is estimated for Kazakhstan as one of the resource-rich countries. The review consists of three main parts. The first part describes the IUAVT application areas and the tasks it can solve. The following areas of application are considered: precision agriculture, the hazardous geophysical processes monitoring, environmental pollution monitoring, exploration of minerals, wild animals monitoring, technical and engineering structures monitoring, and traffic monitoring. The economic potential is estimated by the areas of application of IUAVT in Kazakhstan. The second part contains the review of the technical, legal, and software-algorithmic limitations of IUAVT and modern approaches aimed at overcoming these limitations. The third part—discussion—comprises the consideration of the impact of these limitations and unsolved tasks of the IUAVT employment in the areas of activity under consideration, and assessment of the overall economic effect

    Analysis of the Correlation between Mass-Media Publication Activity and COVID-19 Epidemiological Situation in Early 2022

    No full text
    The paper presents the results of a correlation analysis between the information trends in the electronic media of Kazakhstan and indicators of the epidemiological situation of COVID-19 according to the World Health Organization (WHO). The developed method is based on topic modeling and some other methods of processing natural language texts. The method allows for calculating the correlations between media topics, moods, the results of full-text search queries, and objective WHO data. The analysis of the results shows how the attitudes of society towards the problems of COVID-19 changed from 2021–2022. Firstly, the results reflect a steady trend of decreasing interest of electronic media in the topic of the pandemic, although to an unequal extent for different thematic groups. Secondly, there has been a tendency to shift the focus of attention to more pragmatic issues, such as remote learning problems, remote work, the impact of quarantine restrictions on the economy, etc

    Determination of Reservoir Oxidation Zone Formation in Uranium Wells Using Ensemble Machine Learning Methods

    No full text
    Approximately 50% of the world’s uranium is mined in a closed way using underground well leaching. In the process of uranium mining at formation-infiltration deposits, an important role is played by the correct identification of the formation of reservoir oxidation zones (ROZs), within which the uranium content is extremely low and which affect the determination of ore reserves and subsequent mining processes. The currently used methodology for identifying ROZs requires the use of highly skilled labor and resource-intensive studies using neutron fission logging; therefore, it is not always performed. At the same time, the available electrical logging measurements data collected in the process of geophysical well surveys and exploration well data can be effectively used to identify ROZs using machine learning models. This study presents a solution to the problem of detecting ROZs in uranium deposits using ensemble machine learning methods. This method provides an index of weighted harmonic measure (f1_weighted) in the range from 0.72 to 0.93 (XGB classifier), and sufficient stability at different ratios of objects in the input dataset. The obtained results demonstrate the potential for practical use of this method for detecting ROZs in formation-infiltration uranium deposits using ensemble machine learning
    corecore