    With the improvement of intelligent systems, speech recognition technologies are being widely integrated into various aspects of human life. Speech recognition is applied to smart assistants, smart home infrastructure, the call center applications of banks, information system components for impaired people, etc. But these facilities of information systems are available only for common languages, like English, Chinese, or Russian. For low-resource language, these opportunities for information technologies are still not implemented. Most modern speech recognition approaches are still not tested on agglutinative languages, especially for the languages of Turkic group like Kazakh, Tatar, and Turkish Languages. The HMM-GMM (Hidden Markov Models - Gaussian Mixture Models) model has been the most popular in the field of Automatic Speech Recognition (ASR) for a long time. Currently, neural networks are widely used in different fields of NLP, especially in automatic speech recognition. In an enormous number of works application of neural networks within different stages of automatic speech recognition makes the quality level of this systems much better. Integral speech recognition systems based on neural networks are investigated in the article. The paper proves that the Connectionist Temporal Classification (CTC) model works precisely for agglutinative languages. The author conducted an experiment with the LSHTM neural network using an encoder-decoder model, which is based on the attention-based models. The result of the experiment showed a Character Error Rate (CER) equal to 8.01% and a Word Error Rate (WER) equal to 17.91%. This result proves the possibility of getting a good ASR model without the use of the Language Model (LM)

    Similar Text Fragments Extraction for Identifying Common Wikipedia Communities

    Similar text fragments extraction from weakly formalized data is the task of natural language processing and intelligent data analysis and is used for solving the problem of automatic identification of connected knowledge fields. In order to search such common communities in Wikipedia, we propose to use as an additional stage a logical-algebraic model for similar collocations extraction. With Stanford Part-Of-Speech tagger and Stanford Universal Dependencies parser, we identify the grammatical characteristics of collocation words. WithWordNet synsets, we choose their synonyms. Our dataset includes Wikipedia articles from different portals and projects. The experimental results show the frequencies of synonymous text fragments inWikipedia articles that form common information spaces. The number of highly frequented synonymous collocations can obtain an indication of key common up-to-date Wikipedia communities


    This paper focuses on the study of some aspects of the theory of oriented graphs in Bayesian networks. In some papers on the theory of Bayesian networks, the concept of “Generation of vertices” denotes a certain set of vertices with many parents belonging to previous generations. Terminology for this concept, in our opinion, has not yet fully developed. The concept of “Generation” in some cases makes it easier to solve some problems in Bayesian networks and to build simpler algorithms.  In this paper we will consider the well-known example “Asia”, described in many articles and books, as well as in the technical documentation for various toolboxes. For the construction of this example, we have used evaluation versions of AgenaRisk.Niniejszy artykuł koncentruje się na badaniu pewnych aspektów teorii zorientowanych grafów w sieciach bayesowskich. W niektórych artykułach na temat teorii sieci bayesowskich pojęcie „generacji wierzchołków” oznacza pewien zestaw wierzchołków z wieloma rodzicami należącymi do poprzednich generacji. Terminologia tego pojęcia, naszym zdaniem, nie została jeszcze w pełni rozwinięta. Koncepcja „Generacji” w niektórych przypadkach ułatwia rozwiązywanie niektórych problemów w sieciach bayesowskich i budowanie prostszych algorytmów. W tym artykule rozważymy dobrze znany przykład „Azja”, opisany w wielu artykułach i książkach, a także w dokumentacji technicznej różnych zestawów narzędzi. Do budowy tego przykładu wykorzystaliśmy wersje testowe AgenaRisk

    Clusterization by the K-means method when K is unknown

    There are various methods of objects’ clusterization used in different areas of machine learning. Among the vast amount of clusterization methods, the K-means method is one of the most popular. Such a method has as pros as cons. Speaking about the advantages of this method, we can mention the rather high speed of objects clusterization. The main disadvantage is a necessity to know the number of clusters before the experiment. This paper describes the new way and the new method of clusterization, based on the K-means method. The method we suggest is also quite fast in terms of processing speed, however, it does not require the user to know in advance the exact number of clusters to be processed. The user only has to define the range within which the number of clusters is located. Besides, using suggested method there is a possibility to limit the radius of clusters, which would allow finding objects that express the criteria of one cluster in the most distinctive and accurate way, and it would also allow limiting the number of objects in each cluster within the certain range

    Cycles in Bayesian Networks

    The article is devoted to some critical problems of using Bayesian networks for solving practical problems, in which graph models contain directed cycles. The strict requirement of the acyclicity of the directed graph representing the Bayesian network does not allow to efficiently solve most of the problems that contain directed cycles. The modern theory of Bayesian networks prohibits the use of directed cycles. The requirement of acyclicity of the graph can significantly simplify the general theory of Bayesian networks, significantly simplify the development of algorithms and their implementation in program code for calculations in Bayesian networks

    Voice Identification Using Classification Algorithms

    This article discusses the classification algorithms for the problem of personality identification by voice using machine learning methods. We used the MFCC algorithm in the speech preprocessing process. To solve the problem, a comparative analysis of five classification algorithms was carried out. In the first experiment, the support vector method was determined—0.90 and multilayer perceptron—0.83, that showed the best results. In the second experiment, a multilayer perceptron with an accuracy of 0.93 was proposed using the Robust scaler method for personal identification. Therefore, to solve this problem, it is possible to use a multi-layer perceptron, taking into account the specifics of the speech signal


    The method of targeting advertising on Internet sites based on a structured self-learning database is considered. The database accumulates data on previously accepted requests to display ads from a closed auction, data on participation in the auction and the results of displaying ads – the presence of a click and product installation. The base is structured by streams with features – site, place, price. Each such structural stream has statistical properties that are much simpler compared to the general ad impression stream, which makes it possible to predict the effectiveness of advertising. The selection of bidding requests only promising in terms of the result allows to reduce the cost of displaying advertising.Rozważono metodę ukierunkowywania reklam w serwisach internetowych w oparciu o ustrukturyzowaną samouczącą się bazę danych. W bazie gromadzone są dane o wcześniej zaakceptowanych żądaniach wyświetlenia reklam z zamkniętej aukcji, dane o udziale w aukcji oraz o wynikach wyświetlania reklam – zarejestrowanie kliknięcia i instalacji produktu. Bazę tworzą strumienie z cechami – strona, miejsce, cena. Każdy taki strumień strukturalny ma właściwości statystyczne, które są znacznie prostsze w porównaniu do ogólnego strumienia wyświetleń reklamy, co pozwala przewidywać skuteczność reklamy. Selekcja tylko obiecujących pod względem wyniku zapytań ofertowych pozwala na obniżenie kosztów wyświetlania reklam


    The paper examines the features of segmentation of the upper respiratory tract to determine nasal air conduction. 2D and 3D illustrations of the segmentation process and the obtained results are given. When forming an analytical model of the aerodynamics of the nasal cavity, the main indicator that characterizes the configuration of the nasal canal is the equivalent diameter, which is determined at each intersection of the nasal cavity. It is calculated based on the area and perimeter of the corresponding section of the nasal canal. When segmenting the nasal cavity, it is first necessary to eliminate air structures that do not affect the aerodynamics of the upper respiratory tract - these are, first of all, intact spaces of the paranasal sinuses, in which diffuse air exchange prevails. In the automatic mode, this is possible by performing the elimination of unconnected isolated areas and finding the difference coefficients of the areas connected by confluences with the nasal canal in the next step. High coefficients of difference of sections between intersections will indicate the presence of separated areas and contribute to their elimination. The complex configuration and high individual variability of the structures of the nasal cavity does not allow segmentation to be fully automated, but this approach contributes to the absence of interactive correction in 80% of tomographic datasets. The proposed method, which takes into account the intensity of the image elements close to the contour ones, allows to reduce the averaging error from tomographic reconstruction up to 2 times due to artificial sub-resolution. The perspective of the work is the development of methods for fully automatic segmentation of the structures of the nasal cavity, taking into account the individual anatomical variability of the upper respiratory tract.W pracy przeanalizowano cechy segmentacji górnych dróg oddechowych w celu określenia powietrznego przewodnictwa nosowego. Przedstawiono zdjęcia 2D i 3D procesu segmentacji oraz uzyskanych wyników. Podczas formowania analitycznego modelu aerodynamiki jamy nosowej głównym wskaźnikiem charakteryzującym konfigurację kanału nosowego jest ekwiwalentna średnica, którą wyznacza się na każdym skrzyżowaniu jam nosowych. Jest ona obliczana na podstawie pola powierzchni i obwodu odpowiedniego odcinka kanału nosowego. Podczas segmentacji jamy nosowej w pierwszej kolejności należy wyeliminować struktury powietrzne, które nie wpływają na aerodynamikę górnych dróg oddechowych – są to przede wszystkim nienaruszone przestrzenie zatok przynosowych, w których dominuje rozproszona wymiana powietrza. W trybie automatycznym jest to możliwe dzięki eliminacji niepołączonych izolowanych obszarów i znalezieniu, w kolejnym kroku, współczynników różnicy obszarów połączonych konfluencjami z przewodem nosowym. Wysokie współczynniki różnic przekrojów pomiędzy skrzyżowaniami będą wskazywały na obecność wydzielonych obszarów i przyczynią się do ich eliminacji. Złożona konfiguracja i duża zmienność osobnicza struktur jamy nosowej nie pozwala na pełną automatyzację segmentacji, jednak takie podejście przyczynia się do braku konieczności interaktywnej korekcji w 80% zestawów danych tomograficznych. Zaproponowana metoda, uwzględniająca intensywność elementów obrazu znajdujących się blisko konturu, pozwala na nawet 2-krotne zmniejszenie błędu uśredniania z rekonstrukcji tomograficznej, wynikającego ze sztucznej subrozdzielczości. Perspektywą pracy jest opracowanie metod w pełni automatycznej segmentacji struktur jamy nosowej z uwzględnieniem indywidualnej zmienności anatomicznej górnych dróg oddechowych