13 research outputs found
Система виявлення мережевих аномалій за допомогою гібриду неконтрольованої та керованої нейронної мережі
In this article realization method of attacks and anomalies detection with the use of training of ordinary and attacking packages, respectively. The method that was used to teach an attack on is a combination of an uncontrollable and controlled neural network. In an uncontrolled network, attacks are classified in smaller categories, taking into account their features and using the self- organized map. To manage clusters, a neural network based on back-propagation method used. We use PyBrain as the main framework for designing, developing and learning perceptron data. This framework has a sufficient number of solutions and algorithms for training, designing and testing various types of neural networks. Software architecture is presented using a procedural-object approach. Because there is no need to save intermediate result of the program (after learning entire perceptron is stored in the file), all the progress of learning is stored in the normal files on hard disk.У цій статті спосіб реалізації атак та виявлення аномалій із використанням тренінгу звичайних та атакувальних пакетів відповідно. Метод, який використовувався для навчання нападу, - це поєднання неконтрольованої та керованої нейронної мережі. У неконтрольованій мережі напади класифікуються на менші категорії з урахуванням їх особливостей та використання самоорганізованої карти. Для управління кластерами використовується нейронна мережа, заснована на методі зворотного поширення. Ми використовуємо PyBrain як основну основу для проектування, розробки та вивчення даних перцептрону. Цей фреймворк має достатню кількість рішень та алгоритмів для навчання, проектування та тестування різних типів нейронних мереж. Архітектура програмного забезпечення представлена з використанням процедурно-об’єктного підходу. Оскільки немає необхідності зберігати проміжний результат програми (після навчання весь персептрон зберігається у файлі), весь хід навчання зберігається у звичайних файлах на жорсткому диску
The fall of genres that did not happen: formalising history of the “universal” semantics of Russian iambic tetrameter
This paper examines thematic features of the “universal” poetic meters that are disproportionately popular in a tradition. Focusing on the example of 19th-century Russian iambic tetrameter, we propose a method for diachronic analysis of semantic structure of a meter based on the combination of topic modelling and network analysis. We represent each poem as topic probabilities and use most probable topics appearing within one poem to build up the connections of a network. This representation allows us to detect a chronological process of semantic expansion of the meter: its usage spreads to various thematic domains contributing to its perception as “universal”. At the same time, we show that the expansion of iambic tetrameter does not lead to the collapse of semantic traditions of other meters and their associations with certain genres. Testing the amount of shared connections between meters against randomized data demonstrates that the increase in number of topics within a meter is mostly driven by the sample size, rather than by the direct borrowing from other meters. The stability of thematic connections inside each meter displays the conservative nature of poetic meters and surprisingly strong retention of association between meter and semantics
Recent Trends in the Use of Statistical Tests for Comparing Swarm and Evolutionary Computing Algorithms: Practical Guidelines and a Critical Review
A key aspect of the design of evolutionary and swarm intelligence algorithms is studying their performance. Statistical comparisons are also a crucial part which allows for reliable conclusions to be drawn. In the present paper we gather and examine the approaches taken from different perspectives to summarise the assumptions made by these statistical tests, the conclusions reached and the steps followed to perform them correctly. In this paper, we conduct a survey on the current trends of the proposals of statistical analyses for the comparison of algorithms of computational intelligence and include a description of the statistical background of these tests. We illustrate the use of the most common tests in the context of the Competition on single-objective real parameter optimisation of the IEEE Congress on Evolutionary Computation (CEC) 2017 and describe the main advantages and drawbacks of the use of each kind of test and put forward some recommendations concerning their use.Spanish Ministry of Economy, Industry and CompetitivenessSpanish Ministry of Scienc
Modelling semantic relations with distributitional semantics and deep learning : question answering, entailment recognition and paraphrase detection
Nesta dissertação apresenta-se uma abordagem à tarefa de modelar relações semânticas
entre dois textos com base em modelos de semântica distribucional e em aprendizagem
profunda. O presente trabalho tira partido de várias disciplinas da ciência
cognitiva, com especial relevo para a computação, a linguística e a inteligência artificial,
e com fortes influência da neurociência e da psicologia cognitiva.
Os modelos de semântica distribucional (também conhecidos como ”word embeddings”)
são usados para representar o significado das palavras. As representações
semânticas das palavras podem ainda ser combinadas para obter o significado de
um excerto de um texto recorrendo ao uso da aprendizagem profunda, isto é, com o
apoio das redes neurais de convolução.
Esta abordagen é utilizada para replicar a experiência realizada por Bogdanova
et al. (2015) na tarefa de deteção de perguntas que podem ser respondidas as mesmas
respostas tal como estas foram respondidas em fóruns on-line. Os resultados do
desempenho obtidos pelas experiências apresentadas nesta dissertação são equivalentes
ou melhores que os resultados obtidos no trabalho de referência mencionado
acima.
Apresentao também um estudo sobre o impacto do pré-processamento apropriado
do texto, tendo em conta os resultados que podem ser obtidos pelas abordagens
adotadas no trabalho de referência supramencionado. Este estudo é levado a cabo
removendo-se certas pistas que podem levar o sistema, indevidamente, a detetar
perguntas equivalentes. Essa remoção das pistas leva a uma diminuição significativa
no desempenho do sistema desenvolvido no trabalho de referência.
Nesta dissertação é ainda apresentado um estudo sobre o impacto que os word
embeddings treinados previamente têm na tarefa de detetar perguntas semanticamente
equivalentes. Substituindo-se, aleatoriamente, word embeddings previamente
treinados por outros melhora-se o desempenho do sistema.
Além disso, o modelo foi utilizado na tarefa de reconhecimento de implicações
para Português, onde mostrou uma taxa de acerto similar à da baseline. Este trabalho também reporta os resultados da aplicação da abordagem adotada
numa competição para a deteção de paráfrases em Russo. A configuração final apresenta
duas melhorias: usa character embeddings em vez de word embeddings e usa
vários filtros de convolução. Esta configuração foi testado na execução padrão da
Tarefa 2 da competição relevante, e mostrou resultados competitivos.This dissertation presents an approach to the task of modelling semantic relations between
two texts, which is based on distributional semantic models and deep learning.
The present work takes advantage of various disciplines of cognitive science, mainly
computation, linguistics and artificial intelligence, with strong influences from neuroscience
and cognitive psychology.
Distributional semantic models (also known as word embeddings) are used to
represent the meaning of words. Word semantic representations can be further combined
towards obtaining the meaning of a larger chunk of a text using a deep learning
approach, namely with the support of convolutional neural networks.
These approaches are used to replicate the experiment carried out, by Bogdanova
et al. (2015), for the task of detecting questions that can be answered by exactly the
same answer in online user forums. Performance results obtained by my experiments
are comparable or better than the ones reported in that referenced work.
I present also a study on the impact of appropriate text preprocessing with respect
to the results that can be obtained by the approaches adopted in that referenced
work. Removing certain clues that can unduly help the system to detect equivalent
questions leads to a significant decrease in system’s performance supported by that
referenced work.
I also present a study of the impact that pre-trained word embeddings have in the
task of detecting the semantically equivalent questions. Replacing pre-trained word
embeddings by randomly initialised ones improves the performance of the system.
Additionally, the model was applied to the task of entailment recognition for Portuguese
and showed an accuracy on a level with the baseline.
This dissertation also reports on the results of an experimental study on the application
of the adopted approach to the shared task of sentence paraphrase detection
in Russian. The final set up contained two improvements: it uses several convolutional
filters and it uses character embeddings instead of word embeddings. It was tested in Task 2 standard run of the relevant shared task and it showed competitive
results
Optimisation of a weightless neural network using particle swarms
Among numerous pattern recognition methods the neural network approach has been the subject of much research due to its ability to learn from a given collection of representative examples. This thesis is concerned with the design of weightless neural networks, which decompose a given pattern into several sets of n points, termed n-tuples. Considerable research has shown that by optimising the input connection mapping of such n-tuple networks classification performance can be improved significantly. In this thesis the application of a population-based stochastic optimisation technique, known as Particle Swarm Optimisation (PSO), to the optimisation of the connectivity pattern of such “n-tuple” classifiers is explored.
The research was aimed at improving the discriminating power of the classifier in recognising handwritten characters by exploiting more efficient learning strategies. The proposed "learning" scheme searches for ‘good’ input connections of the n-tuples in the solution space and shrinks the search area step by step. It refines its search by attracting the particles to positions with good solutions in an iterative manner. Every iteration the performance or fitness of each input connection is evaluated, so a reward and punishment based fitness function was modelled for the task. The original PSO was refined by combining it with other bio-inspired approaches like Self-Organized Criticality and Nearest Neighbour Interactions. The hybrid algorithms were adapted for the n-tuple system and the performance was measured in selecting better connectivity patterns. The Genetic Algorithm (GA) has been shown to be accomplishing the same goals as the PSO, so the performances and convergence properties of the GA were compared against the PSO to optimise input connections.
Experiments were conducted to evaluate the proposed methods by applying the trained classifiers to recognise handprinted digits from a widely used database. Results revealed the superiority of the particle swarm optimised training for the n-tuples over other algorithms including the GA. Low particle velocity in PSO was favourable for exploring more areas in the solution space and resulted in better recognition rates. Use of hybridisation was helpful and one of the versions of the hybrid PSO was found to be the best performing algorithm in finding the optimum set of input maps for the n-tuple network
Machine Learning
Machine Learning can be defined in various ways related to a scientific domain concerned with the design and development of theoretical and implementation tools that allow building systems with some Human Like intelligent behavior. Machine learning addresses more specifically the ability to improve automatically through experience