505 research outputs found

    Role of biases in neural network models

    Get PDF

    Empiricism without Magic: Transformational Abstraction in Deep Convolutional Neural Networks

    Get PDF
    In artificial intelligence, recent research has demonstrated the remarkable potential of Deep Convolutional Neural Networks (DCNNs), which seem to exceed state-of-the-art performance in new domains weekly, especially on the sorts of very difficult perceptual discrimination tasks that skeptics thought would remain beyond the reach of artificial intelligence. However, it has proven difficult to explain why DCNNs perform so well. In philosophy of mind, empiricists have long suggested that complex cognition is based on information derived from sensory experience, often appealing to a faculty of abstraction. Rationalists have frequently complained, however, that empiricists never adequately explained how this faculty of abstraction actually works. In this paper, I tie these two questions together, to the mutual benefit of both disciplines. I argue that the architectural features that distinguish DCNNs from earlier neural networks allow them to implement a form of hierarchical processing that I call “transformational abstraction”. Transformational abstraction iteratively converts sensory-based representations of category exemplars into new formats that are increasingly tolerant to “nuisance variation” in input. Reflecting upon the way that DCNNs leverage a combination of linear and non-linear processing to efficiently accomplish this feat allows us to understand how the brain is capable of bi-directional travel between exemplars and abstractions, addressing longstanding problems in empiricist philosophy of mind. I end by considering the prospects for future research on DCNNs, arguing that rather than simply implementing 80s connectionism with more brute-force computation, transformational abstraction counts as a qualitatively distinct form of processing ripe with philosophical and psychological significance, because it is significantly better suited to depict the generic mechanism responsible for this important kind of psychological processing in the brain

    Machine Learning Enhanced Free-Space and Underwater OAM Optical Communications

    Get PDF
    Communications, bandwidth, security, and hardware simplicity are principles of interest to society at large. Recent advances in optics and in understanding properties of light, such as orbital angular momentum (OAM), have provided new potential mediums for communication. Machine learning has wound its way into a broad range of fascinating areas. An emerging field of research is the use of a unique property of lasers called orbital angular momentum (OAM). With the proper hardware, a laser can go from a Gaussian shaped distribution to a doughnut shaped pattern, where the radius can be changed. Multiple OAM patterns, or modes, can be combined to create unique patterns. This research explores the use of machine learning to de-multiplex OAM patterns. The OAM patterns can be used to encode bits for communication. This work explores ways to improve pattern recognition or classification in both underwater and free-space environments. Specifically, various approaches are applied to train convolutional neural networks to make them more robust to signal degradation through turbulence and attenuation. A new image transform is used to improve OAM pattern classification. Finally, some of the state of the art deep convolutional neural networks are explored to see which provide the most robust performance in free-space and underwater communications. A variety of methods are shown to improve the state of the art in pattern classification in OAM communications

    Learning and generalization in feed-forward neural networks

    Get PDF

    Sensorimotor neural systems for a predatory stealth behaviour camouflaging motion

    Get PDF
    A thesis submitted to the University of London in partial fulfillment of the requirements for the admission to the degree of Doctor of Philosophy

    Convolutional neural networks for malware classification

    Get PDF
    According to AV vendors malicious software has been growing exponentially last years. One of the main reasons for these high volumes is that in order to evade detection, malware authors started using polymorphic and metamorphic techniques. As a result, traditional signature-based approaches to detect malware are being insufficient against new malware and the categorization of malware samples had become essential to know the basis of the behavior of malware and to fight back cybercriminals. During the last decade, solutions that fight against malicious software had begun using machine learning approaches. Unfortunately, there are few opensource datasets available for the academic community. One of the biggest datasets available was released last year in a competition hosted on Kaggle with data provided by Microsoft for the Big Data Innovators Gathering (BIG 2015). This thesis presents two novel and scalable approaches using Convolutional Neural Networks (CNNs) to assign malware to its corresponding family. On one hand, the first approach makes use of CNNs to learn a feature hierarchy to discriminate among samples of malware represented as gray-scale images. On the other hand, the second approach uses the CNN architecture introduced by Yoon Kim [12] to classify malware samples according their x86 instructions. The proposed methods achieved an improvement of 93.86% and 98,56% with respect to the equal probability benchmark

    NovelCraft: A Dataset for Novelty Detection and Discovery in Open Worlds

    Full text link
    In order for artificial agents to successfully perform tasks in changing environments, they must be able to both detect and adapt to novelty. However, visual novelty detection research often only evaluates on repurposed datasets such as CIFAR-10 originally intended for object classification, where images focus on one distinct, well-centered object. New benchmarks are needed to represent the challenges of navigating the complex scenes of an open world. Our new NovelCraft dataset contains multimodal episodic data of the images and symbolic world-states seen by an agent completing a pogo stick assembly task within a modified Minecraft environment. In some episodes, we insert novel objects of varying size within the complex 3D scene that may impact gameplay. Our visual novelty detection benchmark finds that methods that rank best on popular area-under-the-curve metrics may be outperformed by simpler alternatives when controlling false positives matters most. Further multimodal novelty detection experiments suggest that methods that fuse both visual and symbolic information can improve time until detection as well as overall discrimination. Finally, our evaluation of recent generalized category discovery methods suggests that adapting to new imbalanced categories in complex scenes remains an exciting open problem.Comment: Published in Transactions on Machine Learning Research (03/2023
    corecore