505 research outputs found
Empiricism without Magic: Transformational Abstraction in Deep Convolutional Neural Networks
In artificial intelligence, recent research has demonstrated the remarkable potential of Deep Convolutional Neural Networks (DCNNs), which seem to exceed state-of-the-art performance in new domains weekly, especially on the sorts of very difficult perceptual discrimination tasks that skeptics thought would remain beyond the reach of artificial intelligence. However, it has proven difficult to explain why DCNNs perform so well. In philosophy of mind, empiricists have long suggested that complex cognition is based on information derived from sensory experience, often appealing to a faculty of abstraction. Rationalists have frequently complained, however, that empiricists never adequately explained how this faculty of abstraction actually works. In this paper, I tie these two questions together, to the mutual benefit of both disciplines. I argue that the architectural features that distinguish DCNNs from earlier neural networks allow them to implement a form of hierarchical processing that I call “transformational abstraction”. Transformational abstraction iteratively converts sensory-based representations of category exemplars into new formats that are increasingly tolerant to “nuisance variation” in input. Reflecting upon the way that DCNNs leverage a combination of linear and non-linear processing to efficiently accomplish this feat allows us to understand how the brain is capable of bi-directional travel between exemplars and abstractions, addressing longstanding problems in empiricist philosophy of mind. I end by considering the prospects for future research on DCNNs, arguing that rather than simply implementing 80s connectionism with more brute-force computation, transformational abstraction counts as a qualitatively distinct form of processing ripe with philosophical and psychological significance, because it is significantly better suited to depict the generic mechanism responsible for this important kind of psychological processing in the brain
Machine Learning Enhanced Free-Space and Underwater OAM Optical Communications
Communications, bandwidth, security, and hardware simplicity are principles of interest to society at large. Recent advances in optics and in understanding properties of light, such as orbital angular momentum (OAM), have provided new potential mediums for communication.
Machine learning has wound its way into a broad range of fascinating areas. An emerging field of research is the use of a unique property of lasers called orbital angular momentum (OAM). With the proper hardware, a laser can go from a Gaussian shaped distribution to a doughnut shaped pattern, where the radius can be changed. Multiple OAM patterns, or modes, can be combined to create unique patterns. This research explores the use of machine learning to de-multiplex OAM patterns. The OAM patterns can be used to encode bits for communication.
This work explores ways to improve pattern recognition or classification in both underwater and free-space environments. Specifically, various approaches are applied to train convolutional neural networks to make them more robust to signal degradation through turbulence and attenuation. A new image transform is used to improve OAM pattern classification. Finally, some of the state of the art deep convolutional neural networks are explored to see which provide the most robust performance in free-space and underwater communications. A variety of methods are shown to improve the state of the art in pattern classification in OAM communications
Sensorimotor neural systems for a predatory stealth behaviour camouflaging motion
A thesis submitted to the University of London in partial fulfillment of the requirements for the admission to the degree of Doctor of Philosophy
Convolutional neural networks for malware classification
According to AV vendors malicious software has been growing exponentially
last years. One of the main reasons for these high volumes is that in order
to evade detection, malware authors started using polymorphic and metamorphic
techniques. As a result, traditional signature-based approaches to
detect malware are being insufficient against new malware and the categorization
of malware samples had become essential to know the basis of the
behavior of malware and to fight back cybercriminals.
During the last decade, solutions that fight against malicious software had
begun using machine learning approaches. Unfortunately, there are few opensource
datasets available for the academic community. One of the biggest
datasets available was released last year in a competition hosted on Kaggle
with data provided by Microsoft for the Big Data Innovators Gathering
(BIG 2015). This thesis presents two novel and scalable approaches using
Convolutional Neural Networks (CNNs) to assign malware to its corresponding
family. On one hand, the first approach makes use of CNNs to learn a
feature hierarchy to discriminate among samples of malware represented as
gray-scale images. On the other hand, the second approach uses the CNN
architecture introduced by Yoon Kim [12] to classify malware samples according
their x86 instructions. The proposed methods achieved an improvement
of 93.86% and 98,56% with respect to the equal probability benchmark
NovelCraft: A Dataset for Novelty Detection and Discovery in Open Worlds
In order for artificial agents to successfully perform tasks in changing
environments, they must be able to both detect and adapt to novelty. However,
visual novelty detection research often only evaluates on repurposed datasets
such as CIFAR-10 originally intended for object classification, where images
focus on one distinct, well-centered object. New benchmarks are needed to
represent the challenges of navigating the complex scenes of an open world. Our
new NovelCraft dataset contains multimodal episodic data of the images and
symbolic world-states seen by an agent completing a pogo stick assembly task
within a modified Minecraft environment. In some episodes, we insert novel
objects of varying size within the complex 3D scene that may impact gameplay.
Our visual novelty detection benchmark finds that methods that rank best on
popular area-under-the-curve metrics may be outperformed by simpler
alternatives when controlling false positives matters most. Further multimodal
novelty detection experiments suggest that methods that fuse both visual and
symbolic information can improve time until detection as well as overall
discrimination. Finally, our evaluation of recent generalized category
discovery methods suggests that adapting to new imbalanced categories in
complex scenes remains an exciting open problem.Comment: Published in Transactions on Machine Learning Research (03/2023
Recommended from our members
Weightless neural networks for face recognition
This thesis was submitted for the degree of Doctor of Philosophy and awarded by Brunel University.The interface with the real-world has proved to be extremely challenging throughout the past 70 years in which computer technology has been developing. The problem initially is assumed to be somewhat trivial, as humans are exceptionally skilled at interpreting real-world data, for example pictures and sounds. Traditional analytical methods have so far not provided the complete answer to what will be termed pattern recognition.
Biological inspiration has motivated pattern recognition researchers since the early days of the subject, and the idea of a neural network which has self-evolving properties has always been seen to be a potential solution to this endeavour. Unlike the development of computer technology in which successive generations of improved devices have been developed, the neural network approach has been less successful, with major setbacks occurring in its development. However, the fact that natural processing in animals and humans is a voltage-based process, devoid of software, and self-evolving, provides an on-going motivation for pattern recognition in artificial neural networks. This thesis addresses the application of weightless neural networks using a ranking pre-processor to implement general pattern recognition with specific reference to face processing. The evaluation of the system will be carried out on open source databases in order to obtain a direct comparison of the efficacy of the method, in particular considerable use will be made of the MIT-CBCL face database. The methodology is cost effective in both software and hardware forms, offers real-time video processing, and can be implemented on all computer platforms. The results of this research show significant improvements over published results, and provide a viable commercial methodology for general pattern recognition
- …