Search CORE

505 research outputs found

Role of biases in neural network models

Author: West Ansgar Heinrich Ludolf
Publication venue: The University of Edinburgh
Publication date: 01/01/1997
Field of study

Empiricism without Magic: Transformational Abstraction in Deep Convolutional Neural Networks

Author: Buckner Cameron
Publication venue
Publication date: 01/01/2018
Field of study

In artificial intelligence, recent research has demonstrated the remarkable potential of Deep Convolutional Neural Networks (DCNNs), which seem to exceed state-of-the-art performance in new domains weekly, especially on the sorts of very difficult perceptual discrimination tasks that skeptics thought would remain beyond the reach of artificial intelligence. However, it has proven difficult to explain why DCNNs perform so well. In philosophy of mind, empiricists have long suggested that complex cognition is based on information derived from sensory experience, often appealing to a faculty of abstraction. Rationalists have frequently complained, however, that empiricists never adequately explained how this faculty of abstraction actually works. In this paper, I tie these two questions together, to the mutual benefit of both disciplines. I argue that the architectural features that distinguish DCNNs from earlier neural networks allow them to implement a form of hierarchical processing that I call “transformational abstraction”. Transformational abstraction iteratively converts sensory-based representations of category exemplars into new formats that are increasingly tolerant to “nuisance variation” in input. Reflecting upon the way that DCNNs leverage a combination of linear and non-linear processing to efficiently accomplish this feat allows us to understand how the brain is capable of bi-directional travel between exemplars and abstractions, addressing longstanding problems in empiricist philosophy of mind. I end by considering the prospects for future research on DCNNs, arguing that rather than simply implementing 80s connectionism with more brute-force computation, transformational abstraction counts as a qualitatively distinct form of processing ripe with philosophical and psychological significance, because it is significantly better suited to depict the generic mechanism responsible for this important kind of psychological processing in the brain

PhilPapers

PhilSci Archive

Machine Learning Enhanced Free-Space and Underwater OAM Optical Communications

Author: Neary Patrick L.
Publication venue: DigitalCommons@USU
Publication date: 01/08/2020
Field of study

Communications, bandwidth, security, and hardware simplicity are principles of interest to society at large. Recent advances in optics and in understanding properties of light, such as orbital angular momentum (OAM), have provided new potential mediums for communication. Machine learning has wound its way into a broad range of fascinating areas. An emerging field of research is the use of a unique property of lasers called orbital angular momentum (OAM). With the proper hardware, a laser can go from a Gaussian shaped distribution to a doughnut shaped pattern, where the radius can be changed. Multiple OAM patterns, or modes, can be combined to create unique patterns. This research explores the use of machine learning to de-multiplex OAM patterns. The OAM patterns can be used to encode bits for communication. This work explores ways to improve pattern recognition or classification in both underwater and free-space environments. Specifically, various approaches are applied to train convolutional neural networks to make them more robust to signal degradation through turbulence and attenuation. A new image transform is used to improve OAM pattern classification. Finally, some of the state of the art deep convolutional neural networks are explored to see which provide the most robust performance in free-space and underwater communications. A variety of methods are shown to improve the state of the art in pattern classification in OAM communications

DigitalCommons@USU

Learning and generalization in feed-forward neural networks

Author: Smieja Frank Jan
Publication venue: The University of Edinburgh
Publication date: 01/01/1989
Field of study

Edinburgh Research Archive

Sensorimotor neural systems for a predatory stealth behaviour camouflaging motion

Author: Anderson Andrew
Publication venue
Publication date: 30/12/2013
Field of study

A thesis submitted to the University of London in partial fulfillment of the requirements for the admission to the degree of Doctor of Philosophy

Queen Mary Research Online

Convolutional neural networks for malware classification

Author: Gibert Llauradó Daniel
Publication venue: Universitat Politècnica de Catalunya
Publication date: 20/10/2016
Field of study

According to AV vendors malicious software has been growing exponentially last years. One of the main reasons for these high volumes is that in order to evade detection, malware authors started using polymorphic and metamorphic techniques. As a result, traditional signature-based approaches to detect malware are being insufficient against new malware and the categorization of malware samples had become essential to know the basis of the behavior of malware and to fight back cybercriminals. During the last decade, solutions that fight against malicious software had begun using machine learning approaches. Unfortunately, there are few opensource datasets available for the academic community. One of the biggest datasets available was released last year in a competition hosted on Kaggle with data provided by Microsoft for the Big Data Innovators Gathering (BIG 2015). This thesis presents two novel and scalable approaches using Convolutional Neural Networks (CNNs) to assign malware to its corresponding family. On one hand, the first approach makes use of CNNs to learn a feature hierarchy to discriminate among samples of malware represented as gray-scale images. On the other hand, the second approach uses the CNN architecture introduced by Yoon Kim [12] to classify malware samples according their x86 instructions. The proposed methods achieved an improvement of 93.86% and 98,56% with respect to the equal probability benchmark

UPCommons. Portal del coneixement obert de la UPC

NovelCraft: A Dataset for Novelty Detection and Discovery in Open Worlds

Author: Feeney Patrick
Hughes Michael C.
Liu Li-Ping
Lymperopoulos Panagiotis
Scheutz Matthias
Schneider Sarah
Publication venue
Publication date: 28/03/2023
Field of study

In order for artificial agents to successfully perform tasks in changing environments, they must be able to both detect and adapt to novelty. However, visual novelty detection research often only evaluates on repurposed datasets such as CIFAR-10 originally intended for object classification, where images focus on one distinct, well-centered object. New benchmarks are needed to represent the challenges of navigating the complex scenes of an open world. Our new NovelCraft dataset contains multimodal episodic data of the images and symbolic world-states seen by an agent completing a pogo stick assembly task within a modified Minecraft environment. In some episodes, we insert novel objects of varying size within the complex 3D scene that may impact gameplay. Our visual novelty detection benchmark finds that methods that rank best on popular area-under-the-curve metrics may be outperformed by simpler alternatives when controlling false positives matters most. Further multimodal novelty detection experiments suggest that methods that fuse both visual and symbolic information can improve time until detection as well as overall discrimination. Finally, our evaluation of recent generalized category discovery methods suggests that adapting to new imbalanced categories in complex scenes remains an exciting open problem.Comment: Published in Transactions on Machine Learning Research (03/2023

arXiv.org e-Print Archive

Recommended from our members

Weightless neural networks for face recognition

Author: Khaki Kazimali M
Publication venue: Brunel University School of Engineering and Design PhD Theses
Publication date: 01/01/2013
Field of study

This thesis was submitted for the degree of Doctor of Philosophy and awarded by Brunel University.The interface with the real-world has proved to be extremely challenging throughout the past 70 years in which computer technology has been developing. The problem initially is assumed to be somewhat trivial, as humans are exceptionally skilled at interpreting real-world data, for example pictures and sounds. Traditional analytical methods have so far not provided the complete answer to what will be termed pattern recognition. Biological inspiration has motivated pattern recognition researchers since the early days of the subject, and the idea of a neural network which has self-evolving properties has always been seen to be a potential solution to this endeavour. Unlike the development of computer technology in which successive generations of improved devices have been developed, the neural network approach has been less successful, with major setbacks occurring in its development. However, the fact that natural processing in animals and humans is a voltage-based process, devoid of software, and self-evolving, provides an on-going motivation for pattern recognition in artificial neural networks. This thesis addresses the application of weightless neural networks using a ranking pre-processor to implement general pattern recognition with specific reference to face processing. The evaluation of the system will be carried out on open source databases in order to obtain a direct comparison of the efficacy of the method, in particular considerable use will be made of the MIT-CBCL face database. The methodology is cost effective in both software and hardware forms, offers real-time video processing, and can be implemented on all computer platforms. The results of this research show significant improvements over published results, and provide a viable commercial methodology for general pattern recognition

Brunel University Research Archive