5,739 research outputs found

    Graduate Catalog of Studies, 2023-2024

    Get PDF

    Modular lifelong machine learning

    Get PDF
    Deep learning has drastically improved the state-of-the-art in many important fields, including computer vision and natural language processing (LeCun et al., 2015). However, it is expensive to train a deep neural network on a machine learning problem. The overall training cost further increases when one wants to solve additional problems. Lifelong machine learning (LML) develops algorithms that aim to efficiently learn to solve a sequence of problems, which become available one at a time. New problems are solved with less resources by transferring previously learned knowledge. At the same time, an LML algorithm needs to retain good performance on all encountered problems, thus avoiding catastrophic forgetting. Current approaches do not possess all the desired properties of an LML algorithm. First, they primarily focus on preventing catastrophic forgetting (Diaz-Rodriguez et al., 2018; Delange et al., 2021). As a result, they neglect some knowledge transfer properties. Furthermore, they assume that all problems in a sequence share the same input space. Finally, scaling these methods to a large sequence of problems remains a challenge. Modular approaches to deep learning decompose a deep neural network into sub-networks, referred to as modules. Each module can then be trained to perform an atomic transformation, specialised in processing a distinct subset of inputs. This modular approach to storing knowledge makes it easy to only reuse the subset of modules which are useful for the task at hand. This thesis introduces a line of research which demonstrates the merits of a modular approach to lifelong machine learning, and its ability to address the aforementioned shortcomings of other methods. Compared to previous work, we show that a modular approach can be used to achieve more LML properties than previously demonstrated. Furthermore, we develop tools which allow modular LML algorithms to scale in order to retain said properties on longer sequences of problems. First, we introduce HOUDINI, a neurosymbolic framework for modular LML. HOUDINI represents modular deep neural networks as functional programs and accumulates a library of pre-trained modules over a sequence of problems. Given a new problem, we use program synthesis to select a suitable neural architecture, as well as a high-performing combination of pre-trained and new modules. We show that our approach has most of the properties desired from an LML algorithm. Notably, it can perform forward transfer, avoid negative transfer and prevent catastrophic forgetting, even across problems with disparate input domains and problems which require different neural architectures. Second, we produce a modular LML algorithm which retains the properties of HOUDINI but can also scale to longer sequences of problems. To this end, we fix the choice of a neural architecture and introduce a probabilistic search framework, PICLE, for searching through different module combinations. To apply PICLE, we introduce two probabilistic models over neural modules which allows us to efficiently identify promising module combinations. Third, we phrase the search over module combinations in modular LML as black-box optimisation, which allows one to make use of methods from the setting of hyperparameter optimisation (HPO). We then develop a new HPO method which marries a multi-fidelity approach with model-based optimisation. We demonstrate that this leads to improvement in anytime performance in the HPO setting and discuss how this can in turn be used to augment modular LML methods. Overall, this thesis identifies a number of important LML properties, which have not all been attained in past methods, and presents an LML algorithm which can achieve all of them, apart from backward transfer

    Beam scanning by liquid-crystal biasing in a modified SIW structure

    Get PDF
    A fixed-frequency beam-scanning 1D antenna based on Liquid Crystals (LCs) is designed for application in 2D scanning with lateral alignment. The 2D array environment imposes full decoupling of adjacent 1D antennas, which often conflicts with the LC requirement of DC biasing: the proposed design accommodates both. The LC medium is placed inside a Substrate Integrated Waveguide (SIW) modified to work as a Groove Gap Waveguide, with radiating slots etched on the upper broad wall, that radiates as a Leaky-Wave Antenna (LWA). This allows effective application of the DC bias voltage needed for tuning the LCs. At the same time, the RF field remains laterally confined, enabling the possibility to lay several antennas in parallel and achieve 2D beam scanning. The design is validated by simulation employing the actual properties of a commercial LC medium

    Optical Remote Sensing of Oil Spills by using Machine Learning Methods in the Persian Gulf: A Multi-Class Approach

    Get PDF
    Marine oil spills are harmful for the environment and costly for society. Coastal areas are particularly vulnerable since they provide habitats for organisms, animals and marine ecosystems. This thesis studied machine learning methods to classify thick oil in a multi-class case, using remotely sensed multi-spectral data in the Persian Gulf. The study area covers a large area between United Arab Emirates (UAE) and Iran. The dataset is extracted from 10 Sentinel-2 tiles on six spectral bands between 492 nm to 2202 nm. These images were annotated for four classes, namely thick oil, thin oil, ocean water and turbid water by using the Bonn Agreement to analyse true color composite images. A variety of machine learning methods were trained and evaluated using this dataset. Then a robustness evaluation was done by using selected machine learning methods on an independent dataset. Initially multiple machine learning methods were included; three decision trees, six K-Nearest Neighbor (KNN) models, two Artificial Neural Network (ANN) models, two Naive bayes models, and two discriminant models. Two KNN models and two ANN models were then picked for further evaluation. The results show that the fine KNN approach with two nearest neighbors had the best performance based on the computed statistical measures. However, the robustness evaluation showed that the tri-layered NN performed better. This thesis has shown that supervised machine learning with a multi-class approach can be used for oil spill monitoring using multi-spectral remote sensing data in the Persian Gulf

    Resilience and food security in a food systems context

    Get PDF
    This open access book compiles a series of chapters written by internationally recognized experts known for their in-depth but critical views on questions of resilience and food security. The book assesses rigorously and critically the contribution of the concept of resilience in advancing our understanding and ability to design and implement development interventions in relation to food security and humanitarian crises. For this, the book departs from the narrow beaten tracks of agriculture and trade, which have influenced the mainstream debate on food security for nearly 60 years, and adopts instead a wider, more holistic perspective, framed around food systems. The foundation for this new approach is the recognition that in the current post-globalization era, the food and nutritional security of the world’s population no longer depends just on the performance of agriculture and policies on trade, but rather on the capacity of the entire (food) system to produce, process, transport and distribute safe, affordable and nutritious food for all, in ways that remain environmentally sustainable. In that context, adopting a food system perspective provides a more appropriate frame as it incites to broaden the conventional thinking and to acknowledge the systemic nature of the different processes and actors involved. This book is written for a large audience, from academics to policymakers, students to practitioners

    Predictive Demand Response Modeling for Logistic Systems Innovation and Optimization

    Get PDF
    In the ever-increasing dynamics of global business markets, logistic systems must optimize the usage of all possible sources to continually innovate. Scenario-based demand prediction plays an important role in the effective economic operations and planning of logistics. However, many uncertainties and demand variability, which are associated with innovative changes, complicate demand forecasting and expose system operators to the risk of failing to meet demand. This dissertation presents new approaches to predictively explore how customer preferences will change and consequently demand would respond to the new setup of services caused by an innovative transformation of the logistic layout. The critical challenge is that the responses from customers in particular and demand in general to the innovative changes and corresponding adjustments are uncertain and unknown in practice, and there is no historical data to learn from and directly support the predictive model. In this dissertation, we are dealing with three different predictive demand response modeling approaches, jointly shaping a new methodological pathway. Chapter 1 provides a novel approach for predictive modeling probabilistic customer behavior over new service offers which are much faster than ever done before, based on the case of a large Chinese parcel-delivery service provider. Chapter 2 introduces an approach for predicting scenario-based erection-site demand schedules under uncertainty of disruptive events in construction projects whose logistics transformed from traditional to modular style, based on the case of a USA-based innovative leader in modular building production. For such a leader to advance in its logistics design innovations and associated capacity adjustments, and also to enhance its capability for taking more market share, it is crucial to estimate potential future demand for modular construction and corresponding probable projects in terms of their potential location, size, and characteristics. For this purpose, Chapter 3 introduces a methodological approach for estimating scenario-based future demand for modular construction projects to be implemented over the US metropolitan statistical areas.Ph.D

    Soundscape in Urban Forests

    Get PDF
    This Special Issue of Forests explores the role of soundscapes in urban forested areas. It is comprised of 11 papers involving soundscape studies conducted in urban forests from Asia and Africa. This collection contains six research fields: (1) the ecological patterns and processes of forest soundscapes; (2) the boundary effects and perceptual topology; (3) natural soundscapes and human health; (4) the experience of multi-sensory interactions; (5) environmental behavior and cognitive disposition; and (6) soundscape resource management in forests

    Innovation in Energy Security and Long-Term Energy Efficiency â…¡

    Get PDF
    The sustainable development of our planet depends on the use of energy. The increasing world population inevitably causes an increase in the demand for energy, which, on the one hand, threatens us with the potential to encounter a shortage of energy supply, and, on the other hand, causes the deterioration of the environment. Therefore, our task is to reduce this demand through different innovative solutions (i.e., both technological and social). Social marketing and economic policies can also play their role by affecting the behavior of households and companies and by causing behavioral change oriented to energy stewardship, with an overall switch to renewable energy resources. This reprint provides a platform for the exchange of a wide range of ideas, which, ultimately, would facilitate driving societies toward long-term energy efficiency

    A Survey on Event-based News Narrative Extraction

    Full text link
    Narratives are fundamental to our understanding of the world, providing us with a natural structure for knowledge representation over time. Computational narrative extraction is a subfield of artificial intelligence that makes heavy use of information retrieval and natural language processing techniques. Despite the importance of computational narrative extraction, relatively little scholarly work exists on synthesizing previous research and strategizing future research in the area. In particular, this article focuses on extracting news narratives from an event-centric perspective. Extracting narratives from news data has multiple applications in understanding the evolving information landscape. This survey presents an extensive study of research in the area of event-based news narrative extraction. In particular, we screened over 900 articles that yielded 54 relevant articles. These articles are synthesized and organized by representation model, extraction criteria, and evaluation approaches. Based on the reviewed studies, we identify recent trends, open challenges, and potential research lines.Comment: 37 pages, 3 figures, to be published in the journal ACM CSU

    A review of technical factors to consider when designing neural networks for semantic segmentation of Earth Observation imagery

    Full text link
    Semantic segmentation (classification) of Earth Observation imagery is a crucial task in remote sensing. This paper presents a comprehensive review of technical factors to consider when designing neural networks for this purpose. The review focuses on Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Generative Adversarial Networks (GANs), and transformer models, discussing prominent design patterns for these ANN families and their implications for semantic segmentation. Common pre-processing techniques for ensuring optimal data preparation are also covered. These include methods for image normalization and chipping, as well as strategies for addressing data imbalance in training samples, and techniques for overcoming limited data, including augmentation techniques, transfer learning, and domain adaptation. By encompassing both the technical aspects of neural network design and the data-related considerations, this review provides researchers and practitioners with a comprehensive and up-to-date understanding of the factors involved in designing effective neural networks for semantic segmentation of Earth Observation imagery.Comment: 145 pages with 32 figure
    • …
    corecore