19 research outputs found

    Direct Learning-Based Deep Spiking Neural Networks: A Review

    Full text link
    The spiking neural network (SNN), as a promising brain-inspired computational model with binary spike information transmission mechanism, rich spatially-temporal dynamics, and event-driven characteristics, has received extensive attention. However, its intricately discontinuous spike mechanism brings difficulty to the optimization of the deep SNN. Since the surrogate gradient method can greatly mitigate the optimization difficulty and shows great potential in directly training deep SNNs, a variety of direct learning-based deep SNN works have been proposed and achieved satisfying progress in recent years. In this paper, we present a comprehensive survey of these direct learning-based deep SNN works, mainly categorized into accuracy improvement methods, efficiency improvement methods, and temporal dynamics utilization methods. In addition, we also divide these categorizations into finer granularities further to better organize and introduce them. Finally, the challenges and trends that may be faced in future research are prospected.Comment: Accepted by Frontiers in Neuroscienc

    AutoML @ NeurIPS 2018 challenge: Design and Results

    Get PDF
    Preprint submitted to NeurIPS2018 Volume of Springer Series on Challenges in Machine LearningInternational audienceWe organized a competition on Autonomous Lifelong Machine Learning with Drift that was part of the competition program of NeurIPS 2018. This data driven competition asked participants to develop computer programs capable of solving supervised learning problems where the i.i.d. assumption did not hold. Large data sets were arranged in a lifelong learning and evaluation scenario and CodaLab was used as the challenge platform. The challenge attracted more than 300 participants in its two month duration. This chapter describes the design of the challenge and summarizes its main results

    人の行動分類のための教師なし転移学習

    Get PDF
    筑波大学 (University of Tsukuba)201

    A GPR-GPS-GIS-integrated, information-rich and error-aware system for detecting, locating and characterizing underground utilities

    Get PDF
    Underground utilities have proliferated throughout the years. The location and dimension of many underground utilities have not always been properly collected and documented, leading to utility conflicts and utility strikes, and thus resulting in property damages, project delays, cost overruns, environment pollutions, injuries and deaths. The underlying reasons are twofold. First, the reliable data regarding the location and dimension of underground utility are missing or incomplete. Existing methods to collect data are not efficient and effective. Second, positional uncertainties are inherent in the measured utility locations. An effective means is not yet available to visualize and communicate the inherent positional uncertainties associated with utility location data to end-users (e.g., excavator operator). To address the aforementioned problems, this research integrate ground penetrating radar (GPR), global positioning system (GPS) and geographic information system (GIS) to form a total 3G system to collect, inventory and visualize underground utility data. Furthermore, a 3D probabilistic error band is created to model and visualize the inherent positional uncertainties in utility data. ^ Three main challenges are addressed in this research. The first challenge is the interpretation of GPR and GPS raw data. A novel method is created in this research to simultaneously estimate the radius and buried depth of underground utilities using GPR scans and auxiliary GPS data. The proposed method was validated using GPR field scans obtained under various settings. It was found that this newly created method increases the accuracy of estimating the buried depth and radius of the buried utility under a general scanning condition. The second challenge is the geo-registration of detected utility locations. This challenge is addressed by integration of GPR, GPS and GIS. The newly created system takes advantages of GPR and GPS to detect and locate underground utilities in 3D and uses GIS for storing, updating, modeling, and visualizing collected utility data in a real world coordinate system. The third challenge is positional error/uncertainty assessment and modeling. The locational errors of GPR system are evaluated in different depth and soil conditions. Quantitative linkages between error magnitudes and its influencing factors (i.e., buried depths and soil conditions) are established. In order to handle the positional error of underground utilities, a prototype of 3D probabilistic error band is created and implemented in GIS environment. This makes the system error-aware and also paves the way to a more intelligent error-aware GIS. ^ To sum up, the newly created system is able to detect, locate and characterize underground utilities in an information-rich and error-aware manner

    Next-generation energy systems for sustainable smart cities: Roles of transfer learning

    Get PDF
    Smart cities attempt to reach net-zero emissions goals by reducing wasted energy while improving grid stability and meeting service demand. This is possible by adopting next-generation energy systems, which leverage artificial intelligence, the Internet of things (IoT), and communication technologies to collect and analyze big data in real-time and effectively run city services. However, training machine learning algorithms to perform various energy-related tasks in sustainable smart cities is a challenging data science task. These algorithms might not perform as expected, take much time in training, or do not have enough input data to generalize well. To that end, transfer learning (TL) has been proposed as a promising solution to alleviate these issues. To the best of the authors’ knowledge, this paper presents the first review of the applicability of TL for energy systems by adopting a well-defined taxonomy of existing TL frameworks. Next, an in-depth analysis is carried out to identify the pros and cons of current techniques and discuss unsolved issues. Moving on, two case studies illustrating the use of TL for (i) energy prediction with mobility data and (ii) load forecasting in sports facilities are presented. Lastly, the paper ends with a discussion of the future directions

    Deep Learning Based Upper-limb Motion Estimation Using Surface Electromyography

    Get PDF
    To advance human-machine interfaces (HMI) that can help disabled people reconstruct lost functions of upper-limbs, machine learning (ML) techniques, particularly classification-based pattern recognition (PR), have been extensively implemented to decode human movement intentions from surface electromyography (sEMG) signals. However, performances of ML can be substantially affected, or even limited, by feature engineering that requires expertise in both domain knowledge and experimental experience. To overcome this limitation, researchers are now focusing on deep learning (DL) techniques to derive informative, representative, and transferable features from raw data automatically. Despite some progress reported in recent literature, it is still very challenging to achieve reliable and robust interpretation of user intentions in practical scenarios. This is mainly because of the high complexity of upper-limb motions and the non-stable characteristics of sEMG signals. Besides, the PR scheme only identifies discrete states of motion. To complete coordinated tasks such as grasping, users have to rely on a sequential on/off control of each individual function, which is inherently different from the simultaneous and proportional control (SPC) strategy adopted by the natural motions of upper-limbs. The aim of this thesis is to develop and advance several DL techniques for the estimation of upper-limb motions from sEMG, and the work is centred on three themes: 1) to improve the reliability of gesture recognition by rejecting uncertain classification outcomes; 2) to build regression frameworks for joint kinematics estimation that enables SPC; and 3) to reduce the degradation of estimation performances when DL model is applied to a new individual. In order to achieve these objectives, the following efforts were made: 1) a confidence model was designed to predict the possibility of correctness with regard to each classification of convolutional neural networks (CNN), such that the uncertain recognition can be identified and rejected; 2) a hybrid framework using CNN for deep feature extraction and long short-term memory neural network (LSTM) was constructed to conduct sequence regression, which could simultaneously exploit the temporal and spatial information in sEMG data; 3) the hybrid framework was further extended by integrating Kalman filter with LSTM units in the recursive learning process, obtaining a deep Kalman filter network (DKFN) to perform kinematics estimation more effectively; and 4) a novel regression scheme was proposed for supervised domain adaptation (SDA), based on which the model generalisation among subjects can be substantially enhanced

    Creating public value in information and communication technology: a learning analytics approach

    Get PDF
    This thesis contributes to the ongoing global discourse in ICT4D on ICT and its effect on socio-economic development in both theory and practice. The thesis comprises five studies presented logically from chapters 5 to 9. The thesis employs Mixed Methods research methodology within the Critical Realist epistemological perspective in Information Systems Research. Studies 1-4 employ different quantitative research and analytical methods while study 5 employs a qualitative research and analytical method. Study 1 proposes and operationalizes a predictive analytics framework in Learning Analytics by using a case study of the Computer Science Department of the University of Jos, Nigeria. Multiple Linear Regression was used with the aid of the Statistical Package for Social Sciences (SPSS) analysis tool. Statistical Hypothesis testing was then used to validate the model with a 5% level of significance. Results show how predictive learning analytics can be successfully operationalized and used for predicting students’ academic performances. In Study 2 the relative efficiency of ICT infrastructure utilization with respect to the educational component of the Human Development Index (HDI) is investigated. A Novel conceptual model is proposed and the Data Envelopment Analysis (DEA) methodology is used to measure the relative efficiency of the components of ICT infrastructure (Inputs) and the components of education (Outputs). Ordinary Least Squares (OLS) Regression Analysis is used to determine the effect of ICT infrastructure on Educational Attainment/Adult Literacy Rates. Results show a strong positive effect of ICT infrastructure on educational attainment and adult literacy rates, a strong correlation between this infrastructure and literacy rates as well as provide a theoretical support for the argument of increasing ICT infrastructure to provide an increase in human development especially within the educational context. In Study 3 the relative efficiency and productivity of ICT Infrastructure Utilization in Education are examined. The research employs the Data Envelopment Analysis (DEA) and Malmquist Index (MI), well established non-parametric data analysis methodologies, applied to archival data on International countries divided into Arab States, Europe, Sub-Saharan Africa and World regions. Ordinary Least Squares (OLS) Regression analysis is applied to determine the effect of ICT infrastructure on Adult Literacy Rates. Findings show a relatively efficient utilization and steady increase in productivity for the regions but with only Europe and the Arab States currently operating in a state of positive growth in productivity. A strong positive effect of ICT infrastructure on Adult Literacy Rates is also observed. Study 4 investigates the efficiency and productivity of ICT utilization in public value creation with respect to Adult Literacy Rates. The research employs Data Envelopment Analysis (DEA) and Malmquist Index (MI), well established non-parametric data analysis methodologies, applied to archival data on International countries divided into Arab States, Europe, Sub-Saharan Africa and World regions. Findings show a relatively efficient utilization of ICT in public value creation but an average decline in productivity levels. Finally, in Study 5 a Critical Discourse Analysis (CDA) on the UNDP Human Development Research Reports from 2010-2016 is carried out to determine whether or not any public value is created or derived from the policy directions being put forward and their subsequent implementations. The CDA is operationalized by Habermas’ Theory of Communicative Action (TCA). Findings show that Public Value is indeed being created and at the core of the policy directions being called for in these reports.School of ComputingPh.D. (Information Systems

    Apprentissage d'espaces sémantiques

    Get PDF
    Dans cette dissertation, nous présentons plusieurs techniques d’apprentissage d’espaces sémantiques pour plusieurs domaines, par exemple des mots et des images, mais aussi à l’intersection de différents domaines. Un espace de représentation est appelé sémantique si des entités jugées similaires par un être humain, ont leur similarité préservée dans cet espace. La première publication présente un enchaînement de méthodes d’apprentissage incluant plusieurs techniques d’apprentissage non supervisé qui nous a permis de remporter la compétition “Unsupervised and Transfer Learning Challenge” en 2011. Le deuxième article présente une manière d’extraire de l’information à partir d’un contexte structuré (177 détecteurs d’objets à différentes positions et échelles). On montrera que l’utilisation de la structure des données combinée à un apprentissage non supervisé permet de réduire la dimensionnalité de 97% tout en améliorant les performances de reconnaissance de scènes de +5% à +11% selon l’ensemble de données. Dans le troisième travail, on s’intéresse à la structure apprise par les réseaux de neurones profonds utilisés dans les deux précédentes publications. Plusieurs hypothèses sont présentées et testées expérimentalement montrant que l’espace appris a de meilleures propriétés de mixage (facilitant l’exploration de différentes classes durant le processus d’échantillonnage). Pour la quatrième publication, on s’intéresse à résoudre un problème d’analyse syntaxique et sémantique avec des réseaux de neurones récurrents appris sur des fenêtres de contexte de mots. Dans notre cinquième travail, nous proposons une façon d’effectuer de la recherche d’image ”augmentée” en apprenant un espace sémantique joint où une recherche d’image contenant un objet retournerait aussi des images des parties de l’objet, par exemple une recherche retournant des images de ”voiture” retournerait aussi des images de ”pare-brises”, ”coffres”, ”roues” en plus des images initiales.In this work, we focus on learning semantic spaces for multiple domains, but also at the intersection of different domains. The semantic space is where the learned representation lives. This space is called semantic if similar entities from a human perspective have their similarity preserved in this space. We use different machine learning algorithms to learn representations with interesting intrinsic properties. The first article presents a pipeline including many different unsupervised learning techniques used to win the Unsupervised and Transfer Learning Challenge in 2011. In the second article, we present a pipeline taking advantage of the structure of the data for a scene classification problem. This approach allows us to drastically reduce the dimensionality while improving significantly on the scene recognition accuracy. The third article focuses on the space structure learned by deep representations. We show that performing the sampling procedure from deeper levels of representation space explores more of the different classes. In the fourth article, we tackle a semantic parsing problem with several Recurrent Neural Network architectures taking as input context windows of word embeddings. In the fifth article, an investigation on learning a single semantic space at the intersection of words and images is presented. We propose a way to perform ”augmented search” where a search on an image containing an object would also return images of the object’s parts
    corecore