Search CORE

7,314 research outputs found

Thirty Years of Machine Learning: The Road to Pareto-Optimal Wireless Networks

Author: Chen Kwang-Cheng
Hanzo Lajos
Jiang Chunxiao
Ren Yong
Wang Jingjing
Zhang Haijun
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 13/01/2019
Field of study

Future wireless networks have a substantial potential in terms of supporting a broad range of complex compelling applications both in military and civilian fields, where the users are able to enjoy high-rate, low-latency, low-cost and reliable information services. Achieving this ambitious goal requires new radio techniques for adaptive learning and intelligent decision making because of the complex heterogeneous nature of the network structures and wireless services. Machine learning (ML) algorithms have great success in supporting big data analytics, efficient parameter estimation and interactive decision making. Hence, in this article, we review the thirty-year history of ML by elaborating on supervised learning, unsupervised learning, reinforcement learning and deep learning. Furthermore, we investigate their employment in the compelling applications of wireless networks, including heterogeneous networks (HetNets), cognitive radios (CR), Internet of things (IoT), machine to machine networks (M2M), and so on. This article aims for assisting the readers in clarifying the motivation and methodology of the various ML algorithms, so as to invoke them for hitherto unexplored services as well as scenarios of future wireless networks.Comment: 46 pages, 22 fig

arXiv.org e-Print Archive

Southampton (e-Prints Soton)

Mobility aids for visually impaired persons: Journals reviewed

Author: Cardona Mesa Ahmed Alejandro
Vasquez Salazar Ruben Dario
Publication venue: Asia Pacific Academy of Science Pte. Ltd.
Publication date: 01/01/1970
Field of study

This paper reviews the literature on mobile assistive devices for visual impaired people, in order to have a clear understanding of the technology and technological progress of helping visual impaired people. In this way, it aims to obtain basic guidelines for analyzing the most relevant equipment to help people with impaired vision and highlight the improvements that can be achieved. The most common device is to integrate different sensors and electronic components into the walking stick to improve their obstacle detection ability. In addition, equipment with cameras, including computer vision algorithms and artificial intelligence technology, has been developed to improve the performance and efficiency of the equipment. Finally, the basic characteristics of the auxiliary system are introduced, and it is found that there is no equipment to meet the needs of users

Asia Pacific Academy of Science Pte. Ltd.

Inferring transportation modes from GPS trajectories using a convolutional neural network

Author: Dabiri Sina
Heaslip Kevin
Publication venue: 'Elsevier BV'
Publication date: 01/01/2018
Field of study

Identifying the distribution of users' transportation modes is an essential part of travel demand analysis and transportation planning. With the advent of ubiquitous GPS-enabled devices (e.g., a smartphone), a cost-effective approach for inferring commuters' mobility mode(s) is to leverage their GPS trajectories. A majority of studies have proposed mode inference models based on hand-crafted features and traditional machine learning algorithms. However, manual features engender some major drawbacks including vulnerability to traffic and environmental conditions as well as possessing human's bias in creating efficient features. One way to overcome these issues is by utilizing Convolutional Neural Network (CNN) schemes that are capable of automatically driving high-level features from the raw input. Accordingly, in this paper, we take advantage of CNN architectures so as to predict travel modes based on only raw GPS trajectories, where the modes are labeled as walk, bike, bus, driving, and train. Our key contribution is designing the layout of the CNN's input layer in such a way that not only is adaptable with the CNN schemes but represents fundamental motion characteristics of a moving object including speed, acceleration, jerk, and bearing rate. Furthermore, we ameliorate the quality of GPS logs through several data preprocessing steps. Using the clean input layer, a variety of CNN configurations are evaluated to achieve the best CNN architecture. The highest accuracy of 84.8% has been achieved through the ensemble of the best CNN configuration. In this research, we contrast our methodology with traditional machine learning algorithms as well as the seminal and most related studies to demonstrate the superiority of our framework.Comment: 12 pages, 3 figures, 7 tables, Transportation Research Part C: Emerging Technologie

arXiv.org e-Print Archive

Monash University, Institute of Transport Studies: World Transit Research (WTR)

Commonsense Knowledge in Sentiment Analysis of Ordinance Reactions for Smart Governance

Author: Puri Manish
Publication venue: Montclair State University Digital Commons
Publication date: 01/05/2019
Field of study

Smart Governance is an emerging research area which has attracted scientific as well as policy interests, and aims to improve collaboration between government and citizens, as well as other stakeholders. Our project aims to enable lawmakers to incorporate data driven decision making in enacting ordinances. Our first objective is to create a mechanism for mapping ordinances (local laws) and tweets to Smart City Characteristics (SCC). The use of SCC has allowed us to create a mapping between a huge number of ordinances and tweets, and the use of Commonsense Knowledge (CSK) has allowed us to utilize human judgment in mapping. We have then enhanced the mapping technique to link multiple tweets to SCC. In order to promote transparency in government through increased public participation, we have conducted sentiment analysis of tweets in order to evaluate the opinion of the public with respect to ordinances passed in a particular region. Our final objective is to develop a mapping algorithm in order to directly relate ordinances to tweets. In order to fulfill this objective, we have developed a mapping technique known as TOLCS (Tweets Ordinance Linkage by Commonsense and Semantics). This technique uses pragmatic aspects in Commonsense Knowledge as well as semantic aspects by domain knowledge. By reducing the sample space of big data to be processed, this method represents an efficient way to accomplish this task. The ultimate goal of the project is to see how closely a given region is heading towards the concept of Smart City

Montclair State University Digital Commons

OLIV: An Artificial Intelligence-Powered Assistant for Object Localization for Impaired Vision

Author: Patnik Anshuman
Wang Linda
Wong Alexander
Wong Edrick
Wong Justin
Publication venue: University of Waterloo (Waterloo, Ontario, Canada)
Publication date: 24/12/2018
Field of study

This paper introduces OLIV, a novel end-to-end artificial intelligence-powered assistant system designed to aid individuals with impairedvision in their day-to-day tasks in locating displaced objects. Toachieve this goal, OLIV leverages the current advances in AI-basedspeech recognition, speech generation, and object detection to un-derstand the user’s request and give directions to the relative loca-tion of the displaced object. OLIV consists of three main modules:i) a speech module, ii) an object detection module, and iii) a logicunit module. The speech module interfaces with the user to inter-pret the verbal query of the user and verbally responds to the user.The object detection module identifies the objects of interest andtheir associated locations in a scene. Finally, the logic unit modulemakes sense of the user’s intent along with the localized objects ofinterest, and builds a semantic description that the user can under-stand for the speech module to convey verbally back to the user.Initial results from a proof-of-concept system trained to localize fourdifferent types of objects show promise to the feasibility of OLIV asa useful aid for individuals with impaired vision

Waterloo Library Journal Publishing Service (University of Waterloo, Canada)

The Evolution of First Person Vision Methods: A Survey

Author: Betancourt Alejandro
Morerio Pietro
Rauterberg Matthias
Regazzoni Carlo S.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2015
Field of study

The emergence of new wearable technologies such as action cameras and smart-glasses has increased the interest of computer vision scientists in the First Person perspective. Nowadays, this field is attracting attention and investments of companies aiming to develop commercial devices with First Person Vision recording capabilities. Due to this interest, an increasing demand of methods to process these videos, possibly in real-time, is expected. Current approaches present a particular combinations of different image features and quantitative methods to accomplish specific objectives like object detection, activity recognition, user machine interaction and so on. This paper summarizes the evolution of the state of the art in First Person Vision video analysis between 1997 and 2014, highlighting, among others, most commonly used features, methods, challenges and opportunities within the field.Comment: First Person Vision, Egocentric Vision, Wearable Devices, Smart Glasses, Computer Vision, Video Analytics, Human-machine Interactio

arXiv.org e-Print Archive

CiteSeerX

Pure OAI Repository

Archivio istituzionale della ricerca - Università di Genova

Machine Learning-Aided Operations and Communications of Unmanned Aerial Vehicles: A Contemporary Survey

Author: Hossain Ekram
Huang Hailong
Kurunathan Harrison
Li Kai
Ni Wei
Publication venue
Publication date: 07/11/2022
Field of study

The ongoing amalgamation of UAV and ML techniques is creating a significant synergy and empowering UAVs with unprecedented intelligence and autonomy. This survey aims to provide a timely and comprehensive overview of ML techniques used in UAV operations and communications and identify the potential growth areas and research gaps. We emphasise the four key components of UAV operations and communications to which ML can significantly contribute, namely, perception and feature extraction, feature interpretation and regeneration, trajectory and mission planning, and aerodynamic control and operation. We classify the latest popular ML tools based on their applications to the four components and conduct gap analyses. This survey also takes a step forward by pointing out significant challenges in the upcoming realm of ML-aided automated UAV operations and communications. It is revealed that different ML techniques dominate the applications to the four key modules of UAV operations and communications. While there is an increasing trend of cross-module designs, little effort has been devoted to an end-to-end ML framework, from perception and feature extraction to aerodynamic control and operation. It is also unveiled that the reliability and trust of ML in UAV operations and applications require significant attention before full automation of UAVs and potential cooperation between UAVs and humans come to fruition.Comment: 36 pages, 304 references, 19 Figure

arXiv.org e-Print Archive

Visual, navigation and communication aid for visually impaired person

Author: Mahmood Mufrath
Saha Sagor
Shakal Farhan Hossain
Publication venue: 'Institute of Advanced Engineering and Science'
Publication date: 01/04/2021
Field of study

The loss of vision restrained the visually impaired people from performing their daily task. This issue has impeded their free-movement and turned them into dependent a person. People in this sector did not face technologies revamping their situations. With the advent of computer vision, artificial intelligence, the situation improved to a great extent. The propounded design is an implementation of a wearable device which is capable of performing a lot of features. It is employed to provide visual instinct by recognizing objects, identifying the face of choices. The device runs a pre-trained model to classify common objects from household items to automobiles items. Optical character recognition and Google translate were executed to read any text from image and convert speech of the user to text respectively. Besides, the user can search for an interesting topic by the command in the form of speech. Additionally, ultrasonic sensors were kept fixed at three positions to sense the obstacle during navigation. The display attached help in communication with deaf person and GPS and GSM module aid in tracing the user. All these features run by voice commands which are passed through the microphone of any earphone. The visual input is received through the camera and the computation task is processed in the raspberry pi board. However, the device seemed to be effective during the test and validation

ZENODO

Institute of Advanced Engineering and Science