Search CORE

8,029 research outputs found

Mining recurrent concepts in data streams using the discrete Fourier transform

Author: A. Bifet
C. Alippi
H. Kargupta
H. Morshedlou
J. Gama
J.B. Gomes
M.J. Hosseini
N. Linial
S. Hoeglinger
Publication venue: arXiv
Publication date: 01/01/2014
Field of study

In this research we address the problem of capturing recurring concepts in a data stream environment. Recurrence capture enables the re-use of previously learned classifiers without the need for re-learning while providing for better accuracy during the concept recurrence interval. We capture concepts by applying the Discrete Fourier Transform (DFT) to Decision Tree classifiers to obtain highly compressed versions of the trees at concept drift points in the stream and store such trees in a repository for future use. Our empirical results on real world and synthetic data exhibiting varying degrees of recurrence show that the Fourier compressed trees are more robust to noise and are able to capture recurring concepts with higher precision than a meta learning approach that chooses to re-use classifiers in their originally occurring form

Crossref

AUT Scholarly Commons

Finding Your Way Back: Comparing Path Odometry Algorithms for Assisted Return.

Author: Elyasi Fatemeh
Manduchi Roberto
Ren Peng
Tsai Chia Hsuan
Publication venue: eScholarship, University of California
Publication date: 01/03/2021
Field of study

We present a comparative analysis of inertial-based odometry algorithms for the purpose of assisted return. An assisted return system facilitates backtracking of a path previously taken, and can be particularly useful for blind pedestrians. We present a new algorithm for path matching, and test it in simulated assisted return tasks with data from WeAllWalk, the only existing data set with inertial data recorded from blind walkers. We consider two odometry systems, one based on deep learning (RoNIN), and the second based on robust turn detection and step counting. Our results show that the best path matching results are obtained using the turns/steps odometry system

PubMed Central

eScholarship - University of California

Use of Ensembles of Fourier Spectra in Capturing Recurrent Concepts in Data Streams

Author: Bifet Albert
Pears Russel
Pfahringer Bernhard
Sakthithasan Sakthithasan
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2015
Field of study

In this research, we apply ensembles of Fourier encoded spectra to capture and mine recurring concepts in a data stream environment. Previous research showed that compact versions of Decision Trees can be obtained by applying the Discrete Fourier Transform to accurately capture recurrent concepts in a data stream. However, in highly volatile environments where new concepts emerge often, the approach of encoding each concept in a separate spectrum is no longer viable due to memory overload and thus in this research we present an ensemble approach that addresses this problem. Our empirical results on real world data and synthetic data exhibiting varying degrees of recurrence reveal that the ensemble approach outperforms the single spectrum approach in terms of classification accuracy, memory and execution time

arXiv.org e-Print Archive

Crossref

Research Commons@Waikato

Generic Object Detection and Segmentation for Real-World Environments

Author: Johansen Anders Skaarup
Publication venue: Aalborg Universitetsforlag
Publication date: 01/01/2023
Field of study

VBN

Linking social media, medical literature, and clinical notes using deep learning.

Author: Asghari Mohsen
Publication venue: ThinkIR: The University of Louisville\u27s Institutional Repository
Publication date: 01/08/2021
Field of study

Researchers analyze data, information, and knowledge through many sources, formats, and methods. The dominant data format includes text and images. In the healthcare industry, professionals generate a large quantity of unstructured data. The complexity of this data and the lack of computational power causes delays in analysis. However, with emerging deep learning algorithms and access to computational powers such as graphics processing unit (GPU) and tensor processing units (TPUs), processing text and images is becoming more accessible. Deep learning algorithms achieve remarkable results in natural language processing (NLP) and computer vision. In this study, we focus on NLP in the healthcare industry and collect data not only from electronic medical records (EMRs) but also medical literature and social media. We propose a framework for linking social media, medical literature, and EMRs clinical notes using deep learning algorithms. Connecting data sources requires defining a link between them, and our key is finding concepts in the medical text. The National Library of Medicine (NLM) introduces a Unified Medical Language System (UMLS) and we use this system as the foundation of our own system. We recognize social media’s dynamic nature and apply supervised and semi-supervised methodologies to generate concepts. Named entity recognition (NER) allows efficient extraction of information, or entities, from medical literature, and we extend the model to process the EMRs’ clinical notes via transfer learning. The results include an integrated, end-to-end, web-based system solution that unifies social media, literature, and clinical notes, and improves access to medical knowledge for the public and experts

University of Louisville