356,419 research outputs found
Web Mining Research: A Survey
With the huge amount of information available online, the World Wide Web is a
fertile area for data mining research. The Web mining research is at the cross
road of research from several research communities, such as database,
information retrieval, and within AI, especially the sub-areas of machine
learning and natural language processing. However, there is a lot of confusions
when comparing research efforts from different point of views. In this paper,
we survey the research in the area of Web mining, point out some confusions
regarded the usage of the term Web mining and suggest three Web mining
categories. Then we situate some of the research with respect to these three
categories. We also explore the connection between the Web mining categories
and the related agent paradigm. For the survey, we focus on representation
issues, on the process, on the learning algorithm, and on the application of
the recent works as the criteria. We conclude the paper with some research
issues.Comment: 15 page
Online Data Poisoning Attack
We study data poisoning attacks in the online setting where training items
arrive sequentially, and the attacker may perturb the current item to
manipulate online learning. Importantly, the attacker has no knowledge of
future training items nor the data generating distribution. We formulate online
data poisoning attack as a stochastic optimal control problem, and solve it
with model predictive control and deep reinforcement learning. We also upper
bound the suboptimality suffered by the attacker for not knowing the data
generating distribution. Experiments validate our control approach in
generating near-optimal attacks on both supervised and unsupervised learning
tasks
Symbolic Methodology in Numeric Data Mining: Relational Techniques for Financial Applications
Currently statistical and artificial neural network methods dominate in
financial data mining. Alternative relational (symbolic) data mining methods
have shown their effectiveness in robotics, drug design and other applications.
Traditionally symbolic methods prevail in the areas with significant
non-numeric (symbolic) knowledge, such as relative location in robot
navigation. At first glance, stock market forecast looks as a pure numeric area
irrelevant to symbolic methods. One of our major goals is to show that
financial time series can benefit significantly from relational data mining
based on symbolic methods. The paper overviews relational data mining
methodology and develops this techniques for financial data mining.Comment: 20 pages, 1 figure, 16 table
HOList: An Environment for Machine Learning of Higher-Order Theorem Proving
We present an environment, benchmark, and deep learning driven automated
theorem prover for higher-order logic. Higher-order interactive theorem provers
enable the formalization of arbitrary mathematical theories and thereby present
an interesting, open-ended challenge for deep learning. We provide an
open-source framework based on the HOL Light theorem prover that can be used as
a reinforcement learning environment. HOL Light comes with a broad coverage of
basic mathematical theorems on calculus and the formal proof of the Kepler
conjecture, from which we derive a challenging benchmark for automated
reasoning. We also present a deep reinforcement learning driven automated
theorem prover, DeepHOL, with strong initial results on this benchmark.Comment: Accepted at ICML 201
Are Disentangled Representations Helpful for Abstract Visual Reasoning?
A disentangled representation encodes information about the salient factors
of variation in the data independently. Although it is often argued that this
representational format is useful in learning to solve many real-world
down-stream tasks, there is little empirical evidence that supports this claim.
In this paper, we conduct a large-scale study that investigates whether
disentangled representations are more suitable for abstract reasoning tasks.
Using two new tasks similar to Raven's Progressive Matrices, we evaluate the
usefulness of the representations learned by 360 state-of-the-art unsupervised
disentanglement models. Based on these representations, we train 3600 abstract
reasoning models and observe that disentangled representations do in fact lead
to better down-stream performance. In particular, they enable quicker learning
using fewer samples.Comment: Accepted to NeurIPS 201
Machine Vision Guided 3D Medical Image Compression for Efficient Transmission and Accurate Segmentation in the Clouds
Cloud based medical image analysis has become popular recently due to the
high computation complexities of various deep neural network (DNN) based
frameworks and the increasingly large volume of medical images that need to be
processed. It has been demonstrated that for medical images the transmission
from local to clouds is much more expensive than the computation in the clouds
itself. Towards this, 3D image compression techniques have been widely applied
to reduce the data traffic. However, most of the existing image compression
techniques are developed around human vision, i.e., they are designed to
minimize distortions that can be perceived by human eyes. In this paper we will
use deep learning based medical image segmentation as a vehicle and demonstrate
that interestingly, machine and human view the compression quality differently.
Medical images compressed with good quality w.r.t. human vision may result in
inferior segmentation accuracy. We then design a machine vision oriented 3D
image compression framework tailored for segmentation using DNNs. Our method
automatically extracts and retains image features that are most important to
the segmentation. Comprehensive experiments on widely adopted segmentation
frameworks with HVSMR 2016 challenge dataset show that our method can achieve
significantly higher segmentation accuracy at the same compression rate, or
much better compression rate under the same segmentation accuracy, when
compared with the existing JPEG 2000 method. To the best of the authors'
knowledge, this is the first machine vision guided medical image compression
framework for segmentation in the clouds.Comment: IEEE Computer Society Conference on Computer Vision and Pattern
Recognition(CVPR), Long Beach, CA, 201
Fluctuation-dissipation theorem and models of learning
Advances in statistical learning theory have resulted in a multitude of
different designs of learning machines. But which ones are implemented by
brains and other biological information processors? We analyze how various
abstract Bayesian learners perform on different data and argue that it is
difficult to determine which learning-theoretic computation is performed by a
particular organism using just its performance in learning a stationary target
(learning curve). Basing on the fluctuation-dissipation relation in statistical
physics, we then discuss a different experimental setup that might be able to
solve the problem.Comment: 23 pages, 1 figure; manuscript restructured following reviewers'
suggestions; references added; misprints correcte
Deep learning methods based on cross-section images for predicting effective thermal conductivity of composites
Effective thermal conductivity is an important property of composites for
different thermal management applications. Although physics-based methods, such
as effective medium theory and solving partial differential equation, dominate
the relevant research, there is significant interest to establish the
structure-property linkage through the machine learning method. The performance
of general machine learning methods is highly dependent on features selected to
represent the microstructures. 3D convolutional neural networks (CNNs) can
directly extract geometric features of composites, which have been demonstrated
to establish structure-property linkages with high accuracy. However, to obtain
the 3D microstructure in composite is generally challenging in reality. In this
work, we attempt to use 2D cross-section images which can be easier to obtain
in real applications as input of 2D CNNs to predict effective thermal
conductivity of 3D composites. The results show that by using multiple
cross-section images along or perpendicular to the preferred directionality of
the fillers, the prediction accuracy of 2D CNNs can be as good as 3D CNNs. Such
a result is demonstrated with the particle filled composite and a stochastic
complex composite. The prediction accuracy is dependent on the
representativeness of cross-section images used. Multiple cross-section images
can fully determine the shape and distribution of fillers. The average over
multiple images and the use of large-size images can reduce the uncertainty and
increase the prediction accuracy. Besides, since cross-section images along the
heat flow direction can distinguish between serial structures and parallel
structures, they are more representative than cross-section images
perpendicular to the heat flow direction
On Learning to Think: Algorithmic Information Theory for Novel Combinations of Reinforcement Learning Controllers and Recurrent Neural World Models
This paper addresses the general problem of reinforcement learning (RL) in
partially observable environments. In 2013, our large RL recurrent neural
networks (RNNs) learned from scratch to drive simulated cars from
high-dimensional video input. However, real brains are more powerful in many
ways. In particular, they learn a predictive model of their initially unknown
environment, and somehow use it for abstract (e.g., hierarchical) planning and
reasoning. Guided by algorithmic information theory, we describe RNN-based AIs
(RNNAIs) designed to do the same. Such an RNNAI can be trained on never-ending
sequences of tasks, some of them provided by the user, others invented by the
RNNAI itself in a curious, playful fashion, to improve its RNN-based world
model. Unlike our previous model-building RNN-based RL machines dating back to
1990, the RNNAI learns to actively query its model for abstract reasoning and
planning and decision making, essentially "learning to think." The basic ideas
of this report can be applied to many other cases where one RNN-like system
exploits the algorithmic information content of another. They are taken from a
grant proposal submitted in Fall 2014, and also explain concepts such as
"mirror neurons." Experimental results will be described in separate papers.Comment: 36 pages, 1 figure. arXiv admin note: substantial text overlap with
arXiv:1404.782
A Framework for learning multi-agent dynamic formation strategy in real-time applications
Formation strategy is one of the most important parts of many multi-agent
systems with many applications in real world problems. In this paper, a
framework for learning this task in a limited domain (restricted environment)
is proposed. In this framework, agents learn either directly by observing an
expert behavior or indirectly by observing other agents or objects behavior.
First, a group of algorithms for learning formation strategy based on limited
features will be presented. Due to distributed and complex nature of many
multi-agent systems, it is impossible to include all features directly in the
learning process; thus, a modular scheme is proposed in order to reduce the
number of features. In this method, some important features have indirect
influence in learning instead of directly involving them as input features.
This framework has the ability to dynamically assign a group of positions to a
group of agents to improve system performance. In addition, it can change the
formation strategy when the context changes. Finally, this framework is able to
automatically produce many complex and flexible formation strategy algorithms
without directly involving an expert to present and implement such complex
algorithms.Comment: 27 pages, 9 figure
- …