Search CORE

633 research outputs found

Recommended from our members

Lava flow morphology at an erupting andesitic stratovolcano: a satellite perspective on El Reventador, Ecuador

Author: Arnold D. W. D.
Biggs J.
Dietterich H. R.
Mothes P.
Vallejo Vargas S.
Wadge G.
Publication venue: 'Elsevier BV'
Publication date: 15/02/2019
Field of study

Lava flows pose a significant hazard to infrastructure and property located close to volcanoes, and understanding how flows advance is necessary to manage volcanic hazard during eruptions. Compared to low-silica basaltic flows, flows of andesite composition are infrequently erupted and so relatively few studies of their characteristics and behaviour exist. We use El Reventador, Ecuador as a target to investigate andesitic lava flow properties during a 4.5 year period of extrusive eruption between February 2012 and August 2016. We use satellite radar to map the dimensions of 43 lava flows and look at variations in their emplacement behaviour over time. We find that flows descend the north and south flanks of El Reventador, and were mostly emplaced during durations shorter than the satellite repeat interval of 24 days.Flows ranged in length from 0.3 to 1.7 km, and the length of these flows decreased over the observation period. We measure a decrease in flow volume with time that is correlated with a long-term exponential decrease in eruption rate, and propose that this behaviour is caused by temporary magma storage in the conduit acting as a melt capacitor between the magma reservoir and the surface. We use the dimensions of the flow levees and widths to estimate the flow yield strengths, which were of the order of 10-100 kPa. We observe that some flows were diverted by topographic obstacles, and compare measurements of decreased channel width and increased flow thickness at the obstacles with observations from laboratory experiments. Radar observations, such as those presented here, could be used to map and measure properties of evolving lava flow fields at other remote or difficult to monitor volcanoes

Central Archive at the University of Reading

Explore Bristol Research

Hi-Val: Iterative Learning of Hierarchical Value Functions for Policy Generation

Author: D Silver
D Silver
G Chowdhary
G Konidaris
J Hostetler
Levente Kocsis
M Jun
P Auer
RS Sutton
TG Dietterich
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Task decomposition is effective in manifold applications where the global complexity of a problem makes planning and decision-making too demanding. This is true, for example, in high-dimensional robotics domains, where (1) unpredictabilities and modeling limitations typically prevent the manual specification of robust behaviors, and (2) learning an action policy is challenging due to the curse of dimensionality. In this work, we borrow the concept of Hierarchical Task Networks (HTNs) to decompose the learning procedure, and we exploit Upper Confidence Tree (UCT) search to introduce HOP, a novel iterative algorithm for hierarchical optimistic planning with learned value functions. To obtain better generalization and generate policies, HOP simultaneously learns and uses action values. These are used to formalize constraints within the search space and to reduce the dimensionality of the problem. We evaluate our algorithm both on a fetching task using a simulated 7-DOF KUKA light weight arm and, on a pick and delivery task with a Pioneer robot

Crossref

Archivio della ricerca- Università di Roma La Sapienza

A Non-Sequential Representation of Sequential Data for Churn Prediction

Author: A. Lemmens
C.-P. Wei
D. Ruta
J.R. Quinlan
L. Breiman
R. Duda
T.G. Dietterich
Y. Freund
Y.-S. Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

We investigate the length of event sequence giving best predictions when using a continuous HMM approach to churn prediction from sequential data. Motivated by observations that predictions based on only the few most recent events seem to be the most accurate, a non-sequential dataset is constructed from customer event histories by averaging features of the last few events. A simple K-nearest neighbor algorithm on this dataset is found to give significantly improved performance. It is quite intuitive to think that most people will react only to events in the fairly recent past. Events related to telecommunications occurring months or years ago are unlikely to have a large impact on a customer’s future behaviour, and these results bear this out. Methods that deal with sequential data also tend to be much more complex than those dealing with simple nontemporal data, giving an added benefit to expressing the recent information in a non-sequential manner

Crossref

Bournemouth University Research Online

Coventry University Pure Portal

Boosting parallel perceptrons for label noise reduction in classification problems

Author: D. Meyer
N. Nilsson
P. Auer
R.E. Schapire
T. Dietterich
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2005
Field of study

The final publication is available at Springer via http://dx.doi.org/10.1007/11499305_60Proceedings of First International Work-Conference on the Interplay Between Natural and Artificial Computation, IWINAC 2005, Las Palmas, Canary Islands, Spain, June 15-18, 2005Boosting combines an ensemble of weak learners to construct a new weighted classifier that is often more accurate than any of its components. The construction of such learners, whose training sets depend on the performance of the previous members of the ensemble, is carried out by successively focusing on those patterns harder to classify. This fact deteriorates boosting’s results when dealing with malicious noise as, for instance, mislabeled training examples. In order to detect and avoid those noisy examples during the learning process, we propose the use of Parallel Perceptrons. Among other things, these novel machines allow to naturally define margins for hidden unit activations. We shall use these margins to detect which patterns may have an incorrect label and also which are safe, in the sense of being well represented in the training sample by many other similar patterns. As candidates for being noisy examples we shall reduce the weights of the former ones, and as a support for the overall detection procedure we shall augment the weights of the latter ones.With partial support of Spain’s CICyT, TIC 01–572, TIN 2004–0767

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Biblos-e Archivo

Transductive Learning with String Kernels for Cross-Domain Text Classification

Author: AM Fernández
D Bollegala
G Ifrim
H Lodhi
J Shawe-Taylor
M Franco-Salvador
M Long
Marius Popescu
RT Ionescu
RT Ionescu
RT Ionescu
TG Dietterich
Publication venue
Publication date: 02/11/2018
Field of study

For many text classification tasks, there is a major problem posed by the lack of labeled data in a target domain. Although classifiers for a target domain can be trained on labeled text data from a related source domain, the accuracy of such classifiers is usually lower in the cross-domain setting. Recently, string kernels have obtained state-of-the-art results in various text classification tasks such as native language identification or automatic essay scoring. Moreover, classifiers based on string kernels have been found to be robust to the distribution gap between different domains. In this paper, we formally describe an algorithm composed of two simple yet effective transductive learning approaches to further improve the results of string kernels in cross-domain settings. By adapting string kernels to the test set without using the ground-truth test labels, we report significantly better accuracy rates in cross-domain English polarity classification.Comment: Accepted at ICONIP 2018. arXiv admin note: substantial text overlap with arXiv:1808.0840

arXiv.org e-Print Archive

Crossref

Adaptive Anomaly Detection via Self-Calibration and Dynamic Updating

Author: D. Wolpert
H. Ringberg
K. Wang
K. Wang
L. Breiman
T. Pietraszek
T.G. Dietterich
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2009
Field of study

The deployment and use of Anomaly Detection (AD) sensors often requires the intervention of a human expert to manually calibrate and optimize their performance. Depending on the site and the type of traffic it receives, the operators might have to provide recent and sanitized training data sets, the characteristics of expected traffic (i.e. outlier ratio), and exceptions or even expected future modifications of system's behavior. In this paper, we study the potential performance issues that stem from fully automating the AD sensors' day-to-day maintenance and calibration. Our goal is to remove the dependence on human operator using an unlabeled, and thus potentially dirty, sample of incoming traffic. To that end, we propose to enhance the training phase of AD sensors with a self-calibration phase, leading to the automatic determination of the optimal AD parameters. We show how this novel calibration phase can be employed in conjunction with previously proposed methods for training data sanitization resulting in a fully automated AD maintenance cycle. Our approach is completely agnostic to the underlying AD sensor algorithm. Furthermore, the self-calibration can be applied in an online fashion to ensure that the resulting AD models reflect changes in the system's behavior which would otherwise render the sensor's internal state inconsistent. We verify the validity of our approach through a series of experiments where we compare the manually obtained optimal parameters with the ones computed from the self-calibration phase. Modeling traffic from two different sources, the fully automated calibration shows a 7.08% reduction in detection rate and a 0.06% increase in false positives, in the worst case, when compared to the optimal selection of parameters. Finally, our adaptive models outperform the statically generated ones retaining the gains in performance from the sanitization process over time

CiteSeerX

Crossref

Columbia University Academic Commons

Fast Reinforcement Learning with Large Action Sets Using Error-Correcting Output Codes for MDP Factorization

Author: C. Dimitrakakis
D. Negoescu
G. Tesauro
J.L. Bentley
K. Crammer
S. Bubeck
T. Dietterich
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

International audienceThe use of Reinforcement Learning in real-world scenarios is strongly limited by issues of scale. Most RL learning algorithms are unable to deal with problems composed of hundreds or sometimes even dozens of possible actions, and therefore cannot be applied to many real-world problems. We consider the RL problem in the supervised classification framework where the optimal policy is obtained through a multiclass classifier, the set of classes being the set of actions of the problem. We introduce error-correcting output codes (ECOCs) in this setting and propose two new methods for reducing complexity when using rollouts-based approaches. The first method consists in using an ECOC-based classifier as the multiclass classifier, reducing the learning complexity from O(A2) to O(Alog(A)) . We then propose a novel method that profits from the ECOC's coding dictionary to split the initial MDP into O(log(A)) separate two-action MDPs. This second method reduces learning complexity even further, from O(A2) to O(log(A)) , thus rendering problems with large action sets tractable. We finish by experimentally demonstrating the advantages of our approach on a set of benchmark problems, both in speed and performance

arXiv.org e-Print Archive

HAL - Lille 3

Crossref

INRIA a CCSD electronic archive server

Database Architecture (R)evolution:New Hardware vs. New Software

Author: Argyros T.
Boncz P.A. (Peter)
Dietterich D.
Harizopoulos S.
Madden S. (Samuel)
Waas F.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/03/2010
Field of study

CWI's Institutional Repository

Recommended from our members

Anchoring Knowledge in Interaction: Towards a Harmonic Subsymbolic/Symbolic Framework and Architecture of Computational Cognition

Author: A Chella
A Schwering
D Hall
GE Hinton
J Lehmann
MH Fischer
S Coradeschi
TG Dietterich
Y Bengio
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

We outline a proposal for a research program leading to a new paradigm, architectural framework, and prototypical implementation, for the cognitively inspired anchoring of an agent’s learning, knowledge formation, and higher reasoning abilities in real-world interactions: Learning through interaction in real-time in a real environment triggers the incremental accumulation and repair of knowledge that leads to the formation of theories at a higher level of abstraction. The transformations at this higher level filter down and inform the learning process as part of a permanent cycle of learning through experience, higher-order deliberation, theory formation and revision. The envisioned framework will provide a precise computational theory, algorithmic descriptions, and an implementation in cyber-physical systems, addressing the lifting of action patterns from the subsymbolic to the symbolic knowledge level, effective methods for theory formation, adaptation, and evolution, the anchoring of knowledge-level objects, real-world interactions and manipulations, and the realization and evaluation of such a system in different scenarios. The expected results can provide new foundations for future agent architectures, multi-agent systems, robotics, and cognitive systems, and can facilitate a deeper understanding of the development and interaction in human-technological settings

City Research Online

Crossref

Edinburgh Research Explorer

Multiclass Semi-Supervised Learning on Graphs using Ginzburg-Landau Functional Minimization

Author: A Bertozzi
A Bertozzi
A Subramanya
AD Szlam
AL Bertozzi
D Zhou
EL Allwein
G Gilboa
GE Hinton
JA Dobrosotskaya
JA Dobrosotskaya
L Zelnik-Manor
RR Coifman
RV Kohn
TG Dietterich
Y LeCun
Y Li
YM Jung
Publication venue
Publication date: 06/06/2013
Field of study

We present a graph-based variational algorithm for classification of high-dimensional data, generalizing the binary diffuse interface model to the case of multiple classes. Motivated by total variation techniques, the method involves minimizing an energy functional made up of three terms. The first two terms promote a stepwise continuous classification function with sharp transitions between classes, while preserving symmetry among the class labels. The third term is a data fidelity term, allowing us to incorporate prior information into the model in a semi-supervised framework. The performance of the algorithm on synthetic data, as well as on the COIL and MNIST benchmark datasets, is competitive with state-of-the-art graph-based multiclass segmentation methods.Comment: 16 pages, to appear in Springer's Lecture Notes in Computer Science volume "Pattern Recognition Applications and Methods 2013", part of series on Advances in Intelligent and Soft Computin

arXiv.org e-Print Archive

Crossref