Search CORE

156 research outputs found

Scalable stellar evolution forecasting: Deep learning emulation vs. hierarchical nearest neighbor interpolation

Author: Jordan A. I.
Maltsev K.
Qadir G. A.
Riedmiller K.
Roepke F. K.
Schneider F. R. N.
Publication venue
Publication date: 22/09/2023
Field of study

Many astrophysical applications require efficient yet reliable forecasts of stellar evolution tracks. One example is population synthesis, which generates forward predictions of models for comparison with observations. The majority of state-of-the-art population synthesis methods are based on analytic fitting formulae to stellar evolution tracks that are computationally cheap to sample statistically over a continuous parameter range. Running detailed stellar evolution codes, such as MESA, over wide and densely sampled parameter grids is prohibitively expensive computationally, while stellar-age based linear interpolation in-between sparsely sampled grid points leads to intolerably large systematic prediction errors. In this work, we provide two solutions of automated interpolation methods that find satisfactory trade-off points between cost-efficiency and accuracy. We construct a timescale-adapted evolutionary coordinate and use it in a two-step interpolation scheme that traces the evolution of stars from zero age main sequence all the way to the end of core helium burning while covering a mass range from

{0.65}

300 \, \mathrm{M_\odot}

. The feedforward neural network regression model (first solution) that we train to predict stellar surface variables can make millions of predictions, sufficiently accurate over the entire parameter space, within tens of seconds on a 4-core CPU. The hierarchical nearest neighbor interpolation algorithm (second solution) that we hard-code to the same end achieves even higher predictive accuracy, the same algorithm remains applicable to all stellar variables evolved over time, but it is two orders of magnitude slower. Our methodological framework is demonstrated to work on the MIST data set. Finally, we discuss prospective applications and provide guidelines how to generalize our methods to higher dimensional parameter spaces.Comment: Submitted to A&

arXiv.org e-Print Archive

Learning object relationships which determine the outcome of actions

Author: A Ferry
A Stoytchev
AGE Collins
B Rosman
D E Knuth
D Mareschal
E Bates
E Thelen
E Ugur
F Guerin
G Kootstra
H Chaput
I C Uzgiris
J A Jorgensen
J J Lockman
J M Mandler
J M Mandler
J Mugan
J Piaget
J Piaget
J Piaget
K J Rohlfing
K Khoshelham
K Mourao
K S Bourgeois
K W Fischer
L B Smith
M Casasola
M Kaariainen
M Kubat
M Riedmiller
M Riedmiller
M Rolf
M Schlesinger
N Pugeault
P Tommasino
P Willatts
PJ Kellman
R B Rusu
S Fichtl
S Fichtl
S J Hespos
S Olesen
S Stolbach
T G R Bower
T Zimmerman
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/12/2012
Field of study

Peer reviewedPublisher PD

Aberdeen University Research

Crossref

Directory of Open Access Journals

University of Southern Denmark Research Output

Weighing Counts: Sequential Crowd Counting by Reinforcement Learning

Author: A Hussein
D Silver
H Idrees
H Lu
H Xiong
IH Laradji
L Liu
L Van Hove
M Riedmiller
O Vinyals
R Guerrero-Gómez-Olmedo
T Stahl
V Mnih
Publication venue
Publication date: 01/01/2020
Field of study

We formulate counting as a sequential decision problem and present a novel crowd counting model solvable by deep reinforcement learning. In contrast to existing counting models that directly output count values, we divide one-step estimation into a sequence of much easier and more tractable sub-decision problems. Such sequential decision nature corresponds exactly to a physical process in reality scale weighing. Inspired by scale weighing, we propose a novel 'counting scale' termed LibraNet where the count value is analogized by weight. By virtually placing a crowd image on one side of a scale, LibraNet (agent) sequentially learns to place appropriate weights on the other side to match the crowd count. At each step, LibraNet chooses one weight (action) from the weight box (the pre-defined action pool) according to the current crowd image features and weights placed on the scale pan (state). LibraNet is required to learn to balance the scale according to the feedback of the needle (Q values). We show that LibraNet exactly implements scale weighing by visualizing the decision process how LibraNet chooses actions. Extensive experiments demonstrate the effectiveness of our design choices and report state-of-the-art results on a few crowd counting benchmarks. We also demonstrate good cross-dataset generalization of LibraNet. Code and models are made available at: https://git.io/libranetComment: Accepted to Proc. Eur. Conf. Computer Vision (ECCV) 202

arXiv.org e-Print Archive

Crossref

Adelaide Research & Scholarship

Distance in audio for VR: Constraints and opportunities

Author: Björling O
Clark A.
Daniel J.
Gerzon MA.
Kruszielski LF
Penha R.
Pike C
Riedmiller N.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 23/08/2017
Field of study

Spatial audio is enjoying a surge in attention in both scene and object based paradigms, due to the trend for, and accessibility of, immersive experience. This has been enabled through convergence in computing enhancements, component size reduction, and associated price reductions. For the first time, applications such as virtual reality (VR) are technologies for the consumer. Audio for VR is captured to provide a counterpart to the video or animated image, and can be rendered to combine elements of physical and psychoacoustic modelling, as well as artistic design. Given that distance is an inherent property of spatial audio, that it can augment sound's efficacy in cueing user attention (a problem which practitioners are seeking to solve), and that conventional film sound practices have intentionally exploited its use, the absence of research on its implementation and effects in immersive environments is notable. This paper sets out the case for its importance, from a perspective of research and practice. It focuses on cinematic VR, whose challenges for spatialized audio are clear, and at times stretches beyond the restrictions specific to distance in audio for VR, into more general audio constraints

Crossref

Queen Mary Research Online

AI to enhance interactive simulation-based training in resuscitation medicine

Author: AG G EM R, Champion H
Bellomo R Goldsmith D, Uchino S
Brisk R
Confidential N
CW C Soar J, Aibiki M
FF A Santana N
Hogan H Healey F, Neale G, Thomson R, Vincent C, Black N
JP N Soar J, Smith G
Kaneva B Torralba A, Freeman W
Kolb D
Li W Fritz M
MG H Little L
Mnih V Badia A, Mirza P, Graves A, Lillicrap T, Harley P, Silver D, Kavukcuoglu K
Mnih V Kavukcuoglu K, Silver D, Graves A, Antonoglou I, Wierstra D, Riedmiller M
Mnih V Kavukcuoglu K, Silver D, Rusu A, Veness J, Bellemare M, Graves A, Riedmiller M, Fidjeland A, Ostrovski G, Petersen S, Beattie C, Sadik A, Antonoglou I, King H, Kumaran D, Wierstra D, Legg S, Hassabis D
National Institute for Health and Clinical Excellence
NE S AG G, SA R
Perkins G Kimani P, Bullock I
Perkins G Kimani P, Bullock I
Pishchulin L Jain A, Wojek C, Andriluka M, Thormaehlen T, Schiele B
RM S Niles D, Meaney P
Schneider M Rittle-Johnson B, Star J
Silver D
Sutton R Barto A
Thomson R Leuttel D, Healey F, Scobie S
Wang S Summers R
Young G
Publication venue: 'BCS Learning and Development Limited'
Publication date: 10/05/2018
Field of study

Crossref

Ulster University's Research Portal

Pattern Recognition and Event Reconstruction in Particle Physics Experiments

This report reviews methods of pattern recognition and event reconstruction used in modern high energy physics experiments. After a brief introduction into general concepts of particle detectors and statistical evaluation, different approaches in global and local methods of track pattern recognition are reviewed with their typical strengths and shortcomings. The emphasis is then moved to methods which estimate the particle properties from the signals which pattern recognition has associated. Finally, the global reconstruction of the event is briefly addressed.Comment: 101 pages, 58 figure

arXiv.org e-Print Archive

Crossref

CERN Document Server

OpenGrey Repository

Learning model-free robot control by a Monte Carlo EM algorithm

Author: A. P. Dempster
D. P. Bertsekas
G. Wei
Georgios Kontes
J. Peters
J. Peters
M. Riedmiller
Marc Toussaint
Nikos Vlassis
P. Dayan
R. M. Neal
R. S. Sutton
Savas Piperidis
Y. Kim
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Neural networks for modeling gene-gene interactions in association studies

Author: A Jakulin
AA Motsinger
AA Motsinger
AA Motsinger-Reif
AA Motsinger-Reif
AA Motsinger-Reif
AA Motsinger-Reif
AD Flouris
AG Heidema
B North
BA McKinney
CM Bishop
Frauke Günther
G Schwarz
H Akaike
HJ Cordell
I Ruczinski
J Liu
J Millstein
J Ott
JH Moore
JH Moore
JR Koza
K Bammann
K Broberg
Karin Bammann
L Breiman
L Briollais
LW Hahn
M Riedmiller
MB Lanktree
MD Ritchie
MD Ritchie
ME Sáez
MJ Wade
MR Nelson
N Risch
Nina Wawro
NR Cook
P McCullagh
PR Lucek
R Development Core Team
R Foraita
R Hecht-Nielsen
R Tibshirani
RL Milne
S Fritsch
SH Chen
SK Musani
W Branicki
W Li
WS Bush
X Tang
Y Amit
Y Qi
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Our aim is to investigate the ability of neural networks to model different two-locus disease models. We conduct a simulation study to compare neural networks with two standard methods, namely logistic regression models and multifactor dimensionality reduction. One hundred data sets are generated for each of six two-locus disease models, which are considered in a low and in a high risk scenario. Two models represent independence, one is a multiplicative model, and three models are epistatic. For each data set, six neural networks (with up to five hidden neurons) and five logistic regression models (the null model, three main effect models, and the full model) with two different codings for the genotype information are fitted. Additionally, the multifactor dimensionality reduction approach is applied. Results The results show that neural networks are more successful in modeling the structure of the underlying disease model than logistic regression models in most of the investigated situations. In our simulation study, neither logistic regression nor multifactor dimensionality reduction are able to correctly identify biological interaction. Conclusions Neural networks are a promising tool to handle complex data situations. However, further research is necessary concerning the interpretation of their parameters.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Predicting Bevirimat resistance of HIV-1 from genotype

Author: A Kernytsky
A Löytynoja
AD Sevin
C Cole
C Notredame
CS Adamson
CS Adamson
D Heider
D Nguyen
D Wang
Daniel Hoffmann
DK Worthylake
Dominik Heider
E Frank
ER Wright
F Li
F Li
F Wilcoxon
GC Cawley
HB Shen
IH Witten
J Demsar
J Kingston
J Kyte
J Thompson
J Verheyen
J Zhou
Jens Verheyen
K Salzwedel
K Salzwedel
KC Chou
KV Baelen
L Breiman
L Nanni
M Borschbach
M Miller
M Riedmiller
MA Accola
N Beerenwinkel
N Beerenwinkel
N Margot
N Morellet
R Development Core Team
R King
R Lathrop
RC Edgar
RE Banfield
RJ Murray
S Draghici
S McCallister
S Ong
S Tzafestas
SR Eddy
T Fawcett
T Sing
W Resch
WW Cohen
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Maturation inhibitors are a new class of antiretroviral drugs. Bevirimat (BVM) was the first substance in this class of inhibitors entering clinical trials. While the inhibitory function of BVM is well established, the molecular mechanisms of action and resistance are not well understood. It is known that mutations in the regions CS p24/p2 and p2 can cause phenotypic resistance to BVM. We have investigated a set of p24/p2 sequences of HIV-1 of known phenotypic resistance to BVM to test whether BVM resistance can be predicted from sequence, and to identify possible molecular mechanisms of BVM resistance in HIV-1. Results We used artificial neural networks and random forests with different descriptors for the prediction of BVM resistance. Random forests with hydrophobicity as descriptor performed best and classified the sequences with an area under the Receiver Operating Characteristics (ROC) curve of 0.93 ± 0.001. For the collected data we find that p2 sequence positions 369 to 376 have the highest impact on resistance, with positions 370 and 372 being particularly important. These findings are in partial agreement with other recent studies. Apart from the complex machine learning models we derived a number of simple rules that predict BVM resistance from sequence with surprising accuracy. According to computational predictions based on the data set used, cleavage sites are usually not shifted by resistance mutations. However, we found that resistance mutations could shorten and weaken the <it>α</it>-helix in p2, which hints at a possible resistance mechanism. Conclusions We found that BVM resistance of HIV-1 can be predicted well from the sequence of the p2 peptide, which may prove useful for personalized therapy if maturation inhibitors reach clinical practice. Results of secondary structure analysis are compatible with a possible route to BVM resistance in which mutations weaken a six-helix bundle discovered in recent experiments, and thus ease Gag cleavage by the retroviral protease.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Evolutionary Multi-objective Optimization for Simultaneous Generation of Signal-Type and Symbol-Type Representations

Author: A. Martin
A. Roselito Teixeira de
C. Coello Coello
C.M. Bishop
D. Badre
D.A. Miller
H.A. Abbass
H.A. Abbass
I. Taha
J. Gabrieli
J.A. Fodor
J.R. Quinlan
K. Deb
K.P. Burnham
M. Hüsken
M. Ishikawa
M. Riedmiller
R. Andrews
R. Setiono
R. Setiono
R.D. Reed
W. Duch
W. Duch
Y. Jin
Y. Jin
Y. Jin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2005
Field of study

It has been a controversial issue in the research of cognitive science and artificial intelligence whether signal-type representations (typically connectionist networks) or symbol-type representations (e.g., semantic networks, production systems) should be used. Meanwhile, it has also been recognized that both types of information representations might exist in the human brain. In addition, symbol-type representations are often very helpful in gaining insights into unknown systems. For these reasons, comprehensible symbolic rules need to be extracted from trained neural networks. In this paper, an evolutionary multi-objective algorithm is employed to generate multiple models that facilitate the generation of signal-type and symbol-type representations simultaneously. It is argued that one main difference between signal-type and symbol-type representations lies in the fact that the signal-type representations are models of a higher complexity (fine representation), whereas symbol-type representations are models of a lower complexity (coarse representation). Thus, by generating models with a spectrum of model complexity, we are able to obtain a population of models of both signal-type and symbol-type quality, although certain post-processing is needed to get a fully symbol-type representation. An illustrative example is given on generating neural networks for the breast cancer diagnosis benchmark problem. © Springer-Verlag Berlin Heidelberg 2005

Crossref

Surrey Research Insight