1,233 research outputs found
Path integral policy improvement with differential dynamic programming
Path Integral Policy Improvement with Covariance Matrix Adaptation (PI2-CMA) is a step-based model free reinforcement learning approach that combines statistical estimation techniques with fundamental results from Stochastic Optimal Control. Basically, a policy distribution is improved iteratively using reward weighted averaging of the corresponding rollouts. It was assumed that PI2-CMA somehow exploited gradient information that was contained by the reward weighted statistics. To our knowledge we are the first to expose the principle of this gradient extraction rigorously. Our findings reveal that PI2-CMA essentially obtains gradient information similar to the forward and backward passes in the Differential Dynamic Programming (DDP) method. It is then straightforward to extend the analogy with DDP by introducing a feedback term in the policy update. This suggests a novel algorithm which we coin Path Integral Policy Improvement with Differential Dynamic Programming (PI2-DDP). The resulting algorithm is similar to the previously proposed Sampled Differential Dynamic Programming (SaDDP) but we derive the method independently as a generalization of the framework of PI2-CMA. Our derivations suggest to implement some small variations to SaDDP so to increase performance. We validated our claims on a robot trajectory learning task
Characterizing local anisotropy of coercive force in motor laminations with the moving magnet hysteresis comparator
Effect of transmitter position on the torque generation of a magnetic resonance based motoring system
Strongly coupled magnetic resonance is most often used to transfer electrical power from a transmitter to a resonant receiver coil to supply devices over an air gap. In this work, the induced current in two receiver coils (stator and rotor) is used to generate torque on the rotor coil. The effect of the transmitter position relative to the stator and rotor receiver coils on the torque generation is studied in detail, both in simulation and experimentally. Results show a 36% to 37% gain in peak torque when properly varying the stator orientation for a given transmitter distance
Modeling the number of hidden events subject to observation delay
This paper considers the problem of predicting the number of events that have
occurred in the past, but which are not yet observed due to a delay. Such
delayed events are relevant in predicting the future cost of warranties,
pricing maintenance contracts, determining the number of unreported claims in
insurance and in modeling the outbreak of diseases. Disregarding these
unobserved events results in a systematic underestimation of the event
occurrence process. Our approach puts emphasis on modeling the time between the
occurrence and observation of the event, the so-called observation delay. We
propose a granular model for the heterogeneity in this observation delay based
on the occurrence day of the event and on calendar day effects in the
observation process, such as weekday and holiday effects. We illustrate this
approach on a European general liability insurance data set where the
occurrence of an accident is reported to the insurer with delay
Computational analysis of the effect of superparamagnetic nanoparticle properties on bioheat transfer in magnetic nanoparticle hyperthermia
P
Coupled electrical-thermal model for monopolar and bipolar radiofrequency liver tumor ablation
Transfer learning in ECG classification from human to horse using a novel parallel neural network architecture
Automatic or semi-automatic analysis of the equine electrocardiogram (eECG) is currently not possible because human or small animal ECG analysis software is unreliable due to a different ECG morphology in horses resulting from a different cardiac innervation. Both filtering, beat detection to classification for eECGs are currently poorly or not described in the literature. There are also no public databases available for eECGs as is the case for human ECGs. In this paper we propose the use of wavelet transforms for both filtering and QRS detection in eECGs. In addition, we propose a novel robust deep neural network using a parallel convolutional neural network architecture for ECG beat classification. The network was trained and tested using both the MIT-BIH arrhythmia and an own made eECG dataset with 26.440 beats on 4 classes: normal, premature ventricular contraction, premature atrial contraction and noise. The network was optimized using a genetic algorithm and an accuracy of 97.7% and 92.6% was achieved for the MIT-BIH and eECG database respectively. Afterwards, transfer learning from the MIT-BIH dataset to the eECG database was applied after which the average accuracy, recall, positive predictive value and F1 score of the network increased with an accuracy of 97.1%
- …