664 research outputs found
Contributions to statistical machine learning algorithm
This thesis's research focus is on computational statistics along with DEAR (abbreviation of differential equation associated regression) model direction, and that in mind, the journal papers are written as contributions to statistical machine learning algorithm literature
Deep Reinforcement Learning and its application to games
The following project aims is to review the main concepts of Reinforcement Learning and
combine them with the tools of Deep Learning, studying in depth the application of these
methodologies, the Deep Reinforcement Learning algorithms, that are having such an
impact today being applied to numerous fields such as autonomous driving, robot control, gaming
and many more. In order to do this, first, in chapter 1, we will give a general overview of Deep
Reinforcement Learning as a introduction, as well as which is motivation to study this topic.
Then, in chapter 2, since it will be fundamental to achieve our goal, we give a brief review of
Deep Learning. We get into details with chapter 3, where we define Reinforcement Learning
mathematically, formalizing the concepts in order to build the classic solution algorithms in
chapter 4. As an application of these techniques, the implementation of the algorithms for the
game of Blackjack is presented in chapter 5. Finally, in chapter 6, we reach our initial objective by
building the algorithm that hides behind the Deep Q-Networks and we apply it to the Gridworld
games in chapter 7. A conclusions and improvements section for the project culminates the text.El siguiente proyecto tiene como objetivo revisar los principales conceptos del Aprendizaje
con Refuerzo y combinarlo con las herramientas del Aprendizaje Profundo, estudiando con
detalle la aplicación de estas metodologías, Aprendizaje con Refuerzo Profundo, que están
teniendo tanto impacto en la actualidad siendo aplicados a numerosos campos como la conducción
autónoma, el control de robots, juegos y muchos más. Para ello, en primer lugar, en el capítulo 1,
situaremos al Aprendizaje con Refuerzo Profundo a modo de introducción, motivando el estudio
de este campo. Acto seguido, en el capítulo 2, ya que será fundamental para lograr nuestro
objetivo, se realiza una breve revisión del Aprendizaje Profundo. Entraremos en detalles con
el capítulo 3, donde definiremos matemáticamente que se entiende Aprendizaje con Refuerzo,
formalizando los conceptos con el fin de construir los algoritmos de solución clásicos en el capítulo
4. Como aplicación de estas técnicas, en el capítulo 5 se presenta la implementación de los
algoritmos para el juego del Blackjack. Finalmente, en el capítulo 5, alcanzaremos nuestro
objetivo inicial construyendo el algoritmo detrás de las Deep Q-Networks y lo aplicamos a los
juegos Gridworld en capítulo 7. Una sección de conclusiones y mejoras para el proyecto culmina
el texto.Universidad de Sevilla. Grado en Matemáticas y Estadístic
Hybrid Behaviour of Markov Population Models
We investigate the behaviour of population models written in Stochastic
Concurrent Constraint Programming (sCCP), a stochastic extension of Concurrent
Constraint Programming. In particular, we focus on models from which we can
define a semantics of sCCP both in terms of Continuous Time Markov Chains
(CTMC) and in terms of Stochastic Hybrid Systems, in which some populations are
approximated continuously, while others are kept discrete. We will prove the
correctness of the hybrid semantics from the point of view of the limiting
behaviour of a sequence of models for increasing population size. More
specifically, we prove that, under suitable regularity conditions, the sequence
of CTMC constructed from sCCP programs for increasing population size converges
to the hybrid system constructed by means of the hybrid semantics. We
investigate in particular what happens for sCCP models in which some
transitions are guarded by boolean predicates or in the presence of
instantaneous transitions
On the Termination Problem for Probabilistic Higher-Order Recursive Programs
In the last two decades, there has been much progress on model checking of
both probabilistic systems and higher-order programs. In spite of the emergence
of higher-order probabilistic programming languages, not much has been done to
combine those two approaches. In this paper, we initiate a study on the
probabilistic higher-order model checking problem, by giving some first
theoretical and experimental results. As a first step towards our goal, we
introduce PHORS, a probabilistic extension of higher-order recursion schemes
(HORS), as a model of probabilistic higher-order programs. The model of PHORS
may alternatively be viewed as a higher-order extension of recursive Markov
chains. We then investigate the probabilistic termination problem -- or,
equivalently, the probabilistic reachability problem. We prove that almost sure
termination of order-2 PHORS is undecidable. We also provide a fixpoint
characterization of the termination probability of PHORS, and develop a sound
(but possibly incomplete) procedure for approximately computing the termination
probability. We have implemented the procedure for order-2 PHORSs, and
confirmed that the procedure works well through preliminary experiments that
are reported at the end of the article
Topics in Programming Languages, a Philosophical Analysis through the case of Prolog
[EN]Programming languages seldom find proper anchorage in philosophy of logic, language and science. is more, philosophy of language seems to be restricted to natural languages and linguistics, and even philosophy of logic is rarely framed into programming languages topics. The logic programming paradigm and Prolog are, thus, the most adequate paradigm and programming language to work on this subject, combining natural language processing and linguistics, logic programming and constriction methodology on both algorithms and procedures, on an overall philosophizing declarative status. Not only this, but the dimension of the Fifth Generation Computer system related to strong Al wherein Prolog took a major role. and its historical frame in the very crucial dialectic between procedural and declarative paradigms, structuralist and empiricist biases, serves, in exemplar form, to treat straight ahead philosophy of logic, language and science in the contemporaneous age as well.
In recounting Prolog's philosophical, mechanical and algorithmic harbingers, the opportunity is open to various routes. We herein shall exemplify some:
- the mechanical-computational background explored by Pascal, Leibniz, Boole, Jacquard, Babbage, Konrad Zuse, until reaching to the ACE (Alan Turing) and EDVAC (von Neumann), offering the backbone in computer architecture, and the work of Turing, Church, Gödel, Kleene, von Neumann, Shannon, and others on computability, in parallel lines, throughly studied in detail, permit us to interpret ahead the evolving realm of programming languages. The proper line from lambda-calculus, to the Algol-family, the declarative and procedural split with the C language and Prolog, and the ensuing branching and programming languages explosion and further delimitation, are thereupon inspected as to relate them with the proper syntax, semantics and philosophical élan of logic programming and Prolog
- …