Search CORE

24 research outputs found

Βαθιά υπό καθοδήγηση ενισχυτική μάθηση

Author: Chrysomallis Iason http://users.isc.tuc.gr/~ichrysomallis
Χρυσομαλλης Ιασων http://users.isc.tuc.gr/~ichrysomallis
Publication venue: Technical University of Crete::School of Electrical and Computer Engineering
Publication date
Field of study

Summarization: Imitation is a popular technique of behavioral learning widely practiced in nature. The most famous applications involve animal babies imitating their parents, with imitation providing the stepping stone to walk their first steps in life survival. Additionally, imitation examples can be found in cross species instances, with most known samples the voice imitation of parrots or crow behavioral imitation. The imitation learning paradigm has naturally been taken up in machine learning applications, implemented in supervised learning and in reinforcement learning, mostly with the use of explicit imitation, where the mentor agent attempts to explicitly teach learners. Implicit imitation, on the other hand, assumes that learning agents observe the state transitions of an agent they use as a mentor, and try to recreate them based on their own abilities and knowledge of their environment. Though it has also been employed with some success in the past, implicit imitation has only recently been utilized in conjunction with deep reinforcement learning, the current leading reinforcement learning paradigm. In this thesis, we enhance the operation of implicit imitation by adding four state-of-the-art deep reinforcement learning algorithms, treated as "imitation optimization modules". These include Double Deep Q-network [Hasselt, Guez, and Silver, 2016], Prioritized Experience Replay [Schaul et al., 2016], Dueling Network Architecture [Wang et al., 2016] and Parameter Space Noise for Exploration [Plappert et al., 2018]. We modify these appropriately to better fit the implicit imitation learning paradigm. By enabling and disabling those methods we create diverse combinations of them; systematically test and compare the viability of each one of these combinations; and end up with a clear "winner": the combination of Double Deep Q-network, Prioritized Experience Replay and Dueling Network Architecture.Περίληψη: Η μίμηση αποτελεί μία τεχνική συμπεριφορικής εκμάθησης, ευρέως χρησιμοποιούμενη στην φύση. Στο ζωϊκό βασίλειο, για παράδειγμα, τα μωρά μιμούνται τους γονείς τους, και η μίμηση τα εφοδιάζει με τις κατάλληλες γνώσεις για να περπατήσουν στα πρώτα τους βήματα επιβίωσης. Παραδείγματα μίμησης παρατηρούνται και μεταξύ διαφορετικών ειδών, όπως στην φωνητική μίμηση των παπαγάλων ή στην συμπεριφορική μίμηση των κορακιών. Η μίμηση, λοιπόν, δεν θα μπορούσε να μην συμπεριληφθεί σε εφαρμογές της μηχανικής μάθησης, όπου αλγόριθμοι των πεδίων επιτηρούμενης μάθησης και ενισχυτικής μάθησης εκμεταλλεύονται την χρήση τεχνικών απευθείας, κυρίως, μίμησης, όπου ο πράκτορας που λειτουργεί ως "μέντορας'' προσπαθεί να "διδάξει'' απευθείας άλλους. Η μηχανική εκμάθηση μέσω έμμεσης μίμησης, από την άλλη, θεωρεί ότι οι πράκτορες-μιμητές απλά παρατηρούν τις αλλαγές καταστάσεων που προκύπτουν από την συμπεριφορά ενός πράκτορα που επιλέγουν ως μέντορα, και προσπαθούν να τις αναπαράγουν με βάση τις δικές τους δυνατότητες και γνώση του περιβάλλοντός τους. Αν και η έμμεση μίμηση έχει χρησιμοποιηθεί με ικανοποιητικά αποτελέσματα στο απώτερο παρελθόν, μόλις πρόσφατα έχει αξιοποιηθεί σε συνδυασμό με βαθιά ενισχυτική μάθηση, η οποία αποτελεί μια τρέχουσα τεχνολογία αιχμής στη μηχανική μάθηση. Στην παρούσα διπλωματική εργασία, βελτιώνουμε περαιτέρω την διαδικασία της έμμεσης μηχανικής εκμάθησης ενσωματώνοντας τέσσερις σύγχρονους αλγόριθμους βαθειάς ενισχυτικής μάθησης, τους οποίους θεωρούμε και χρησιμοποιούμε ως δομικά στοιχεία βελτιστοποίησης της προσπάθειας μίμησης. Οι εν λόγω αλγόριθμοι είναι οι Double Deep Q-network [Hasselt, Guez, and Silver, 2016], Prioritized Experience Replay [Schaul et al., 2016], Dueling Network Architecture [Wang et al., 2016] και Parameter Space Noise for Exploration [Plappert et al., 2018]. Προσαρμόσαμε τη λειτουργία των αλγορίθμων ώστε να συνάδει με το μοντέλο της έμμεσης μίμησης. Ενεργοποιώντας και απενεργοποιώντας τις παραπάνω μεθόδους, δημιουργούμε ποικίλους συνδυασμούς αυτών, και δοκιμάζουμε μεθοδικά και συγκρίνουμε την βιωσιμότητα του κάθε ενός από αυτούς τους συνδυασμούς. Οι πειραματισμοί μας κατέληξαν στην ανάδειξη ενός ξεκάθαρου "νικητή”: συγκεκριμένα, του συνδυασμού των Double Deep Q-network, Prioritized Experience Replay και Dueling Network Architecture

Institutional Repository of the Technical University of Crete

A Design Procedure for Aperture-Coupled Microstrip Antennas Based on Approximate Equivalent Networks

Author: Breinbjerg Olav
Chrysomallis M.
Kyriacou G. A.
Mavrides O.
Sahalos J. N.
Publication venue
Publication date: 01/01/2000
Field of study

Online Research Database In Technology

High dose oral prednisone vs. prednisone plus azathioprine for the treatment of oral pemphigus: a retrospective, bi-centre, comparative study

Author: Agarwal
Benoit Corven
Bystryn
Chaidemenos
Chrysomallis
Ioannides
Leshem
Lever
Lever
Mourellou
Murrell
Scully
Publication venue: 'Wiley'
Publication date
Field of study

Crossref

On the Reduction of Transmission Complexity in MIMO-WCDMA Frequency-Selective Fading Orientations via Eigenvalue Analysis

Author: C. T. Dervos
D. I. Kaklamani
G. A. Kyriakou
I. S. Venieris
K. Siakavara
M. T. Chrysomallis
P. K. Gkonis
Publication venue: 'MDPI AG'
Publication date: 01/10/2018
Field of study

In this paper, a novel transmission strategy for Mutliple Input Multiple Output Wideband Code Division Multiple Access (MIMO-WCDMA) orientations operating in frequency-selective fading environments is investigated, in terms of overall algorithmic complexity reduction. To this end, Principal Component Analysis (PCA) is employed on the received data matrix, in order to define the significant terms that are taken into account during transmission matrix formulation. According to the presented results, feedback information of only the primary eigenvector of the corresponding covariance matrix of the received data matrix is required, in order to maintain the mean Bit Error Rate (BER) at acceptable levels. In particular, a complexity reduction of up to 10% can be achieved, when comparing BER values derived by the selection of all components of the received covariance matrix during transmission matrix formulation, and the corresponding BER when selecting half of the components. This reduction is maintained to 10%, when considering a realistic four-element antenna design; however, in this case mean BER inaccuracy is further reduced to 1%

Multidisciplinary Digital Publishing Institute

Directory of Open Access Journals

Oropharyngeal Pemphigus Vulgaris and Clinical Remission

Author: Amagai
Amagai
Bystryn
Chrysomallis
David
Elvira Ruoppo
Giulio Fortuna
Herbst
Kavusi
Michele D. Mignogna
Mignogna
Mimouni
Murrell
Nair
Qasmi
Robinson
Scully
Shirakata
Stefania Leuci
Yeh
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A comparative effectiveness research of azathioprine and cyclophosphamide on the clinical and serological response in pemphigus vulgaris

Author: Abasq
Akhtar
Amagai
Anand
Beena Uppal
Beissert
Belloni-Fortina
Chams-Davatchi
Cheng
Chrysomallis
Cummins
Dutz
Frew
Grando
Harman
Harman
Kabir Sardana
Kasperkiewicz
Kent
Mignogna
Murrell
Olszewska
Pooja Agarwal
Rose
Sardana
Schiavo
Shivani Bansal
VijayK Garg
Zivanovic
Publication venue: 'Medknow'
Publication date: 01/01/2016
Field of study

Crossref

Eine Methode für den Entwurf von Hilfe der Chebyschevschen Polynome

Author: B. D. Popovic
B. J. Strait
B. J. Strait
D. H. Sinnott
D. H. Sinnott
J. L. Fernandez Jambrina
J. N. Sahalos
J. Sahalos
K. Hirasawa
K. Siakavara
M. Chrysomallis
R. H. Duhamel
T. Milligan
W. L. Stutzman
W. L. Weeks
Y. W. Kong
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Pressure effects on protein secondary structure and hydrogen deuterium exchange in chymotrypsinogen: a Fourier transform infrared spectroscopic study

Author: Anderle
Barksdale
Brandts
Byler
Carter
Chrysomallis
Cooper
Cortigo
Englander
Englander
Gekko
Gill
Gregory
Hawley
Hawley
Hawley
Heremans
Heremans
Heremans
Hvidt
Kawai
Kossiakoff
Krimm
Levitt
Li
Nystrom
Parker
Peticolas
Rothschild
Susi
Susi
Weber
Williams
Willumsen
Wong
Wong
Wong
Wong
Wong
Wong
Wong
Woodward
Woodward
Zipp
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

The Management of Pemphigus Vulgaris in a Burn Intensive Care Unit

Author: Ahmed
Ahmed
Ahmed
Ahmed
Amagi
Arnold
Auerbach
Beissert
Beissert
Bystryn
Bystryn
Bystryn
Bystryn
Bystryn
Chad Hivnor
Chams-Davatchi
Chrysomallis
Cianchini
Cummins
Daniel
Eming
Enk
Feldman
Grundmann-Kollmann
Guillaume
Hashimoto
Ioannides
Ioannides
Iraji
Iraji
Joly
Kasperkiewicz
Kasperkiewicz
Kevin K. Chung
Lever
Lolis
Martin
Mary E. Miller
Mimouni
Morrison
Nathanial Miletta
Pisanti
Powell
ReMine
Roujeau
Ruocco
Ruocco
Segura
Strowd
Tabrizi
Tan-Lim
Thomas Lam
Tuohy
Turner
Zakka
Publication venue: 'Ovid Technologies (Wolters Kluwer Health)'
Publication date: 01/01/2014
Field of study

Crossref

A ROS-based energy management system for a prototype fuel cell hybrid vehicle

Author: Chrysomallis Iason(http://users.isc.tuc.gr/~ichrysomallis)
Doitsidis Eleftherios(http://users.isc.tuc.gr/~edoitsidis)
Georgakopoulos Stavros(http://users.isc.tuc.gr/~sgeorgakopoulos)
Gkionis Nikolaos(http://users.isc.tuc.gr/~ngkionis)
Piperidis Savvas(http://users.isc.tuc.gr/~spiperidis)
Tsourveloudis Nikolaos(http://users.isc.tuc.gr/~ntsourveloudis)
Γεωργακοπουλος Σταυρος(http://users.isc.tuc.gr/~sgeorgakopoulos)
Γκιωνης Νικολαος(http://users.isc.tuc.gr/~ngkionis)
Δοϊτσιδης Ελευθεριος(http://users.isc.tuc.gr/~edoitsidis)
Πιπεριδης Σαββας(http://users.isc.tuc.gr/~spiperidis)
Τσουρβελουδης Νικολαος(http://users.isc.tuc.gr/~ntsourveloudis)
Χρυσομαλλης Ιασων(http://users.isc.tuc.gr/~ichrysomallis)
Publication venue: 'MDPI AG'
Publication date
Field of study

Summarization: The automotive industry has been rapidly transforming and moving further from internal combustion engines, towards hybrid or electric vehicles. A key component for the successful adoption of the aforementioned approach is their Energy Management Systems (EMSs). In the proposed work, we describe in detail a custom EMS, with unique characteristics, which was developed and installed in a hydrogen-powered prototype vehicle. The development of the EMS was based on off-the-shelf components and the adoption of a Robot Operating System (ROS), a meta-operating system developed for robotic-oriented applications. Our approach offers soft real-time control and the ability to organize the controller of the EMS as a straightforward and comprehensive message system that provides the necessary inter-process communication at the core of the EMS control procedure. We describe in detail the software-based implementation and validate our approach through experimental results obtained while the prototype was racing in a low-energy consumption competitionPresented on: Energie

Institutional Repository of the Technical University of Crete