Search CORE

30 research outputs found

A New Policy Iteration Algorithm For Reinforcement Learning in Zero-Sum Markov Games

Author: Srikant R.
Winnicki Anna
Publication venue
Publication date: 28/10/2023
Field of study

Optimal policies in standard MDPs can be obtained using either value iteration or policy iteration. However, in the case of zero-sum Markov games, there is no efficient policy iteration algorithm; e.g., it has been shown that one has to solve Omega(1/(1-alpha)) MDPs, where alpha is the discount factor, to implement the only known convergent version of policy iteration. Another algorithm, called naive policy iteration, is easy to implement but is only provably convergent under very restrictive assumptions. Prior attempts to fix naive policy iteration algorithm have several limitations. Here, we show that a simple variant of naive policy iteration for games converges exponentially fast. The only addition we propose to naive policy iteration is the use of lookahead policies, which are anyway used in practical algorithms. We further show that lookahead can be implemented efficiently in the function approximation setting of linear Markov games, which are the counterpart of the much-studied linear MDPs. We illustrate the application of our algorithm by providing bounds for policy-based RL (reinforcement learning) algorithms. We extend the results to the function approximation setting.Comment: 41 page

arXiv.org e-Print Archive

The Role of Lookahead and Approximate Policy Evaluation in Reinforcement Learning with Linear Value Function Approximation

Author: Livesay Michael
Lubars Joseph
Srikant R.
Winnicki Anna
Publication venue
Publication date: 12/07/2022
Field of study

Function approximation is widely used in reinforcement learning to handle the computational difficulties associated with very large state spaces. However, function approximation introduces errors which may lead to instabilities when using approximate dynamic programming techniques to obtain the optimal policy. Therefore, techniques such as lookahead for policy improvement and m-step rollout for policy evaluation are used in practice to improve the performance of approximate dynamic programming with function approximation. We quantitatively characterize, for the first time, the impact of lookahead and m-step rollout on the performance of approximate dynamic programming (DP) with function approximation: (i) without a sufficient combination of lookahead and m-step rollout, approximate DP may not converge, (ii) both lookahead and m-step rollout improve the convergence rate of approximate DP, and (iii) lookahead helps mitigate the effect of function approximation and the discount factor on the asymptotic performance of the algorithm. Our results are presented for two approximate DP methods: one which uses least-squares regression to perform function approximation and another which performs several steps of gradient descent of the least-squares objective in each iteration.Comment: 36 pages, 4 figure

arXiv.org e-Print Archive

Jak poprawić stopień przestrzegania zaleceń terapeutycznych i jakość współpracy lekarz - pacjent

Author: Basiński Krzysztof
Chrostowska Marzena
Narkiewicz Krzysztof
Szyndler Anna
Winnicki Michał
Publication venue: 'Salvia Medical Sciences Ltd'
Publication date: 05/09/2016
Field of study

Good cooperation between patient and physician is a very important part of treatment, especially in the case of chronic diseases. Previous studies conducted by the World Health Organization show that, on average, every second patient doesn’t follow therapeutic recommendations. In Poland, this percentage is even higher, and in the case of some diseases exceeds 70%. Importantly, these results are based primarily on patient statements, obtained by using questionnaire reviews, so in practice, the percentage of not properly cooperating patients may be even larger.The reasons for this phenomenon lie both on the patients and health care professionals side. The greatest impact on patients health behavior have psychological and socio-economical factors. First group includes primarily cognitive function, life satisfaction, personality, sense of control and mental state. The second group is associated mainly with the material status, but as the cyclic surveys on the Polish population show, the impact of income on treatment adherence from year to year is becoming smaller. Causes related with Health Service concern invalid communication between doctor and patient as well as lack of patient’s involvement in setting plan of therapy.Previous studies indicate how important is the quality of the relationship between physician and patient. Healthcare professionals should recognize patient’s needs and possibilities and fit treatment process to them. Better cooperation can be achieved by guiding motivation dialogue and patient’s engagement in therapy plan determination.Dobra współpraca lekarza z pacjentem jest bardzo ważnym elementem leczenia, zwłaszcza w przypadku chorób przewlekłych. Dotychczasowe badania prowadzone przez Światową Organizację Zdrowia wskazują, że przeciętnie co drugi chory nie przestrzega prawidłowo zaleceń terapeutycznych. W Polsce odsetek ten jest jeszcze wyższy i w przypadku niektórych chorób sięga ponad 70%. Co ważne, wyniki te opierają się przede wszystkim na deklaracjach pacjentów uzyskanych na postawie kwestionariuszowych narzędzi badawczych, zatem w praktyce odsetek chorych niewspółpracujących w sposób prawidłowy może być jeszcze większy. Przyczyny tego zjawiska leżą zarówno po stronie pacjentów, jak i pracowników służby zdrowia. Na chorych najbardziej wpływają czynniki psychologiczne oraz społeczno-ekonomiczne. Do tych pierwszych należy zaliczyć przede wszystkim funkcjonowanie poznawcze, satysfakcję z życia, osobowość, poczucie kontroli oraz stan psychiczny. Druga grupa wiąże się przede wszystkim z sytuacją materialną, jednak — jak pokazują cykliczne badania w polskiej populacji — wpływ dochodów na przestrzeganie zaleceń terapeutycznych z roku na rok jest coraz mniejszy. Powody związane z opieką medyczną to przede wszystkim nieprawidłowa komunikacja z lekarzem i nieangażowanie chorego w ustalanie planu terapii. Dotychczasowe wyniki badań wskazują, jak istotna dla przestrzegania zaleceń terapeutycznych jest jakość relacji lekarz–pacjent. Pracownicy służby zdrowia powinni poznać chorego i dostosować proces leczenia do jego potrzeb i możliwości. Polepszenie współpracy można osiągnąć, prowadząc dialog motywujący i angażując chorego w ustalanie planu terapii

Via Medica Journals

Recommended from our members

Enhanced methods for unbiased deep sequencing of Lassa and Ebola RNA viruses from clinical and biological samples

Author: Andersen Kristian G
Berlin Aaron
Birren Bruce W
Busby Michele
Ehiane Philomena E
England Eleina
Folarin Onikepe
Garry Robert F
Gire Stephen K
Gladden Adrianne D
Gnirke Andreas
Goba Augustine
Grant Donald S
Happi Christian
Hensley Lisa
Honko Anna
Kahn S Humarr
Levin Joshua Z
Malboeuf Christine M
Matranga Christian B
Mikkelsen Tarjei S
Moses Lina M
Odia Ikponmwonsa
Sabeti Pardis C
Stremlau Matthew
Tewhey Ryan
Winnicki Sarah
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 05/01/2015
Field of study

We have developed a robust RNA sequencing method for generating complete de novo assemblies with intra-host variant calls of Lassa and Ebola virus genomes in clinical and biological samples. Our method uses targeted RNase H-based digestion to remove contaminating poly(rA) carrier and ribosomal RNA. This depletion step improves both the quality of data and quantity of informative reads in unbiased total RNA sequencing libraries. We have also developed a hybrid-selection protocol to further enrich the viral content of sequencing libraries. These protocols have enabled rapid deep sequencing of both Lassa and Ebola virus and are broadly applicable to other viral genomics studies. Electronic supplementary material The online version of this article (doi:10.1186/s13059-014-0519-7) contains supplementary material, which is available to authorized users

Harvard University - DASH

Serum EPO and VEGF levels in patients with sleep-disordered breathing and acute myocardial infarction

Author: Adam Galazka
Andrzej Kukwa
Anna Budaj-Fidecka
Anna M. Czarnecka
Antoni Krzeski
CH Lee
CJ Parsa
EJ Yeo
F Andreotti
FS Angeli
G Niccoli
Grzegorz Opolski
I Bin-Jaliah
J Szenajch
JM Goldman
Krzysztof J. Filipiak
L Calvillo
L Lavie
L Lavie
L Xu
M Ferrario
M Winnicki
Marcin Grabowski
Monika Kuzminska
NA Shah
R Dittadi
R Schulz
RE Friedrich
Renata Glowczynska
S Imagawa
S Koch
S Namiuchi
S Steiner
SE Schiza
T Mooe
VK Somers
W Jelkmann
Wojciech Kukwa
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Genome-wide association study identifies human genetic variants associated with fatal outcome from Lassa fever

Author: Akpede George O
Andersen Kristian G
Asogun Danny A
Barnes Kayla
Branco Luis M
Broodie Nisha
Chak Bridget
Chapin Sarah R
Dic-Ijiewere Mercy
Eromon Philomena E
Folarin Onikepe A
Garry Robert F
Gire Stephen K
Gladden-Young Adrianne
Goba Augustine
Grant Donald S
Günther Stephan
Happi Anise N
Happi Christian T
Hensley Lisa
Heuklom Shannon
Honko Anna N
Iraoyah Kelly
Iruolagbe Christopher O
Jalloh Simbirie
Jiang Pan-Pan
Kales Susan
Kanneh Lansana
Karlsson Elinor K
Kennedy Sharon G
Khan S Humarr
Kotliar Dylan
Kunz Stefan
Lander Eric S
McCormick Joseph B
Mehta Samar
Momoh Mambu
Moses Lina M
Nair Parvathy
Odia Ikponmwosa
Okogbenin Sylvanus A
Okokhere Peter O
Okonkwo Alexander K
Oldstone Michael B A
Ollila Hanna M
Omoniwa Omowunmi
Osazuwa Omoregie O
Pauthner Matthias
Phelan Eric
Raju Siddharth
Reilly Steven K
Robles-Sikisaka Refugio
Rubins Kathleen
Sabeti Pardis C
Sandi John Demby
Schaffner Stephen F
Schieffelin John S
Siddle Katherine J
Stremlau Matt
Tabrizi Shervin
Tariyal Ridhi
Tewhey Ryan
Vitti Joseph J
Winnicki Sarah
Yozwiak Nathan
Publication venue: Nature Research
Publication date: 07/02/2024
Field of study

Infection with Lassa virus (LASV) can cause Lassa fever, a haemorrhagic illness with an estimated fatality rate of 29.7%, but causes no or mild symptoms in many individuals. Here, to investigate whether human genetic variation underlies the heterogeneity of LASV infection, we carried out genome-wide association studies (GWAS) as well as seroprevalence surveys, human leukocyte antigen typing and high-throughput variant functional characterization assays. We analysed Lassa fever susceptibility and fatal outcomes in 533 cases of Lassa fever and 1,986 population controls recruited over a 7 year period in Nigeria and Sierra Leone. We detected genome-wide significant variant associations with Lassa fever fatal outcomes near GRM7 and LIF in the Nigerian cohort. We also show that a haplotype bearing signatures of positive selection and overlapping LARGE1, a required LASV entry factor, is associated with decreased risk of Lassa fever in the Nigerian cohort but not in the Sierra Leone cohort. Overall, we identified variants and genes that may impact the risk of severe Lassa fever, demonstrating how GWAS can provide insight into viral pathogenesis

LSTM Online Archive

The Jackson Laboratory: The Mouseion at the JAXlibrary

Helsingin yliopiston digitaalinen arkisto

Functional Anatomy, Histology and Biomechanics of the human Achilles Tendon - a comprehensive review

Author: Ochała-Kłos Anna
Pękala Przemysław
Rutowicz Bartosz
Tomaszewski Krzysztof
Winnicki Kamil
Publication venue: 'Elsevier BV'
Publication date: 01/01/2020
Field of study

Jagiellonian Univeristy Repository