Search CORE

235,450 research outputs found

Feature-based methods for large scale dynamic programming

Author
Publication venue: Massachusetts Institute of Technology, Laboratory for Information and Decision Systems]
Publication date: 01/01/1994
Field of study

Caption title.Includes bibliographical references (p. 40-42).Supported by the NSF. ECS 9216531 Supported by the EPRI. 8030-10John N. Tsitsiklis and Benjamin Van Roy

DSpace@MIT

Feature-based methods for large scale dynamic programming

Author: A. G. Barto
B. R. Bakshi
Benjamin van Roy
C. J. C. H. Watkina
D. P. Bertsekas
D. P. Bertsekas
D. P. Bertsekas
D. P. Bortsekas
D. Reetz
G. Tesauro
J. N. Tsitsiklis
John N. Tsitsiklis
P. D. Dayan
P. J. Schweitzer
R. E. Bellman
R. E. Korf
R. P. Lippman
R. S. Sutton
T. Poggio
W. Whitt
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Linear Programming for Large-Scale Markov Decision Problems

Author: Abbasi-Yadkori Yasin
Bartlett Peter L.
Malek Alan
Publication venue
Publication date: 01/01/2014
Field of study

We consider the problem of controlling a Markov decision process (MDP) with a large state space, so as to minimize average cost. Since it is intractable to compete with the optimal policy for large scale problems, we pursue the more modest goal of competing with a low-dimensional family of policies. We use the dual linear programming formulation of the MDP average cost problem, in which the variable is a stationary distribution over state-action pairs, and we consider a neighborhood of a low-dimensional subset of the set of stationary distributions (defined in terms of state-action features) as the comparison class. We propose two techniques, one based on stochastic convex optimization, and one based on constraint sampling. In both cases, we give bounds that show that the performance of our algorithms approaches the best achievable by any policy in the comparison class. Most importantly, these results depend on the size of the comparison class, but not on the size of the state space. Preliminary experiments show the effectiveness of the proposed algorithms in a queuing application.Comment: 27 pages, 3 figure

arXiv.org e-Print Archive

CiteSeerX

Queensland University of Technology ePrints Archive

Applying inspection to object-oriented software

Author: Brooks A.
Macdonald F.
Miller J.
Roper M.
Wood M.
Publication venue
Publication date: 01/01/1995
Field of study

The benefits of the object-oriented paradigmare widely cited. At the same time, inspection is deemed to be the most cost-effective means of detecting defects in software products. Why then, is there no published experience, let alone quantitative data, on the application of inspection to object-oriented systems? We describe the facilities of the object-oriented paradigm and the issues that these raise when inspecting object-oriented code. Several problems are caused by the disparity between the static code structure and its dynamic runtime behaviour. The large number of small methods in object-oriented systems can also cause problems. We then go on to describe three areas which may help mitigate problems found. Firstly, the use of various programming methods may assist in making object-oriented code easier to inspect. Secondly, improved program documentation can help the inspector understand the code which is under inspection. Finally, tool support can help the inspector to analyse the dynamic behaviour of the code. We conclude that while both the object-oriented paradigm and inspection provide excellent benefits on their own, combining the two may be a difficult exercise, requiring extensive support if it is to be successful

CiteSeerX

University of Strathclyde Institutional Repository

Convolutional Neural Networks over Tree Structures for Programming Language Processing

Author: Jin Zhi
Li Ge
Mou Lili
Wang Tao
Zhang Lu
Publication venue
Publication date: 08/12/2015
Field of study

Programming language processing (similar to natural language processing) is a hot research topic in the field of software engineering; it has also aroused growing interest in the artificial intelligence community. However, different from a natural language sentence, a program contains rich, explicit, and complicated structural information. Hence, traditional NLP models may be inappropriate for programs. In this paper, we propose a novel tree-based convolutional neural network (TBCNN) for programming language processing, in which a convolution kernel is designed over programs' abstract syntax trees to capture structural information. TBCNN is a generic architecture for programming language processing; our experiments show its effectiveness in two different program analysis tasks: classifying programs according to functionality, and detecting code snippets of certain patterns. TBCNN outperforms baseline methods, including several neural models for NLP.Comment: Accepted at AAAI-1

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications