53 research outputs found

    Survey of Intrusion Detection Research

    Get PDF
    The literature holds a great deal of research in the intrusion detection area. Much of this describes the design and implementation of specific intrusion detection systems. While the main focus has been the study of different detection algorithms and methods, there are a number of other issues that are of equal importance to make these systems function well in practice. I believe that the reason that the commercial market does not use many of the ideas described is that there are still too many unresolved issues. This survey focuses on presenting the different issues that must be addressed to build fully functional and practically usable intrusion detection systems (IDSs). It points out the state of the art in each area and suggests important open research issues

    Adaptive Reward-Free Exploration

    Get PDF
    Reward-free exploration is a reinforcement learning setting studied by Jin et al. (2020), who address it by running several algorithms with regret guarantees in parallel. In our work, we instead give a more natural adaptive approach for reward-free exploration which directly reduces upper bounds on the maximum MDP estimation error. We show that, interestingly, our reward-free UCRL algorithm can be seen as a variant of an algorithm of Fiechter from 1994, originally proposed for a different objective that we call best-policy identification. We prove that RF-UCRL needs of order (SAH4/ε2)(log(1/δ)+S)({SAH^4}/{\varepsilon^2})(\log(1/\delta) + S) episodes to output, with probability 1δ1-\delta, an ε\varepsilon-approximation of the optimal policy for any reward function. This bound improves over existing sample-complexity bounds in both the small ε\varepsilon and the small δ\delta regimes. We further investigate the relative complexities of reward-free exploration and best-policy identification

    Planning in Markov Decision Processes with Gap-Dependent Sample Complexity

    Get PDF
    We propose MDP-GapE, a new trajectory-based Monte-Carlo Tree Search algorithm for planning in a Markov Decision Process in which transitions have a finite support. We prove an upper bound on the number of calls to the generative models needed for MDP-GapE to identify a near-optimal action with high probability. This problem-dependent sample complexity result is expressed in terms of the sub-optimality gaps of the state-action pairs that are visited during exploration. Our experiments reveal that MDP-GapE is also effective in practice, in contrast with other algorithms with sample complexity guarantees in the fixed-confidence setting, that are mostly theoretical

    Fast active learning for pure exploration in reinforcement learning

    Get PDF
    Realistic environments often provide agents with very limited feedback. When the environment is initially unknown, the feedback, in the beginning, can be completely absent, and the agents may first choose to devote all their effort on exploring efficiently. The exploration remains a challenge while it has been addressed with many hand-tuned heuristics with different levels of generality on one side, and a few theoretically-backed exploration strategies on the other. Many of them are incarnated by intrinsic motivation and in particular explorations bonuses. A common rule of thumb for exploration bonuses is to use 1/n1/\sqrt{n} bonus that is added to the empirical estimates of the reward, where nn is a number of times this particular state (or a state-action pair) was visited. We show that, surprisingly, for a pure-exploration objective of reward-free exploration, bonuses that scale with 1/n1/n bring faster learning rates, improving the known upper bounds with respect to the dependence on the horizon HH. Furthermore, we show that with an improved analysis of the stopping time, we can improve by a factor HH the sample complexity in the best-policy identification setting, which is another pure-exploration objective, where the environment provides rewards but the agent is not penalized for its behavior during the exploration phase

    A Cross-Disciplinary Survey of Beliefs about Human Nature and Culture.

    Get PDF
    How far has the Darwinian revolution come? To what extent have evolutionary ideas penetrated into the social sciences and humanities? Are the “science wars” over? Or do whole blocs of disciplines face off over an unbridgeable epistemic gap? To answer questions like these, contributors to top journals in 22 disciplines were surveyed on their beliefs about human nature, culture, and science. More than 600 respondents completed the survey. Scoring patterns divided into two main sets of disciplines. Genetic influences were emphasized in the evolutionary social sciences, evolutionary humanities, psychology, empirical study of the arts, philosophy, economics, and political science. Environmental influences were emphasized in most of the humanities disciplines and in anthropology, sociology, education, and women’s or gender studies. Confidence in scientific explanation correlated positively with emphasizing genetic influences on behavior, and negatively with emphasizing environmental influences. Knowing the current actual landscape of belief should help scholars avoid sterile debates and ease the way toward fruitful collaborations with neighboring disciplines

    Planning in Markov Decision Processes with Gap-Dependent Sample Complexity

    Get PDF
    International audienceWe propose MDP-GapE, a new trajectory-based Monte-Carlo Tree Search algorithm for planning in a Markov Decision Process in which transitions have a finite support. We prove an upper bound on the number of calls to the generative models needed for MDP-GapE to identify a near-optimal action with high probability. This problem-dependent sample complexity result is expressed in terms of the sub-optimality gaps of the state-action pairs that are visited during exploration. Our experiments reveal that MDP-GapE is also effective in practice, in contrast with other algorithms with sample complexity guarantees in the fixed-confidence setting, that are mostly theoretical

    Fast active learning for pure exploration in reinforcement learning

    Get PDF
    International audienceRealistic environments often provide agents with very limited feedback. When the environment is initially unknown, the feedback, in the beginning, can be completely absent, and the agents may first choose to devote all their effort on exploring efficiently. The exploration remains a challenge while it has been addressed with many hand-tuned heuristics with different levels of generality on one side, and a few theoretically backed exploration strategies on the other. Many of them are incarnated by intrinsic motivation and in particular explorations bonuses. A common rule of thumb for exploration bonuses is to use 1/ √ n bonus that is added to the empirical estimates of the reward, where nn is a number of times this particular state (or a state-action pair) was visited. We show that, surprisingly, for a pure-exploration objective of reward-free exploration, bonuses that scale with 1/n bring faster learning rates, improving the known upper bounds with respect to the dependence on the horizon HH. Furthermore, we show that with an improved analysis of the stopping time, we can improve by a factor HH the sample complexity in the best-policy identification setting, which is another pure-exploration objective, where the environment provides rewards but the agent is not penalized for its behavior during the exploration phase

    Adaptive reward-free exploration

    Get PDF
    International audienceReward-free exploration is a reinforcement learning setting recently studied by Jin et al., who address it by running several algorithms with regret guarantees in parallel. In our work, we instead propose a more adaptive approach for reward-free exploration which directly reduces upper bounds on the maximum MDP estimation error. We show that, interestingly, our reward-free UCRL algorithm can be seen as a variant of an algorithm of Fiechter from 1994 [11], originally proposed for a different objective that we call best-policy identification. We prove that RF-UCRL needs O (SAH 4 /ε 2) log(1/δ) episodes to output, with probability 1 − δ, an ε-approximation of the optimal policy for any reward function. We empirically compare it to oracle strategies using a generative model

    Effects of deletion of the Streptococcus pneumoniae lipoprotein diacylglyceryl transferase gene lgt on ABC transporter function and on growth in vivo

    Get PDF
    Lipoproteins are an important class of surface associated proteins that have diverse roles and frequently are involved in the virulence of bacterial pathogens. As prolipoproteins are attached to the cell membrane by a single enzyme, prolipoprotein diacylglyceryl transferase (Lgt), deletion of the corresponding gene potentially allows the characterisation of the overall importance of lipoproteins for specific bacterial functions. We have used a Δlgt mutant strain of Streptococcus pneumoniae to investigate the effects of loss of lipoprotein attachment on cation acquisition, growth in media containing specific carbon sources, and virulence in different infection models. Immunoblots of triton X-114 extracts, flow cytometry and immuno-fluorescence microscopy confirmed the Δlgt mutant had markedly reduced lipoprotein expression on the cell surface. The Δlgt mutant had reduced growth in cation depleted medium, increased sensitivity to oxidative stress, reduced zinc uptake, and reduced intracellular levels of several cations. Doubling time of the Δlgt mutant was also increased slightly when grown in medium with glucose, raffinose and maltotriose as sole carbon sources. These multiple defects in cation and sugar ABC transporter function for the Δlgt mutant were associated with only slightly delayed growth in complete medium. However the Δlgt mutant had significantly reduced growth in blood or bronchoalveolar lavage fluid and a marked impairment in virulence in mouse models of nasopharyngeal colonisation, sepsis and pneumonia. These data suggest that for S. pneumoniae loss of surface localisation of lipoproteins has widespread effects on ABC transporter functions that collectively prevent the Δlgt mutant from establishing invasive infection

    Geographical patterns in blood lead in relation to industrial emissions and traffic in Swedish children, 1978–2007

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Blood lead concentrations (B-Pb) were measured in 3 879 Swedish school children during the period 1978–2007. The objective was to study the effect of the proximity to lead sources based on the children's home and school location.</p> <p>Methods</p> <p>The children's home address and school location were geocoded and their proximity to a lead smelter and major roads was calculated using geographical information system (GIS) software. All the statistical analyses were carried out using means of generalized log-linear modelling, with natural-logarithm-transformed B-Pb, adjusted for sex, school year, lead-exposing hobby, country of birth and, in the periods 1988–1994 and 1995–2007, parents' smoking habits.</p> <p>Results</p> <p>The GIS analysis revealed that although the emission from the smelter and children's B-Pb levels had decreased considerably since 1978, proximity to the lead smelter continued to affect levels of B-Pb, even in recent years (geometric mean: near smelter: 22.90 μg/l; far from smelter 19.75 μg/l; p = 0.001). The analysis also revealed that proximity to major roads noticeably affected the children's B-Pb levels during the period 1978–1987 (geometric mean near major roads: 44.26 μg/l; far from roads: 38.32 μg/l; p = 0.056), due to the considerable amount of lead in petrol. This effect was, however, not visible after 1987 due to prohibition of lead in petrol.</p> <p>Conclusion</p> <p>The results show that proximity to the lead smelter still has an impact on the children's B-Pb levels. This is alarming since it could imply that living or working in the vicinity of a former lead source could pose a threat years after reduction of the emission. The analysis also revealed that urban children exposed to lead from traffic were only affected during the early period, when there were considerable amounts of lead in petrol, and that the prohibition of lead in petrol in later years led to reduced levels of lead in the blood of urban children.</p
    corecore