1,274 research outputs found

    Reinforcement Learning via AIXI Approximation

    Full text link
    This paper introduces a principled approach for the design of a scalable general reinforcement learning agent. This approach is based on a direct approximation of AIXI, a Bayesian optimality notion for general reinforcement learning agents. Previously, it has been unclear whether the theory of AIXI could motivate the design of practical algorithms. We answer this hitherto open question in the affirmative, by providing the first computationally feasible approximation to the AIXI agent. To develop our approximation, we introduce a Monte Carlo Tree Search algorithm along with an agent-specific extension of the Context Tree Weighting algorithm. Empirically, we present a set of encouraging results on a number of stochastic, unknown, and partially observable domains.Comment: 8 LaTeX pages, 1 figur

    Probabilities on Sentences in an Expressive Logic

    Get PDF
    Automated reasoning about uncertain knowledge has many applications. One difficulty when developing such systems is the lack of a completely satisfactory integration of logic and probability. We address this problem directly. Expressive languages like higher-order logic are ideally suited for representing and reasoning about structured knowledge. Uncertain knowledge can be modeled by using graded probabilities rather than binary truth-values. The main technical problem studied in this paper is the following: Given a set of sentences, each having some probability of being true, what probability should be ascribed to other (query) sentences? A natural wish-list, among others, is that the probability distribution (i) is consistent with the knowledge base, (ii) allows for a consistent inference procedure and in particular (iii) reduces to deductive logic in the limit of probabilities being 0 and 1, (iv) allows (Bayesian) inductive reasoning and (v) learning in the limit and in particular (vi) allows confirmation of universally quantified hypotheses/sentences. We translate this wish-list into technical requirements for a prior probability and show that probabilities satisfying all our criteria exist. We also give explicit constructions and several general characterizations of probabilities that satisfy some or all of the criteria and various (counter) examples. We also derive necessary and sufficient conditions for extending beliefs about finitely many sentences to suitable probabilities over all sentences, and in particular least dogmatic or least biased ones. We conclude with a brief outlook on how the developed theory might be used and approximated in autonomous reasoning agents. Our theory is a step towards a globally consistent and empirically satisfactory unification of probability and logic.Comment: 52 LaTeX pages, 64 definiton/theorems/etc, presented at conference Progic 2011 in New Yor

    Processing Plant Diseases Using Transformer Model

    Get PDF
    Agriculture faces challenges in achieving high-yield production while minimizing the use of chemicals. The excessive use of chemicals in agriculture poses many problems. Accurate disease diagnosis is crucial for effective plant disease detection and treatment. Automatic identification of plant diseases using computer vision techniques offers new and efficient approaches compared to traditional methods. Transformers, a type of deep learning model, have shown great promise in computer vision, but as the technology is still new, many vision transformer models struggle to identify diseases by examining the entire leaf. This paper aims to utilize the vision transformer model in analyzing and identifying common diseases that hinder the growth and development of plants through the plant leave images. Besides, it aims to improve the model's stability by focusing more on the entire leaf than individual parts and generalizing better results on leaves not in the image center. Added features such as Shift Patch Tokenization, Locality Self Attention, and Positional Encoding help focus on the whole leaf. The final test accuracy obtained is 89.58%, with relatively slight variances in precision, accuracy, and F1 score across classes, as well as satisfactory model robustness towards changes in leaf orientation and position within the image. The model's effectiveness shows the vision transformer's potential for automated plant disease diagnosis, which can help farmers take timely measures to prevent losses and ensure food security

    Context tree switching

    No full text
    This paper describes the Context Tree Switching technique, a modification of Context Tree Weighting for the prediction of binary, stationary, n-Markov sources. By modifying Context Tree Weighting’s recursive weighting scheme, it is possible to mix over a strictly larger class of models without increasing the asymptotic time or space complexity of the original algorithm. We prove that this generalization preserves the desirable theoretical properties of Context Tree Weighting on stationary n-Markov sources, and show empirically that this new technique leads to consistent improvements over Context Tree Weighting as measured on the Calgary Corpus

    Reinforcement Learning via AIXI Approximation

    Get PDF
    This paper introduces a principled approach for the design of a scalable general reinforcement learning agent. This approach is based on a direct approximation of AIXI, a Bayesian optimality notion for general reinforcement learning agents. Previously, it has been unclear whether the theory of AIXI could motivate the design of practical algorithms. We answer this hitherto open question in the affirmative, by providing the first computationally feasible approximation to the AIXI agent. To develop our approximation, we introduce a Monte Carlo Tree Search algorithm along with an agent-specific extension of the Context Tree Weighting algorithm. Empirically, we present a set of encouraging results on a number of stochastic, unknown, and partially observable domains

    Treating of Rayon-flocked Fabric by Atmospheric Pressure Plasma

    Get PDF
    This study investigates hydrophobisation of the surface of rayon-flocked fabric by means of atmospheric pressure plasma (APP) treatment with tetramethylsilane (TMS). Plasma deposition of TMS is regarded as an effective, single-step low pollution method. A detailed study of the process parameters was conducted. A highly hydrophobic surface was successfully fabricated on rayon-flocked fabric and the hydrophobic surface was found to have good stain resistance to coffee and milk tea

    Learned Garbage Collection

    Get PDF
    Several programming languages use garbage collectors (GCs) to automatically manage memory for the programmer. Such collectors must decide when to look for unreachable objects to free, which can have a large performance impact on some applications. In this preliminary work, we propose a design for a learned garbage collector that autonomously learns over time when to perform collections. By using reinforcement learning, our design can incorporate user-defined reward functions, allowing an autonomous garbage collector to learn to optimize the exact metric the user desires (e.g., request latency or queries per second). We conduct an initial experimental study on a prototype, demonstrating that an approach based on tabular Q learning may be promising

    A probabilistic chemical programmable computer

    Get PDF
    The exponential growth of the power of modern digital computers is based upon the miniaturisation of vast nanoscale arrays of electronic switches, but this will be eventually constrained by fabrication limits and power dissipation. Chemical processes have the potential to scale beyond these limits performing computations through chemical reactions, yet the lack of well-defined programmability limits their scalability and performance. We present a hybrid digitally programmable chemical array as a probabilistic computational machine that uses chemical oscillators partitioned in interconnected cells as a computational substrate. This hybrid architecture performs efficient computation by distributing between chemical and digital domains together with error correction. The efficiency is gained by combining digital with probabilistic chemical logic based on nearest neighbour interactions and hysteresis effects. We demonstrated the implementation of one- and two- dimensional Chemical Cellular Automata and solutions to combinatorial optimization problems.Comment: 20 page manuscript, 6 figures, 112 page supplementary volum

    CD8α is expressed by human monocytes and enhances FcγR-dependent responses

    Get PDF
    Abstract Background CD8α enhances the responses of antigen-specific CTL activated through TCR through binding MHC class I, favoring lipid raft partitioning of TCR, and inducing intracellular signaling. CD8α is also found on dendritic cells and rat macrophages, but whether CD8α enhances responses of a partner receptor, like TCR, to activate these cells is not known. TCR and FcR, use analogous or occasionally interchangeable signaling mechanisms suggesting the possibility that CD8α co-activates FcR responses. Interestingly, CD8α+ monocytes are often associated with rat models of disease involving immune-complex deposition and FcR-mediated pathology, such as arthritis, glomerulonephritis, ischaemia, and tumors. While rat macrophages have been shown to express CD8α evidence for CD8α expression by mouse or human monocytes or macrophages was incomplete. Results We detected CD8α, but not CD8β on human monocytes and the monocytic cell line THP-1 by flow cytometry. Reactivity of anti-CD8α mAb with monocytes is at least partly independent of FcR as anti-CD8α mAb detect CD8α by western blot and inhibit binding of MHC class I tetramers. CD8α mRNA is also found in monocytes and THP-1 suggesting CD8α is synthesized by monocytes and not acquired from other CD8α+ cell types. Interestingly, CD8α from monocytes and blood T cells presented distinguishable patterns by 2-D electrophoresis. Anti-CD8α mAb alone did not activate monocyte TNF release. In comparison, TNF release by human monocytes stimulated in a FcR-dependent manner with immune-complexes was enhanced by inclusion of anti-CD8α mAb in immune-complexes. Conclusion Human monocytes express CD8α. Co-engagement of CD8α and FcR enhances monocyte TNF release, suggesting FcR may be a novel partner receptor for CD8α on innate immune cells.</p
    corecore