864 research outputs found
Discrete Prompt Compression with Reinforcement Learning
Instruction-tuned Language Models (LMs) are widely used by users to address
various problems with task-specific prompts. Constraints associated with the
context window length and computational costs encourage the development of
compressed prompts. Existing methods rely heavily on training embeddings, which
are designed to accommodate multiple token meanings. This presents challenges
in terms of interpretability, a fixed number of embedding tokens, reusability
across different LMs, and inapplicability when interacting with black-box APIs.
This study proposes prompt compression with reinforcement learning (PCRL), a
novel discrete prompt compression method that addresses these issues. PCRL
employs a computationally efficient policy network that directly edits prompts.
The PCRL training approach can be flexibly applied to various types of LMs, as
well as decoder-only and encoder-decoder architecture, and can be trained
without gradient access to LMs or labeled data. PCRL achieves an average
reduction of 24.6% in token count across various instruction prompts while
preserving performance. Further, we demonstrate that the learned policy can be
transferred to larger LMs, and through various analyses, we aid the
understanding of token importance within prompts
Evolving speciated checkers players with crowding algorithm
Abstract-Conventiona1 evolutionary algorithms have a property that only one solution often dominates and it is sometimes useful to find diverse solutions and combine them because there might be many different solutions to one problem in real world problems. Recently, developing checkers player using evolutionary algorithms has been widely exploited to show the power of evolution for machine learning. In this paper, we propose an evolutionary checkers player that is developed by a speciation technique called crowding algorithm. In many experiments, our checkers player with ensemble structure shows better performance than non-speciated checkers players
The interplay between real and pseudo magnetic field in graphene with strain
We investigate electric and magnetic properties of graphene with rotationally
symmetric strain. The strain generates large pseudo magnetic field with
alternating sign in space, which forms a strongly confined quantum dot
connected to six chiral channels. The orbital magnetism, degeneracy, and
channel opening can be understood from the interplay between real and pseudo
magnetic field which have different parities under time reversal and mirror
reflection.
While the orbital magnetic response of the confined state is diamagnetic, it
can be paramagnetic if there is an accidental degeneracy with opposite mirror
reflection parity
Bacterial community analysis in upflow multilayer anaerobic reactor (UMAR) treating high-solids organic wastes
A novel anaerobic digestion configuration, the upflow multi-layer anaerobic reactor (UMAR), was developed to treat high-solids organic wastes. The UMAR was hypothesized to form multi-layer along depth due to the upflow plug flow; use of a recirculation system and a rotating distributor and baffles aimed to assist treating high-solids influent. The chemical oxygen demand (COD) removal efficiency and methane (CH4) production rate were 89% and 2.10 L CH4/L/day, respectively, at the peak influent COD concentration (110.4 g/L) and organic loading rate (7.5 g COD/L/day). The 454 pyrosequencing results clearly indicated heterogeneous distribution of bacterial communities at different vertical locations (upper, middle, and bottom) of the UMAR. Firmicutes was the dominant (>70%) phylum at the middle and bottom parts, while Deltaproteobacteria and Chloroflexi were only found in the upper part. Potential functions of the bacteria were discussed to speculate on their roles in the anaerobic performance of the UMAR system
Recent Advances in General Game Playing
The goal of General Game Playing (GGP) has been to develop computer programs that can perform well across various game types. It is natural for human game players to transfer knowledge from games they already know how to play to other similar games. GGP research attempts to design systems that work well across different game types, including unknown new games. In this review, we present a survey of recent advances (2011 to 2014) in GGP for both traditional games and video games. It is notable that research on GGP has been expanding into modern video games. Monte-Carlo Tree Search and its enhancements have been the most influential techniques in GGP for both research domains. Additionally, international competitions have become important events that promote and increase GGP research. Recently, a video GGP competition was launched. In this survey, we review recent progress in the most challenging research areas of Artificial Intelligence (AI) related to universal game playing
Charge Transfer Induced Molecular Hole Doping into Thin Film of Metal-Organic-Frameworks
Despite the highly porous nature with significantly large surface area, metal
organic frameworks (MOFs) can be hardly used in electronic, and optoelectronic
devices due to their extremely poor electrical conductivity. Therefore, the
study of MOF thin films that require electron transport or conductivity in
combination with the everlasting porosity is highly desirable. In the present
work, thin films of Co3(NDC)3DMF4 MOFs with improved electronic conductivity
are synthesized using layer-by-layer and doctor blade coating techniques
followed by iodine doping. The as-prepared and doped films are characterized
using FE-SEM, EDX, UV/Visible spectroscopy, XPS, current-voltage measurement,
photoluminescence spectroscopy, cyclic voltammetry, and incident photon to
current efficiency measurements. In addition, the electronic and semiconductor
property of the MOF films are characterized using Hall Effect measurement,
which reveals that in contrast to the insulator behavior of the as-prepared
MOFs, the iodine doped MOFs behave as a p-type semiconductor. This is caused by
charge transfer induced hole doping into the frameworks. The observed charge
transfer induced hole doping phenomenon is also confirmed by calculating the
densities of states of the as-prepared and iodine doped MOFs based on density
functional theory. Photoluminescence spectroscopy demonstrate an efficient
interfacial charge transfer between TiO2 and iodine doped MOFs, which can be
applied to harvest solar radiations.Comment: Main paper (19 pages, 6 figures) and supplementary information (15
pages, 10 figures), accepted in ACS Appl. Materials & Interface
- …