248,457 research outputs found
Meta-Reinforcement Learning via Language Instructions
Although deep reinforcement learning has recently been very successful at
learning complex behaviors, it requires a tremendous amount of data to learn a
task. One of the fundamental reasons causing this limitation lies in the nature
of the trial-and-error learning paradigm of reinforcement learning, where the
agent communicates with the environment and progresses in the learning only
relying on the reward signal. This is implicit and rather insufficient to learn
a task well. On the contrary, humans are usually taught new skills via natural
language instructions. Utilizing language instructions for robotic motion
control to improve the adaptability is a recently emerged topic and
challenging. In this paper, we present a meta-RL algorithm that addresses the
challenge of learning skills with language instructions in multiple
manipulation tasks. On the one hand, our algorithm utilizes the language
instructions to shape its interpretation of the task, on the other hand, it
still learns to solve task in a trial-and-error process. We evaluate our
algorithm on the robotic manipulation benchmark (Meta-World) and it
significantly outperforms state-of-the-art methods in terms of training and
testing task success rates. Codes are available at
\url{https://tumi6robot.wixsite.com/million}
Assessing the impact of representational and contextual problem features on student use of right-hand rules
Students in introductory physics struggle with vector algebra and these
challenges are often associated with contextual and representational features
of the problems. Performance on problems about cross product direction is
particularly poor and some research suggests that this may be primarily due to
misapplied right-hand rules. However, few studies have had the resolution to
explore student use of right-hand rules in detail. This study reviews
literature in several disciplines, including spatial cognition, to identify ten
contextual and representational problem features that are most likely to
influence performance on problems requiring a right-hand rule. Two quantitative
measures of performance (correctness and response time) and two qualitative
measures (methods used and type of errors made) were used to explore the impact
of these problem features on student performance. Quantitative results are
consistent with expectations from the literature, but reveal that some features
(such as the type of reasoning required and the physical awkwardness of using a
right-hand rule) have a greater impact than others (such as whether the vectors
are placed together or separate). Additional insight is gained by the
qualitative analysis, including identifying sources of difficulty not
previously discussed in the literature and revealing that the use of
supplemental methods, such as physically rotating the paper, can mitigate
errors associated with certain features
Silicon CMOS architecture for a spin-based quantum computer
Recent advances in quantum error correction (QEC) codes for fault-tolerant
quantum computing \cite{Terhal2015} and physical realizations of high-fidelity
qubits in a broad range of platforms \cite{Kok2007, Brown2011, Barends2014,
Waldherr2014, Dolde2014, Muhonen2014, Veldhorst2014} give promise for the
construction of a quantum computer based on millions of interacting qubits.
However, the classical-quantum interface remains a nascent field of
exploration. Here, we propose an architecture for a silicon-based quantum
computer processor based entirely on complementary metal-oxide-semiconductor
(CMOS) technology, which is the basis for all modern processor chips. We show
how a transistor-based control circuit together with charge-storage electrodes
can be used to operate a dense and scalable two-dimensional qubit system. The
qubits are defined by the spin states of a single electron confined in a
quantum dot, coupled via exchange interactions, controlled using a microwave
cavity, and measured via gate-based dispersive readout \cite{Colless2013}. This
system, based entirely on available technology and existing components, is
compatible with general surface code quantum error correction
\cite{Terhal2015}, enabling large-scale universal quantum computation
- …