298 research outputs found
Learning Semantic-Agnostic and Spatial-Aware Representation for Generalizable Visual-Audio Navigation
Visual-audio navigation (VAN) is attracting more and more attention from the
robotic community due to its broad applications, \emph{e.g.}, household robots
and rescue robots. In this task, an embodied agent must search for and navigate
to the sound source with egocentric visual and audio observations. However, the
existing methods are limited in two aspects: 1) poor generalization to unheard
sound categories; 2) sample inefficient in training. Focusing on these two
problems, we propose a brain-inspired plug-and-play method to learn a
semantic-agnostic and spatial-aware representation for generalizable
visual-audio navigation. We meticulously design two auxiliary tasks for
respectively accelerating learning representations with the above-desired
characteristics. With these two auxiliary tasks, the agent learns a
spatially-correlated representation of visual and audio inputs that can be
applied to work on environments with novel sounds and maps. Experiment results
on realistic 3D scenes (Replica and Matterport3D) demonstrate that our method
achieves better generalization performance when zero-shot transferred to scenes
with unseen maps and unheard sound categories
Learning Gradient Fields for Scalable and Generalizable Irregular Packing
The packing problem, also known as cutting or nesting, has diverse
applications in logistics, manufacturing, layout design, and atlas generation.
It involves arranging irregularly shaped pieces to minimize waste while
avoiding overlap. Recent advances in machine learning, particularly
reinforcement learning, have shown promise in addressing the packing problem.
In this work, we delve deeper into a novel machine learning-based approach that
formulates the packing problem as conditional generative modeling. To tackle
the challenges of irregular packing, including object validity constraints and
collision avoidance, our method employs the score-based diffusion model to
learn a series of gradient fields. These gradient fields encode the
correlations between constraint satisfaction and the spatial relationships of
polygons, learned from teacher examples. During the testing phase, packing
solutions are generated using a coarse-to-fine refinement mechanism guided by
the learned gradient fields. To enhance packing feasibility and optimality, we
introduce two key architectural designs: multi-scale feature extraction and
coarse-to-fine relation extraction. We conduct experiments on two typical
industrial packing domains, considering translations only. Empirically, our
approach demonstrates spatial utilization rates comparable to, or even
surpassing, those achieved by the teacher algorithm responsible for training
data generation. Additionally, it exhibits some level of generalization to
shape variations. We are hopeful that this method could pave the way for new
possibilities in solving the packing problem
Development of 146nm Vacuum UV Light Source
AbstractThe principle of dielectric-barrier discharge (DBD) producing 146nm vacuum ultraviolet (VUV) light is introduced in this article. MgF2 and Kr are used as the output window and the discharge gas, respectively, in the VUV light source. Fairly wide, narrow-bandwidth UV light could be generated with peak wavelength of 146nm and a full width at half maxima of 12nm. In addition, the impact of air pressure, voltage amplitude and frequency to the light source is also analyzed
JNET: Learning User Representations via Joint Network Embedding and Topic Embedding
User representation learning is vital to capture diverse user preferences,
while it is also challenging as user intents are latent and scattered among
complex and different modalities of user-generated data, thus, not directly
measurable. Inspired by the concept of user schema in social psychology, we
take a new perspective to perform user representation learning by constructing
a shared latent space to capture the dependency among different modalities of
user-generated data. Both users and topics are embedded to the same space to
encode users' social connections and text content, to facilitate joint modeling
of different modalities, via a probabilistic generative framework. We evaluated
the proposed solution on large collections of Yelp reviews and StackOverflow
discussion posts, with their associated network structures. The proposed model
outperformed several state-of-the-art topic modeling based user models with
better predictive power in unseen documents, and state-of-the-art network
embedding based user models with improved link prediction quality in unseen
nodes. The learnt user representations are also proved to be useful in content
recommendation, e.g., expert finding in StackOverflow
Electrochemical Reduction of CO2 on Compositionally Variant Au-Pt Bimetallic Thin Films
The electrocatalytic reduction of CO2 on Au-Pt bimetallic catalysts with different compositions was evaluated, offering a platform for uncovering the correlation between the catalytic activity and the surface composition of bimetallic electrocatalysts. The Au-Pt alloy films were synthesized by a magnetron sputtering co-deposition technique with tunable composition. It was found that the syngas ratio (CO:H2) on the Au-Pt films is able to be tuned by systematically controlling the binary composition. This tunable catalytic selectivity is attributed to the variation of binding strength of COOH and CO intermediates, influenced by the surface electronic structure (d-band center energy) which is linked to the surface composition of the bimetallic films. Notably, a gradual shift of the d-band center away from the Fermi level was observed with increasing Au content, which correspondingly reduces the binding strength of the COOH and CO intermediates, leading to the distinct catalytic activity for the reduction of CO2 on the compositionally variant Au-Pt bimetallic films. In addition, the formation of formic acid in the bimetallic systems at reduced overpotentials and higher yield indicates that synergistic effects can facilitate reaction pathways for products that are not accessible with the individual components.</p
Pyrolyzing soft template-containing poly(ionic liquid) into hierarchical N-doped porous carbon for electroreduction of carbon dioxide
Heteroatom-doped carbon materials have demonstrated great potential in the electrochemical reduction reaction of CO2 (CO2RR) due to their versatile structure and function. However, rational structure control remains one challenge. In this work, we reported a unique carbon precursor of soft template-containing porous poly(ionic liquid) (PIL) that was directly synthesized via free-radical self-polymerization of ionic liquid monomer in a soft template route. Variation of the carbonization temperature in a direct pyrolysis process without any additive yielded a series of carbon materials with facile adjustable textural properties and N species. Significantly, the integration of soft-template in the PIL precursor led to the formation of hierarchical porous carbon material with a higher surface area and larger pore size than that from the template-free precursor. In CO2RR to CO, the champion catalyst gave a Faraday efficiency of 83.0% and a current density of 1.79 mA?cm?2 at ?0.9 V vs. reversible hydrogen electrode (vs. RHE). The abundant graphite N species and hierarchical pore structure, especially the unique hierarchical small-/ultra-micropores were revealed to enable better CO2RR performance
Resilient smart power grid synchronization estimation method for system resilience with partial missing measurements
With the increasing demand for power system stability and resilience, effective real-time tracking plays a crucial role in smart grid synchronization. However, most studies have focused on measurement noise, while they seldom think about the problem of measurement data loss in smart power grid synchronization. To solve this problem, a resilient fault-tolerant extended Kalman filter (RFTEKF) is proposed to track voltage amplitude, voltage phase angle and frequency dynamically. First, a three-phase unbalanced network's positive sequence fast estimation model is established. Then, the loss phenomenon of measurements occurs randomly, and the randomness of data loss's randomness is defined by discrete interval distribution [0], [1]. Subsequently, a resilient fault-tolerant extended Kalman filter based on the real-time estimation framework is designed using the time-stamp technique to acquire partial data loss information. Finally, extensive simulation results manifest the proposed RFTEKF can synchronize the smart grid more effectively than the traditional extended Kalman filter (EKF)
Robotic Cane as a Soft SuperLimb for Elderly Sit-to-Stand Assistance
Many researchers have identified robotics as a potential solution to the
aging population faced by many developed and developing countries. If so, how
should we address the cognitive acceptance and ambient control of elderly
assistive robots through design? In this paper, we proposed an explorative
design of an ambient SuperLimb (Supernumerary Robotic Limb) system that
involves a pneumatically-driven robotic cane for at-home motion assistance, an
inflatable vest for compliant human-robot interaction, and a depth sensor for
ambient intention detection. The proposed system aims at providing active
assistance during the sit-to-stand transition for at-home usage by the elderly
at the bedside, in the chair, and on the toilet. We proposed a modified
biomechanical model with a linear cane robot for closed-loop control
implementation. We validated the design feasibility of the proposed ambient
SuperLimb system including the biomechanical model, our result showed the
advantages in reducing lower limb efforts and elderly fall risks, yet the
detection accuracy using depth sensing and adjustments on the model still
require further research in the future. Nevertheless, we summarized empirical
guidelines to support the ambient design of elderly-assistive SuperLimb systems
for lower limb functional augmentation.Comment: 8 pages, 9 figures, accepted for IEEE RoboSoft 202
- …