1,350 research outputs found

    Effect of Attention and Self-Supervised Speech Embeddings on Non-Semantic Speech Tasks

    Full text link
    Human emotion understanding is pivotal in making conversational technology mainstream. We view speech emotion understanding as a perception task which is a more realistic setting. With varying contexts (languages, demographics, etc.) different share of people perceive the same speech segment as a non-unanimous emotion. As part of the ACM Multimedia 2023 Computational Paralinguistics ChallengE (ComParE) in the EMotion Share track, we leverage their rich dataset of multilingual speakers and multi-label regression target of 'emotion share' or perception of that emotion. We demonstrate that the training scheme of different foundation models dictates their effectiveness for tasks beyond speech recognition, especially for non-semantic speech tasks like emotion understanding. This is a very complex task due to multilingual speakers, variability in the target labels, and inherent imbalance in the regression dataset. Our results show that HuBERT-Large with a self-attention-based light-weight sequence model provides 4.6% improvement over the reported baseline.Comment: Accepted to appear at ACM Multimedia 2023 Multimedia Grand Challenges Trac

    Jacobian Methods for Dynamic Polarization Control in Optical Applications

    Full text link
    Dynamic polarization control (DPC) is beneficial for many optical applications. It uses adjustable waveplates to perform automatic polarization tracking and manipulation. Efficient algorithms are essential to realizing an endless polarization control process at high speed. However, the standard gradientbased algorithm is not well analyzed. Here we model the DPC with a Jacobian-based control theory framework that finds a lot in common with robot kinematics. We then give a detailed analysis of the condition of the Stokes vector gradient as a Jacobian matrix. We identify the multi-stage DPC as a redundant system enabling control algorithms with null-space operations. An efficient, reset-free algorithm can be found. We anticipate more customized DPC algorithms to follow the same framework in various optical systems

    Neural encoding of socially adjusted value during competitive and hazardous foraging

    Get PDF
    In group foraging organisms, optimizing the conflicting demands of competitive food loss and safety is critical. We demonstrate that humans select competition avoidant and risk diluting strategies during foraging depending on socially adjusted value. We formulate a mathematically grounded quantification of socially adjusted value in foraging environments and show using multivariate fMRI analyses that socially adjusted value is encoded by mid-cingulate and ventromedial prefrontal cortices, regions that integrate value and action signals

    StereoPose: Category-Level 6D Transparent Object Pose Estimation from Stereo Images via Back-View NOCS

    Full text link
    Most existing methods for category-level pose estimation rely on object point clouds. However, when considering transparent objects, depth cameras are usually not able to capture meaningful data, resulting in point clouds with severe artifacts. Without a high-quality point cloud, existing methods are not applicable to challenging transparent objects. To tackle this problem, we present StereoPose, a novel stereo image framework for category-level object pose estimation, ideally suited for transparent objects. For a robust estimation from pure stereo images, we develop a pipeline that decouples category-level pose estimation into object size estimation, initial pose estimation, and pose refinement. StereoPose then estimates object pose based on representation in the normalized object coordinate space~(NOCS). To address the issue of image content aliasing, we further define a back-view NOCS map for the transparent object. The back-view NOCS aims to reduce the network learning ambiguity caused by content aliasing, and leverage informative cues on the back of the transparent object for more accurate pose estimation. To further improve the performance of the stereo framework, StereoPose is equipped with a parallax attention module for stereo feature fusion and an epipolar loss for improving the stereo-view consistency of network predictions. Extensive experiments on the public TOD dataset demonstrate the superiority of the proposed StereoPose framework for category-level 6D transparent object pose estimation.Comment: 7 pages, 6 figures, Project homepage: https://appsrv.cse.cuhk.edu.hk/~kaichen/stereopose.htm

    Research on Flavor Substances of Cooked Rice by Electromagnetic Induction Heating with Pots Made of Different Materials Based on GC-IMS

    Get PDF
    The rice cooked by IH electromagnetic heating do not normally appear to be pinched and appear to be more uniform, while the rice heated by the electric heating element is easier to pinch and paste pot.Based on gas phase ion mobility spectrometry analysis of the volatile substances produced by IH electromagnetic heating of Japonica rice and Indica rice in pots with stainless steel, enamel, aluminum alloy and cast iron as inner tanks, the changes of flavor substances in the heated rice cooked with pots of different materials and utensils were investigated and the IH electromagnetic optimum is taken as a confirmed material.The results show that the material of the IH electromagnetic heating pot will affect the concentration of rice flavor substances, and the effects on different varieties of rice are distinctive. The concentration of caprylic aldehyde and 1-octene-3-one in the stainless-steel pot heated Japonica rice increases, and the concentration of 3-hydroxy-2-butanone in the rice cooked with the cast iron pot increases. The aluminum alloy pot and the cast iron pot bring up the concentration of the alcohols and ketones in cooked Indica rice. The concentration of similar substances increases too. Both Japonica rice and Indica rice heated with aluminum alloy pots show an increase in the concentration of butyl acetate, and furfural is produced while cooking rice in enamel pots.Through comparative analysis of all flavor substances and their concentrations of IH electromagnetic in heating rice, we conclude that rice cooked in cast iron pot have the best flavor, followed by that in stainless steel pot, while aluminum alloy pot brings down the rice flavor to below average level, and pasting pot is often observed in the enamel pot

    Supervised physical training improves fine motor skills of 5-year-old children

    Get PDF
    Introduction: Fine motor skills are important for children not only in the activities of daily living, but also for learning activities. In the present study, the effects of supervised physical training were investigated in normal children. Objective: To evaluate the effects of supervised training by combining full-body exercise and the eye-hand coordination activities to improve fine motor skills in a group of five-year-old normal children. Methods: Fifty-two children were selected and randomized in exercise and control groups. The exercise group participated in three 30-minute training sessions per week for 24 weeks. Results: The fine motor skills and hand grip strength of the exercise group were significantly increased, while there was no significant change in the control group during the experimental period. Conclusion: The results indicate that the current exercise training program is effective and can be applied to 5-year-old normal children to improve their fine motor skills. In addition, this program has simple physical activities that are appropriate to the physical and mental level of child development. The 30-minute training session would be easily implemented in the kindergarten program
    • …
    corecore