56 research outputs found

    RL-Duet: Online Music Accompaniment Generation Using Deep Reinforcement Learning

    Full text link
    This paper presents a deep reinforcement learning algorithm for online accompaniment generation, with potential for real-time interactive human-machine duet improvisation. Different from offline music generation and harmonization, online music accompaniment requires the algorithm to respond to human input and generate the machine counterpart in a sequential order. We cast this as a reinforcement learning problem, where the generation agent learns a policy to generate a musical note (action) based on previously generated context (state). The key of this algorithm is the well-functioning reward model. Instead of defining it using music composition rules, we learn this model from monophonic and polyphonic training data. This model considers the compatibility of the machine-generated note with both the machine-generated context and the human-generated context. Experiments show that this algorithm is able to respond to the human part and generate a melodic, harmonic and diverse machine part. Subjective evaluations on preferences show that the proposed algorithm generates music pieces of higher quality than the baseline method

    Continuously Controllable Facial Expression Editing in Talking Face Videos

    Full text link
    Recently audio-driven talking face video generation has attracted considerable attention. However, very few researches address the issue of emotional editing of these talking face videos with continuously controllable expressions, which is a strong demand in the industry. The challenge is that speech-related expressions and emotion-related expressions are often highly coupled. Meanwhile, traditional image-to-image translation methods cannot work well in our application due to the coupling of expressions with other attributes such as poses, i.e., translating the expression of the character in each frame may simultaneously change the head pose due to the bias of the training data distribution. In this paper, we propose a high-quality facial expression editing method for talking face videos, allowing the user to control the target emotion in the edited video continuously. We present a new perspective for this task as a special case of motion information editing, where we use a 3DMM to capture major facial movements and an associated texture map modeled by a StyleGAN to capture appearance details. Both representations (3DMM and texture map) contain emotional information and can be continuously modified by neural networks and easily smoothed by averaging in coefficient/latent spaces, making our method simple yet effective. We also introduce a mouth shape preservation loss to control the trade-off between lip synchronization and the degree of exaggeration of the edited expression. Extensive experiments and a user study show that our method achieves state-of-the-art performance across various evaluation criteria.Comment: Demo video: https://youtu.be/WD-bNVya6k

    A critical review of cyber-physical security for building automation systems

    Full text link
    Modern Building Automation Systems (BASs), as the brain that enables the smartness of a smart building, often require increased connectivity both among system components as well as with outside entities, such as optimized automation via outsourced cloud analytics and increased building-grid integrations. However, increased connectivity and accessibility come with increased cyber security threats. BASs were historically developed as closed environments with limited cyber-security considerations. As a result, BASs in many buildings are vulnerable to cyber-attacks that may cause adverse consequences, such as occupant discomfort, excessive energy usage, and unexpected equipment downtime. Therefore, there is a strong need to advance the state-of-the-art in cyber-physical security for BASs and provide practical solutions for attack mitigation in buildings. However, an inclusive and systematic review of BAS vulnerabilities, potential cyber-attacks with impact assessment, detection & defense approaches, and cyber-secure resilient control strategies is currently lacking in the literature. This review paper fills the gap by providing a comprehensive up-to-date review of cyber-physical security for BASs at three levels in commercial buildings: management level, automation level, and field level. The general BASs vulnerabilities and protocol-specific vulnerabilities for the four dominant BAS protocols are reviewed, followed by a discussion on four attack targets and seven potential attack scenarios. The impact of cyber-attacks on BASs is summarized as signal corruption, signal delaying, and signal blocking. The typical cyber-attack detection and defense approaches are identified at the three levels. Cyber-secure resilient control strategies for BASs under attack are categorized into passive and active resilient control schemes. Open challenges and future opportunities are finally discussed.Comment: 38 pages, 7 figures, 6 tables, submitted to Annual Reviews in Contro

    Carbon–biodiversity relationships in a highly diverse subtropical forest

    Full text link
    Carbon‐focused climate mitigation strategies are becoming increasingly important in forests. However, with ongoing biodiversity declines we require better knowledge of how much such strategies account for biodiversity. We particularly lack information across multiple trophic levels and on established forests, where the interplay between carbon stocks, stand age, and tree diversity might influence carbon–biodiversity relationships. Using a large dataset (>4600 heterotrophic species of 23 taxonomic groups) from secondary, subtropical forests, we tested how multitrophic diversity and diversity within trophic groups relate to aboveground, belowground, and total carbon stocks at different levels of tree species richness and stand age. Our study revealed that aboveground carbon, the key component of climate‐based management, was largely unrelated to multitrophic diversity. By contrast, total carbon stocks—that is, including belowground carbon—emerged as a significant predictor of multitrophic diversity. Relationships were nonlinear and strongest for lower trophic levels, but nonsignificant for higher trophic level diversity. Tree species richness and stand age moderated these relationships, suggesting long‐term regeneration of forests may be particularly effective in reconciling carbon and biodiversity targets. Our findings highlight that biodiversity benefits of climate‐oriented management need to be evaluated carefully, and only maximizing aboveground carbon may fail to account for biodiversity conservation requirements

    Direct and indirect effects of climate on richness drive the latitudinal diversity gradient in forest trees

    Get PDF
    Data accessibility statement: Full census data are available upon reasonable request from the ForestGEO data portal, http://ctfs.si.edu/datarequest/ We thank Margie Mayfield, three anonymous reviewers and Jacob Weiner for constructive comments on the manuscript. This study was financially supported by the National Key R&D Program of China (2017YFC0506100), the National Natural Science Foundation of China (31622014 and 31570426), and the Fundamental Research Funds for the Central Universities (17lgzd24) to CC. XW was supported by the Strategic Priority Research Program of the Chinese Academy of Sciences (XDB3103). DS was supported by the Czech Science Foundation (grant no. 16-26369S). Yves Rosseel provided us valuable suggestions on using the lavaan package conducting SEM analyses. Funding and citation information for each forest plot is available in the Supplementary Information Text 1.Peer reviewedPostprin

    Multiscale entropy algorithms and their applications in cardiac disease discrimination

    No full text
    In recent years, the number of cardiac disease patients has been increasing. Modern medical research has shown that the complexity of electrocardiogram (ECG) signals is related to cardiovascular diseases. This paper investigates the difference in complexity of ECG data from the people with different cardiovascular diseases, such as atrial fibrillation (AF), ventricular arrhythmia (VA) and congestive heart failure (CHF). The empirical mode decomposition (EMD) and multiscale entropy method are used to analyze the ECG data, and a mathematical model established by a support vector machine is used to identify different diseases. The accuracy recognition rate of the AF recognition is 96.25%, and that of the CHF and VA reach 90.26% and 92.20%, respectively. The experimental results show that the recognition method proposed in this paper is successful

    Association between Serum Uric Acid and Liver Enzymes in Adults Aged 20 Years and Older in the United States: NHANES 2005–2012

    No full text
    Although the relationship between serum uric acid (SUA) and nonalcoholic fatty liver disease has been widely reported, the relationship between SUA and liver enzymes has rarely been reported. The purpose of this study was to evaluate the association of SUA levels with alanine aminotransferase (ALT) and aspartate aminotransferase (AST) in populations aged 20 years and older in the United States. We analyzed 7165 individuals aged 20 years and older from the National Health and Nutrition Examination Survey (NHANES) in the United States. Weighted multiple linear regression models were used to analyze the relationship between SUA and ALT and AST. A generalized additive model and a smooth curve fitting were used to observe the linear relationship. SUA was positively correlated with ALT and AST. In addition, the overall increasing trend of ALT and SUA was observed across the SUA quartile groups. In the stratified analysis by sex and race, the SUA levels in male, female, Mexican American, and Non-Hispanic White individuals, and those of another race, were positively correlated with ALT and AST. However, the SUA levels in Non-Hispanic Black individuals had a nonlinear relationship with ALT and AST. In individuals aged 20 years and older in the United States (excluding Non-Hispanic Black individuals), SUA levels were positively associated with ALT and AST. Therefore, with a rise in SUA levels, liver function should be monitored or intervened with in people aged 20 years and older in the United States

    Clinical nurses’ compassion fatigue psychological experience process: a constructivist grounded theory study

    No full text
    Abstract Background Clinical nurses are susceptible to compassion fatigue when exposed to various types of traumatic events in patients for extended periods of time. However, the developmental process, staging, and psychological responses distinct to each stage of compassion fatigue in nurses are not fully clarified. This study aimed to explore the processes of compassion fatigue and the psychological experiences specific to each phase of compassion fatigue among clinical nurses. Methods Charmaz’s Constructivist Grounded Theory methodology was used in this qualitative research. Semi-structured interviews were conducted with 13 clinical nurses with varying degrees of compassion fatigue from December 2020 to January 2021. Interview data were analyzed using grounded theory processes. Results The data were categorized into five separate categories and 22 sub-categories. This study found that the process of compassion fatigue is dynamic and cumulative, which was classified into five phases: compassion experience period, compassion decrement period, compassion discomfort period, compassion distress period, and compassion fatigue period. Conclusion Clinical nurses who experience compassion fatigue may go through five stages that are stage-specific and predictable. The findings can shed light on local and global applications to better understand the problem of nurses’ compassion fatigue. The interventions for addressing compassion fatigue in clinical nurses should be stage-specific, targeted, and individualized

    A non-negative and high-resolution finite volume method for the depth-integrated solute transport equation using an unstructured triangular mesh

    No full text
    This paper proposes a new high-resolution finite volume method for solving the two-dimensional (2D) solute transport equation using an unstructured mesh. A new simple r-factor algorithm is introduced into the Total Variation Diminishing flux limiter to achieve a more efficient yet accurate high-resolution scheme for solving the advection term. To avoid the physically-meaningless negative solutions resulted from using the Green–Gauss theorem, a nonlinear two-point flux approximation scheme is adopted to deal with the anisotropic diffusion term. The developed method can be readily coupled with a two-dimensional finite-volume-based flow models under unstructured triangular mesh. By integrating with the ELCIRC flow model, the proposed method was verified using three idealized benchmark cases (i.e., advection of a circle-shaped solute field, advection in a cyclogenesis flow field and transport of a initially square-shaped solute plume), and further applied to simulate the non-reactive solute transport process driven by irregular tides in the Deep Bay, eastern Pearl River Estuary of China. These cases are also simulated by models using other existing methods, including different r-factor for advection term and the Green–Gauss theorem for diffusion term. The comparison between the results from the new method and those from other existing methods demonstrated the new method could describe advection induced concentration shock and discontinuities, and anisotropic diffusion at high resolution without providing spurious oscillations and negative values
    • 

    corecore