186 research outputs found
EmoGen: Eliminating Subjective Bias in Emotional Music Generation
Music is used to convey emotions, and thus generating emotional music is
important in automatic music generation. Previous work on emotional music
generation directly uses annotated emotion labels as control signals, which
suffers from subjective bias: different people may annotate different emotions
on the same music, and one person may feel different emotions under different
situations. Therefore, directly mapping emotion labels to music sequences in an
end-to-end way would confuse the learning process and hinder the model from
generating music with general emotions. In this paper, we propose EmoGen, an
emotional music generation system that leverages a set of emotion-related music
attributes as the bridge between emotion and music, and divides the generation
into two stages: emotion-to-attribute mapping with supervised clustering, and
attribute-to-music generation with self-supervised learning. Both stages are
beneficial: in the first stage, the attribute values around the clustering
center represent the general emotions of these samples, which help eliminate
the impacts of the subjective bias of emotion labels; in the second stage, the
generation is completely disentangled from emotion labels and thus free from
the subjective bias. Both subjective and objective evaluations show that EmoGen
outperforms previous methods on emotion control accuracy and music quality
respectively, which demonstrate our superiority in generating emotional music.
Music samples generated by EmoGen are available via this
link:https://ai-muzic.github.io/emogen/, and the code is available at this
link:https://github.com/microsoft/muzic/.Comment: 12 pages, 7 page
MuseCoco: Generating Symbolic Music from Text
Generating music from text descriptions is a user-friendly mode since the
text is a relatively easy interface for user engagement. While some approaches
utilize texts to control music audio generation, editing musical elements in
generated audio is challenging for users. In contrast, symbolic music offers
ease of editing, making it more accessible for users to manipulate specific
musical elements. In this paper, we propose MuseCoco, which generates symbolic
music from text descriptions with musical attributes as the bridge to break
down the task into text-to-attribute understanding and attribute-to-music
generation stages. MuseCoCo stands for Music Composition Copilot that empowers
musicians to generate music directly from given text descriptions, offering a
significant improvement in efficiency compared to creating music entirely from
scratch. The system has two main advantages: Firstly, it is data efficient. In
the attribute-to-music generation stage, the attributes can be directly
extracted from music sequences, making the model training self-supervised. In
the text-to-attribute understanding stage, the text is synthesized and refined
by ChatGPT based on the defined attribute templates. Secondly, the system can
achieve precise control with specific attributes in text descriptions and
offers multiple control options through attribute-conditioned or
text-conditioned approaches. MuseCoco outperforms baseline systems in terms of
musicality, controllability, and overall score by at least 1.27, 1.08, and 1.32
respectively. Besides, there is a notable enhancement of about 20% in objective
control accuracy. In addition, we have developed a robust large-scale model
with 1.2 billion parameters, showcasing exceptional controllability and
musicality
Design monolayer iodinenes based on halogen bond and tiling theory
Xenes, two-dimensional (2D) monolayers composed of a single element, with
graphene as a typical representative, have attracted widespread attention. Most
of the previous Xenes, X from group-IIIA to group-VIA elements have bonding
characteristics of covalent bonds. In this work, we for the first time unveil
the pivotal role of a halogen bond, which is a distinctive type of bonding with
interaction strength between that of a covalent bond and a van der Waals
interaction, in 2D group-VIIA monolayers. Combing the ingenious
non-edge-to-edge tiling theory and state-of-art ab initio method with refined
local density functional M06-L, we provide a precise and effective bottom-up
construction of 2D iodine monolayer sheets, iodinenes, primarily governed by
halogen bonds, and successfully design a category of stable iodinenes,
encompassing herringbone, Pythagorean, gyrated truncated hexagonal, i.e.
diatomic-kagome, and gyrated hexagonal tiling pattern. These iodinene
structures exhibit a wealth of properties, such as flat bands, nontrivial
topology, and fascinating optical characteristics, offering valuable insights
and guidance for future experimental investigations. Our work not only unveils
the unexplored halogen bonding mechanism in 2D materials but also opens a new
avenue for designing other non-covalent bonding 2D materials.Comment: 6 pages, 4 figure
Non-covalent interactions in electrochemical reactions and implications in clean energy applications
Understanding and controlling non-covalent interactions associated with solvent molecules and redox-inactive ions provide new opportunities to enhance the reaction entropy changes and reaction kinetics of metal redox centers, which can increase the thermodynamic efficiency of energy conversion and storage devices. Here, we report systematic changes in the redox entropy of one-electron transfer reactions including [Fe(CN)6]3-/4-, [Fe(H2O)6]3+/2+and [Ag(H2O)4]+/0induced by the addition of redox inactive ions, where approximately twenty different known structure making/breaking ions were employed. The measured reaction entropy changes of these redox couples were found to increase linearly with higher concentration and greater structural entropy (having greater structure breaking tendency) for inactive ions with opposite charge to the redox centers. The trend could be attributed to the altered solvation shells of oxidized and reduced redox active species due to non-covalent interactions among redox centers, inactive ions and water molecules, which was supported by Raman spectroscopy. Not only were these non-covalent interactions shown to increase reaction entropy, but they were also found to systematically alter the redox kinetics, where increasing redox reaction energy changes associated with the presence of water structure breaking cations were correlated linearly with the greater exchange current density of [Fe(CN)6]3-/4-.United States. Department of Energy. Office of Basic Energy Science (Award Number DE-SC0001299/DE-FG02-09ER46577)Hong Kong (China). Innovation and Technology Commission (Project No. ITS/ 020/16FP)United States. Department of Energy (Contract No. DE-AC02-5CH11231
Bis(μ-2,2′-disulfanediyldibenzoato)bis[aqua(2,2′-bipyridine)nickel(II)]
In the centrosymmetric title complex, [Ni2(C14H8O4S2)2(C10H8N2)2(H2O)2], the NiII atom is coordinated by two N atoms from one 2,2′-bipyridine ligand, three carboxylate O atoms (one bidentate and one monodentate) from two different disulfanediyldibenzoate ligands and one O atom from a coordinated water molecule in an octahedral coordination geometry. The disulfanediyldibenzoate dianion bridges two NiII atoms. Adjacent molecules are linked through the coordinated water molecules, forming a O—H⋯O hydrogen-bonded chain running along the a axis
ActionPrompt: Action-Guided 3D Human Pose Estimation With Text and Pose Prompting
Recent 2D-to-3D human pose estimation (HPE) utilizes temporal consistency
across sequences to alleviate the depth ambiguity problem but ignore the action
related prior knowledge hidden in the pose sequence. In this paper, we propose
a plug-and-play module named Action Prompt Module (APM) that effectively mines
different kinds of action clues for 3D HPE. The highlight is that, the mining
scheme of APM can be widely adapted to different frameworks and bring
consistent benefits. Specifically, we first present a novel Action-related Text
Prompt module (ATP) that directly embeds action labels and transfers the rich
language information in the label to the pose sequence. Besides, we further
introduce Action-specific Pose Prompt module (APP) to mine the position-aware
pose pattern of each action, and exploit the correlation between the mined
patterns and input pose sequence for further pose refinement. Experiments show
that APM can improve the performance of most video-based 2D-to-3D HPE
frameworks by a large margin.Comment: 6 pages, 4 figures, 2023ICM
- …