6,905 research outputs found

    Enhancing the vocal range of single-speaker singing voice synthesis with melody-unsupervised pre-training

    Full text link
    The single-speaker singing voice synthesis (SVS) usually underperforms at pitch values that are out of the singer's vocal range or associated with limited training samples. Based on our previous work, this work proposes a melody-unsupervised multi-speaker pre-training method conducted on a multi-singer dataset to enhance the vocal range of the single-speaker, while not degrading the timbre similarity. This pre-training method can be deployed to a large-scale multi-singer dataset, which only contains audio-and-lyrics pairs without phonemic timing information and pitch annotation. Specifically, in the pre-training step, we design a phoneme predictor to produce the frame-level phoneme probability vectors as the phonemic timing information and a speaker encoder to model the timbre variations of different singers, and directly estimate the frame-level f0 values from the audio to provide the pitch information. These pre-trained model parameters are delivered into the fine-tuning step as prior knowledge to enhance the single speaker's vocal range. Moreover, this work also contributes to improving the sound quality and rhythm naturalness of the synthesized singing voices. It is the first to introduce a differentiable duration regulator to improve the rhythm naturalness of the synthesized voice, and a bi-directional flow model to improve the sound quality. Experimental results verify that the proposed SVS system outperforms the baseline on both sound quality and naturalness

    Neural Concatenative Singing Voice Conversion: Rethinking Concatenation-Based Approach for One-Shot Singing Voice Conversion

    Full text link
    Any-to-any singing voice conversion (SVC) is confronted with the challenge of ``timbre leakage'' issue caused by inadequate disentanglement between the content and the speaker timbre. To address this issue, this study introduces NeuCoSVC, a novel neural concatenative SVC framework. It consists of a self-supervised learning (SSL) representation extractor, a neural harmonic signal generator, and a waveform synthesizer. The SSL extractor condenses audio into fixed-dimensional SSL features, while the harmonic signal generator leverages linear time-varying filters to produce both raw and filtered harmonic signals for pitch information. The synthesizer reconstructs waveforms using SSL features, harmonic signals, and loudness information. During inference, voice conversion is performed by substituting source SSL features with their nearest counterparts from a matching pool which comprises SSL features extracted from the reference audio, while preserving raw harmonic signals and loudness from the source audio. By directly utilizing SSL features from the reference audio, the proposed framework effectively resolves the ``timbre leakage" issue caused by previous disentanglement-based approaches. Experimental results demonstrate that the proposed NeuCoSVC system outperforms the disentanglement-based speaker embedding approach in one-shot SVC across intra-language, cross-language, and cross-domain evaluations

    Gradual Enhancement of Stripe-Type Antiferromagnetism in Spin Ladder Material BaFe2_2S3_3 Under Pressure

    Full text link
    We report pressure-dependent neutron diffraction and muon spin relaxation/rotation measurements combined with first-principles calculations to investigate the structural, magnetic, and electronic properties of BaFe2_2S3_3 under pressure. The experimental results reveal a gradual enhancement of the stripe-type ordering temperature with increasing pressure up to 2.6 GPa and no observable change in the size of the ordered moment. The ab initio calculations suggest that the magnetism is highly sensitive to the Fe-S bond lengths and angles, clarifying discrepancies with previously published results. In contrast to our experimental observations, the calculations predict a monotonic reduction of the ordered moment with pressure. We suggest that the robustness of the stripe-type antiferromagnetism is due to strong electron correlations not fully considered in the calculations

    On the definition of noise

    Get PDF
    Urbanization has exposed people to extreme sound levels. Although researchers have investigated the ability of people to listen, analyze, and distinguish sound, the concept of noise has not been clearly articulated from a human perspective. The lack of knowledge on how people perceive noise limits our capacity to control it in a targeted manner. This study aimed to interpret the definition of noise from the public perspective based on a grounded theory approach. Seventy-eight participants were interviewed about noise, and four categories of perceived understanding of noise were identified: challenges, definitions of noise, opportunities, and action. As one of the challenges, urbanization is associated with increased noise levels around the human environment. In terms of definition, perceiving sound as noise is considered to be a result of the complex and dynamic process that includes sound, the environment, and humans. Sound and humans interact with the environment. In terms of opportunities, noise may have positive roles on certain occasions, dispelling the misconception that noise is exclusively negative. In addition, we found that noise perception has gradually shifted from noise control to noise utilization. In terms of action, noise can be controlled at the sound sources, susceptible target groups, susceptible behaviors and states, locations, and times where noise is perceived with high frequency. In this study, we investigated several aspects of noise, ranging from noise control, soundscape definition, and ‘soundscape indices’ (SSID) integration and application. Our findings provide an additional basis for developing better definitions, control, and utilization strategies of noise in the future, thereby improving the quality of the sound environment

    Size does matter:Effects of in-game advertising stimuli on brand recall and brand recognition

    Get PDF
    Rapid growth in the global gaming industry has created substantial opportunities for marketers. The potential and effectiveness of in-game advertising have attracted increasing attention from academics.Drawing on product placement literature, this study explores the influence of size (small, large), order (primacy, recency) and level of absorption (low, high) on consumer memory (brand recall and recognition) of well-known brands placed in a racing game. Using a controlled experiment, 285 participants were recruited to play the sports video game Trackmania 2 Canyon. Results indicate a positive effect of size, but order and level of absorption had no effect on brand recall and brand recognition. In particular, large size brands are recalled and recognised significantly better. Findings offer important implicationsfor marketers in a global gaming industry that is steadily growing

    Luxury fashion brands: factors influencing young female consumers' luxury fashion purchasing in Taiwan

    Get PDF
    Purpose - Against the background of increasing consumption of luxury fashion brands by young female consumers in Asian countries, this paper offers insights into the consumption motives and purchasing behaviour of that market segment in Taiwan. Design/methodology/approach - Analysis of data collected using face-to-face semi-structured interviews with twenty-three 18-32 year-old fashion-conscious females was completed and new empirical insights are offered. Findings - The study found a high level of involvement in the world of luxury fashion retailing. Asian consumers devoured media commentary, drew inspiration from female celebrities, and treated information-seeking and discussion of luxury fashion brands with friends as a serious and enjoyable pursuit. The social status conferred by expensive fashion wear motivated them to spend on luxury brands even if their discretionary income was limited. Potential guilt in so doing was assuaged by rationalising that the quality was good and the purchase would be long lasting. Marketers targeting this valuable segment should communicate appeals to an aspirational lifestyle in traditional and social media, effective at reaching young women. Originality/value - The study reported in this article contributes to the limited published research into the luxury marketing sector in Asia by examining the buying behaviour of female Strawberry Generation consumers in Taiwan. It is the first to research and investigate the meanings attached to luxury by these individuals in the collectivist culture of Taiwan, as well as their motivations, and the factors influencing their purchase of luxury fashions. The study thus contributes with new knowledge to the buying of luxury fashion products by young female Taiwanese consumers, which may be extended to other collectivist cultures in Asia
    • …
    corecore