91 research outputs found

    Automatic Music Transcription: Breaking the Glass Ceiling

    Get PDF
    Automatic music transcription is considered by many to be the Holy Grail in the field of music signal analysis. However, the performance of transcription systems is still significantly below that of a human expert, and accuracies reported in recent years seem to have reached a limit, although the field is still very active. In this paper we analyse limitations of current methods and identify promising directions for future research. Current transcription methods use general purpose models which are unable to capture the rich diversity found in music signals. In order to overcome the limited performance of transcription systems, algorithms have to be tailored to specific use-cases. Semi-automatic approaches are another way of achieving a more reliable transcription. Also, the wealth of musical scores and corresponding audio data now available are a rich potential source of training data, via forced alignment of audio to scores, but large scale utilisation of such data has yet to be attempted. Other promising approaches include the integration of information across different methods and musical aspects

    Telerobotic Pointing Gestures Shape Human Spatial Cognition

    Full text link
    This paper aimed to explore whether human beings can understand gestures produced by telepresence robots. If it were the case, they can derive meaning conveyed in telerobotic gestures when processing spatial information. We conducted two experiments over Skype in the present study. Participants were presented with a robotic interface that had arms, which were teleoperated by an experimenter. The robot could point to virtual locations that represented certain entities. In Experiment 1, the experimenter described spatial locations of fictitious objects sequentially in two conditions: speech condition (SO, verbal descriptions clearly indicated the spatial layout) and speech and gesture condition (SR, verbal descriptions were ambiguous but accompanied by robotic pointing gestures). Participants were then asked to recall the objects' spatial locations. We found that the number of spatial locations recalled in the SR condition was on par with that in the SO condition, suggesting that telerobotic pointing gestures compensated ambiguous speech during the process of spatial information. In Experiment 2, the experimenter described spatial locations non-sequentially in the SR and SO conditions. Surprisingly, the number of spatial locations recalled in the SR condition was even higher than that in the SO condition, suggesting that telerobotic pointing gestures were more powerful than speech in conveying spatial information when information was presented in an unpredictable order. The findings provide evidence that human beings are able to comprehend telerobotic gestures, and importantly, integrate these gestures with co-occurring speech. This work promotes engaging remote collaboration among humans through a robot intermediary.Comment: 27 pages, 7 figure

    Communicating Auditory Impairments Using Electroacoustic Composition

    Get PDF
    Changes in human sensory perception can occur for a variety of reasons. In the case of distortions or transformations in the human auditory system, the aetiology may include factors such as medical conditions affecting cognition or physiology, interaction of the ears with mechanical waves, or stem from chemically induced sources, such the consumption of alcohol. These changes may be permanent, intermittent, or temporary. In order to communicate such effects to an audience in an accessible, and easily understood manner, a series of electroacoustic compositions were produced. This concept follows on from previous work on the theme of representing auditory hallucinations. Specifically, these compositions relate to auditory impairments that humans can experience due to tinnitus or through the consumption of alcohol. In the case of tinnitus, whilst much is known about the causes and symptoms, the experience of what it is like to live with tinnitus is less explored and those who have acquired the condition may often feel frustration when trying to convey the experience of ‘what it is like’ for them. In terms of impairment from alcohol consumption, whilst there is much hearsay, little research exists on the immediate and short-term effects of alcohol consumption on the human auditory system, despite over half of the UK population reported as consuming alcohol in 2017. The methodology employed to design these compositions draws upon scientific research findings, including experimental and explorative studies involving human participants, coupled with electroacoustic composition techniques. The pieces are typically constructed by mixing field recordings with synthesised materials and incorporating a range of temporal and frequency domain manipulations to the elements therein. In this way, the listener is able to experience the phenomenon in a recognisable context, where distortions of reality can be emulated to varying degrees. It is intended that these compositions can serve as easily accessible and understood examples of auditory impairments and that they might find utility in the communication of symptoms to those who have never experienced the underlying causes or conditions. This presents opportunities for pieces like these to be used in scenarios such as education and public health awareness campaigns

    Impact of Safety-Related Dose Reductions or Discontinuations on Sustained Virologic Response in HCV-Infected Patients: Results from the GUARD-C Cohort.

    Get PDF
    BACKGROUND: Despite the introduction of direct-acting antiviral agents for chronic hepatitis C virus (HCV) infection, peginterferon alfa/ribavirin remains relevant in many resource-constrained settings. The non-randomized GUARD-C cohort investigated baseline predictors of safety-related dose reductions or discontinuations (sr-RD) and their impact on sustained virologic response (SVR) in patients receiving peginterferon alfa/ribavirin in routine practice. METHODS: A total of 3181 HCV-mono-infected treatment-naive patients were assigned to 24 or 48 weeks of peginterferon alfa/ribavirin by their physician. Patients were categorized by time-to-first sr-RD (Week 4/12). Detailed analyses of the impact of sr-RD on SVR24 (HCV RNA <50 IU/mL) were conducted in 951 Caucasian, noncirrhotic genotype (G)1 patients assigned to peginterferon alfa-2a/ribavirin for 48 weeks. The probability of SVR24 was identified by a baseline scoring system (range: 0-9 points) on which scores of 5 to 9 and <5 represent high and low probability of SVR24, respectively. RESULTS: SVR24 rates were 46.1% (754/1634), 77.1% (279/362), 68.0% (514/756), and 51.3% (203/396), respectively, in G1, 2, 3, and 4 patients. Overall, 16.9% and 21.8% patients experienced ≥1 sr-RD for peginterferon alfa and ribavirin, respectively. Among Caucasian noncirrhotic G1 patients: female sex, lower body mass index, pre-existing cardiovascular/pulmonary disease, and low hematological indices were prognostic factors of sr-RD; SVR24 was lower in patients with ≥1 vs. no sr-RD by Week 4 (37.9% vs. 54.4%; P = 0.0046) and Week 12 (41.7% vs. 55.3%; P = 0.0016); sr-RD by Week 4/12 significantly reduced SVR24 in patients with scores <5 but not ≥5. CONCLUSIONS: In conclusion, sr-RD to peginterferon alfa-2a/ribavirin significantly impacts on SVR24 rates in treatment-naive G1 noncirrhotic Caucasian patients. Baseline characteristics can help select patients with a high probability of SVR24 and a low probability of sr-RD with peginterferon alfa-2a/ribavirin.This study was sponsored by F. Hoffmann-La Roche Ltd, Basel, Switzerland. Support for third-party writing assistance for this manuscript, furnished by Blair Jarvis MSc, ELS, of Health Interactions, was provided by F. Hoffmann-La Roche Ltd, Basel, Switzerland
    corecore