380 research outputs found

    SAFE: A System for Extraction and Retrieval of Semantic Audio Descriptors

    Get PDF
    date-added: 2014-08-02 10:04:50 +0000 date-modified: 2014-11-26 17:42:49 +0000 keywords: semantic audio, VST plugins, data collection, ISMIR demoIn this paper, we present an overview of the Semantic Audio Feature Extraction (SAFE) Project, a system for the extraction and retrieval of semantic descriptions of musical timbre, deployed within the digital audio workstation. By embedding the data capture system into the music production workflow, we are able to maximise the return of semantically annotated music production data, whilst mit- igating against issues such as musical and environmental bias. Users of the plugins are free to submit semantic de- scriptions of their own music, whilst utilising the continually growing collaborative dataset of musical descriptors. In order to provide more contextually representative timbral transformations, the dataset is partitioned using metadata, captured within the application

    Towards Fully Integrated Real-time Detection Framework for Online Contents Analysis - RED-Alert Approach

    Get PDF
    Social media is extensively used nowadays and is gaining popularity among the users with the increasing growth in the network capacity, connectivity, and speed. Moreover, affordable prices of data plans, especially mobile data packages, have considerably increased the use of multimedia by different users. This includes terrorists who use social media platforms to promote their ideology and intimidate their adversaries. It is therefore very important to develop automated solutions to semantically analyse online contents to assist law enforcement agencies in the preventive policing of online activities. A major challenge for the social media forensic analysis is to preserve the privacy of citizens who use online social networking platforms. This paper presents results of European H2020 project RED-Alert that aims to enable secure and privacy preserving data processing; hence the malicious content and the corresponding personality can be ethically tracked. We have mined seven social media channels for content and providing support for ten languages for analysis. Our proposed solution is designed to ensure security and policing of online contents by detecting terrorist material. We have used social network analysis, speech recognition, face and object detection besides audio event detection to extract information from online sources that are fed in a complex event processor. We have discussed the challenges and prospects of this work especially the need of analysing online contents while respecting European and national data protection laws notably GDPR

    The structure of fluid trifluoromethane and methylfluoride

    Full text link
    We present hard X-ray and neutron diffraction measurements on the polar fluorocarbons HCF3 and H3CF under supercritical conditions and for a range of molecular densities spanning about a factor of ten. The Levesque-Weiss-Reatto inversion scheme has been used to deduce the site-site potentials underlying the measured partial pair distribution functions. The orientational correlations between adjacent fluorocarbon molecules -- which are characterized by quite large dipole moments but no tendency to form hydrogen bonds -- are small compared to a highly polar system like fluid hydrogen chloride. In fact, the orientational correlations in HCF3 and H3CF are found to be nearly as small as those of fluid CF4, a fluorocarbon with no dipole moment.Comment: 11 pages, 9 figure

    Semantic Description of Timbral Transformations in Music Production

    Get PDF
    Abstract. In music production, descriptive terminology is used to define perceived sound transformations. By understanding the underlying statistical features associated with these descriptions, we can aid the retrieval of contextually relevant processing parameters using natural language, and create intelligent systems capable of assisting in audio engineering. In this study, we present an analysis of a dataset containing descriptive terms gathered using a series of processing modules, embedded within a Digital Audio Workstation. By applying hierarchical clustering to the audio feature space, we show that similarity in term representations exists within and between transformation classes. Furthermore, the organisation of terms in low-dimensional timbre space can be explained using perceptual concepts such as size and dissonance. We conclude by performing Latent Semantic Indexing to show that similar groupings exist based on term frequency

    The Structure, Dynamics and Electronic Structure of Liquid Ag-Se Alloys Investigated by Ab Initio Simulation

    Full text link
    Ab initio molecular-dynamics simulations have been used to investigate the structure, dynamics and electronic properties of the liquid alloy Ag(1-x)Se(x) at 1350 K and at the three compositions x=0.33, 0.42 and 0.65. The calculations are based on density-functional theory in the local density approximation and on the pseudopotential plane-wave method. The reliability of the simulations is confirmed by detailed comparisons with very recent neutron diffraction results for the partial structure factors and radial distribution functions (RDF) of the stoichiometric liquid Ag2Se. The simulations show a dramatic change of the Se-Se RDF with increasing Se content. This change is due to the formation of Se clusters bound by covalent bonds, the Se-Se bond length being almost the same as in pure c-Se and l-Se. The clusters are predominantly chain-like, but for higher x a large fraction of 3-fold coordinated Se atoms is also found. It is shown that the equilibrium fractions of Se present as isolated atoms and in clusters can be understood on a simple charge-balance model based on an ionic interpretation. The Ag and Se diffusion coefficients both increase with Se content, in spite of the Se clustering. An analysis of the Se-Se bond dynamics reveals surprisingly short bond lifetimes of less than 1 ps. The changes in the density of states with composition arise directly from the formation of Se-Se covalent bonds. Results for the electronic conductivity obtained using the Kubo-Greenwood approximation are in adequate agreement with experiment for l-Ag2Se, but not for the high Se contents. Possible reasons for this are discussed.Comment: 14 pages, Revtex, 14 Postscript figures embedded in the tex

    Aphasia and Spirituality: the feasibility of assessment and intervention using WELLHEAD and SHALOM

    Get PDF
    Background: Spiritual aspects of aphasia rehabilitation are poorly understood, though identified within adjustment. Existing spiritual health assessments have not been used with people with aphasia, and no structured program to facilitate intervention has been documented, despite acknowledgments that spirituality is important in health and wellbeing and distinct from quality of life and mental health. Aims: Mixed methods were used to investigate the accessibility and acceptability of a spiritual health assessment (SHALOM) and WELLHEAD, a toolkit originated by Mumby for spiritual health assessment and intervention, using the religiously neutral dimensions of ‘WIDE, LONG, HIGH and DEEP’. Method: A Steering group (five people with aphasia) shaped the feasibility study cyclically, agreeing that ‘Meaning and purpose’ defined spirituality, and selecting SHALOM. WELLHEAD was modified collaboratively with the Steering group and Hospital Chaplain. A convenience sample of 10 people with aphasia (discharged from therapy) represented diverse aphasia histories, ages and religious backgrounds. Participants completed a two-hour session using SHALOM, the WELLHEAD toolkit and a feedback questionnaire within video-recorded interviews. Quantitative results from all three components were integrated with a qualitative thematic analysis in NVivo 11 including numerical and descriptive summaries verified by the participants, feedback interview transcripts and field notes with reflections. The thematic analysis was systematically and independently verified by a co-researcher. Feedback from participants was further verified by incorporating their comments from reviewing the overall findings. Results: Quantitative and qualitative feedback evaluated the materials positively. Thematic analysis provided evidence of the accessibility, acceptability and positive impact of WELLHEAD irrespective of aphasia severity or aetiology, and religious background. ‘Belief’, ‘Faith’ and ‘Religion’ were disambiguated. SHALOM was also linguistically and cognitively accessible with communication support even for those with severe aphasia. Scores from WELLHEAD and SHALOM were compared and set into the context of wider standardisation of SHALOM, providing the first evidence of spiritual health measures in participants with aphasia. Conclusions: This preliminary work lays foundations for spiritual assessment and intervention in aphasia. Establishing the psychometric properties of SHALOM and WELLHEAD in people with aphasia requires a larger sample. Additional study of intervention is proposed, with clear potential for wider application of WELLHEAD in diverse settings and populations

    Motivational interviewing for low mood and adjustment early after stroke: a feasibility randomised trial

    Get PDF
    Background Management of psychological adjustment and low mood after stroke can result in positive health outcomes. We have adapted a talk-based therapy, motivational interviewing (MI), and shown it to be potentially effective for managing low mood and supporting psychological adjustment post-stroke in a single-centre trial. In the current study, we aimed to explore the feasibility of delivering MI using clinical stroke team members, and using an attention control (AC), to inform the protocol for a future definitive trial. Methods This parallel two-arm feasibility trial took place in north-west England. Recruitment occurred between December 2012 and November 2013. Participants were stroke patients aged 18 years or over, who were medically stable, had no severe communication problems, and were residents of the hospital catchment. Randomisation was to MI or AC, and was conducted by a researcher not involved in recruitment using opaque sealed envelopes. The main outcome measures were descriptions of study feasibility (recruitment/retention rates, MI delivery by clinical staff, use of AC) and acceptability (through qualitative interviews and completion of study measures), and fidelity to MI and AC (through review of session audio-recordings). Information was also collected on participants’ mood, quality of life, adjustment, and resource-use. Results Over 12 months, 461 patients were screened, 124 were screened eligible, and 49 were randomised: 23 to MI, 26 to AC. At 3 months, 13 MI and 18 AC participants completed the follow-up assessment (63% retention). This was less than expected based on our original trial. An AC was successfully implemented. Alternative approaches would be required to ensure the feasibility of clinical staff delivering MI. The study measures, MI, and AC interventions were considered acceptable, and there was good fidelity to the interventions. There were no adverse events related to study participation. Conclusions It was possible to recruit and retain participants, train clinical staff to deliver MI, and implement an appropriate AC. Changes would be necessary to conduct a future multi-centre trial, including: assuming a recruitment rate lower than that in the current study; implementing more strategies to increase participant retention; and considering alternative clinical staff groups to undertake the delivery of MI and AC

    An EMA analysis of the effect of increasing word length on consonant production in apraxia of speech: A case study

    Get PDF
    The effect of increasing word length on the articulatory dynamics (i.e. duration, distance, maximum acceleration, maximum deceleration, and maximum velocity) of consonant production in acquired apraxia of speech was investigated using electromagnetic articulography (EMA). Tongue-tip and tongue-back movement of one apraxic patient was recorded using the AG-200 EMA system during word-initial consonant productions in one, two, and three syllable words. Significantly deviant articulatory parameters were recorded for each of the target consonants during one, two, and three syllables words. Word length effects were most evident during the release phase of target consonant productions. The results are discussed with respect to theories of speech motor control as they relate to AOS
    • 

    corecore