1,054 research outputs found
ELVIS: Entertainment-led video summaries
© ACM, 2010. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in ACM Transactions on Multimedia Computing, Communications, and Applications, 6(3): Article no. 17 (2010) http://doi.acm.org/10.1145/1823746.1823751Video summaries present the user with a condensed and succinct representation of the content of a video stream. Usually this is achieved by attaching degrees of importance to low-level image, audio and text features. However, video content elicits strong and measurable physiological responses in the user, which are potentially rich indicators of what video content is memorable to or emotionally engaging for an individual user. This article proposes a technique that exploits such physiological responses to a given video stream by a given user to produce Entertainment-Led VIdeo Summaries (ELVIS). ELVIS is made up of five analysis phases which correspond to the analyses of five physiological response measures: electro-dermal response (EDR), heart rate (HR), blood volume pulse (BVP), respiration rate (RR), and respiration amplitude (RA). Through these analyses, the temporal locations of the most entertaining video subsegments, as they occur within the video stream as a whole, are automatically identified. The effectiveness of the ELVIS technique is verified through a statistical analysis of data collected during a set of user trials. Our results show that ELVIS is more consistent than RANDOM, EDR, HR, BVP, RR and RA selections in identifying the most entertaining video subsegments for content in the comedy, horror/comedy, and horror genres. Subjective user reports also reveal that ELVIS video summaries are comparatively easy to understand, enjoyable, and informative
Is there still a place for the concept of therapeutic regression in psychoanalysis?
The author uses his own failure to find a place for the idea of therapeutic regression in his clinical thinking or practice as the basis for an investigation into its meaning and usefulness. He makes a distinction between three ways the term ‘regression’ is used in psychoanalytic discourse: as a way of evoking a primitive level of experience; as a reminder in some clinical situations of the value of non-intervention on the part of the analyst; and as a description of a phase of an analytic treatment with some patients where the analyst needs to put aside normal analytic technique in order to foster a regression in the patient. It is this third meaning, which the author terms “therapeutic regression” that this paper examines, principally by means of an extended discussion of two clinical examples of a patient making a so-called therapeutic regression, one given by Winnicott and the other by Masud Khan. The author argues that in these examples the introduction of the concept of therapeutic regression obscures rather than clarifies the clinical process. He concludes that, as a substantial clinical concept, the idea of therapeutic regression has outlived its usefulness. However he also notes that many psychoanalytic writers continue to find a use for the more generic concept of regression, and that the very engagement with the more particular idea of therapeutic regression has value in provoking questions as to what is truly therapeutic in psychoanalytic treatment
How Local is the Local Diversity? Reinforcing Sequential Determinantal Point Processes with Dynamic Ground Sets for Supervised Video Summarization
The large volume of video content and high viewing frequency demand automatic
video summarization algorithms, of which a key property is the capability of
modeling diversity. If videos are lengthy like hours-long egocentric videos, it
is necessary to track the temporal structures of the videos and enforce local
diversity. The local diversity refers to that the shots selected from a short
time duration are diverse but visually similar shots are allowed to co-exist in
the summary if they appear far apart in the video. In this paper, we propose a
novel probabilistic model, built upon SeqDPP, to dynamically control the time
span of a video segment upon which the local diversity is imposed. In
particular, we enable SeqDPP to learn to automatically infer how local the
local diversity is supposed to be from the input video. The resulting model is
extremely involved to train by the hallmark maximum likelihood estimation
(MLE), which further suffers from the exposure bias and non-differentiable
evaluation metrics. To tackle these problems, we instead devise a reinforcement
learning algorithm for training the proposed model. Extensive experiments
verify the advantages of our model and the new learning algorithm over
MLE-based methods
Research priorities relating to communication and swallowing for people with learning disabilities across the lifespan
Purpose
This research priority setting partnership (PSP) aims to collaboratively identify the “top ten” research priorities relating to communication and swallowing for children and adults with learning disabilities, across the lifespan in the UK, using a modified James Lind Alliance approach.
Design/methodology/approach
A steering group and reference group were established to oversee the PSP. A survey of speech and language therapists (SLTs) resulted in 157 research suggestions. These were further developed into 95 research questions through a multi-stakeholder workshop. Questions were prioritised via an online card-sort activity completed by SLTs, health-care or education professionals and carers. Research questions were analysed thematically. Ten adults with learning disabilities were supported to assign ratings to themes reflecting their prioritisation. The top ten research priorities were identified by combining results from these activities.
Findings
The top ten research priorities related to intervention, outcome measurement and service delivery around communication and dysphagia.
Originality/value
To the best of the authors’ knowledge, this is the first UK-wide research PSP on learning disabilities and speech and language therapy across the lifespan. It uses a novel approach to incorporate the preferences of people with learning disabilities in the prioritisation
The Fastest Flights in Nature: High-Speed Spore Discharge Mechanisms among Fungi
BACKGROUND: A variety of spore discharge processes have evolved among the fungi. Those with the longest ranges are powered by hydrostatic pressure and include "squirt guns" that are most common in the Ascomycota and Zygomycota. In these fungi, fluid-filled stalks that support single spores or spore-filled sporangia, or cells called asci that contain multiple spores, are pressurized by osmosis. Because spores are discharged at such high speeds, most of the information on launch processes from previous studies has been inferred from mathematical models and is subject to a number of errors. METHODOLOGY/PRINCIPAL FINDINGS: In this study, we have used ultra-high-speed video cameras running at maximum frame rates of 250,000 fps to analyze the entire launch process in four species of fungi that grow on the dung of herbivores. For the first time we have direct measurements of launch speeds and empirical estimates of acceleration in these fungi. Launch speeds ranged from 2 to 25 m s(-1) and corresponding accelerations of 20,000 to 180,000 g propelled spores over distances of up to 2.5 meters. In addition, quantitative spectroscopic methods were used to identify the organic and inorganic osmolytes responsible for generating the turgor pressures that drive spore discharge. CONCLUSIONS/SIGNIFICANCE: The new video data allowed us to test different models for the effect of viscous drag and identify errors in the previous approaches to modeling spore motion. The spectroscopic data show that high speed spore discharge mechanisms in fungi are powered by the same levels of turgor pressure that are characteristic of fungal hyphae and do not require any special mechanisms of osmolyte accumulation
- …