27,793 research outputs found
Articulation rate in Swedish child-directed speech increases as a function of the age of the child even when surprisal is controlled for
In earlier work, we have shown that articulation rate in Swedish
child-directed speech (CDS) increases as a function of the age of the child,
even when utterance length and differences in articulation rate between
subjects are controlled for. In this paper we show on utterance level in
spontaneous Swedish speech that i) for the youngest children, articulation rate
in CDS is lower than in adult-directed speech (ADS), ii) there is a significant
negative correlation between articulation rate and surprisal (the negative log
probability) in ADS, and iii) the increase in articulation rate in Swedish CDS
as a function of the age of the child holds, even when surprisal along with
utterance length and differences in articulation rate between speakers are
controlled for. These results indicate that adults adjust their articulation
rate to make it fit the linguistic capacity of the child.Comment: 5 pages, Interspeech 201
Size constancy in bat biosonar?
Perception and encoding of object size is an important feature of sensory systems. In the visual system object size is encoded by the visual angle (visual aperture) on the retina, but the aperture depends on the distance of the object. As object distance is not unambiguously encoded in the visual system, higher computational mechanisms are needed. This phenomenon is termed "size constancy". It is assumed to reflect an automatic re-scaling of visual aperture with perceived object distance. Recently, it was found that in echolocating bats, the 'sonar aperture', i.e., the range of angles from which sound is reflected from an object back to the bat, is unambiguously perceived and neurally encoded. Moreover, it is well known that object distance is accurately perceived and explicitly encoded in bat sonar. Here, we addressed size constancy in bat biosonar, recruiting virtual-object techniques. Bats of the species Phyllostomus discolor learned to discriminate two simple virtual objects that only differed in sonar aperture. Upon successful discrimination, test trials were randomly interspersed using virtual objects that differed in both aperture and distance. It was tested whether the bats spontaneously assigned absolute width information to these objects by combining distance and aperture. The results showed that while the isolated perceptual cues encoding object width, aperture, and distance were all perceptually well resolved by the bats, the animals did not assign absolute width information to the test objects. This lack of sonar size constancy may result from the bats relying on different modalities to extract size information at different distances. Alternatively, it is conceivable that familiarity with a behaviorally relevant, conspicuous object is required for sonar size constancy, as it has been argued for visual size constancy. Based on the current data, it appears that size constancy is not necessarily an essential feature of sonar perception in bats
Design and implementation of a user-oriented speech recognition interface: the synergy of technology and human factors
The design and implementation of a user-oriented speech recognition interface are described. The interface enables the use of speech recognition in so-called interactive voice response systems which can be accessed via a telephone connection. In the design of the interface a synergy of technology and human factors is achieved. This synergy is very important for making speech interfaces a natural and acceptable form of human-machine interaction. Important concepts such as interfaces, human factors and speech recognition are discussed. Additionally, an indication is given as to how the synergy of human factors and technology can be realised by a sketch of the interface's implementation. An explanation is also provided of how the interface might be integrated in different applications fruitfully
Recommended from our members
Virtual reality in the rehabilitation of people with intellectual disabilities
Virtual reality (VR) possesses many qualities that give it rehabilitative potential for people with intellectual disabilities, both as an intervention and an assessment. It can provide a safe setting in which to practice skills that might carry too many risks in the real world. Unlike human tutors, computers are infinitely patient and consistent. Virtual worlds can be manipulated in ways the real world cannot be and can convey concepts without the use of language or other symbol systems. Published applications for this client group have all been as rehabilitative interventions. These are described in three groups: promoting skills for independent living, enhancing cognitive performance, and improving social skills. Five groups of studies are reviewed that utilize virtual technology to promote skills for independent living: grocery shopping, preparing food, orientation, road safety, and manufacturing skills. Fears that skills or habits learnt in a virtual setting would not transfer to the real world setting have not been supported by the available evidence, apart from those studies with people with autistic spectrum disorders. Future directions are in the development of more applications for independent living skills, exploring interventions for promoting motor and cognitive skills, and the developments of ecologically valid forms of assessment
Focal Spot, Spring 1990
https://digitalcommons.wustl.edu/focal_spot_archives/1054/thumbnail.jp
Distributional effects and individual differences in L2 morphology learning
Second language (L2) learning outcomes may depend on the structure of the input and learners’ cognitive abilities. This study tested whether less predictable input might facilitate learning and generalization of L2 morphology while evaluating contributions of statistical learning ability, nonverbal intelligence, phonological short-term memory, and verbal working memory. Over three sessions, 54 adults were exposed to a Russian case-marking paradigm with a balanced or skewed item distribution in the input. Whereas statistical learning ability and nonverbal intelligence predicted learning of trained items, only nonverbal intelligence also predicted generalization of case-marking inflections to new vocabulary. Neither measure of temporary storage capacity predicted learning. Balanced, less predictable input was associated with higher accuracy in generalization but only in the initial test session. These results suggest that individual differences in pattern extraction play a more sustained role in L2 acquisition than instructional manipulations that vary the predictability of lexical items in the input
Access to recorded interviews: A research agenda
Recorded interviews form a rich basis for scholarly inquiry. Examples include oral histories, community memory projects, and interviews conducted for broadcast media. Emerging technologies offer the potential to radically transform the way in which recorded interviews are made accessible, but this vision will demand substantial investments from a broad range of research communities. This article reviews the present state of practice for making recorded interviews available and the state-of-the-art for key component technologies. A large number of important research issues are identified, and from that set of issues, a coherent research agenda is proposed
Spartan Daily, March 9, 1961
Volume 48, Issue 81https://scholarworks.sjsu.edu/spartandaily/4135/thumbnail.jp
ASVspoof 2017 Version 2.0: meta-data analysis and baseline enhancements
International audienceThe now-acknowledged vulnerabilities of automatic speaker verification (ASV) technology to spoofing attacks have spawned interests to develop so-called spoofing countermeasures. By providing common databases, protocols and metrics for their assessment, the ASVspoof initiative was born to spear-head research in this area. The first competitive ASVspoof challenge held in 2015 focused on the assessment of countermeasures to protect ASV technology from voice conversion and speech synthesis spoofing attacks. The second challenge switched focus to the consideration of replay spoofing attacks and countermeasures. This paper describes Version 2.0 of the ASVspoof 2017 database which was released to correct data anomalies detected post-evaluation. The paper contains as-yet unpublished meta-data which describes recording and playback devices and acoustic environments. These support the analysis of replay detection performance and limits. Also described are new results for the official ASVspoof baseline system which is based upon a constant Q cesptral coefficient frontend and a Gaussian mixture model backend. Reported are enhancements to the baseline system in the form of log-energy coefficients and cepstral mean and variance normalisation in addition to an alternative i-vector backend. The best results correspond to a 48% relative reduction in equal error rate when compared to the original baseline system
- …