Search CORE

27,793 research outputs found

Articulation rate in Swedish child-directed speech increases as a function of the age of the child even when surprisal is controlled for

Author: Bjerva Johannes
Hörberg Thomas
Sjons Johan
Östling Robert
Publication venue
Publication date: 01/01/2017
Field of study

In earlier work, we have shown that articulation rate in Swedish child-directed speech (CDS) increases as a function of the age of the child, even when utterance length and differences in articulation rate between subjects are controlled for. In this paper we show on utterance level in spontaneous Swedish speech that i) for the youngest children, articulation rate in CDS is lower than in adult-directed speech (ADS), ii) there is a significant negative correlation between articulation rate and surprisal (the negative log probability) in ADS, and iii) the increase in articulation rate in Swedish CDS as a function of the age of the child holds, even when surprisal along with utterance length and differences in articulation rate between speakers are controlled for. These results indicate that adults adjust their articulation rate to make it fit the linguistic capacity of the child.Comment: 5 pages, Interspeech 201

arXiv.org e-Print Archive

Crossref

Proceedings - University of Groningen

Publikationer från Stockholms universitet

University of Groningen

ARTS repository - University of Groningen

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Dissertations of the University of Groningen

Size constancy in bat biosonar?

Author: AH Holway
AS Feng
BC Carstens
BD Lawrence
C Hagemann
C Hagemann
CF Moss
CF Moss
CM DeLong
ES Airapetianz
G Joermann
G Rother
G Schuller
GG Kwiecinski
H Hartrige
HR Goerlitz
HR Goerlitz
JA Simmons
JA Simmons
JA Simmons
JJ Wenstrup
JM Foley
L Wiegrebe
Lutz Wiegrebe
LW Gellermann
M Heinrich
Manuel S. Malmierca
ME Bates
Melina Heinrich
MJ Morgan
MW Holderied
N Kopco
P Holler
P Weissenbacher
R Aubauer
R Haber
RA Suthers
RS Heffner
S Mistry
S Schoernich
SO Murray
SP Dear
SP McKee
U Firzlaff
W Thies
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

Perception and encoding of object size is an important feature of sensory systems. In the visual system object size is encoded by the visual angle (visual aperture) on the retina, but the aperture depends on the distance of the object. As object distance is not unambiguously encoded in the visual system, higher computational mechanisms are needed. This phenomenon is termed "size constancy". It is assumed to reflect an automatic re-scaling of visual aperture with perceived object distance. Recently, it was found that in echolocating bats, the 'sonar aperture', i.e., the range of angles from which sound is reflected from an object back to the bat, is unambiguously perceived and neurally encoded. Moreover, it is well known that object distance is accurately perceived and explicitly encoded in bat sonar. Here, we addressed size constancy in bat biosonar, recruiting virtual-object techniques. Bats of the species Phyllostomus discolor learned to discriminate two simple virtual objects that only differed in sonar aperture. Upon successful discrimination, test trials were randomly interspersed using virtual objects that differed in both aperture and distance. It was tested whether the bats spontaneously assigned absolute width information to these objects by combining distance and aperture. The results showed that while the isolated perceptual cues encoding object width, aperture, and distance were all perceptually well resolved by the bats, the animals did not assign absolute width information to the test objects. This lack of sonar size constancy may result from the bats relying on different modalities to extract size information at different distances. Alternatively, it is conceivable that familiarity with a behaviorally relevant, conspicuous object is required for sonar size constancy, as it has been argued for visual size constancy. Based on the current data, it appears that size constancy is not necessarily an essential feature of sonar perception in bats

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

Open Access LMU

PubMed Central

MPG.PuRe

Design and implementation of a user-oriented speech recognition interface: the synergy of technology and human factors

Author: Kloosterman Sietse H.
Publication venue: Elsevier
Publication date: 01/01/1994
Field of study

The design and implementation of a user-oriented speech recognition interface are described. The interface enables the use of speech recognition in so-called interactive voice response systems which can be accessed via a telephone connection. In the design of the interface a synergy of technology and human factors is achieved. This synergy is very important for making speech interfaces a natural and acceptable form of human-machine interaction. Important concepts such as interfaces, human factors and speech recognition are discussed. Additionally, an indication is given as to how the synergy of human factors and technology can be realised by a sketch of the interface's implementation. An explanation is also provided of how the interface might be integrated in different applications fruitfully

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

University of Twente Research Information

Dissertations of the University of Groningen

Recommended from our members

Virtual reality in the rehabilitation of people with intellectual disabilities

Author: Brown DJ
Standen PJ
Publication venue: 'Mary Ann Liebert Inc'
Publication date: 01/01/2005
Field of study

Virtual reality (VR) possesses many qualities that give it rehabilitative potential for people with intellectual disabilities, both as an intervention and an assessment. It can provide a safe setting in which to practice skills that might carry too many risks in the real world. Unlike human tutors, computers are infinitely patient and consistent. Virtual worlds can be manipulated in ways the real world cannot be and can convey concepts without the use of language or other symbol systems. Published applications for this client group have all been as rehabilitative interventions. These are described in three groups: promoting skills for independent living, enhancing cognitive performance, and improving social skills. Five groups of studies are reviewed that utilize virtual technology to promote skills for independent living: grocery shopping, preparing food, orientation, road safety, and manufacturing skills. Fears that skills or habits learnt in a virtual setting would not transfer to the real world setting have not been supported by the available evidence, apart from those studies with people with autistic spectrum disorders. Future directions are in the development of more applications for independent living skills, exploring interventions for promoting motor and cognitive skills, and the developments of ecologically valid forms of assessment

Nottingham Trent Institutional Repository (IRep)

Focal Spot, Spring 1990

Author
Publication venue: Digital Commons@Becker
Publication date: 01/04/1990
Field of study

https://digitalcommons.wustl.edu/focal_spot_archives/1054/thumbnail.jp

Digital Commons@Becker

Distributional effects and individual differences in L2 morphology learning

Author: Brooks Patricia J.
Kempe Vera
Kwoka Nicole
Publication venue
Publication date: 02/05/2016
Field of study

Second language (L2) learning outcomes may depend on the structure of the input and learners’ cognitive abilities. This study tested whether less predictable input might facilitate learning and generalization of L2 morphology while evaluating contributions of statistical learning ability, nonverbal intelligence, phonological short-term memory, and verbal working memory. Over three sessions, 54 adults were exposed to a Russian case-marking paradigm with a balanced or skewed item distribution in the input. Whereas statistical learning ability and nonverbal intelligence predicted learning of trained items, only nonverbal intelligence also predicted generalization of case-marking inflections to new vocabulary. Neither measure of temporary storage capacity predicted learning. Balanced, less predictable input was associated with higher accuracy in generalization but only in the initial test session. These results suggest that individual differences in pattern extraction play a more sustained role in L2 acquisition than instructional manipulations that vary the predictability of lexical items in the input

Abertay Research Portal

Crossref

Access to recorded interviews: A research agenda

Author: Heeren W.F.L.
Jong F.M.G. de
Oard D.W.
Ordelman R.J.F.
Publication venue: ACM
Publication date: 01/01/2008
Field of study

Recorded interviews form a rich basis for scholarly inquiry. Examples include oral histories, community memory projects, and interviews conducted for broadcast media. Emerging technologies offer the potential to radically transform the way in which recorded interviews are made accessible, but this vision will demand substantial investments from a broad range of research communities. This article reviews the present state of practice for making recorded interviews available and the state-of-the-art for key component technologies. A large number of important research issues are identified, and from that set of issues, a coherent research agenda is proposed

University of Twente Research Information

Spartan Daily, March 9, 1961

Author: San Jose State University School of Journalism and Mass Communications
Publication venue: SJSU ScholarWorks
Publication date: 09/03/1960
Field of study

Volume 48, Issue 81https://scholarworks.sjsu.edu/spartandaily/4135/thumbnail.jp

SJSU ScholarWorks

Loquendo - Politecnico di Torino's 2006 NIST Speaker Recognition Evaluation System

Author: CASTALDO F
COLIBRO D
DALMASSO E
LAFACE P.
VAIR C
Publication venue: ISCA
Publication date
Field of study

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

ASVspoof 2017 Version 2.0: meta-data analysis and baseline enhancements

Author: Aik Lee Kong
Delgado Hector
Evans Nicholas
Kinnunen Tomi
Sahidullah Md
Todisco Massimiliano
Yamagishi Junichi
Publication venue: 'International Speech Communication Association'
Publication date: 26/06/2018
Field of study

International audienceThe now-acknowledged vulnerabilities of automatic speaker verification (ASV) technology to spoofing attacks have spawned interests to develop so-called spoofing countermeasures. By providing common databases, protocols and metrics for their assessment, the ASVspoof initiative was born to spear-head research in this area. The first competitive ASVspoof challenge held in 2015 focused on the assessment of countermeasures to protect ASV technology from voice conversion and speech synthesis spoofing attacks. The second challenge switched focus to the consideration of replay spoofing attacks and countermeasures. This paper describes Version 2.0 of the ASVspoof 2017 database which was released to correct data anomalies detected post-evaluation. The paper contains as-yet unpublished meta-data which describes recording and playback devices and acoustic environments. These support the analysis of replay detection performance and limits. Also described are new results for the official ASVspoof baseline system which is based upon a constant Q cesptral coefficient frontend and a Gaussian mixture model backend. Reported are enhancements to the baseline system in the form of log-energy coefficients and cepstral mean and variance normalisation in addition to an alternative i-vector backend. The best results correspond to a 48% relative reduction in equal error rate when compared to the original baseline system

Crossref

INRIA a CCSD electronic archive server

Edinburgh Research Explorer

EURECOM Repository