Search CORE

13,171 research outputs found

Recommended from our members

Integrating computerized speech and whole language in the elementary school : a study with limited English proficient students.

Author: Salavert Roser
Publication venue: ScholarWorks@UMass Amherst
Publication date: 01/01/1988
Field of study

ScholarWorks@UMass Amherst

SHOE:The extraction of hierarchical structure for machine learning of natural language

Author: Daelemans W.M.P.
Powers D.M.W.
Publication venue: Institute for Language Technology and Artifical IntelIigence, Tilburg University
Publication date: 01/01/1991
Field of study

Tilburg University Repository

Gender detection in children’s speech utterances for human-robot interaction

Author: Abdul-Hassan Alia Karim
Badr Ameer Abdul-Baqi
Publication venue: Institute of Advanced Engineering and Science
Publication date: 01/10/2022
Field of study

The human voice speech essentially includes paralinguistic information used in many real-time applications. Detecting the children’s gender is considered a challenging task compared to the adult’s gender. In this study, a system for human-robot interaction (HRI) is proposed to detect the gender in children’s speech utterances without depending on the text. The robot's perception includes three phases: Feature’s extraction phase where four formants are measured at each glottal pulse and then a median is calculated across these measurements. After that, three types of features are measured which are formant average (AF), formant dispersion (DF), and formant position (PF). Feature’s standardization phase where the measured feature dimensions are standardized using the z-score method. The semantic understanding phase is where the children’s gender is detected accurately using the logistic regression classifier. At the same time, the action of the robot is specified via a speech response using the text to speech (TTS) technique. Experiments are conducted on the Carnegie Mellon University (CMU) Kids dataset to measure the suggested system’s performance. In the suggested system, the overall accuracy is 98%. The results show a relatively clear improvement in terms of accuracy of up to 13% compared to related works that utilized the CMU Kids dataset

ZENODO

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Institute of Advanced Engineering and Science

A process-oriented language for describing aspects of reading comprehension

Author: Brown John Seely
Bruce Bertram C.
Rubin Ann D.
Publication venue: Cambridge, Mass. : Bolt Beranek and Newman, Inc.
Publication date: 01/10/1976
Field of study

Includes bibliographical references (p. 36-38)The research described herein was supported in part by the National Institute of Education under Contract No. MS-NIE-C-400-76-011

Illinois Digital Environment for Access to Learning and Scholarship Repository

c

Author: Anton Batliner
Björn Schuller
Bruno Kessler
Fbk Fondazione
Johannes Wagner
Laurence Devillers
Laurence Vidrascu
Loic Kessous
Noam Amir G
Stefan Steidl
Thurid Vogt
Vered Aharonson
Publication venue
Publication date
Field of study

In this article, we describe and interpret a set of acoustic and linguistic features that characterise emotional/emotion-related user states – confined to the one database processed: four classes in a German corpus of children interacting with a pet robot. To this end, we collected a very large feature vector consisting of more than 4000 features extracted at different sites. We performed extensive feature selection (Sequential Forward Floating Search) for seven acoustic and four linguistic types of features, ending up in a small number of ‘most important ’ features which we try to interpret by discussing the impact of different feature and extraction types. We establish different measures of impact and discuss the mutual influence of acoustics and linguistics

CiteSeerX

USING DEEP LEARNING-BASED FRAMEWORK FOR CHILD SPEECH EMOTION RECOGNITION

Author: Onwujekwe Gerald N
Publication venue: VCU Scholars Compass
Publication date: 01/01/2021
Field of study

Biological languages of the body through which human emotion can be detected abound including heart rate, facial expressions, movement of the eyelids and dilation of the eyes, body postures, skin conductance, and even the speech we make. Speech emotion recognition research started some three decades ago, and the popular Interspeech Emotion Challenge has helped to propagate this research area. However, most speech recognition research is focused on adults and there is very little research on child speech. This dissertation is a description of the development and evaluation of a child speech emotion recognition framework. The higher-level components of the framework are designed to sort and separate speech based on the speaker’s age, ensuring that focus is only on speeches made by children. The framework uses Baddeley’s Theory of Working Memory to model a Working Memory Recurrent Network that can process and recognize emotions from speech. Baddeley’s Theory of Working Memory offers one of the best explanations on how the human brain holds and manipulates temporary information which is very crucial in the development of neural networks that learns effectively. Experiments were designed and performed to provide answers to the research questions, evaluate the proposed framework, and benchmark the performance of the framework with other methods. Satisfactory results were obtained from the experiments and in many cases, our framework was able to outperform other popular approaches. This study has implications for various applications of child speech emotion recognition such as child abuse detection and child learning robots

VCU Scholars Compass

Contexts for writing: understanding the child’s perspective

Author: Johnson Penelope Gail
Publication venue
Publication date: 01/01/1998
Field of study

The integration of social theories into a cognitive explanation of the composing process enlarges our notion of context, calling attention to the historical, social and ideological forces that shape the making of knowledge in educational settings. These approaches suggest that context cues certain actions and that students gain entry into academic contexts if they learn the appropriate forms and discourse conventions. However, methodological approaches to teaching do not address how individuals construct meaning, use knowledge for their own purposes, or engage in reflective processes that influence how individuals will act in a socially-governed situation. Nor do they address the issue of how school-acquired knowledge may be transformed to enable individual students to take ownership of their writing. These concerns motivate the attempt to form a cognitive-social epistemic that acknowledges and explains the role of the individual in constructing meaning within culturally-organized activities in primary educational systems. Through questionnaires, interviews and classroom observations, and applying qualitative analytical procedures, the study discloses layers of complexity in a multi-level description of the ways context and cognition interact. At the general level, a comparative analysis of teachers' and pupils' rationales underlying given writing tasks produces converging references to the educational purposes for writing. At a deeper level, findings that writing possibilities and social possibilities are dynamically interlinked with the emergence of identity, suggest that learning is a constructive process of meaning-making which is uniquely manifested in diverse ways. Studies of classroom interaction determine the impact of strategies deployed within classroom communication to control the meaning-making process and make it possible to discuss the efficacies of peer-interaction in the classroom. A second strand of contextual-oriented research in a non-school setting, which incorporates the computer as a writing tool, reinforces the view that children are primarily social players negotiating roles and relationships by whatever mediational means are made available to them. In light of these results, the thesis acknowledges the complexity of a largely implicit cultural architecture for directing the context of action, and concludes that this structure will be explicated only by adopting an inclusive research strategy to encompass simultaneous acting influences

Durham e-Theses

Frustration recognition from speech during game interaction using wide residual networks

Author: Liu Shuo
Mallol-Ragolta Adria
Parada-Cabaleiro Emilia
Ren Zhao
Schuller Björn W.
Song Meishu
Yang Zijiang
Zhao Ziping
Publication venue
Publication date: 28/08/2020
Field of study

ABSTRACT Background Although frustration is a common emotional reaction during playing games, an excessive level of frustration can harm users’ experiences, discouraging them from undertaking further game interactions. The automatic detection of players’ frustration enables the development of adaptive systems, which through a real-time difficulty adjustment, would adapt the game to the user’s specific needs; thus, maximising players experience and guaranteeing the game success. To this end, we present our speech-based approach for the automatic detection of frustration during game interactions, a specific task still under-explored in research. Method The experiments were performed on the Multimodal Game Frustration Database (MGFD), an audiovisual dataset—collected within the Wizard-of-Oz framework—specially tailored to investigate verbal and facial expressions of frustration during game interactions. We explored the performance of a variety of acoustic feature sets, including Mel-Spectrograms and Mel-Frequency Cepstral Coefficients (MFCCs), as well as the low dimensional knowledge-based acoustic feature set eGeMAPS. Due to the always increasing improvements achieved by the use of Convolutional Neural Networks (CNNs) in speech recognition tasks, unlike the MGFD baseline—based on Long Short-Term Memory (LSTM) architecture and Support Vector Machine (SVM) classifier—in the present work we take into consideration typically used CNNs, including ResNets, VGG, and AlexNet. Furthermore, given the still open debate on the shallow vs deep networks suitability, we also examine the performance of two of the latest deep CNNs, i. e., WideResNets and EfficientNet. Results Our best result, achieved with WideResNets and Mel-Spectrogram features, increases the system performance from 58.8 % Unweighted Average Recall (UAR) to 93.1 % UAR for speech-based automatic frustration recognition

OPUS Augsburg

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Recommended from our members

ICT in primary education: A perspective study into the use and selection procedures of software designed to support the development of basic literacy skills for able and less able pupils (KS1)

Author: Papadimitriou Evangelia
Publication venue: Brunel University School of Sport and Education PhD Theses
Publication date: 01/01/2004
Field of study

This thesis was submitted for the degree of Doctor of Philosophy and awarded by Brunel University.The British government is heavily committed to successfully applying technology in primary education through a series of expensive initiatives stressing the importance of technology in teaching across the curriculum and the belief that technology can contribute to pupils' academic achievement. One would assume that educators use it regularly in their classrooms in the subject of basic literacy. One of the factors that may impede the use of technology in teaching is the good quality software. There are numerous software available but of poor quality. Unfortunately, no criteria are available for teachers to select computer packages. Pupils' contribution to designing software is highly recommended but their views have been ignored in relation to what elements should be included in computer packages. This study was set: a) To explore the use and selection procedure of initial literacy software in primary / nursery schools, and b) To explore young pupils (KS1) thoughts on using basic literacy software and on the technical features and instructional characteristics in such programs. This inquiry investigated the above aims involving the views of the three stakeholders - teachers, developers, and children. Namely, 112 primary school teachers, mostly mature in age and experience, of five LEAs in Southwest area of London, 98 KSI (62 Yrl and 36 Yr2), and 10 software companies. The constructivist paradigm by Cuba & Lincoln was employed to reach joint constructions by comparing and contrasting differences, but mostly to give weight to the perspectives of the less power - children - to "give voice". The study has found that young pupils do not have frequent access to such programs, and to computers in general, though schools are equipped with computers and literacy software. The ratio of computers to pupils is large, 1: 13. Schools opt for the ICT suite in order to secure equal access. Just over half of the teachers feel sufficiently trained in using ICT. The older in age and in teaching experience teachers feel less confident in using technology. Developers share the view that teachers' ICT skills are poor. Half of the available software does not undergo any testing before reaching classrooms since only half of developers evaluate their products, and equally half of teachers preview it, but both without pupils involved. Young in the profession teachers and teachers who feel sufficiently trained tend to preview software more than the rest of their colleagues. No criteria are used in order to select computer packages and teachers feel that they need more skills for that reason. The older in the teaching profession educators find more influential software that has been tried out with children. The criteria found in this study are the same as the ones provided by the literature and the ones used by few teachers. Pupils like to work on computers. They believe that computers contribute to their learning, and equally literacy games contribute to the development of pre-reading skills. They like to work in pairs and explain why. The views of pupils on the difficulties they encounter match the views of teachers and developers. Regarding the software elements the study has shown differences between the two age groups (Yrl and Yr2). Similarly, differences are found between the three stakeholders in relation to technical features in software. The study provides a list of recommendations for classroom teachers.This study is partly funded by Brunel University

Brunel University Research Archive

The effects of music on brain development

Author: Yazar İlhan Uğur
Publication venue: Sakarya University
Publication date: 02/07/2024
Field of study

The influence of music on brain development is a complex and widely studied area in science. Studies show that being exposed to music, especially in early stages of life, can improve different cognitive abilities like language, reasoning, and spatial-temporal skills. Engaging with music also helps in regulating emotions and developing social skills. Furthermore, music education supports creativity and self-expression, which are crucial for overall brain development. With a deeper understanding of how music affects the brain at a neurological level, educators, therapists, and parents can leverage its advantages to enhance well-being and cognitive functions for people of all ages

International Journal of Human Sciences / Uluslararası İnsan Bilimleri Dergisi (E-Journal