1,817 research outputs found

    UMSL Bulletin 2023-2024

    Get PDF
    The 2023-2024 Bulletin and Course Catalog for the University of Missouri St. Louis.https://irl.umsl.edu/bulletin/1088/thumbnail.jp

    Sound Event Detection by Exploring Audio Sequence Modelling

    Get PDF
    Everyday sounds in real-world environments are a powerful source of information by which humans can interact with their environments. Humans can infer what is happening around them by listening to everyday sounds. At the same time, it is a challenging task for a computer algorithm in a smart device to automatically recognise, understand, and interpret everyday sounds. Sound event detection (SED) is the process of transcribing an audio recording into sound event tags with onset and offset time values. This involves classification and segmentation of sound events in the given audio recording. SED has numerous applications in everyday life which include security and surveillance, automation, healthcare monitoring, multimedia information retrieval, and assisted living technologies. SED is to everyday sounds what automatic speech recognition (ASR) is to speech and automatic music transcription (AMT) is to music. The fundamental questions in designing a sound recognition system are, which portion of a sound event should the system analyse, and what proportion of a sound event should the system process in order to claim a confident detection of that particular sound event. While the classification of sound events has improved a lot in recent years, it is considered that the temporal-segmentation of sound events has not improved in the same extent. The aim of this thesis is to propose and develop methods to improve the segmentation and classification of everyday sound events in SED models. In particular, this thesis explores the segmentation of sound events by investigating audio sequence encoding-based and audio sequence modelling-based methods, in an effort to improve the overall sound event detection performance. In the first phase of this thesis, efforts are put towards improving sound event detection by explicitly conditioning the audio sequence representations of an SED model using sound activity detection (SAD) and onset detection. To achieve this, we propose multi-task learning-based SED models in which SAD and onset detection are used as auxiliary tasks for the SED task. The next part of this thesis explores self-attention-based audio sequence modelling, which aggregates audio representations based on temporal relations within and between sound events, scored on the basis of the similarity of sound event portions in audio event sequences. We propose SED models that include memory-controlled, adaptive, dynamic, and source separation-induced self-attention variants, with the aim to improve overall sound recognition

    UMSL Bulletin 2022-2023

    Get PDF
    The 2022-2023 Bulletin and Course Catalog for the University of Missouri St. Louis.https://irl.umsl.edu/bulletin/1087/thumbnail.jp

    VNHSGE: VietNamese High School Graduation Examination Dataset for Large Language Models

    Full text link
    The VNHSGE (VietNamese High School Graduation Examination) dataset, developed exclusively for evaluating large language models (LLMs), is introduced in this article. The dataset, which covers nine subjects, was generated from the Vietnamese National High School Graduation Examination and comparable tests. 300 literary essays have been included, and there are over 19,000 multiple-choice questions on a range of topics. The dataset assesses LLMs in multitasking situations such as question answering, text generation, reading comprehension, visual question answering, and more by including both textual data and accompanying images. Using ChatGPT and BingChat, we evaluated LLMs on the VNHSGE dataset and contrasted their performance with that of Vietnamese students to see how well they performed. The results show that ChatGPT and BingChat both perform at a human level in a number of areas, including literature, English, history, geography, and civics education. They still have space to grow, though, especially in the areas of mathematics, physics, chemistry, and biology. The VNHSGE dataset seeks to provide an adequate benchmark for assessing the abilities of LLMs with its wide-ranging coverage and variety of activities. We intend to promote future developments in the creation of LLMs by making this dataset available to the scientific community, especially in resolving LLMs' limits in disciplines involving mathematics and the natural sciences.Comment: 74 pages, 44 figure

    The Texture of Everyday Life: Carceral Realism and Abolitionist Speculation

    Get PDF
    Exploring the ways in which prisons shape the subjectivity of free-world thinkers, and the ways that subjectivity is expressed in literary texts, this dissertation develops the concept of carceral realism: a cognitive and literary mode that represents prisons and police as the only possible response to social disorder. As this dissertation illustrates, this form of consciousness is experienced as racial paranoia, and it is expressed literary texts, which reflect and help to reify it. Through this process of cultural reification, carceral realism increasingly insists on itself as the only possible mode of thinking. As I argue, however, carceral realism actually stands in a dialectical relationship to abolitionist speculation, or, the active imagining of a world without prisons and police and/or the conditions necessary to actualize such a world. In much the same way that carceral realism embeds itself in realist literary forms, abolitionist speculation plays a constitutive role in the utopian literary tradition. In order to elaborate these concepts, this dissertation begins with a meta-consideration of how cultural productions by incarcerated people are typically framed. Building upon the work of scholars and incarcerated authors’ own interventions in questions of consciousness, authorship, textual production, and study, this chapter contrasts that typical frame with a method of abolitionist reading. Chapter two applies this methodology to Edward Bunker’s 1977 novel The Animal Factory and Claudia Rankine’s 2010 poem Citizen in order to develop the concept of carceral realism and demonstrate how it has developed from the 1970s to the present. In order to lay out the historical foundations of the modern prison, chapter three looks back to the late 18th century and situates the emergence of the penitentiary within debates regarding race, citizenship, and state power. Returning to the 1970s, chapter four investigates the role universities have played in the formation of carceral realism and the complex relationship Chicanos and Asian Americans have to prisons and police by analogizing the institutionalization of prison literary study to the formation of ethnic studies. Chapter five draws this project to a conclusion by developing the concept of abolitionist speculation, or the active imagining of a world without prisons or the police and/or the conditions necessary to realize such a world, which I identify as both a constitutive generic feature of utopian literature and something that exceeds literature altogether. In doing so, this dissertation establishes an ongoing historical relationship between social reproduction of prisons and literary forms that cuts across time, geography, race, gender, and genre

    Evaluating automated and hybrid neural disambiguation for African historical named entities

    Get PDF
    Documents detailing South African history contain ambiguous names. Ambiguous names may be due to people having the same name or the same person being referred to by multiple different names. Thus when searching for or attempting to extract information about a particular person, the name used may affect the results. This problem may be alleviated by using a Named Entity Disambiguation (NED) system to disambiguate names by linking them to a knowledge base. In recent years, transformer-based language models have led to improvements in NED systems. Furthermore, multilingual language models have shown the ability to learn concepts across languages, reducing the amount of training data required in low-resource languages. Thus a multilingual language model-based NED system was developed to disambiguate people's names within a historical South African context using documents written in English and isiZulu from the 500 Year Archive (FHYA). The multilingual language model-based system substantially improved on a probability-based baseline and achieved a micro F1-score of 0.726. At the same time, the entity linking component was able to link 81.9% of the mentions to the correct entity. However, the system's performance on documents written in isiZulu was significantly lower than on the documents written in English. Thus the system was augmented with handcrafted rules to improve its performance. The addition of handcrafted rules resulted in a small but significant improvement in performance when compared to the unaugmented NED system

    Predicate Matrix: an interoperable lexical knowledge base for predicates

    Get PDF
    183 p.La Matriz de Predicados (Predicate Matrix en inglés) es un nuevo recurso léxico-semántico resultado de la integración de múltiples fuentes de conocimiento, entre las cuales se encuentran FrameNet, VerbNet, PropBank y WordNet. La Matriz de Predicados proporciona un léxico extenso y robusto que permite mejorar la interoperabilidad entre los recursos semánticos mencionados anteriormente. La creación de la Matriz de Predicados se basa en la integración de Semlink y nuevos mappings obtenidos utilizando métodos automáticos que enlazan el conocimiento semántico a nivel léxico y de roles. Asimismo, hemos ampliado la Predicate Matrix para cubrir los predicados nominales (inglés, español) y predicados en otros idiomas (castellano, catalán y vasco). Como resultado, la Matriz de predicados proporciona un léxico multilingüe que permite el análisis semántico interoperable en múltiples idiomas

    Viewpoint Diversity in Search Results

    Get PDF
    Adverse phenomena such as the search engine manipulation effect (SEME), where web search users change their attitude on a topic following whatever most highly-ranked search results promote, represent crucial challenges for research and industry. However, the current lack of automatic methods to comprehensively measure or increase viewpoint diversity in search results complicates the understanding and mitigation of such effects. This paper proposes a viewpoint bias metric that evaluates the divergence from a pre-defined scenario of ideal viewpoint diversity considering two essential viewpoint dimensions (i.e., stance and logic of evaluation). In a case study, we apply this metric to actual search results and find considerable viewpoint bias in search results across queries, topics, and search engines that could lead to adverse effects such as SEME. We subsequently demonstrate that viewpoint diversity in search results can be dramatically increased using existing diversification algorithms. The methods proposed in this paper can assist researchers and practitioners in evaluating and improving viewpoint diversity in search results.</p

    The Perception of K-12 Instrumental Directors in Low-Income Areas on Virtual Learning with Skill Development and Retention

    Get PDF
    Due to the extreme measures taken to protect students from COVID-19 during the pandemic, schools closed their doors, and educators struggled to continue teaching through virtual learning platforms. Performance-based classrooms were encouraged to discover new methods and strategies to motivate students to thrive even though face-to-face rehearsals were restricted. This study examined the experiences secondary music education instrumentalists faced while attempting to utilize synchronous and asynchronous instruction in a 100 percent virtual performance-based environment. This study aimed to understand the negative and positive effects placed on secondary instrumentalists’ performance abilities, fundamental development, and participation/retention since the introduction of virtual learning in low-income areas. The focus of this study also examined the possible benefits of enhancing pedagogical skills through the addition of technological advances to push instrumental instruction and performances on the secondary level. This study followed a qualitative hermeneutic phenomenology design. Music educators in low-income DeKalb County communities were interviewed for this study. Participants were requested to share their perspectives and experiences of performance-based virtual learning and results. The study raised the need for future discussions to create and implement a state and national virtual music education guideline that would assist music educators in turning a devastating situation into a blessing for all art programs and their stakeholders
    • …
    corecore