436 research outputs found
Sound Event Detection by Exploring Audio Sequence Modelling
Everyday sounds in real-world environments are a powerful source of information by which humans can interact with their environments. Humans can infer what is happening around them by listening to everyday sounds. At the same time, it is a challenging task for a computer algorithm in a smart device to automatically recognise, understand, and interpret everyday sounds. Sound event detection (SED) is the process of transcribing an audio recording into sound event tags with onset and offset time values. This involves classification and segmentation of sound events in the given audio recording. SED has numerous applications in everyday life which include security and surveillance, automation, healthcare monitoring, multimedia information retrieval, and assisted living technologies. SED is to everyday sounds what automatic speech recognition (ASR) is to speech and automatic music transcription (AMT) is to music. The fundamental questions in designing a sound recognition system are, which portion of a sound event should the system analyse, and what proportion of a sound event should the system process in order to claim a confident detection of that particular sound event. While the classification of sound events has improved a lot in recent years, it is considered that the temporal-segmentation of sound events has not improved in the same extent. The aim of this thesis is to propose and develop methods to improve the segmentation and classification of everyday sound events in SED models. In particular, this thesis explores the segmentation of sound events by investigating audio sequence encoding-based and audio sequence modelling-based methods, in an effort to improve the overall sound event detection performance. In the first phase of this thesis, efforts are put towards improving sound event detection by explicitly conditioning the audio sequence representations of an SED model using sound activity detection (SAD) and onset detection. To achieve this, we propose multi-task learning-based SED models in which SAD and onset detection are used as auxiliary tasks for the SED task. The next part of this thesis explores self-attention-based audio sequence modelling, which aggregates audio representations based on temporal relations within and between sound events, scored on the basis of the similarity of sound event portions in audio event sequences. We propose SED models that include memory-controlled, adaptive, dynamic, and source separation-induced self-attention variants, with the aim to improve overall sound recognition
Computer Vision and Architectural History at Eye Level:Mixed Methods for Linking Research in the Humanities and in Information Technology
Information on the history of architecture is embedded in our daily surroundings, in vernacular and heritage buildings and in physical objects, photographs and plans. Historians study these tangible and intangible artefacts and the communities that built and used them. Thus valuableinsights are gained into the past and the present as they also provide a foundation for designing the future. Given that our understanding of the past is limited by the inadequate availability of data, the article demonstrates that advanced computer tools can help gain more and well-linked data from the past. Computer vision can make a decisive contribution to the identification of image content in historical photographs. This application is particularly interesting for architectural history, where visual sources play an essential role in understanding the built environment of the past, yet lack of reliable metadata often hinders the use of materials. The automated recognition contributes to making a variety of image sources usable forresearch.<br/
Recommended from our members
Sonic heritage: listening to the past
History is so often told through objects, images and photographs, but the potential of sounds to reveal place and space is often neglected. Our research project âSonic Palimpsestâ1 explores the potential of sound to evoke impressions and new understandings of the past, to embrace the sonic as a tool to understand what was, in a way that can complement and add to our predominant visual understandings. Our work includes the expansion of the Oral History archives held at Chatham Dockyard to include womenâs voices and experiences, and the creation of sonic works to engage the public with their heritage. Our research highlights the social and cultural value of oral history and field recordings in the transmission of knowledge to both researchers and the public. Together these recordings document how buildings and spaces within the dockyard were used and experienced by those who worked there. We can begin to understand the social and cultural roles of these buildings within the community, both past and present
Machine Learning Algorithm for the Scansion of Old Saxon Poetry
Several scholars designed tools to perform the automatic scansion of poetry in many languages, but none of these tools
deal with Old Saxon or Old English. This project aims to be a first attempt to create a tool for these languages. We
implemented a Bidirectional Long Short-Term Memory (BiLSTM) model to perform the automatic scansion of Old Saxon
and Old English poems. Since this model uses supervised learning, we manually annotated the Heliand manuscript, and
we used the resulting corpus as labeled dataset to train the model. The evaluation of the performance of the algorithm
reached a 97% for the accuracy and a 99% of weighted average for precision, recall and F1 Score. In addition, we tested
the model with some verses from the Old Saxon Genesis and some from The Battle of Brunanburh, and we observed that
the model predicted almost all Old Saxon metrical patterns correctly misclassified the majority of the Old English input
verses
Cognition-Based Evaluation of Visualisation Frameworks for Exploring Structured Cultural Heritage Data
It is often claimed that Information Visualisation (InfoVis) tools improve the
audienceâs engagement with the display of cultural heritage (CH) collections, open
up CH content to new audiences and support teaching and learning through interactive experiences. But there is a lack of studies systematically evaluating these
claims, particularly from the perspective of modern educational theory. As far as
the author is aware no experimental investigation has been undertaken until now,
that attempts to measure deeper levels of user engagement and learning with InfoVis
tools. The investigation of this thesis complements InfoVis research by initiating a
human-centric approach since little previous research has attempted to incorporate
and integrate human cognition as one of the fundamental components of InfoVis.
In this thesis, using Bloomâs taxonomy of learning objectives as well as individual
learning characteristics (i.e. cognitive preferences), I have evaluated the visitor experience of an art collection both with and without InfoVis tools (between subjects
design). Results indicate that whilst InfoVis tools have some positive effect on the
lower levels of learning, they are less effective for higher levels. In addition, this
thesis shows that InfoVis tools seem to be more effective when they match specific cognitive preferences. These results have implications for both the designers of tools and for CH venues in terms of expectation of effectiveness and exhibition design; the proposed cognitive based evaluation framework and the results of this investigation could provide a valuable baseline for assessing the effectiveness of visitorsâ interaction with the artifacts of online and physical exhibitions where InfoVis tools such as Timelines and Maps along with storytelling techniques are being used
Computer Vision and Architectural History at Eye Level:Mixed Methods for Linking Research in the Humanities and in Information Technology
Information on the history of architecture is embedded in our daily surroundings, in vernacular and heritage buildings and in physical objects, photographs and plans. Historians study these tangible and intangible artefacts and the communities that built and used them. Thus valuableinsights are gained into the past and the present as they also provide a foundation for designing the future. Given that our understanding of the past is limited by the inadequate availability of data, the article demonstrates that advanced computer tools can help gain more and well-linked data from the past. Computer vision can make a decisive contribution to the identification of image content in historical photographs. This application is particularly interesting for architectural history, where visual sources play an essential role in understanding the built environment of the past, yet lack of reliable metadata often hinders the use of materials. The automated recognition contributes to making a variety of image sources usable forresearch.<br/
Operatic Pasticcios in 18th-Century Europe
In Early Modern times, techniques of assembling, compiling and arranging pre-existing material were part of the established working methods in many arts. In the world of 18th-century opera, such practices ensured that operas could become a commercial success because the substitution or compilation of arias fitting the singer's abilities proved the best recipe for fulfilling the expectations of audiences. Known as »pasticcios« since the 18th-century, these operas have long been considered inferior patchwork. The volume collects essays that reconsider the pasticcio, contextualize it, define its preconditions, look at its material aspects and uncover its aesthetical principles
Vielfalt und Integration - diversitĂĄ ed integrazione - diversitĂ© et intĂ©gration: Sprache(n) in sozialen und digitalen RĂ€umen: Eine Festschrift fĂŒr Elisabeth Burr
Diese Festschrift fĂŒr Elisabeth Burr stellt Vielfalt und Integration in der Sprachwissenschaft und in den Digital Humanities in den Mittelpunkt. Die BeitrĂ€ge berĂŒhren zentrale Fragen im Schaffen Burrs: Wie kann Sprache und ihre Variation in AbhĂ€ngigkeit von sozialen und geographischen Faktoren adĂ€quat beschrieben werden? Wie lassen sich informatische und digitale ZugĂ€nge dafĂŒr nutzen? VerknĂŒpft werden sie mit ihr wichtigen und aktuellen Themen aus Sozio-, Gender- und Korpuslinguistik, Dialektologie und Sprachgeographie sowie den digitalen Geisteswissenschaften.
Die Beitragenden sind u. a. Stefania Spina, Thomas Krefeld, Annette Gerstenberg, Lazslo Hinyadi, Carol Chiodo und Lauren Tilton, Manuel Burghardt, Ăyvind Eide, JĂŒrgen Hermes, Andreas Witt. Ray Siemens, Arianna Ciula, Alejandro BĂa sowie Rob Evans
METROPOLITAN ENCHANTMENT AND DISENCHANTMENT. METROPOLITAN ANTHROPOLOGY FOR THE CONTEMPORARY LIVING MAP CONSTRUCTION
We can no longer interpret the contemporary metropolis as we did in the last century. The thought of civil economy regarding the contemporary Metropolis conflicts more or less radically with the merely acquisitive dimension of the behaviour of its citizens. What is needed is therefore a new capacity for
imagining the economic-productive future of the city: hybrid social enterprises, economically sustainable, structured and capable of using technologies, could be a solution for producing value and distributing it fairly and inclusively.
Metropolitan Urbanity is another issue to establish. Metropolis needs new spaces where inclusion can occur, and where a repository of the imagery can be recreated. What is the ontology behind the technique of metropolitan planning and management, its vision and its symbols? Competitiveness,
speed, and meritocracy are political words, not technical ones. Metropolitan Urbanity is the characteristic of a polis that expresses itself in its public places. Today, however, public places are private ones that are destined for public use. The Common Good has always had a space of representation in the city, which was the public space. Today, the Green-Grey Infrastructure is the metropolitan city's monument that communicates a value for future generations and must therefore be recognised and imagined; it is the production of the metropolitan symbolic imagery, the new magic of the city
Recommended from our members
Pluriversal Fashions: Towards an Anti-Racist Fashion Design Pedagogy
This thesis explores ways of devising an anti-racist fashion design pedagogy. The research comprises a two-stage investigation: Part 1, a scoping study of undergraduate fashion design education in the UK and a case study analysing how racialised and gendered differences are currently represented in undergraduate fashion design studentsâ sketchbook research; Part 2, four case studies analysing counter-hegemonic fashion design classes to explore the possibility of an anti-racist fashion design process in fashion design education. The findings from the sketchbook analysis (Part 1) showed how a dominant two-step design tactic is employed in the fashion design process to construct racist and sexist representations by decontextualising and then recontextualising differences. This tactic was shown to reproduce asymmetric power relations built upon racist and colonial logic that reinforces white normativity. These findings were then used to reconceptualise pluralistic fashion concepts in four different pedagogical settings (Part 2) by centring my positionality as fashion design educator who is a woman of colour.
Overall, this thesis argues for incorporating decolonial feminist-informed fashion design processes into higher education to counter racism in fashion by centring heterogeneous concepts of fashion based on counter-hegemonic, non-universalist and non-linear systems of fashion knowledge, foregrounding embodied knowledge and differences. The research contributes to knowledge in fashion design pedagogy in three ways: it presents new empirical evidence to demonstrate how key design tactics privilege white normativity in the fashion design process; it tests alternative decolonial feminist pedagogical approaches to counter the coloniality of fashion design, and it provides a new framework for a pluriversal fashion design pedagogy and praxis in fashion design education
- âŠ