Search CORE

22 research outputs found

Audiovisual Moments in Time:A large-scale annotated dataset of audiovisual actions

Author: Joannou Michael
Noppeney Uta
Rotshtein Pia
Publication venue
Publication date: 01/04/2024
Field of study

We present Audiovisual Moments in Time (AVMIT), a large-scale dataset of audiovisual action events. In an extensive annotation task 11 participants labelled a subset of 3-second audiovisual videos from the Moments in Time dataset (MIT). For each trial, participants assessed whether the labelled audiovisual action event was present and whether it was the most prominent feature of the video. The dataset includes the annotation of 57,177 audiovisual videos, each independently evaluated by 3 of 11 trained participants. From this initial collection, we created a curated test set of 16 distinct action classes, with 60 videos each (960 videos). We also offer 2 sets of pre-computed audiovisual feature embeddings, using VGGish/YamNet for audio data and VGG16/EfficientNetB0 for visual data, thereby lowering the barrier to entry for audiovisual DNN research. We explored the advantages of AVMIT annotations and feature embeddings to improve performance on audiovisual event recognition. A series of 6 Recurrent Neural Networks (RNNs) were trained on either AVMIT-filtered audiovisual events or modality-agnostic events from MIT, and then tested on our audiovisual test set. In all RNNs, top 1 accuracy was increased by 2.71-5.94% by training exclusively on audiovisual events, even outweighing a three-fold increase in training data. Additionally, we introduce the Supervised Audiovisual Correspondence (SAVC) task whereby a classifier must discern whether audio and visual streams correspond to the same action label. We trained 6 RNNs on the SAVC task, with or without AVMIT-filtering, to explore whether AVMIT is helpful for cross-modal learning. In all RNNs, accuracy improved by 2.09-19.16% with AVMIT-filtered data. We anticipate that the newly annotated AVMIT dataset will serve as a valuable resource for research and comparative experiments involving computational models and human participants, specifically when addressing research questions where audiovisual correspondence is of critical importance

University of Birmingham Research Portal

Continuous multi-modal interaction causes human-robot alignment

Author: Belpaeme Tony
Joannou Michael
Wallkötter Sebastian
Westlake Samuel
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2017
Field of study

Ghent University Academic Bibliography

The UK National Thermal-Hydraulics Facility: Motivations, Design and Planning Status

Author: Brummitt Amanda
Dahlfors Marcus
Joannou Jason
Lee Bill
Middleburgh Simon
Rushton Michael
Publication venue: 'EDP Sciences'
Publication date: 01/01/2021
Field of study

Bangor University Research Portal

Recommended from our members

CHERI Concentrate: Practical Compressed Capabilities

Author: Chisnall David
Davis Brooks
Filardo Nathaniel W
Fox Anthony
Gudka Khilan
Joannou Alexandre
Markettos A Theodore
Moore Simon W
Neumann Peter G
Norton Robert M
Roe Michael
Watson Robert NM
Woodruff Jonathan
Xia Hongyan
Publication venue: IEEE TRANSACTIONS ON COMPUTERS
Publication date: 01/01/2019
Field of study

We present CHERI Concentrate, a new fat-pointer compression scheme applied to CHERI, the most developed capability-pointer system at present. Capability fat-pointers are a primary candidate for enforcing fine-grained and non-bypassable security properties in future computer systems, although increased pointer size can severely affect performance. Thus, several proposals for capability compression have been suggested but these did not support legacy instruction sets, ignored features critical to the existing software base, and also introduced design inefficiencies to RISC-style processor pipelines. CHERI Concentrate improves on the state-of-the-art region-encoding efficiency, solves important pipeline problems, and eases semantic restrictions of compressed encoding, allowing it to protect a full legacy software stack. We analyze and extend logic from the open-source CHERI prototype processor design on FPGA to demonstrate encoding efficiency, minimize delay of pointer arithmetic, and eliminate additional load-to-use delay. To verify correctness of our proposed high-performance logic, we present a HOL4 machine-checked proof of the decode and pointer-modify operations. Finally, we measure a 50%-75% reduction in L2 misses for many compiled C-language benchmarks running under a commodity operating system using compressed 128-bit and 64-bit formats, demonstrating both compatibility with and increased performance over the uncompressed, 256-bit format

Apollo (Cambridge)

Recommended from our members

CHERI JNI: Sinking the Java Security Model into the C

Author: Brazdil David
Chisnall David
Davis Brooks
Gudka Khilan
Joannou Alexandre
Laurie Ben
Markettos A Theodore
Maste J Edward
Moore Simon W
Neumann Peter G
Norton Robert
Roe Michael
Son Stacey
Watson Robert NM
Woodruff Jonathan
Publication venue: OPERATING SYSTEMS REVIEW
Publication date: 01/01/2017
Field of study

Java provides security and robustness by building a high- level security model atop the foundation of memory protection. Unfortunately, any native code linked into a Java program – including the million lines used to implement the standard library – is able to bypass both the memory protection and the higher-level policies. We present a hardware-assisted implementation of the Java native code interface, which extends the guarantees required for Java’s security model to native code. Our design supports safe direct access to buffers owned by the JVM, including hardware-enforced read-only access where appropriate. We also present Java language syntax to declaratively describe isolated compartments for native code. We show that it is possible to preserve the memory safety and isolation requirements of the Java security model in C code, allowing native code to run in the same process as Java code with the same impact on security as running equivalent Java code. Our approach has a negligible impact on performance, compared with the existing unsafe native code interface. We demonstrate a prototype implementation running on the CHERI microprocessor synthesized in FPGA.Defense Advanced Research Projects Agency Google, Inc. Isaac Newton Trust Thales E-Securit

Apollo (Cambridge)

Loss of endogenous thymosin β4 accelerates glomerular disease

Author: Asanuma
Babelova
Bock-Marquette
Bravo-Cordero
Brinkkoetter
Brown
Brown
Brunskill
Cavasin
Clemens D. Cohen
Cohen
Conte
David A. Long
Davis
Dessapt-Baradez
Duffield
Eckardt
Elisavet Vasilopoulou
Evans
Fan
Feng
Gee
Goldstein
Greenberg
Greka
Guinobert
Guo
Hannappel
Hidaka
Huang
Huang
Huang
Kathryn E. White
Khan
Knop
Kolatsi-Joannou
Kurts
Le Hir
Li
Li
Liao
Long
Maja T. Lindenmeyer
Man
Maria Kolatsi-Joannou
Martini
McWhorter
Michael G. Robson
Moeller
Morris
Mundel
Neil J. Sebire
Nemolato
Omata
Paul J. Winyard
Paul R. Riley
Peng
Pippin
Pollard
Raftopoulou
Rhaleb
Ridley
Rossdeutsch
Sanders
Santra
Schneider
Smart
Smart
Sosne
Sosne
Thorner
Tian
Tipping
Vasilopoulou
Wang
Welsh
Xu
Yates
Zhu
Zhu
Zuo
Publication venue: 'Elsevier BV'
Publication date: 01/01/2016
Field of study

Glomerular disease is characterized by morphologic changes in podocyte cells accompanied by inflammation and fibrosis. Thymosin

\beta_4

regulates cell morphology, inflammation, and fibrosis in several organs and administration of exogenous thymosin

\beta_4

improves animal models of unilateral ureteral obstruction and diabetic nephropathy. However, the role of endogenous thymosin

\beta_4

in the kidney is unknown. We demonstrate that thymosin β4 is expressed prominently in podocytes of developing and adult mouse glomeruli. Global loss of thymosin

\beta_4

did not affect healthy glomeruli, but accelerated the severity of immune-mediated nephrotoxic nephritis with worse renal function, periglomerular inflammation, and fibrosis. Lack of thymosin

\beta_4

in nephrotoxic nephritis led to the redistribution of podocytes from the glomerular tuft toward the Bowman capsule suggesting a role for thymosin

\beta_4

in the migration of these cells. Thymosin

\beta_4

knockdown in cultured podocytes also increased migration in a wound-healing assay, accompanied by F-actin rearrangement and increased RhoA activity. We propose that endogenous thymosin

\beta_4

is a modifier of glomerular injury, likely having a protective role acting as a brake to slow disease progression

Elsevier - Publisher Connector

Crossref

PubMed Central

UCL Discovery

Oxford University Research Archive

Kent Academic Repository

King's Research Portal

Dual-stream recurrent convolutional neural networks as models of human audiovisual perception

Author: Joannou Michael
Publication venue
Publication date: 05/12/2022
Field of study

Multisensory perception allows humans to operate successfully in the world. Increasingly, deep neural networks (DNNs) are used as models of human unisensory perception. In this work, we take some of the first steps to extend this line of research from the unisensory to the multisensory domain, specifically, audiovisual perception. First, we produce a highly-controlled, large, labelled dataset of audiovisual action events for human vs DNN studies. Next, we introduce a novel deep neural network architecture that we name a ‘dual-stream recurrent convolutional neural network’ (DRCNN), consisting of 2 component CNNs joined by a novel ‘multimodal squeeze unit’ and fed into an RNN. We develop a series of these architectures, leveraging a number of pretrained state-of-the-art CNNs, and train a number of instances of each, producing a series of classifiers. We find that, after optimising 12 classifier instances on audiovisual action recognition, all classifiers are able to solve the audiovisual correspondence problem, indicating that this ability may be a consequence of the task constraints. Further, we find that these classifiers are highly affected by signals in the unattended to modality during unimodal classification tasks, demonstrating a high level of integration across modalities. Further experiments revealed that dual-stream RCNN classifiers perform significantly worse than humans on a visual-only action recognition task when stimuli was clean or distorted by Gaussian noise or Gaussian blur. Both classifiers and humans were able to leverage audio information to increase their levels of performance in the clean condition, and to significantly decrease the effect of visual distortion on their audiovisual performances. Indeed, 5/6 classifiers performed within the range of human performance on clean audiovisual stimuli, and 3/6 maintained human level performance when low levels of Gaussian noise were introduced

University of Birmingham Research Archive, E-theses Repository

Stereoselective synthesis and cyclisation of the acyclic precursor to auripyrone A and B

Author: Joannou John
Perkins Michael Victor
Sampson Rebecca K
Taylor Max Ronald
Publication venue: 'Elsevier BV'
Publication date: 01/01/2006
Field of study

Flinders Academic Commons

Book review: Emma Liggins, George Gissing, the working woman and urban culture / Susan Hamilton, Frances Power Cobbe and Victorian feminism

Author: Carol Jacobi
Derek Beales
Heathorn Stephen
Heathorn Stephen
Julian Rushton
Maroula Joannou
Michael Ledger-Lomas
Mike Sanders
Nina Lübbren
Rachel Dickinson
Steinmetz Willibald
Stephen Heathorn
Publication venue: 'Edinburgh University Press'
Publication date: 01/01/2008
Field of study

Reviews of Emma Liggins's book on George Gissing and Susan Hamilton's on Frances Power Cobb

Crossref

Anglia Ruskin Research

Leveraging domain expertise in architectural exploration

Author: Antara Bhatt (7207004)
Demetrios Joannou (7168745)
Evgeny Shindin (7206998)
Henry Broodny (7206992)
Imad Sanduka (7207007)
Michael Masin (7206995)
Roy Kalawsky (1252716)
Uri Shani (7207001)
Yingchun Tian (3184896)
Publication venue
Publication date: 01/01/2014
Field of study

Domain experience is a key driver behind design quality, especially during the early design phases of a product or service. Currently, the only practical way to bring such experience into a project is to directly engage subject matter experts, which means there is the potential for a resource availability bottleneck because the experts are not available when required. Whilst many domain specific tools have attempted to capture expert knowledge in embedded analytics thus allowing less experienced engineers to perform complex tasks, this is certainly not the case for highly complex systems of systems where their architectures can go far beyond what a single human being can comprehend. This paper proposes a new approach to leveraging design expertise in a manner that facilitates architectural exploration and architecture optimization by using pre-defined architecture patterns. In addition, we propose a means to streamline such a process by delineating the knowledge creation process and architectural exploration analytics with the means to facilitate information flow from the former to the latter through a carefuly designed integration framework

Loughborough University Institutional Repository