21,761 research outputs found
Author Correction: GLORIA - A globally representative hyperspectral in situ dataset for optical sensing of water quality
Correction to: Scientific Data https://doi.org/10.1038/s41597-023-01973-y, published online 16 February 2023.
An author of the paper was omitted in the original version (Ted Conroy, University of Waikato, New Zealand). This has been corrected in the pdf and HTML versions of the paper, and the associated metadata
Processing and Linking Audio Events in Large Multimedia Archives: The EU inEvent Project
In the inEvent EU project [1], we aim at structuring, retrieving, and sharing large archives of networked, and dynamically changing, multimedia recordings, mainly consisting of meetings, videoconferences, and lectures. More specifically, we are developing an integrated system that performs audiovisual processing of multimedia recordings, and labels them in terms of interconnected “hyper-events ” (a notion inspired from hyper-texts). Each hyper-event is composed of simpler facets, including audio-video recordings and metadata, which are then easier to search, retrieve and share. In the present paper, we mainly cover the audio processing aspects of the system, including speech recognition, speaker diarization and linking (across recordings), the use of these features for hyper-event indexing and recommendation, and the search portal. We present initial results for feature extraction from lecture recordings using the TED talks. Index Terms: Networked multimedia events; audio processing: speech recognition; speaker diarization and linking; multimedia indexing and searching; hyper-events. 1
Multichannel Attention Network for Analyzing Visual Behavior in Public Speaking
Public speaking is an important aspect of human communication and
interaction. The majority of computational work on public speaking concentrates
on analyzing the spoken content, and the verbal behavior of the speakers. While
the success of public speaking largely depends on the content of the talk, and
the verbal behavior, non-verbal (visual) cues, such as gestures and physical
appearance also play a significant role. This paper investigates the importance
of visual cues by estimating their contribution towards predicting the
popularity of a public lecture. For this purpose, we constructed a large
database of more than TED talk videos. As a measure of popularity of the
TED talks, we leverage the corresponding (online) viewers' ratings from
YouTube. Visual cues related to facial and physical appearance, facial
expressions, and pose variations are extracted from the video frames using
convolutional neural network (CNN) models. Thereafter, an attention-based long
short-term memory (LSTM) network is proposed to predict the video popularity
from the sequence of visual features. The proposed network achieves
state-of-the-art prediction accuracy indicating that visual cues alone contain
highly predictive information about the popularity of a talk. Furthermore, our
network learns a human-like attention mechanism, which is particularly useful
for interpretability, i.e. how attention varies with time, and across different
visual cues by indicating their relative importance
OpenTED Browser: Insights into European Public Spendings
We present the OpenTED browser, a Web application allowing to interactively
browse public spending data related to public procurements in the European
Union. The application relies on Open Data recently published by the European
Commission and the Publications Office of the European Union, from which we
imported a curated dataset of 4.2 million contract award notices spanning the
period 2006-2015. The application is designed to easily filter notices and
visualise relationships between public contracting authorities and private
contractors. The simple design allows for example to quickly find information
about who the biggest suppliers of local governments are, and the nature of the
contracted goods and services. We believe the tool, which we make Open Source,
is a valuable source of information for journalists, NGOs, analysts and
citizens for getting information on public procurement data, from large scale
trends to local municipal developments.Comment: ECML, PKDD, SoGood workshop 201
NanoFS: a hardware-oriented file system
NanoFS is a novel file system for embedded systems and storage-class memories
(like flash) and is specially designed to be directly implemented in hardware. NanoFS is based on an original internal layout intended to achieve an optimal
hardware implementation of the file system’s file lookup and data fetch operations. File system spe-cification on a sample reader module completely implemented in a pro-grammable device is introduced
- …