Search CORE

2,222 research outputs found

Virtual acoustics displays

Author: Fisher Scott S.
Foster Scott H.
Stone Philip K.
Wenzel Elizabeth M.
Publication venue
Publication date
Field of study

The real time acoustic display capabilities are described which were developed for the Virtual Environment Workstation (VIEW) Project at NASA-Ames. The acoustic display is capable of generating localized acoustic cues in real time over headphones. An auditory symbology, a related collection of representational auditory 'objects' or 'icons', can be designed using ACE (Auditory Cue Editor), which links both discrete and continuously varying acoustic parameters with information or events in the display. During a given display scenario, the symbology can be dynamically coordinated in real time with 3-D visual objects, speech, and gestural displays. The types of displays feasible with the system range from simple warnings and alarms to the acoustic representation of multidimensional data or events

NASA Technical Reports Server

Dynamic bandwidth allocation in ATM networks

Author: Ashibani Majdi Ali Atoomi
Publication venue: Department of Electrical Engineering
Publication date: 01/01/2002
Field of study

Includes bibliographical references.This thesis investigates bandwidth allocation methodologies to transport new emerging bursty traffic types in ATM networks. However, existing ATM traffic management solutions are not readily able to handle the inevitable problem of congestion as result of the bursty traffic from the new emerging services. This research basically addresses bandwidth allocation issues for bursty traffic by proposing and exploring the concept of dynamic bandwidth allocation and comparing it to the traditional static bandwidth allocation schemes

Cape Town University OpenUCT

YodaNN: An Architecture for Ultra-Low Power Binary-Weight CNN Acceleration

Author: Andri Renzo
Benini Luca
Cavigelli Lukas
Rossi Davide
Publication venue
Publication date: 24/02/2017
Field of study

Convolutional neural networks (CNNs) have revolutionized the world of computer vision over the last few years, pushing image classification beyond human accuracy. The computational effort of today's CNNs requires power-hungry parallel processors or GP-GPUs. Recent developments in CNN accelerators for system-on-chip integration have reduced energy consumption significantly. Unfortunately, even these highly optimized devices are above the power envelope imposed by mobile and deeply embedded applications and face hard limitations caused by CNN weight I/O and storage. This prevents the adoption of CNNs in future ultra-low power Internet of Things end-nodes for near-sensor analytics. Recent algorithmic and theoretical advancements enable competitive classification accuracy even when limiting CNNs to binary (+1/-1) weights during training. These new findings bring major optimization opportunities in the arithmetic core by removing the need for expensive multiplications, as well as reducing I/O bandwidth and storage. In this work, we present an accelerator optimized for binary-weight CNNs that achieves 1510 GOp/s at 1.2 V on a core area of only 1.33 MGE (Million Gate Equivalent) or 0.19 mm

^2

and with a power dissipation of 895 {\mu}W in UMC 65 nm technology at 0.6 V. Our accelerator significantly outperforms the state-of-the-art in terms of energy and area efficiency achieving 61.2 TOp/s/[email protected] V and 1135 GOp/s/[email protected] V, respectively

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Asynchronously Replicated Shared Workspaces for a Multi-Media Annotation Service over Internet

Author: Benz Hartmut
Lijding Maria Eva
Publication venue: Springer
Publication date: 01/01/1998
Field of study

This paper describes a world wide collaboration system through multimedia Post-its (user generated annotations). DIANE is a service to create multimedia annotations to every application output on the computer, as well as to existing multimedia annotations. Users collaborate by registering multimedia documents and user generated annotation in shared workspaces. However, DIANE only allows effective participation in a shared workspace over a high performance network (ATM, fast Ethernet) since it deals with large multimedia object. When only slow or unreliable connections are available between a DIANE terminal and server, useful work becomes impossible. To overcome these restrictions we need to replicate DIANE servers so that users do not suffer degradation in the quality of service. We use the asynchronous replication service ODIN to replicate the shared workspaces to every interested site in a transparent way to users. ODIN provides a cost-effective object replication by building a dynamic virtual network over Internet. The topology of this virtual network optimizes the use of network resources while it satisfies the changing requirements of the users

University of Twente Research Information

AudioGen: Textually Guided Audio Generation

Author: Adi Yossi
Copet Jade
Défossez Alexandre
Kreuk Felix
Parikh Devi
Polyak Adam
Singer Uriel
Synnaeve Gabriel
Taigman Yaniv
Publication venue
Publication date: 30/09/2022
Field of study

We tackle the problem of generating audio samples conditioned on descriptive text captions. In this work, we propose AaudioGen, an auto-regressive generative model that generates audio samples conditioned on text inputs. AudioGen operates on a learnt discrete audio representation. The task of text-to-audio generation poses multiple challenges. Due to the way audio travels through a medium, differentiating ``objects'' can be a difficult task (e.g., separating multiple people simultaneously speaking). This is further complicated by real-world recording conditions (e.g., background noise, reverberation, etc.). Scarce text annotations impose another constraint, limiting the ability to scale models. Finally, modeling high-fidelity audio requires encoding audio at high sampling rate, leading to extremely long sequences. To alleviate the aforementioned challenges we propose an augmentation technique that mixes different audio samples, driving the model to internally learn to separate multiple sources. We curated 10 datasets containing different types of audio and text annotations to handle the scarcity of text-audio data points. For faster inference, we explore the use of multi-stream modeling, allowing the use of shorter sequences while maintaining a similar bitrate and perceptual quality. We apply classifier-free guidance to improve adherence to text. Comparing to the evaluated baselines, AudioGen outperforms over both objective and subjective metrics. Finally, we explore the ability of the proposed method to generate audio continuation conditionally and unconditionally. Samples: https://tinyurl.com/audiogen-text2audi

arXiv.org e-Print Archive

An immersive system for browsing and visualizing surveillance video

Author: DeCamp Philip James
Kubat Rony Daniel
Roy Deb K.
Shaw George Macaulay
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2010
Field of study

HouseFly is an interactive data browsing and visualization system that synthesizes audio-visual recordings from multiple sensors, as well as the meta-data derived from those recordings, into a unified viewing experience. The system is being applied to study human behavior in both domestic and retail situations grounded in longitudinal video recordings. HouseFly uses an immersive video technique to display multiple streams of high resolution video using a realtime warping procedure that projects the video onto a 3D model of the recorded space. The system interface provides the user with simultaneous control over both playback rate and vantage point, enabling the user to navigate the data spatially and temporally. Beyond applications in video browsing, this system serves as an intuitive platform for visualizing patterns over time in a variety of multi-modal data, including person tracks and speech transcripts.United States. Office of Naval Research (Award no. N000140910187

CiteSeerX

DSpace@MIT

Crossref

Exploring Energy Consumption Issues for video Streaming in Mobile Devices: a Review

Author: Deshpande A. (Aarti)
Publication venue: 'Arunai Publications Private Limited'
Publication date: 01/01/2017
Field of study

The proliferation of high-end mobile devices, such as smart phones, tablets, together have gained the popularity of multimedia streaming among the user. It is found from various studies and survey that at end of 2020 mobile devices will increase drastically and Mobile video streaming will also grow rapidly than overall average mobile traffic. The streaming application in Smartphone heavily depends on the wireless network activities substantially amount of data transfer server to the client. Because of very high energy requirement of data transmitted in wireless interface for video streaming application considered as most energy consuming application. Therefore to optimize the battery USAge of mobile device during video streaming it is essential to understand the various video streaming techniques and there energy consumption issues in different environment. In this paper we explore energy consumption in mobile device while experiencing video streaming and examine the solution that has been discussed in various research to improve the energy consumption during video streaming in mobile devices . We classify the investigation on a different layer of internet protocol stack they utilize and also compare them and provide proof of fact that already exist in modern Smartphone as energy saving mechanism

Neliti