Search CORE

14,379 research outputs found

Experiments on the DCASE Challenge 2016: Acoustic Scene Classification and Sound Event Detection in Real Life Recording

Author: Badlani Rohan
Elizalde Benjamin
Kumar Anurag
Lane Ian
Raj Bhiksha
Shah Ankit
Vincent Emmanuel
Publication venue
Publication date: 25/08/2016
Field of study

In this paper we present our work on Task 1 Acoustic Scene Classi- fication and Task 3 Sound Event Detection in Real Life Recordings. Among our experiments we have low-level and high-level features, classifier optimization and other heuristics specific to each task. Our performance for both tasks improved the baseline from DCASE: for Task 1 we achieved an overall accuracy of 78.9% compared to the baseline of 72.6% and for Task 3 we achieved a Segment-Based Error Rate of 0.76 compared to the baseline of 0.91

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

HAL-Rennes 1

Sharing criteria and performance standards for the 11.7-12.2 GHz band in region 2

Author
Publication venue
Publication date
Field of study

Possible criteria for sharing between the broadcasting-satellite and the fixed-satellite services are considered for each of several parameters in three categories: system, space station, and earth station. Criteria for sharing between the two satellite services and the three terrestrial services to which the 12-GHz band is allocated are discussed separately, first for the case of the fixed and mobile services and then for the broadcasting service

NASA Technical Reports Server

Proceedings of the 2nd Computer Science Student Workshop: Microsoft Istanbul, Turkey, April 9, 2011

Author
Publication venue: 'Sabanci University Information Center'
Publication date: 01/01/2011
Field of study

Sabanci University Research Database

Segregating Event Streams and Noise with a Markov Renewal Process Model

Author: Plumbley MD
Stowell D
Publication venue
Publication date: 01/08/2013
Field of study

DS and MP are supported by EPSRC Leadership Fellowship EP/G007144/1

Queen Mary Research Online

Surrey Research Insight

Learning to Read by Spelling: Towards Unsupervised Text Recognition

Author: Gupta Ankush
Vedaldi Andrea
Zisserman Andrew
Publication venue
Publication date: 23/09/2018
Field of study

This work presents a method for visual text recognition without using any paired supervisory data. We formulate the text recognition task as one of aligning the conditional distribution of strings predicted from given text images, with lexically valid strings sampled from target corpora. This enables fully automated, and unsupervised learning from just line-level text-images, and unpaired text-string samples, obviating the need for large aligned datasets. We present detailed analysis for various aspects of the proposed method, namely - (1) impact of the length of training sequences on convergence, (2) relation between character frequencies and the order in which they are learnt, (3) generalisation ability of our recognition network to inputs of arbitrary lengths, and (4) impact of varying the text corpus on recognition accuracy. Finally, we demonstrate excellent text recognition accuracy on both synthetically generated text images, and scanned images of real printed books, using no labelled training examples

arXiv.org e-Print Archive

Oxford University Research Archive

Proceedings of the Mobile Satellite System Architectures and Multiple Access Techniques Workshop

Author: Dessouky Khaled
Publication venue
Publication date
Field of study

The Mobile Satellite System Architectures and Multiple Access Techniques Workshop served as a forum for the debate of system and network architecture issues. Particular emphasis was on those issues relating to the choice of multiple access technique(s) for the Mobile Satellite Service (MSS). These proceedings contain articles that expand upon the 12 presentations given in the workshop. Contrasting views on Frequency Division Multiple Access (FDMA), Code Division Multiple Access (CDMA), and Time Division Multiple Access (TDMA)-based architectures are presented, and system issues relating to signaling, spacecraft design, and network management constraints are addressed. An overview article that summarizes the issues raised in the numerous discussion periods of the workshop is also included

NASA Technical Reports Server

Communications techniques and equipment: A compilation

Author
Publication venue
Publication date
Field of study

This Compilation is devoted to equipment and techniques in the field of communications. It contains three sections. One section is on telemetry, including articles on radar and antennas. The second section describes techniques and equipment for coding and handling data. The third and final section includes descriptions of amplifiers, receivers, and other communications subsystems

NASA Technical Reports Server

Recommended from our members

Video content analysis for automated detection and tracking of humans in CCTV surveillance applications

Author: Tawiah Thomas Andzi-Quainoo
Publication venue: Brunel University School of Engineering and Design PhD Theses
Publication date: 01/01/2010
Field of study

This thesis was submitted for the degree of Doctor of Philosophy and awarded by Brunel University.The problems of achieving high detection rate with low false alarm rate for human detection and tracking in video sequence, performance scalability, and improving response time are addressed in this thesis. The underlying causes are the effect of scene complexity, human-to-human interactions, scale changes, and scene background-human interactions. A two-stage processing solution, namely, human detection, and human tracking with two novel pattern classifiers is presented. Scale independent human detection is achieved by processing in the wavelet domain using square wavelet features. These features used to characterise human silhouettes at different scales are similar to rectangular features used in [Viola 2001]. At the detection stage two detectors are combined to improve detection rate. The first detector is based on shape-outline of humans extracted from the scene using a reduced complexity outline extraction algorithm. A Shape mismatch measure is used to differentiate between the human and the background class. The second detector uses rectangular features as primitives for silhouette description in the wavelet domain. The marginal distribution of features collocated at a particular position on a candidate human (a patch of the image) is used to describe statistically the silhouette. Two similarity measures are computed between a candidate human and the model histograms of human and non human classes. The similarity measure is used to discriminate between the human and the non human class. At the tracking stage, a tracker based on joint probabilistic data association filter (JPDAF) for data association, and motion correspondence is presented. Track clustering is used to reduce hypothesis enumeration complexity. Towards improving response time with increase in frame dimension, scene complexity, and number of channels; a scalable algorithmic architecture and operating accuracy prediction technique is presented. A scheduling strategy for improving the response time and throughput by parallel processing is also presented

Brunel University Research Archive