Search CORE

9,820 research outputs found

Learned-Norm Pooling for Deep Feedforward and Recurrent Neural Networks

Author: A. Hyvärinen
D. Ciresan
D. Hubel
D.E. Rumelhart
J. Bergstra
J. Bergstra
K. Fukushima
M. Ranzato
M. Trebar
Y. LeCun
Publication venue
Publication date: 01/01/2014
Field of study

In this paper we propose and investigate a novel nonlinear unit, called

L_p

unit, for deep neural networks. The proposed

L_p

unit receives signals from several projections of a subset of units in the layer below and computes a normalized

L_p

norm. We notice two interesting interpretations of the

L_p

unit. First, the proposed unit can be understood as a generalization of a number of conventional pooling operators such as average, root-mean-square and max pooling widely used in, for instance, convolutional neural networks (CNN), HMAX models and neocognitrons. Furthermore, the

L_p

unit is, to a certain degree, similar to the recently proposed maxout unit (Goodfellow et al., 2013) which achieved the state-of-the-art object recognition results on a number of benchmark datasets. Secondly, we provide a geometrical interpretation of the activation function based on which we argue that the

L_p

unit is more efficient at representing complex, nonlinear separating boundaries. Each

L_p

unit defines a superelliptic boundary, with its exact shape defined by the order

p

. We claim that this makes it possible to model arbitrarily shaped, curved boundaries more efficiently by combining a few

L_p

units of different orders. This insight justifies the need for learning different orders for each unit in the model. We empirically evaluate the proposed

L_p

units on a number of datasets and show that multilayer perceptrons (MLP) consisting of the

L_p

units achieve the state-of-the-art results on a number of benchmark datasets. Furthermore, we evaluate the proposed

L_p

unit on the recently proposed deep recurrent neural networks (RNN).Comment: ECML/PKDD 201

arXiv.org e-Print Archive

CiteSeerX

Crossref

New pixel-DCT domain coding technique for object based and frame based prediction error

Author: Hui KC
Siu WC
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 11/12/2014
Field of study

2004-2005 > Academic research: refereed > Refereed conference paperVersion of RecordPublishe

PolyU Institutional Repository

Cosine-Based Clustering Algorithm Approach

Author: Ashour Wesam M.
Lubbad Mohammed AH
Publication venue: Modern Education and Computer Science Press
Publication date: 01/01/2012
Field of study

Due to many applications need the management of spatial data; clustering large spatial databases is an important problem which tries to find the densely populated regions in the feature space to be used in data mining, knowledge discovery, or efficient information retrieval. A good clustering approach should be efficient and detect clusters of arbitrary shapes. It must be insensitive to the outliers (noise) and the order of input data. In this paper Cosine Cluster is proposed based on cosine transformation, which satisfies all the above requirements. Using multi-resolution property of cosine transforms, arbitrary shape clusters can be effectively identified at different degrees of accuracy. Cosine Cluster is also approved to be highly efficient in terms of time complexity. Experimental results on very large data sets are presented, which show the efficiency and effectiveness of the proposed approach compared to other recent clustering methods

Institutional Repository of the Islamic University of Gaza

Customized television: Standards compliant advanced digital television

Author: Bais M
Cosmas J
Dosch C
Engelsberg A
Erk A
Hansen PS
Healey P
Klungsoeyr GK
Mies R
Ohn JR
Paker Y
Pearmain A
Pedersen L
Sandvand A
Schaefer R
Schoonjans P
Stamnitz P
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2002
Field of study

This correspondence describes a European Union supported collaborative project called CustomTV based on the premise that future TV sets will provide all sorts of multimedia information and interactivity, as well as manage all such services according to each user’s or group of user’s preferences/profiles. We have demonstrated the potential of recent standards (MPEG-4 and MPEG-7) to implement such a scenario by building the following services: an advanced EPG, Weather Forecasting, and Stock Exchange/Flight Information

Crossref

Fraunhofer-ePrints

Brunel University Research Archive

Slowness and Sparseness Lead to Place, Head-Direction, and Spatial-View Cells

Author: Franzius Mathias
Sprekeler Henning
Wiskott Prof. Dr. Laurenz
Publication venue
Publication date: 01/08/2007
Field of study

We present a model for the self-organized formation of place cells, head-direction cells, and spatial-view cells in the hippocampal formation based on unsupervised learning on quasi-natural visual stimuli. The model comprises a hierarchy of Slow Feature Analysis (SFA) nodes, which were recently shown to reproduce many properties of complex cells in the early visual system. The system extracts a distributed grid-like representation of position and orientation, which is transcoded into a localized place-field, head-direction, or view representation, by sparse coding. The type of cells that develops depends solely on the relevant input statistics, i.e., the movement pattern of the simulated animal. The numerical simulations are complemented by a mathematical analysis that allows us to accurately predict the output of the top SFA laye

CogPrints Cognitive Sciences Eprint Archive

MPEG-4 Software Video Encoding

Author: Hamosfakidis Anastasios
Publication venue
Publication date: 10/12/2013
Field of study

A Thesis submitted in fulfillment of the requirements of the degree of doctor of Philosophy in the University of LondonThis thesis presents a software model that allows a parallel decomposition of the MPEG-4 video encoder onto shared memory architectures, in order to reduce its total video encoding time. Since a video sequence consists of video objects each of which is likely to have different encoding requirements, the model incorporates a scheduler which (a) always selects the most appropriate video object for encoding and, (b) employs a mechanism for dynamically allocating video objects allocation onto the system processors, based on video object size information. Further spatial video object parallelism is exploited by applying the single program multiple data (SPMD) paradigm within the different modules of the MPEG-4 video encoder. Due to the fact that not all macroblocks have the same processing requirements, the model also introduces a data partition scheme that generates tiles with identical processing requirements. Since, macroblock data dependencies preclude data parallelism at the shape encoder the model also introduces a new mechanism that allows parallelism using a circular pipeline macroblock technique The encoding time depends partly on an encoder’s computational complexity. This thesis also addresses the problem of the motion estimation, as its complexity has a significant impact on the encoder’s complexity. In particular, two fast motion estimation algorithms have been developed for the model which reduce the computational complexity significantly. The thesis includes experimental results on a four processor shared memory platform, Origin200

Queen Mary Research Online