256 research outputs found
Recommended from our members
Rapid preconditioning of data for accelerating convex hull algorithms
Given a dataset of two-dimensional points in the plane with integer
coordinates, the method proposed reduces a set of n points down to
a set of s points s ≤ n, such that the convex hull on the set of s
points is the same as the convex hull of the original set of n points.
The method is O(n). It helps any convex hull algorithm run faster.
The empirical analysis of a practical case shows a percentage reduction
in points of over 98%, that is reflected as a faster computation with a
speedup factor of at least 4
Recommended from our members
Median architecture by accumulative parallel counters
The time to process each of W/B processing blocks of a median calculation method on a set of N W-bit integers is improved here by a factor of three compared to the literature. Parallelism uncovered in blocks containing B-bit slices are exploited by independent accumulative parallel counters so that the median is calculated faster than any known previous method for any N, W values. The improvements to the method are discussed in the context of calculating the median for a moving set of N integers for which a pipelined architecture is developed. An extra benefit of smaller area for the architecture is also reported
Recommended from our members
Acceleration and visualization of Dynamic Network Optimization
With the emerging prevalence of smart phones and 4G LTE networks, the demand for faster-better-cheaper mobile services anytime and anywhere is ever growing. The Dynamic Network Optimization (DNO) concept emerged as a solution that optimally and continuously tunes the network settings, in response to varying network conditions and subscriber needs. Yet, the DNO realization is still at infancy, largely hindered by the bottleneck of the lengthy optimization runtime. This paper presents the design and prototype of a novel cloud based parallel solution that further enhances the scalability of our prior work on various parallel solutions that accelerate network optimization algorithms. The solution aims to satisfy the high performance required by DNO, preliminarily on a sub-hourly basis. The paper subsequently visualizes a design and a full cycle of a DNO system. A set of potential solutions to large network and real-time DNO are also proposed. Overall, this work creates a breakthrough towards the realization of DNO
Recommended from our members
Fast median calculation method
The ever increasing demand for high image quality requires fast and efficient methods for noise reduction. The best-known order-statistics filter is the median filter. A method is presented to calculate the median on a set of N W-bit integers in W/B time steps. Blocks containing B-bit slices are used to find B-bits of the median; using a novel quantum-like representation allowing the median to be computed in an accelerated manner compared to the best-known method (W time steps). The general method allows a variety of designs to be synthesised systematically. A further novel architecture to calculate the median for a moving set of N integers is also discussed
Recommended from our members
Parallel pipelined histogram architecture via c-slow retiming
A parallel pipelined array of cells suitable for realtime computation of histograms is proposed. The cell architecture builds on previous work to now allow operating on a stream of data at 1 pixel per clock cycle. This new cell is more suitable for interfacing to camera sensors or to microprocessors of 8-bit data buses which are common in consumer digital cameras. Arrays using the new proposed cells are obtained via C-slow retiming techniques and can be clocked at a 65% faster frequency than previous arrays. This achieves over 80% of the performance of two-pixel per clock cycle parallel pipelined arrays
Recommended from our members
C-slow retimed parallel histogram architectures for consumer imaging devices
A parallel pipelined array of cells suitable for real-time computation of histograms is proposed. The cell architecture builds on previous work obtained via C-slow retiming techniques and can be clocked at 65 percent faster frequency than previous arrays. The new arrays can be exploited for higher throughput particularly when dual data rate sampling techniques are used to operate on single streams of data from image sensors. In this way, the new cell operates on a p-bit data bus which is more convenient for interfacing to camera sensors or to microprocessors in consumer digital cameras
Recommended from our members
Generic systolic array for genetic algorithms
The authors present a systolic design for a simple GA mechanism which provides high throughput and unidirectional pipelining by exploiting the inherent parallelism in the genetic operators. The design computes in O(N+G) time steps using O(N2) cells where N is the population size and G is the chromosome length. The area of the device is independent of the chromosome length and so can be easily scaled by replicating the arrays or by employing fine-grain migration. The array is generic in the sense that it does not rely on the fitness function and can be used as an accelerator for any GA application using uniform crossover between pairs of chromosomes. The design can also be used in hybrid systems as an add-on to complement existing designs and methods for fitness function acceleration and island-style population managemen
Transputer implementation of systolic arrays for model reduction
In control theory the dynamics representing a physical system to be controlled is analyzed in terms of a model built up from a mathematical description of the system components. Unfortunately the resulting model can have high or infinite dimensional state equations, so it is often necessary from a practical and computational viewpoint to find a reduced model of the system which mimics closely the behaviour of the high order system. The technique for producing these simplified descriptions is termed model reduction and a number of methods are known. The author is interested in applying computational parallelism to the generation of reduced models to provide an acceleration mechanism for use in a CAD environment. In the paper the author develops systolic algorithms for simple single input-output control problems and implements them on a network of transputers. The performance of this network is then evaluated with respect to a sequential algorithm for the same problems. Finally, the author points out that systolic arrays are normally considered as special purpose methods geared towards exploiting VLSI techniques
- …
