Search CORE

19,247 research outputs found

Parallel VLSI architecture emulation and the organization of APSA/MPP

Author: Odonnell John T.
Publication venue
Publication date
Field of study

The Applicative Programming System Architecture (APSA) combines an applicative language interpreter with a novel parallel computer architecture that is well suited for Very Large Scale Integration (VLSI) implementation. The Massively Parallel Processor (MPP) can simulate VLSI circuits by allocating one processing element in its square array to an area on a square VLSI chip. As long as there are not too many long data paths, the MPP can simulate a VLSI clock cycle very rapidly. The APSA circuit contains a binary tree with a few long paths and many short ones. A skewed H-tree layout allows every processing element to simulate a leaf cell and up to four tree nodes, with no loss in parallelism. Emulation of a key APSA algorithm on the MPP resulted in performance 16,000 times faster than a Vax. This speed will make it possible for the APSA language interpreter to run fast enough to support research in parallel list processing algorithms

Data broadcasting and reduction, prefix computation, and sorting on reduced hypercube (RH) parallel computers

Author: Mukherjee Arup
Publication venue: Digital Commons @ NJIT
Publication date: 31/10/1994
Field of study

The binary hypercube parallel computer has been very popular due to its rich interconnection structure and small average internode distance which allow the efficient embedding of frequently used topologies. Communication patterns of many parallel algorithms also match the hypercube topology. The hypercube has high VLSI complexity. however. due to the logarithmic increase in the number of connections to each node with the increase in the number of dimensions of the hypercube. The reduced hypercube (RH) interconnection network. which is obtained by a uniform reduction in the number of links for each hypercube node. yields lower-complexity interconnection networks when compared to hypercubes with the same number of nodes. It has been shown elsewhere that the RH interconnection network achieves performance comparable to that of the hypercube. at lower hardware cost. The reduced VLSI complexity of the RH also permits the construction of larger systems. thus. making the RH suitable for massively parallel processing. This thesis proposes algorithms for data broadcasting and reduction. prefix computation, and sorting on the RH parallel computer. All these operations are fundamental to many parallel algorithms. A worst case analysis of each algorithm is given and compared with equivalent- algorithms for the regular hypercube. It is shown that the proposed algorithms for the RH yield performance comparable to that for the regular hypercube

Digital Commons @ New Jersey Institute of Technology (NJIT)

Transient Stability Simulation by Waveform Relaxation Methods

Author: Crow Mariesa
Ilić Marija D.
Pai M. A.
Publication venue: Scholars\u27 Mine
Publication date: 01/11/1987
Field of study

In this paper, a new methodology for power system dynamic response calculations is presented. The technique known as the waveform relaxation has been extensively used in transient analysis of VLSI circuits and it can take advantage of new architectures in computer systems such as parallel processors. The application in this paper is limited to swing equations of a large power system. Computational results are presented

Missouri University of Science and Technology (Missouri S&T): Scholars' Mine

Communications for Next Generation single chip computers

Author: Chan Douglas
Smith David R.
Publication venue: 'California Institute of Technology Library'
Publication date: 01/01/1981
Field of study

It is the thesis of this report that much of what is presently thought to require specialized VLSI functions might instead be achieved by combinations of fast general purpose single chip computers with upgraded communication facilities. To this end, the characteristics of applications of this nature are first surveyed briefly and some working principles established. In the light of these, three different chip philosophies are explored in some detail. This study shows that some upgrading of typical single chip I/O will definitely be necessary, but that this upgrading does not have to be complex and that true multiprocessor-multibus operation could be achieved without excessive cost