Search CORE

722,674 research outputs found

Exploiting the Parallelism Exposed by Partial Evaluation

Author: Surati Rajeev
Publication venue
Publication date: 01/05/1994
Field of study

We describe the key role played by partial evaluation in the Supercomputing Toolkit, a parallel computing system for scientific applications that effectively exploits the vast amount of parallelism exposed by partial evaluation. The Supercomputing Toolkit parallel processor and its associated partial evaluation-based compiler have been used extensively by scientists at MIT, and have made possible recent results in astrophysics showing that the motion of the planets in our solar system is chaotically unstable

DSpace@MIT

Learning-Based Approach to Estimation of Morphable Model Parameters

Author: Kumar Vinay
Poggio Tomaso
Publication venue
Publication date: 01/09/2000
Field of study

DSpace@MIT

Partial Evaluation for Scientific Computing: The Supercomputer Toolkit Experience

Author: Berlin Andrew
Surati Rajeev
Publication venue
Publication date: 01/01/1994
Field of study

We describe the key role played by partial evaluation in the Supercomputer Toolkit, a parallel computing system for scientific applications that effectively exploits the vast amount of parallelism exposed by partial evaluation. The Supercomputer Toolkit parallel processor and its associated partial evaluation-based compiler have been used extensively by scientists at M.I.T., and have made possible recent results in astrophysics showing that the motion of the planets in our solar system is chaotically unstable

CiteSeerX

DSpace@MIT

Massively Parallel Sort-Merge Joins in Main Memory Multi-Core Database Systems

Author: Albutiu Martina-Cezara
Kemper Alfons
Neumann Thomas
Publication venue
Publication date: 01/01/2012
Field of study

Two emerging hardware trends will dominate the database system technology in the near future: increasing main memory capacities of several TB per server and massively parallel multi-core processing. Many algorithmic and control techniques in current database technology were devised for disk-based systems where I/O dominated the performance. In this work we take a new look at the well-known sort-merge join which, so far, has not been in the focus of research in scalable massively parallel multi-core data processing as it was deemed inferior to hash joins. We devise a suite of new massively parallel sort-merge (MPSM) join algorithms that are based on partial partition-based sorting. Contrary to classical sort-merge joins, our MPSM algorithms do not rely on a hard to parallelize final merge step to create one complete sort order. Rather they work on the independently created runs in parallel. This way our MPSM algorithms are NUMA-affine as all the sorting is carried out on local memory partitions. An extensive experimental evaluation on a modern 32-core machine with one TB of main memory proves the competitive performance of MPSM on large main memory databases with billions of objects. It scales (almost) linearly in the number of employed cores and clearly outperforms competing hash join proposals - in particular it outperforms the "cutting-edge" Vectorwise parallel query engine by a factor of four.Comment: VLDB201

arXiv.org e-Print Archive

CiteSeerX

Impact of inter-cell interference on ﬂow level performance of scheduling schemes for the UMTS EUL

Author: Berg J.L. van den
Dimitrova D.C.
Heijenk G.J.
Litjens R.
Publication venue: IEEE
Publication date: 01/01/2008
Field of study

The UMTS Enhanced Uplink (EUL) is expected to provide higher capacity, increased data rates, and smaller latency on the communication link from users towards the network. A key mechanism in EUL trafﬁc handling is the packet scheduler, for which a number of basic schemes can be identiﬁed (one-by- one, partial parallel, and full parallel). In this paper we analyze the interaction between the EUL scheduling scheme deployed in the network and the inter-cell interference. On the one hand, different scheduling schemes cause different inter-cell interference patterns on neighbouring cells. On the other hand, the different schemes are affected by inter-cell interference in different ways. The scheduling schemes are evaluated and compared under different approaches for reserving part of the allowed noise rise at the base station for inter-cell interference. For our analysis, we have developed a hybrid analytical/simulation approach allowing for fast evaluation of performance measures such as the mean ﬂow transfer time and fairness expressing how the performance depends on the user’s location. This approach takes into account both the packet-level characteristics and the ﬂow-level dynamics due to the random user behaviour

University of Twente Research Information

Efficient approximation of functions of some large matrices by partial fraction expansions

Author: Bertaccini Daniele
Durastante Fabio
Popolizio Marina
Publication venue: 'Informa UK Limited'
Publication date: 12/04/2018
Field of study

Some important applicative problems require the evaluation of functions

\Psi

of large and sparse and/or \emph{localized} matrices

A

. Popular and interesting techniques for computing

\Psi(A)

and

\Psi(A)\mathbf{v}

, where

\mathbf{v}

is a vector, are based on partial fraction expansions. However, some of these techniques require solving several linear systems whose matrices differ from

A

by a complex multiple of the identity matrix

I

for computing

\Psi(A)\mathbf{v}

or require inverting sequences of matrices with the same characteristics for computing

\Psi(A)

. Here we study the use and the convergence of a recent technique for generating sequences of incomplete factorizations of matrices in order to face with both these issues. The solution of the sequences of linear systems and approximate matrix inversions above can be computed efficiently provided that

A^{-1}

shows certain decay properties. These strategies have good parallel potentialities. Our claims are confirmed by numerical tests

arXiv.org e-Print Archive

Archivio della Ricerca - Università di Pisa

ART