Search CORE

75 research outputs found

Deterministic 1-k routing on meshes with applications to worm-hole routing

Author: Kaufmann M.
Sibeyn J.
Publication venue: Max-Planck-Institut für Informatik
Publication date: 01/01/1993
Field of study

1

k

routing each of the

n^2

processing units of an

n \times n

mesh connected computer initially holds

1

packet which must be routed such that any processor is the destination of at most

k

packets. This problem reflects practical desire for routing better than the popular routing of permutations.

1

k

routing also has implications for hot-potato worm-hole routing, which is of great importance for real world systems. We present a near-optimal deterministic algorithm running in \sqrt{k} \cdot n / 2 + \go{n} steps. We give a second algorithm with slightly worse routing time but working queue size three. Applying this algorithm considerably reduces the routing time of hot-potato worm-hole routing. Non-trivial extensions are given to the general

l

k

routing problem and for routing on higher dimensional meshes. Finally we show that

k

k

routing can be performed in \go{k \cdot n} steps with working queue size four. Hereby the hot-potato worm-hole routing problem can be solved in \go{k^{3/2} \cdot n} steps

MPG.PuRe

Matrix transpose on meshes with buses

Author: Békési József
Galambos Gábor
Publication venue
Publication date: 01/01/2016
Field of study

SZTE Publicatio Repozitórium - SZTE - Repository of Publications

A Glossary of Provincial and Local Words Used in England by Francis Grose, Esq., F. R. & A.S.S. To which is now first incorporated the Supplement by Samuel Pegge, Esq. F.S.A.

Author: Grose Francis, 1731-1791
Pegge Samuel, 1733-1800
Publication venue: John Russell Smith (London, Reino Unido)
Publication date: 01/01/1839
Field of study

Glosario. -- Pertenece a la colección Varia 1800-1950 del Salamanca Corpus. -- Francis Grose, 1731-1791. -- Samuel Pegge, 1733-1800. -- A Glossary of Provincial and Local Words Used in England by Francis Grose, Esq., F. R. & A.S.S. To which is now first incorporated the Supplement by Samuel Pegge, Esq. F.S.A. -- 1839.[ES] Edición de 1839 del Glosario de Francis Grose que incorpora el suplemento de Samuel Pegge. [EN] 1839 edition of Francis Groses's glossary incorporating Samuel Pegge's supplement

Gestion del Repositorio Documental de la Universidad de Salamanca

Progress Report : 1991 - 1994

Author
Publication venue: Max-Planck-Institut für Informatik
Publication date: 01/01/1994
Field of study

MPG.PuRe

Optimizing Communication for Massively Parallel Processing

Author: Kumar Sameer
Publication venue
Publication date: 01/04/2005
Field of study

The current trends in high performance computing show that large machines with tens of thousands of processors will soon be readily available. The IBM Bluegene-L machine with 128k processors (which is currently being deployed) is an important step in this direction. In this scenario, it is going to be a significant burden for the programmer to manually scale his applications. This task of scaling involves addressing issues like load-imbalance and communication overhead. In this thesis, we explore several communication optimizations to help parallel applications to easily scale on a large number of processors. We also present automatic runtime techniques to relieve the programmer from the burden of optimizing communication in his applications. This thesis explores processor virtualization to improve communication performance in applications. With processor virtualization, the computation is mapped to virtual processors (VPs). After one VP has finished computation and is waiting for responses to its messages, another VP can compute, thus overlapping communication with computation. This overlap is only effective if the processor overhead of the communication operation is a small fraction of the total communication time. Fortunately, with network interfaces having co-processors, this happens to be true and processor virtualization has a natural advantage on such interconnects. The communication optimizations we present in this thesis, are motivated by applications such as NAMD (a classical molecular dynamics application) and CPAIMD (a quantum chemistry application). Applications like NAMD and CPAIMD consume a fair share of the time available on supercomputers. So, improving their performance would be of great value. We have successfully scaled NAMD to 1TF of peak performance on 3000 processors of PSC Lemieux, using the techniques presented in this thesis. We study both point-to-point communication and collective communication (specifically all-to-all communication). On a large number of processors all-to-all communication can take several milli-seconds to finish. With synchronous collectives defined in MPI, the processor idles while the collective messages are in flight. Therefore, we demonstrate an asynchronous collective communication framework, to let the CPU compute while the all-to-all messages are in flight. We also show that the best strategy for all-to-all communication depends on the message size, number of processors and other dynamic parameters. This suggests that these parameters can be observed at runtime and used to choose the optimal strategy for all-to-all communication. In this thesis, we demonstrate adaptive strategy switching for all-to-all communication. The communication optimization framework presented in this thesis, has been designed to optimize communication in the context of processor virtualization and dynamic migrating objects. We present the streaming strategy to optimize fine grained object-to-object communication. In this thesis, we motivate the need for hardware collectives, as processor based collectives can be delayed by intermediate that processors busy with computation. We explore a next generation interconnect that supports collectives in the switching hardware. We show the performance gains of hardware collectives through synthetic benchmarks

Illinois Digital Environment for Access to Learning and Scholarship Repository

Cot and Cradle Stories

Author: Traill Catherine Parr
Publication venue: UND Scholarly Commons
Publication date: 01/01/1895
Field of study

https://commons.und.edu/settler-literature/1164/thumbnail.jp

UND Scholarly Commons (University of North Dakota)

Scalable and Secure Multicast Routing for Mobile Ad-hoc Networks

Author: Schmittner Milan
Publication venue
Publication date: 30/09/2014
Field of study

Mobile Ad-Hoc Networks (MANETs) are decentralized and autonomous communication systems: They can be used to provide connectivity when a natural disaster has brought down the infrastructure, or they can support freedom of speech in countries with governmental Internet restrictions. MANET design requires careful attention to scalability and security due to low-capacity and error-prone wireless links as well as the openness of these systems. In this thesis, we address the issue of multicast as a means to efficiently support the MANET application of group communication on the network layer. To this aim, we first survey the research literature on the current state of the art in MANET routing, and we identify a gap between scalability and security in multicast routing protocols–two aspects that were only considered in isolation until now. We then develop an explicit multicast protocol based on the design of a secure unicast protocol, aiming to maintain its security properties while introducing minimal overhead. Our simulation results reveal that our protocol reduces bandwidth utilization in group communication scenarios by up to 45 % compared to the original unicast protocol, while providing significantly better resilience under blackhole attacks. A comparison with pure flooding allows us to identify a practical group size limit, and we present ideas for better large-group support

TUbiblio

tuprints

Polyvalent Parallelizations for Hierarchical Block Matching Motion Estimation

Author: Andreas I. Svolos
Charalampos G. Konstantopoulos
Christos Kaklamanis
Publication venue: 'University of Zagreb - University Computing Centre'
Publication date: 01/01/2000
Field of study

Block matching motion estimation algorithms are widely used in video coding schemes. In this paper,we design an efficient hierarchical block matching motion estimation (HBMME) algorithm on a hypercube multiprocessor. Unlike systolic array designs, this solution is not tied down to specific values of algorithm parameters and thus offers increased flexibility. Moreover, the hypercube network can efficiently handle the non regular data flow of the HBMME algorithm. Our techniques nearly eliminate the occurrence of “difficult” communication patterns, namely many-to-many personalized communication, by replacing them with simple shift operations. These operations have an efficient implementation on most of interconnection networks and thus our techniques can be adapted to other networks as well. With regard to the employed multiprocessor we make no specific assumption about the amount of local memory residing in each processor. Instead, we introduce a free parameter S and assume that each processor has O(S) local memory. By doing so, we handle all the cases of modern multiprocessors, that is fine-grained, medium-grained and coarse-grained multiprocessors and thus our design is quite general

Crossref

HRČAK - Portal of Croatian Scientific and Professional Journals

Hrčak - Portal of scientific journals of Croatia

Truly efficient parallel algorithms: 1-optimal multisearch for an extension of the BSP model

Author: Bäumker Armin
der Heide Friedhelm Meyer auf
Dittrich Wolfgang
Publication venue: Published by Elsevier B.V.
Publication date
Field of study

AbstractIn this paper we design and analyse parallel algorithms with the goal to get exact bounds on their speed-ups on real machines. For this purpose we define an extension of Valiant's BSP model, BSP∗, that rewards blockwise communication, and use Valiant's notion of 1-optimality. Intuitively, a 1-optimal parallel algorithm for p processors achieves speed-up close to p. We consider the Multisearch Problem: Assume a strip in 2D to be partitioned into m segments. Given n query points in the strip, the task is to locate, for each query, its segment. For m ⩽n⩾ p we present a deterministic BSP∗ algorithm that is 1-optimal, if np⩾log2n. For m>n⩾p, we present a randomized BSP∗ algorithm that is l-optimal with high probability, if m⩽2p and n/p⩾log3n. Both results hold for a wide range of BSP∗ parameters where the range becomes larger with growing input size n. We further report on implementation work. Previous parallel algorithms for Multisearch were far away from being 1-optimal in our model and did not consider blockwise communication

Elsevier - Publisher Connector

A Dictionary of the Kentish Dialect and Provincialisms in Use in the County of Kent.

Author: Parish William Douglas, 1833-1904
Shaw William Francis, 1839-1904
Publication venue: Trübner & Co. (London, Reino Unido)
Publication date: 01/01/1887
Field of study

Glosario. -- Kent. -- Pertenece a la Colección Varia 1800-1950 de The Salamanca Corpus. -- William Douglas Parish (1833-1904). -- . William Francis Shaw (1839-1904). -- A Dictionary of the Kentish Dialect and Provincialisms in Use in the County of Kent. . -- 1887.[ES] Glosario del dialecto de Kent. [EN] Glossary of the Kentish dialect

Gestion del Repositorio Documental de la Universidad de Salamanca