25 research outputs found

    The Use of Refined Descriptive Sampling and Applications in Parallel Monte Carlo Simulation

    Get PDF
    Refined descriptive sampling is designed to improve upon the descriptive sampling method for experimentation in simulation. The former reduces significantly the risk of sampling bias generated by descriptive sampling and eliminates its problem related to the sample size. In this paper, we propose an optimal parallel Monte Carlo simulation algorithm using refined descriptive sampling and evaluate in parallel architecture, performance measures of a stable M/M/1 queueing system, a Pert network and the Newsboy problem

    Developing a Benchmark for Evaluating the Performance of Parallel Computers

    Get PDF
    This paper discusses the development of a portable suite of benchmarking programs for parallel computers. Comparative measurement of the performance of parallel computing systems has been limited because of the great diversity of architectures and of processor interconnection schemes. One solution is to translate benchmark codes into a consistent and portable parallel language. This paper reports on progress in developing such a portable suite of benchmarks. An extensive introduction to parallel computing is included as an appendix, to provide a thorough understanding of the factors complicating development of the performance suite. Key to the development was the use of p4, a library of tools developed at Argonne National Laboratory. The benchmark codes were translated successfully using p4 and were run on a variety of parallel machines. Conclusions and suggestions for future work are given

    Divide-and-conquer algorithms for multiprocessors

    Get PDF
    During the past decade there has been a tremendous surge in understanding the nature of parallel computation. A number of parallel computers are commercially available. However, there are some problems in developing application programs on these computers;This dissertation considers various issues involved in implementing parallel algorithms on Multiple Instruction Multiple Data (MIMD) machines with a bounded number of processors. Strategies for implementing divide-and-conquer algorithms on MIMD machines are proposed. Results linking time complexity, communication complexity and the complexity of divide-and-combine functions of divide-and-conquer algorithms are analyzed. An efficient criterion for partitioning a parallel program is proposed and a method for obtaining a closed form expression for time complexity of a parallel program in terms of problem size and number of processors is developed

    Context flow architecture

    Get PDF

    Simulation, Analysis, and Optimization of Heterogeneous CPU-GPU Systems

    Get PDF
    With the computing industry\u27s recent adoption of the Heterogeneous System Architecture (HSA) standard, we have seen a rapid change in heterogeneous CPU-GPU processor designs. State-of-the-art heterogeneous CPU-GPU processors tightly integrate multicore CPUs and multi-compute unit GPUs together on a single die. This brings the MIMD processing capabilities of the CPU and the SIMD processing capabilities of the GPU together into a single cohesive package with new HSA features comprising better programmability, coherency between the CPU and GPU, shared Last Level Cache (LLC), and shared virtual memory address spaces. These advancements can potentially bring marked gains in heterogeneous processor performance and have piqued the interest of researchers who wish to unlock these potential performance gains. Therefore, in this dissertation I explore the heterogeneous CPU-GPU processor and application design space with the goal of answering interesting research questions, such as, (1) what are the architectural design trade-offs in heterogeneous CPU-GPU processors and (2) how do we best maximize heterogeneous CPU-GPU application performance on a given system. To enable my exploration of the heterogeneous CPU-GPU design space, I introduce a novel discrete event-driven simulation library called KnightSim and a novel computer architectural simulator called M2S-CGM. M2S-CGM includes all of the simulation elements necessary to simulate coherent execution between a CPU and GPU with shared LLC and shared virtual memory address spaces. I then utilize M2S-CGM for the conduct of three architectural studies. First, I study the architectural effects of shared LLC and CPU-GPU coherence on the overall performance of non-collaborative GPU-only applications. Second, I profile and analyze a set of collaborative CPU-GPU applications to determine how to best optimize them for maximum collaborative performance. Third, I study the impact of varying four key architectural parameters on collaborative CPU-GPU performance by varying GPU compute unit coalesce size, GPU to memory controller bandwidth, GPU frequency, and system wide switching fabric latency

    Uma ferramenta para o desenvolvimento de modelos de simulação integrada ao ambiente grid

    Get PDF
    Dissertação (mestrado) - Universidade Federal de Santa Catarina, Centro Tecnológico. Programa de Pós-Graduação em Ciência da Computação.Com o atual aumento da capacidade computacional, pode-se montar ambientes de alto desempenho com computadores de baixo custo. Dessa forma, esses ambientes têm se tornado cada vez mais populares. Existem vários tipos de configurações que podem prover o alto desempenho. No entanto, há baixa oferta de ambientes de modelagem integrados com grid para o público-alvo (modeladores). Nesta dissertação foi implementada uma ferramenta que possibilita a criação de modelos de simulação discreta e a execução destes em um grid computacional. Essa ferramenta é composta de duas partes: editor e processador de modelos. O editor pode criar e alterar modelos de simulação discreta. O processador foi desenvolvido para interpretá-los e executá-los. As partes se comunicam por meio de um arquivo que contém a representação do modelo, o qual está descrito em XML. Alguns testes foram realizados com o intuito de validar as estatísticas geradas pelo processador de modelos. Esses testes foram satisfatórios e mostram a possibilidade da integração das ferramentas de construção de modelos com os ambientes em grid

    Development of a parallel database environment

    Get PDF

    The exploitation of parallelism on shared memory multiprocessors

    Get PDF
    PhD ThesisWith the arrival of many general purpose shared memory multiple processor (multiprocessor) computers into the commercial arena during the mid-1980's, a rift has opened between the raw processing power offered by the emerging hardware and the relative inability of its operating software to effectively deliver this power to potential users. This rift stems from the fact that, currently, no computational model with the capability to elegantly express parallel activity is mature enough to be universally accepted, and used as the basis for programming languages to exploit the parallelism that multiprocessors offer. To add to this, there is a lack of software tools to assist programmers in the processes of designing and debugging parallel programs. Although much research has been done in the field of programming languages, no undisputed candidate for the most appropriate language for programming shared memory multiprocessors has yet been found. This thesis examines why this state of affairs has arisen and proposes programming language constructs, together with a programming methodology and environment, to close the ever widening hardware to software gap. The novel programming constructs described in this thesis are intended for use in imperative languages even though they make use of the synchronisation inherent in the dataflow model by using the semantics of single assignment when operating on shared data, so giving rise to the term shared values. As there are several distinct parallel programming paradigms, matching flavours of shared value are developed to permit the concise expression of these paradigms.The Science and Engineering Research Council

    Mapping Signal Processing Algorithms on Parallel Arcidtectures

    Get PDF
    Electrical Engineerin
    corecore