Search CORE

7,449 research outputs found

Distributed Finite Element Analysis Using a Transputer Network

Author: Baehmann Peggy
Danial Albert
Favenesi James
Reynolds Brian
Shephard Mark
Tombrello Joseph
Turrentine Ronald
Watson James
Yang Dabby
Publication venue
Publication date
Field of study

The principal objective of this research effort was to demonstrate the extraordinarily cost effective acceleration of finite element structural analysis problems using a transputer-based parallel processing network. This objective was accomplished in the form of a commercially viable parallel processing workstation. The workstation is a desktop size, low-maintenance computing unit capable of supercomputer performance yet costs two orders of magnitude less. To achieve the principal research objective, a transputer based structural analysis workstation termed XPFEM was implemented with linear static structural analysis capabilities resembling commercially available NASTRAN. Finite element model files, generated using the on-line preprocessing module or external preprocessing packages, are downloaded to a network of 32 transputers for accelerated solution. The system currently executes at about one third Cray X-MP24 speed but additional acceleration appears likely. For the NASA selected demonstration problem of a Space Shuttle main engine turbine blade model with about 1500 nodes and 4500 independent degrees of freedom, the Cray X-MP24 required 23.9 seconds to obtain a solution while the transputer network, operated from an IBM PC-AT compatible host computer, required 71.7 seconds. Consequently, the

80,000 transputer network demonstrated a cost-performance ratio about 60 times better than the

15,000,000 Cray X-MP24 system

OpenACC Based GPU Parallelization of Plane Sweep Algorithm for Geometric Intersection

Author: AB Khlopotine
JL Bentley
M McKenney
MT Goodrich
MT Goodrich
Publication venue: e-Publications@Marquette
Publication date: 01/01/2019
Field of study

Line segment intersection is one of the elementary operations in computational geometry. Complex problems in Geographic Information Systems (GIS) like finding map overlays or spatial joins using polygonal data require solving segment intersections. Plane sweep paradigm is used for finding geometric intersection in an efficient manner. However, it is difficult to parallelize due to its in-order processing of spatial events. We present a new fine-grained parallel algorithm for geometric intersection and its CPU and GPU implementation using OpenMP and OpenACC. To the best of our knowledge, this is the first work demonstrating an effective parallelization of plane sweep on GPUs. We chose compiler directive based approach for implementation because of its simplicity to parallelize sequential code. Using Nvidia Tesla P100 GPU, our implementation achieves around 40X speedup for line segment intersection problem on 40K and 80K data sets compared to sequential CGAL library

R Package distrMod: S4 Classes and Methods for Probability Models

Author: Matthias Kohl
Peter Ruckdeschel
Publication venue
Publication date
Field of study

Package distrMod provides an object oriented (more specifically S4-style) implementation of probability models. Moreover, it contains functions and methods to compute minimum criterion estimators - in particular, maximum likelihood and minimum distance estimators.

Recent progress in exact geometric computation

Author: Li C.
Pion S.
Yap C.K.
Publication venue: Published by Elsevier Inc.
Publication date: 01/01/2005
Field of study

AbstractComputational geometry has produced an impressive wealth of efficient algorithms. The robust implementation of these algorithms remains a major issue. Among the many proposed approaches for solving numerical non-robustness, Exact Geometric Computation (EGC) has emerged as one of the most successful. This survey describes recent progress in EGC research in three key areas: constructive zero bounds, approximate expression evaluation and numerical filters

INRIA a CCSD electronic archive server

Opt: A Domain Specific Language for Non-linear Least Squares Optimization in Graphics and Imaging

Author: Bernstein Gilbert
DeVito Zachary
Fisher Matthew
Hanrahan Pat
Mara Michael
Nießner Matthias
Ragan-Kelley Jonathan
Theobalt Christian
Zollhöfer Michael
Publication venue
Publication date: 01/01/2016
Field of study

Many graphics and vision problems can be expressed as non-linear least squares optimizations of objective functions over visual data, such as images and meshes. The mathematical descriptions of these functions are extremely concise, but their implementation in real code is tedious, especially when optimized for real-time performance on modern GPUs in interactive applications. In this work, we propose a new language, Opt (available under http://optlang.org), for writing these objective functions over image- or graph-structured unknowns concisely and at a high level. Our compiler automatically transforms these specifications into state-of-the-art GPU solvers based on Gauss-Newton or Levenberg-Marquardt methods. Opt can generate different variations of the solver, so users can easily explore tradeoffs in numerical precision, matrix-free methods, and solver approaches. In our results, we implement a variety of real-world graphics and vision applications. Their energy functions are expressible in tens of lines of code, and produce highly-optimized GPU solver implementations. These solver have performance competitive with the best published hand-tuned, application-specific GPU solvers, and orders of magnitude beyond a general-purpose auto-generated solver

arXiv.org e-Print Archive

Fourier Transform of the Stretched Exponential Function: Analytic Error Bounds, Double Exponential Transform, and Open-Source Implementation libkww

Author: Wuttke Joachim
Publication venue: 'MDPI AG'
Publication date: 01/01/2012
Field of study

The C library \texttt{libkww} provides functions to compute the Kohlrausch-Williams-Watts function, i.e.\ the Laplace-Fourier transform of the stretched (or compressed) exponential function

\exp(-t^\beta)

for exponents

\beta

between 0.1 and 1.9 with sixteen-digits accuracy. Analytic error bounds are derived for the low and high frequency series expansions. For intermediate frequencies the numeric integration is enormously accelerated by using the Ooura-Mori double exponential transformation. The source code is available from the project home page \url{http://apps.jcns.fz-juelich.de/doku/sc/kww}.Comment: Version 3. 11 pages, 4 figures. Describes software version 2.

arXiv.org e-Print Archive

Juelich Shared Electronic Resources