Search CORE

21 research outputs found

SVD-DIP: Overcoming the Overfitting Problem in DIP-based CT Reconstruction

Author: Barbano Riccardo
Jin Bangti
Lameter Michael
Leuschner Johannes
Maass Peter
Nittscher Marco
Publication venue
Publication date: 28/03/2023
Field of study

The deep image prior (DIP) is a well-established unsupervised deep learning method for image reconstruction; yet it is far from being flawless. The DIP overfits to noise if not early stopped, or optimized via a regularized objective. We build on the regularized fine-tuning of a pretrained DIP, by adopting a novel strategy that restricts the learning to the adaptation of singular values. The proposed SVD-DIP uses ad hoc convolutional layers whose pretrained parameters are decomposed via the singular value decomposition. Optimizing the DIP then solely consists in the fine-tuning of the singular values, while keeping the left and right singular vectors fixed. We thoroughly validate the proposed method on real-measured

\mu

CT data of a lotus root as well as two medical datasets (LoDoPaB and Mayo). We report significantly improved stability of the DIP optimization, by overcoming the overfitting to noise

arXiv.org e-Print Archive

FINJ: A Fault Injection Tool for HPC Systems

Author: A Gainaru
C Lameter
F Cappello
J Calhoun
MC Hsueh
N DeBardeleben
O Tuncer
Publication venue
Publication date: 01/09/2018
Field of study

We present FINJ, a high-level fault injection tool for High-Performance Computing (HPC) systems, with a focus on the management of complex experiments. FINJ provides support for custom workloads and allows generation of anomalous conditions through the use of fault-triggering executable programs. FINJ can also be integrated seamlessly with most other lower-level fault injection tools, allowing users to create and monitor a variety of highly-complex and diverse fault conditions in HPC systems that would be difficult to recreate in practice. FINJ is suitable for experiments involving many, potentially interacting nodes, making it a very versatile design and evaluation tool.Comment: To be presented at the 11th Resilience Workshop in the 2018 Euro-Par conferenc

arXiv.org e-Print Archive

Crossref

Archivio della Ricerca - Università di Pisa

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

The Scalable Commutativity Rule: Designing Scalable Software for Multicore Processors

Author: Boyd-Wickizer S.
Boyd-Wickizer S.
Boyd-Wickizer S.
Cadar C.
Corbet J.
Koopman P.
Lameter C.
McKenney P. E.
Roy A.
Shapiro M.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2013
Field of study

What fundamental opportunities for scalability are latent in interfaces, such as system call APIs? Can scalability opportunities be identified even before any implementation exists, simply by considering interface specifications? To answer these questions this paper introduces the following rule: Whenever interface operations commute, they can be implemented in a way that scales. This rule aids developers in building more scalable software starting from interface design and carrying on through implementation, testing, and evaluation. To help developers apply the rule, a new tool named Commuter accepts high-level interface models and generates tests of operations that commute and hence could scale. Using these tests, Commuter can evaluate the scalability of an implementation. We apply Commuter to 18 POSIX calls and use the results to guide the implementation of a new research operating system kernel called sv6. Linux scales for 68% of the 13,664 tests generated by Commuter for these calls, and Commuter finds many problems that have been observed to limit application scalability. sv6 scales for 99% of the tests.Engineering and Applied Science

CiteSeerX

DSpace@MIT

Crossref

Harvard University - DASH

Reliable scalable symbolic computation: The design of SymGridPar2

Author: Al Zain
Aswad
Barroso
Borwein
Char
Cole
Daberkow
Davidson
Dean
Geck
Gropp
Halstead
Lameter
Lamport
Linton
Loogen
Lübeck
P. Maier
P.W. Trinder
R. Stewart
Schneider
Trinder
Wrzesinska
Publication venue: 'Elsevier BV'
Publication date: 01/04/2014
Field of study

Symbolic computation is an important area of both Mathematics and Computer Science, with many large computations that would benefit from parallel execution. Symbolic computations are, however, challenging to parallelise as they have complex data and control structures, and both dynamic and highly irregular parallelism. The SymGridPar framework (SGP) has been developed to address these challenges on small-scale parallel architectures. However the multicore revolution means that the number of cores and the number of failures are growing exponentially, and that the communication topology is becoming increasingly complex. Hence an improved parallel symbolic computation framework is required. This paper presents the design and initial evaluation of SymGridPar2 (SGP2), a successor to SymGridPar that is designed to provide scalability onto 10^5 cores, and hence also provide fault tolerance. We present the SGP2 design goals, principles and architecture. We describe how scalability is achieved using layering and by allowing the programmer to control task placement. We outline how fault tolerance is provided by supervising remote computations, and outline higher-level fault tolerance abstractions. We describe the SGP2 implementation status and development plans. We report the scalability and efficiency, including weak scaling to about 32,000 cores, and investigate the overheads of tolerating faults for simple symbolic computations

Heriot Watt Pure

Crossref

Stirling Online Research Repository (RIOXX)

Sheffield Hallam University Research Archive

Stirling Online Research Repository

NUMA (Non-Uniform Memory Access): An Overview

Author: Christoph Lameter
Lameter C.
Lameter C.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

An overview of non-uniform memory access

Author: Braithwaite R.
Christoph Lameter
Lameter C.
Lameter C.
Li Y.
Schimmel K.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Adaptive integration of hardware and software lock elision techniques

Author: Dice Dave
Karnagel Tomas
Lameter Christoph
Wang Qingping
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

FINJ: A fault injection tool for HPC systems

Author: A Gainaru
C Lameter
F Cappello
J Calhoun
MC Hsueh
N DeBardeleben
O Tuncer
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Crossref

Archivio della Ricerca - Università di Pisa

NUMA-Awareness as a Plug-In for an Eventify-Based Fast Multipole Method

Author: A Amer
C Lameter
E Agullo
L Greengard
L Ying
M Abduljabbar
R Beatson
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

Following the trend towards Exascale, today’s supercomputers consist of increasingly complex and heterogeneous compute nodes. To exploit the performance of these systems, research software in HPC needs to keep up with the rapid development of hardware architectures. Since manual tuning of software to each and every architecture is neither sustainable nor viable, we aim to tackle this challenge through appropriate software design. In this article, we aim to improve the performance and sustainability of FMSolvr, a parallel Fast Multipole Method for Molecular Dynamics, by adapting it to Non-Uniform Memory Access architectures in a portable and maintainable way. The parallelization of FMSolvr is based on Eventify, an event-based tasking framework we co-developed with FMSolvr. We describe a layered software architecture that enables the separation of the Fast Multipole Method from its parallelization. The focus of this article is on the development and analysis of a reusable NUMA module that improves performance while keeping both layers separated to preserve maintainability and extensibility. By means of the NUMA module we introduce diverse NUMA-aware data distribution, thread pinning and work stealing policies for FMSolvr. During the performance analysis the modular design of the NUMA module was advantageous since it facilitates combination, interchange and redesign of the developed policies. The performance analysis reveals that the runtime of FMSolvr is reduced by 21% from 1.48 ms to 1.16 ms through these policies

Crossref

Juelich Shared Electronic Resources

Student Cluster Competition 2018, Team Nanyang Technological University: Reproducing performance of a Multi-Physics Simulations of the Tsunamigenic 2004 Sumatra Megathrust Earthquake on the Intel Skylake architecture

Author: Bian Wu
Breuer
Bu-Sung Lee
Heinecke
Lameter
PELTIES
Uphoff
Uphoff
Weiliang Heng
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref