Search CORE

25 research outputs found

HPC platforms @ UL: Overview (as of 2014) and Usage

Author: Bouvry Pascal
Cartiaux Hyacinthe
Georgatos Fotis
Varrette Sébastien
Publication venue
Publication date: 01/05/2014
Field of study

Open Repository and Bibliography - Luxembourg

Overview and Challenges of the UL HPC Facility at the Belval and EuroHPC Horizon

Author: Bouvry Pascal
Cartiaux Hyacinthe
Diehl Sarah
Parisot Clément
Plugaru Valentin
Varrette Sébastien
Publication venue
Publication date: 01/06/2017
Field of study

Open Repository and Bibliography - Luxembourg

Management of an Academic HPC Cluster: The UL Experience

Author: Bouvry Pascal
Cartiaux Hyacinthe
Georgatos Fotis
Varrette Sébastien
Publication venue
Publication date: 01/07/2014
Field of study

The intensive growth of processing power, data storage and transmission capabilities has revolutionized many aspects of science. These resources are essential to achieve high-quality results in many application areas. In this context, the University of Luxembourg (UL) operates since 2007 an High Performance Computing (HPC) facility and the related storage. The aspect of bridging computing and storage is a requirement of UL service – the reasons are both legal (certain data may not move) and performance related. Nowa- days, people from the three faculties and/or the two Interdisciplinary centers within the UL, are users of this facility. More specifically, key research priorities such as Systems Bio-medicine (by LCSB) and Security, Reliability & Trust (by SnT) require access to such HPC facilities in order to function in an adequate environment. The management of HPC solutions is a complex enterprise and a constant area for discussion and improvement. The UL HPC facility and the derived deployed services is a complex computing system to manage by its scale: at the moment of writing, it consists of 150 servers, 368 nodes (3880 computing cores) and 1996 TB of shared raw storage which are all configured, monitored and operated by three per- sons using advanced IT automation solutions based on Puppet [1], FAI [2] and Capistrano [3]. This paper covers all the aspects in relation to the management of such a complex infrastructure, whether technical or administrative. Most design choices or implemented approaches have been motivated by several years of experience in addressing research needs, mainly in the HPC area but also in complementary services (typically Web-based). In this context, we tried to answer in a flexible and convenient way many technological issues. This experience report may be of interest for other research centers belonging either to the public or the private sector looking for good if not best practices in cluster architecture and management

Crossref

Open Repository and Bibliography - Luxembourg

Aggregating and Consolidating two High Performant Network Topologies: The ULHPC Experience

Author: Cartiaux Hyacinthe
Olloh Abatcha
Valette Teddy
Varrette Sébastien
Publication venue
Publication date: 01/07/2022
Field of study

High Performance Computing (HPC) encompasses advanced computation over parallel processing. The execution time of a given simulation depends upon many factors, such as the number of CPU/GPU cores, their utilisation factor and, of course, the inter- connect performance, efficiency, and scalability. In practice, this last component and the associated topology remains the most significant differentiators between HPC systems and lesser perfor- mant systems. The University of Luxembourg operates since 2007 a large academic HPC facility which remains one of the reference implementation within the country and offers a cutting-edge re- search infrastructure to Luxembourg public research. The main high-bandwidth low-latency network of the operated facility relies on the dominant interconnect technology in the HPC market i.e., Infiniband (IB) over a Fat-tree topology. It is complemented by an Ethernet-based network defined for management tasks, external access and interactions with user’s applications that do not support Infiniband natively. The recent acquisition of a new cutting-edge supercomputer Aion which was federated with the previous flag- ship cluster Iris was the occasion to aggregate and consolidate the two types of networks. This article depicts the architecture and the solutions designed to expand and consolidate the existing networks beyond their seminal capacity limits while keeping at best their Bisection bandwidth. At the IB level, and despite moving from a non-blocking configuration, the proposed approach defines a blocking topology maintaining the previous Fat-Tree height. The leaf connection capacity is more than tripled (moving from 216 to 672 end-points) while exhibiting very marginal penalties, i.e. less than 3% (resp. 0.3%) Read (resp. Write) bandwidth degradation against reference parallel I/O benchmarks, or a stable and sustain- able point-to-point bandwidth efficiency among all possible pairs of nodes (measured above 95.45% for bi-directional streams). With regards the Ethernet network, a novel 2-layer topology aiming for improving the availability, maintainability and scalability of the interconnect is described. It was deployed together with consistent network VLANs and subnets enforcing strict security policies via ACLs defined on the layer 3, offering isolated and secure net- work environments. The implemented approaches are applicable to a broad range of HPC infrastructures and thus may help other HPC centres to consolidate their own interconnect stacks when designing or expanding their network infrastructures

Open Repository and Bibliography - Luxembourg

UL HPC Tutorial: Multi-Physics workflows: test cases on CFD / MD / Chemistry applications

Author: Bouvry Pascal
Cartiaux Hyacinthe
Diehl Sarah
Parisot Clément
Plugaru Valentin
Varrette Sébastien
Publication venue
Publication date: 01/06/2018
Field of study

Open Repository and Bibliography - Luxembourg

Overview and Challenges of the UL HPC Facility at the EuroHPC Horizon

Author: Bouvry Pascal
Cartiaux Hyacinthe
Diehl Sarah
Parisot Clément
Plugaru Valentin
Varrette Sébastien
Publication venue
Publication date: 01/11/2017
Field of study

Open Repository and Bibliography - Luxembourg