592 research outputs found

    EPSILOD: efficient parallel skeleton for generic iterative stencil computations in distributed GPUs

    Get PDF
    Producción CientíficaIterative stencil computations are widely used in numerical simulations. They present a high degree of parallelism, high locality and mostly-coalesced memory access patterns. Therefore, GPUs are good candidates to speed up their computa- tion. However, the development of stencil programs that can work with huge grids in distributed systems with multiple GPUs is not straightforward, since it requires solv- ing problems related to the partition of the grid across nodes and devices, and the synchronization and data movement across remote GPUs. In this work, we present EPSILOD, a high-productivity parallel programming skeleton for iterative stencil computations on distributed multi-GPUs, of the same or different vendors that sup- ports any type of n-dimensional geometric stencils of any order. It uses an abstract specification of the stencil pattern (neighbors and weights) to internally derive the data partition, synchronizations and communications. Computation is split to better overlap with communications. This paper describes the underlying architecture of EPSILOD, its main components, and presents an experimental evaluation to show the benefits of our approach, including a comparison with another state-of-the-art solution. The experimental results show that EPSILOD is faster and shows good strong and weak scalability for platforms with both homogeneous and heterogene- ous types of GPUJunta de Castilla y León, Ministerio de Economía, Industria y Competitividad, y Fondo Europeo de Desarrollo Regional (FEDER): Proyecto PCAS (TIN2017-88614-R) y Proyecto PROPHET-2 (VA226P20).Ministerio de Ciencia e Innovación, Agencia Estatal de Investigación y “European Union NextGenerationEU/PRTR” : (MCIN/ AEI/10.13039/501100011033) - grant TED2021-130367B-I00CTE-POWER and Minotauro and the technical support provided by Barcelona Supercomputing Center (RES-IM-2021-2-0005, RES-IM-2021-3-0024, RES- IM-2022-1-0014).Publicación en abierto financiada por el Consorcio de Bibliotecas Universitarias de Castilla y León (BUCLE), con cargo al Programa Operativo 2014ES16RFOP009 FEDER 2014-2020 DE CASTILLA Y LEÓN, Actuación:20007-CL - Apoyo Consorcio BUCL

    Ethnographies of Collaborative Economies across Europe: Understanding Sharing and Caring

    Get PDF
    "Sharing economy" and "collaborative economy" refer to a proliferation of initiatives, business models, digital platforms and forms of work that characterise contemporary life: from community-led initiatives and activist campaigns, to the impact of global sharing platforms in contexts such as network hospitality, transportation, etc. Sharing the common lens of ethnographic methods, this book presents in-depth examinations of collaborative economy phenomena. The book combines qualitative research and ethnographic methodology with a range of different collaborative economy case studies and topics across Europe. It uniquely offers a truly interdisciplinary approach. It emerges from a unique, long-term, multinational, cross-European collaboration between researchers from various disciplines (e.g., sociology, anthropology, geography, business studies, law, computing, information systems), career stages, and epistemological backgrounds, brought together by a shared research interest in the collaborative economy. This book is a further contribution to the in-depth qualitative understanding of the complexities of the collaborative economy phenomenon. These rich accounts contribute to the painting of a complex landscape that spans several countries and regions, and diverse political, cultural, and organisational backdrops. This book also offers important reflections on the role of ethnographic researchers, and on their stance and outlook, that are of paramount interest across the disciplines involved in collaborative economy research

    Towards Exascale Computation for Turbomachinery Flows

    Full text link
    A state-of-the-art large eddy simulation code has been developed to solve compressible flows in turbomachinery. The code has been engineered with a high degree of scalability, enabling it to effectively leverage the many-core architecture of the new Sunway system. A consistent performance of 115.8 DP-PFLOPs has been achieved on a high-pressure turbine cascade consisting of over 1.69 billion mesh elements and 865 billion Degree of Freedoms (DOFs). By leveraging a high-order unstructured solver and its portability to large heterogeneous parallel systems, we have progressed towards solving the grand challenge problem outlined by NASA, which involves a time-dependent simulation of a complete engine, incorporating all the aerodynamic and heat transfer components.Comment: SC23, November, 2023, Denver, CO., US

    Current issues of the management of socio-economic systems in terms of globalization challenges

    Get PDF
    The authors of the scientific monograph have come to the conclusion that the management of socio-economic systems in the terms of global challenges requires the use of mechanisms to ensure security, optimise the use of resource potential, increase competitiveness, and provide state support to economic entities. Basic research focuses on assessment of economic entities in the terms of global challenges, analysis of the financial system, migration flows, logistics and product exports, territorial development. The research results have been implemented in the different decision-making models in the context of global challenges, strategic planning, financial and food security, education management, information technology and innovation. The results of the study can be used in the developing of directions, programmes and strategies for sustainable development of economic entities and regions, increasing the competitiveness of products and services, decision-making at the level of ministries and agencies that regulate the processes of managing socio-economic systems. The results can also be used by students and young scientists in the educational process and conducting scientific research on the management of socio-economic systems in the terms of global challenges

    Inkjet printing digital image generation and compensation for surface chemistry effects

    Get PDF
    Additive manufacturing (AM) of electronic materials using digital inkjet printing (DIJP) is of research interests nowadays because of its potential benefits in the semiconductor industry. Current trends in manufacturing electronics feature DIJP as a key technology to enable the production of customised and microscale functional devices. However, the fabrication of microelectronic components at large scale demands fast printing of tight features with high dimensional accuracy on substrates with varied surface topography which push inkjet printing process to its limits. To understand the DIJP droplet deposition on such substrates, generally requires computational fluid dynamics modelling which is limited in its physics approximation of surface interactions. Otherwise, a kind of “trial and error” approach to determining how the ink spreads, coalesce and solidifies over the substrate is used, often a very time-consuming process. Consequently, this thesis aims to develop new modelling techniques to predict fast and accurately the surface morphology of inkjet-printed features, enabling the optimisation of DIJP control parameters and the compensation of images for better dimensional accuracy of printed electronics devices. This investigation explored three categories of modelling techniques to predict the surface morphology of inkjet-printed features: physics-based, data-driven and hybrid physics-based and data-driven. Two physics-based numerical models were developed to reproduce the inkjet printing droplet deposition and solidification processes using a lattice Boltzmann (LB) multiphase flow model and a finite element (FE) chemo-mechanical model, respectively. The LB model was limited to the simulation of single tracks and small square films and the FE model was mainly employed for the distortion prediction of multilayer objects. Alternatively, two data-driven models were implemented to reconstruct the surface morphology of single tracks and free-form films using images from experiments: image analysis (IA) and shape from shading (SFS). IA assumed volume conservation and minimal energy drop shape to reconstruct the surface while SFS resolved the height of the image using a reflection model. Finally, a hybrid physics-based and data-driven approach was generated which incorporates the uncertainty of droplet landing position and footprint, hydrostatic analytical models, empirical correlations derived from experiments, and relationships derived from physics-based models to predict fast and accurately any free-form layer pattern as a function of physical properties, printing parameters and wetting characteristics. Depending on the selection of the modelling technique to predict the deformed geometry, further considerations were required. For the purely physics-based and data-driven models, a surrogate model using response surface equations was employed to create a transfer function between printing parameters, substrate wetting characteristics and the resulting surface morphology. The development of a transfer function significantly decreased the computational time required by purely physics-based models and enabled the parameter optimisation using a multi-objective genetic algorithm approach to attain the best film dimensional accuracy. Additionally, for multilayer printing applications, a layer compensation approach was achieved utilizing a convolutional neural network trained by the predicted (deformed) geometry to reduce the out of plane error to target shape. The optimal combination of printing parameters and input image compensation helped with the generation of fine features that are traditionally difficult for inkjet, improved resolution of edges and corners by reducing the amount of overflow from material, accounted for varied topography and capillary effects thereof on the substrate surface and considered the effect of multiple layers built up on each other. This study revealed for the first time to the best of our knowledge the role of the droplet location and footprint diameter uncertainty in the stability and uniformity of printed features. Using a droplet overlap map which was proposed as a universal technique to assess the effect of printing parameters on pattern geometry, it was shown that reliable limits for break-up and bulging of printed features were obtained. Considering droplet position and diameter size uncertainties, predicted optimal printing parameters improved the quality of printed films on substrates with different wettability. Finally, a stability diagram illustrating the onset of bulging and separation for lines and films as well as the optimal drop spacing, printing frequency and stand-off distance was generated to inform visually the results. This investigation has developed a predictive physics-based model of the surface morphology of DIJP features on heterogeneous substrates and a methodology to find the printing parameters and compensate the layer geometry required for optimum part dimensional accuracy. The simplicity of the proposed technique makes it a promising tool for model driven inkjet printing process optimization, including real time process control and paves the way for better quality devices in the printed electronics industry

    SoC-based FPGA architecture for image analysis and other highly demanding applications

    Get PDF
    Al giorno d'oggi, lo sviluppo di algoritmi si concentra su calcoli efficienti in termini di prestazioni ed efficienza energetica. Tecnologie come il field programmable gate array (FPGA) e il system on chip (SoC) basato su FPGA (FPGA/SoC) hanno dimostrato la loro capacità di accelerare applicazioni di calcolo intensive risparmiando al contempo il consumo energetico, grazie alla loro capacità di elevato parallelismo e riconfigurazione dell'architettura. Attualmente, i cicli di progettazione esistenti per FPGA/SoC sono lunghi, a causa della complessità dell'architettura. Pertanto, per colmare il divario tra le applicazioni e le architetture FPGA/SoC e ottenere un design hardware efficiente per l'analisi delle immagini e altri applicazioni altamente demandanti utilizzando lo strumento di sintesi di alto livello, vengono prese in considerazione due strategie complementari: tecniche ad hoc e stima delle prestazioni. Per quanto riguarda le tecniche ad-hoc, tre applicazioni molto impegnative sono state accelerate attraverso gli strumenti HLS: discriminatore di forme di impulso per i raggi cosmici, classificazione automatica degli insetti e re-ranking per il recupero delle informazioni, sottolineando i vantaggi quando questo tipo di applicazioni viene attraversato da tecniche di compressione durante il targeting dispositivi FPGA/SoC. Inoltre, in questa tesi viene proposto uno stimatore delle prestazioni per l'accelerazione hardware per prevedere efficacemente l'utilizzo delle risorse e la latenza per FPGA/SoC, costruendo un ponte tra l'applicazione e i domini architetturali. Lo strumento integra modelli analitici per la previsione delle prestazioni e un motore design space explorer (DSE) per fornire approfondimenti di alto livello agli sviluppatori di hardware, composto da due motori indipendenti: DSE basato sull'ottimizzazione a singolo obiettivo e DSE basato sull'ottimizzazione evolutiva multiobiettivo.Nowadays, the development of algorithms focuses on performance-efficient and energy-efficient computations. Technologies such as field programmable gate array (FPGA) and system on chip (SoC) based on FPGA (FPGA/SoC) have shown their ability to accelerate intensive computing applications while saving power consumption, owing to their capability of high parallelism and reconfiguration of the architecture. Currently, the existing design cycles for FPGA/SoC are time-consuming, owing to the complexity of the architecture. Therefore, to address the gap between applications and FPGA/SoC architectures and to obtain an efficient hardware design for image analysis and highly demanding applications using the high-level synthesis tool, two complementary strategies are considered: ad-hoc techniques and performance estimator. Regarding ad-hoc techniques, three highly demanding applications were accelerated through HLS tools: pulse shape discriminator for cosmic rays, automatic pest classification, and re-ranking for information retrieval, emphasizing the benefits when this type of applications are traversed by compression techniques when targeting FPGA/SoC devices. Furthermore, a comprehensive performance estimator for hardware acceleration is proposed in this thesis to effectively predict the resource utilization and latency for FPGA/SoC, building a bridge between the application and architectural domains. The tool integrates analytical models for performance prediction, and a design space explorer (DSE) engine for providing high-level insights to hardware developers, composed of two independent sub-engines: DSE based on single-objective optimization and DSE based on evolutionary multi-objective optimization

    Undergraduate and Graduate Course Descriptions, 2023 Spring

    Get PDF
    Wright State University undergraduate and graduate course descriptions from Spring 2023

    Co-designing Collaborative Care Work through Ethnography

    Get PDF

    Low Power Memory/Memristor Devices and Systems

    Get PDF
    This reprint focusses on achieving low-power computation using memristive devices. The topic was designed as a convenient reference point: it contains a mix of techniques starting from the fundamental manufacturing of memristive devices all the way to applications such as physically unclonable functions, and also covers perspectives on, e.g., in-memory computing, which is inextricably linked with emerging memory devices such as memristors. Finally, the reprint contains a few articles representing how other communities (from typical CMOS design to photonics) are fighting on their own fronts in the quest towards low-power computation, as a comparison with the memristor literature. We hope that readers will enjoy discovering the articles within

    Sustainable Value Co-Creation in Welfare Service Ecosystems : Transforming temporary collaboration projects into permanent resource integration

    Get PDF
    The aim of this paper is to discuss the unexploited forces of user-orientation and shared responsibility to promote sustainable value co-creation during service innovation projects in welfare service ecosystems. The framework is based on the theoretical field of public service logic (PSL) and our thesis is that service innovation seriously requires a user-oriented approach, and that such an approach enables resource integration based on the service-user’s needs and lifeworld. In our findings, we identify prerequisites and opportunities of collaborative service innovation projects in order to transform these projects into sustainable resource integration once they have ended
    corecore