Search CORE

444 research outputs found

Prefetching techniques for client server object-oriented database systems

Author: Knafla Nils
Publication venue: The University of Edinburgh
Publication date: 01/01/1999
Field of study

The performance of many object-oriented database applications suffers from the page fetch latency which is determined by the expense of disk access. In this work we suggest several prefetching techniques to avoid, or at least to reduce, page fetch latency. In practice no prediction technique is perfect and no prefetching technique can entirely eliminate delay due to page fetch latency. Therefore we are interested in the trade-off between the level of accuracy required for obtaining good results in terms of elapsed time reduction and the processing overhead needed to achieve this level of accuracy. If prefetching accuracy is high then the total elapsed time of an application can be reduced significantly otherwise if the prefetching accuracy is low, many incorrect pages are prefetched and the extra load on the client, network, server and disks decreases the whole system performance. Access pattern of object-oriented databases are often complex and usually hard to predict accurately. The ..

CiteSeerX

Edinburgh Research Archive

Doctor of Philosophy

Author: Nellans David
Publication venue: University of Utah
Publication date: 01/12/2014
Field of study

dissertationWith the explosion of chip transistor counts, the semiconductor industry has struggled with ways to continue scaling computing performance in line with historical trends. In recent years, the de facto solution to utilize excess transistors has been to increase the size of the on-chip data cache, allowing fast access to an increased portion of main memory. These large caches allowed the continued scaling of single thread performance, which had not yet reached the limit of instruction level parallelism (ILP). As we approach the potential limits of parallelism within a single threaded application, new approaches such as chip multiprocessors (CMP) have become popular for scaling performance utilizing thread level parallelism (TLP). This dissertation identifies the operating system as a ubiquitous area where single threaded performance and multithreaded performance have often been ignored by computer architects. We propose that novel hardware and OS co-design has the potential to significantly improve current chip multiprocessor designs, enabling increased performance and improved power efficiency. We show that the operating system contributes a nontrivial overhead to even the most computationally intense workloads and that this OS contribution grows to a significant fraction of total instructions when executing several common applications found in the datacenter. We demonstrate that architectural improvements have had little to no effect on the performance of the OS over the last 15 years, leaving ample room for improvements. We specifically consider three potential solutions to improve OS execution on modern processors. First, we consider the potential of a separate operating system processor (OSP) operating concurrently with general purpose processors (GPP) in a chip multiprocessor organization, with several specialized structures acting as efficient conduits between these processors. Second, we consider the potential of segregating existing caching structures to decrease cache interference between the OS and application. Third, we propose that there are components within the OS itself that should be refactored to be both multithreaded and cache topology aware, which in turn, improves the performance and scalability of many-threaded applications

The University of Utah: J. Willard Marriott Digital Library

Hardware-Oriented Cache Management for Large-Scale Chip Multiprocessors

Author: Hammoud Mohammad Hussein
Publication venue
Publication date: 30/09/2010
Field of study

One of the key requirements to obtaining high performance from chip multiprocessors (CMPs) is to effectively manage the limited on-chip cache resources shared among co-scheduled threads/processes. This thesis proposes new hardware-oriented solutions for distributed CMP caches. Computer architects are faced with growing challenges when designing cache systems for CMPs. These challenges result from non-uniform access latencies, interference misses, the bandwidth wall problem, and diverse workload characteristics. Our exploration of the CMP cache management problem suggests a CMP caching framework (CC-FR) that defines three main approaches to solve the problem: (1) data placement, (2) data retention, and (3) data relocation. We effectively implement CC-FR's components by proposing and evaluating multiple cache management mechanisms.Pressure and Distance Aware Placement (PDA) decouples the physical locations of cache blocks from their addresses for the sake of reducing misses caused by destructive interferences. Flexible Set Balancing (FSB), on the other hand, reduces interference misses via extending the life time of cache lines through retaining some fraction of the working set at underutilized local sets to satisfy far-flung reuses. PDA implements CC-FR's data placement and relocation components and FSB applies CC-FR's retention approach.To alleviate non-uniform access latencies and adapt to phase changes in programs, Adaptive Controlled Migration (ACM) dynamically and periodically promotes cache blocks towards L2 banks close to requesting cores. ACM lies under CC-FR's data relocation category. Dynamic Cache Clustering (DCC), on the other hand, addresses diverse workload characteristics and growing non-uniform access latencies challenges via constructing a cache cluster for each core and expands/contracts all clusters synergistically to match each core's cache demand. DCC implements CC-FR's data placement and relocation approaches. Lastly, Dynamic Pressure and Distance Aware Placement (DPDA) combines PDA and ACM to cooperatively mitigate interference misses and non-uniform access latencies. Dynamic Cache Clustering and Balancing (DCCB), on the other hand, combines DCC and FSB to employ all CC-FR's categories and achieve higher system performance. Simulation results demonstrate the effectiveness of the proposed mechanisms and show that they compare favorably with related cache designs

D-Scholarship@Pitt

KOLAM : human computer interfaces fro visual analytics in big data imagery

Author: Haridas Anoop
Publication venue: 'University of Missouri Libraries'
Publication date
Field of study

In the present day, we are faced with a deluge of disparate and dynamic information from multiple heterogeneous sources. Among these are the big data imagery datasets that are rapidly being generated via mature acquisition methods in the geospatial, surveillance (specifically, Wide Area Motion Imagery or WAMI) and biomedical domains. The need to interactively visualize these imagery datasets by using multiple types of views (as needed) into the data is common to these domains. Furthermore, researchers in each domain have additional needs: users of WAMI datasets also need to interactively track objects of interest using algorithms of their choice, visualize the resulting object trajectories and interactively edit these results as needed. While software tools that fulfill each of these requirements individually are available and well-used at present, there is still a need for tools that can combine the desired aspects of visualization, human computer interaction (HCI), data analysis, data management, and (geo-)spatial and temporal data processing into a single flexible and extensible system. KOLAM is an open, cross-platform, interoperable, scalable and extensible framework for visualization and analysis that we have developed to fulfil the above needs. The novel contributions in this thesis are the following: 1) Spatio-temporal caching for animating both giga-pixel and Full Motion Video (FMV) imagery, 2) Human computer interfaces purposefully designed to accommodate big data visualization, 3) Human-in-the-loop interactive video object tracking - ground-truthing of moving objects in wide area imagery using algorithm assisted human-in-the-loop coupled tracking, 4) Coordinated visualization using stacked layers, side-by-side layers/video sub-windows and embedded imagery, 5) Efficient one-click manual tracking, editing and data management of trajectories, 6) Efficient labeling of image segmentation regions and passing these results to desired modules, 7) Visualization of image processing results generated by non-interactive operators using layers, 8) Extension of interactive imagery and trajectory visualization to multi-monitor wall display environments, 9) Geospatial applications: Providing rapid roam, zoom and hyper-jump spatial operations, interactive blending, colormap and histogram enhancement, spherical projection and terrain maps, 10) Biomedical applications: Visualization and target tracking of cell motility in time-lapse cell imagery, collecting ground-truth from experts on whole-slide imagery (WSI) for developing histopathology analytic algorithms and computer-aided diagnosis for cancer grading, and easy-to-use tissue annotation features.Includes bibliographical reference

University of Missouri: MOspace

Design and implementation of a tool for memory access pattern visualization

Author: Ντόγκας Χρήστος Κ.
Publication venue
Publication date: 01/01/2017
Field of study

University of Thessaly Institutional Repository

A Roadmap for HEP Software and Computing R&D for the 2020s

Author: Albrecht Johannes
Alves Antonio Augusto, Jr
Amadio Guilherme
Andronico Giuseppe
Anh-Ky Nguyen
Aphecetche Laurent
Apostolakis John
Asai Makoto
Atzori Luca
Babik Marian
Bagliesi Giuseppe
Bandieramonte Marilena
Banerjee Sunanda
Barisits Martin
Bauerdick Lothar A. T.
Belforte Stefano
Benjamin Douglas
Bernius Catrin
Bhimji Wahid
Bianchi Riccardo Maria
Bird Ian
Biscarat Catherine
Blomer Jakob
Bloom Kenneth
Boccali Tommaso
Bockelman Brian
Bold Tomasz
Bonacorsi Daniele
Boveia Antonio
Bozzi Concezio
Bracko Marko
Britton David
Buckley Andy
Buncic Predrag
Calafiura Paolo
Campana Simone
Canal Philippe
Canali Luca
Carlino Gianpaolo
Castro Nuno
Cattaneo Marco
Cerminara Gianluca
Cervantes Villanueva Javier
Chang Philip
Chapman John
Chen Gang
Childers Taylor
Clarke Peter
Clemencic Marco
Cogneras Eric
Coles Jeremy
Collier Ian
Colling David
Corti Gloria
Cosmo Gabriele
Costanzo Davide
Couturier Ben
Cranmer Kyle
Cranshaw Jack
Cristella Leonardo
Crooks David
Crépé-Renaudin Sabine
Currie Robert
Dallmeier-Tiessen Sünje
De Cian Michel
De Roeck Albert
De Kaushik
Delgado Peris Antonio
Derue Frédéric
Di Girolamo Alessandro
Di Guida Salvatore
Dimitrov Gancho
Doglioni Caterina
Dotti Andrea
Duellmann Dirk
Duflot Laurent
Dykstra Dave
Dziedziniewicz-Wojcik Katarzyna
Dziurda Agnieszka
Egede Ulrik
Elmer Peter
Elmsheuser Johannes
Elvira V. Daniel
Eulisse Giulio
Farrell Steven
Ferber Torben
Filipcic Andrej
Fisk Ian
Fitzpatrick Conor
Flix José
Formica Andrea
Forti Alessandra
Foundation HEP Software
Franzoni Giovanni
Frost James
Fuess Stu
Gaede Frank
Ganis Gerardo
Gardner Robert
Garonne Vincent
Gellrich Andreas
Genser Krzysztof
George Simon
Geurts Frank
Gheata Andrei
Gheata Mihaela
Giacomini Francesco
Giagu Stefano
Giffels Manuel
Gingrich Douglas
Girone Maria
Gligorov Vladimir V.
Glushkov Ivan
Gohn Wesley
Gonzalez Lopez Jose Benito
González Caballero Isidro
González Fernández Juan R.
Govi Giacomo
Grandi Claudio
Grasland Hadrien
Gray Heather
Grillo Lucia
Guan Wen
Gutsche Oliver
Gyurjyan Vardan
Hanushevsky Andrew
Hariri Farah
Hartmann Thomas
Harvey John
Hauth Thomas
Hegner Benedikt
Heinemann Beate
Heinrich Lukas
Heiss Andreas
Hernández José M.
Hildreth Michael
Hodgkinson Mark
Hoeche Stefan
Holzman Burt
Hristov Peter
Huang Xingtao
Ivanchenko Vladimir N.
Ivanov Todor
Iven Jan
Jashal Brij
Jayatilaka Bodhitha
Jones Roger
Jouvin Michel
Jun Soon Yung
Kagan Michael
Kalderon Charles William
Kane Meghan
Karavakis Edward
Katz Daniel S.
Kcira Dorian
Keeble Oliver
Kersevan Borut Paul
Kirby Michael
Klimentov Alexei
Klute Markus
Komarov Ilya
Konstantinov Dmitri
Koppenburg Patrick
Kowalkowski Jim
Kreczko Luke
Kuhr Thomas
Kutschke Robert
Kuznetsov Valentin
Lampl Walter
Lancon Eric
Lange David
Lassnig Mario
Laycock Paul
Leggett Charles
Letts James
Lewendel Birgit
Li Teng
Lima Guilherme
Linacre Jacob
Linden Tomas
Livny Miron
Lo Presti Giuseppe
Lopienski Sebastian
Love Peter
Lyon Adam
Magini Nicolò
Marshall Zachary L
Martelli Edoardo
Martin-Haugh Stewart
Mato Pere
Mazumdar Kajari
McCauley Thomas
McFayden Josh
McKee Shawn
McNab Andrew
Mehdiyev Rashid
Meinhard Helge
Menasce Dario
Mendez Lorenzo Patricia
Mete Alaettin Serhan
Michelotto Michele
Mitrevski Jovan
Moneta Lorenzo
Morgan Ben
Mount Richard
Moyse Edward
Murray Sean
Nairz Armin
Neubauer Mark S
Norman Andrew
Novaes Sérgio
Novak Mihaly
Oyanguren Arantza
Ozturk Nurcan
Pacheco Pages Andres
Paganini Michela
Pansanel Jerome
Pascuzzi Vincent R.
Patrick Glenn
Pearce Alex
Pearson Ben
Pedro Kevin
Perdue Gabriel
Perez-Calero Yzquierdo Antonio
Perrozzi Luca
Petersen Troels
Petric Marko
Petzold Andreas
Piedra Jónatan
Piilonen Leo
Piparo Danilo
Pivarski Jim
Pokorski Witold
Polci Francesco
Potamianos Karolos
Psihas Fernanda
Puig Navarro Albert
Quast Günter
Raven Gerhard
Reuter Jürgen
Ribon Alberto
Rinaldi Lorenzo
Ritter Martin
Robinson James
Rodrigues Eduardo
Roiser Stefan
Rousseau David
Roy Gareth
Rybkine Grigori
Sailer Andre
Sakuma Tai
Santana Renato
Sartirana Andrea
Schellman Heidi
Schovancová Jaroslava
Schramm Steven
Schulz Markus
Sciabà Andrea
Seidel Sally
Sekmen Sezen
Serfon Cedric
Severini Horst
Sexton-Kennedy Elizabeth
Seymour Michael
Sgalaberna Davide
Shapoval Illya
Shiers Jamie
Shiu Jing-Ge
Short Hannah
Siroli Gian Piero
Skipsey Sam
Smith Tim
Snyder Scott
Sokoloff Michael D
Spentzouris Panagiotis
Stadie Hartmut
Stark Giordon
Stewart Gordon
Stewart Graeme
Sánchez Arturo
Sánchez-Hernández Alberto
Taffard Anyes
Tamponi Umberto
Templon Jeff
Tenaglia Giacomo
Tsulaia Vakhtang
Tunnell Christopher
Vaandering Eric
Valassi Andrea
Vallecorsa Sofia
Valsan Liviu
Van Gemmeren Peter
Vernet Renaud
Viren Brett
Vlimant Jean-Roch
Voss Christian
Votava Margaret
Vuosalo Carl
Vázquez Sierra Carlos
Wartel Romain
Watts Gordon T.
Wenaus Torre
Wenzel Sandro
Williams Mike
Winklmeier Frank
Wissing Christoph
Wuerthwein Frank
Wynne Benjamin
Xiaomei Zhang
Yang Wei
Yazgan Efe
Publication venue
Publication date: 18/12/2017
Field of study

Particle physics has an ambitious and broad experimental programme for the coming decades. This programme requires large investments in detector hardware, either to build new facilities and experiments, or to upgrade existing ones. Similarly, it requires commensurate investment in the R&D of software to acquire, manage, process, and analyse the shear amounts of data to be recorded. In planning for the HL-LHC in particular, it is critical that all of the collaborating stakeholders agree on the software goals and priorities, and that the efforts complement each other. In this spirit, this white paper describes the R&D activities required to prepare for this software upgrade.Peer reviewe

Universidade do Minho: RepositoriUM

Hal - Université Grenoble Alpes

HAL Clermont Université

Helsingin yliopiston digitaalinen arkisto

HAL-CEA

Hal-Diderot

arXiv.org e-Print Archive

HAL-IN2P3

DSpace@MIT

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

Archivio della ricerca- Università di Roma La Sapienza

CERN Document Server

HAL-Polytechnique

Explore Bristol Research

ANALYTICAL MODEL FOR CHIP MULTIPROCESSOR MEMORY HIERARCHY DESIGN AND MAMAGEMENT

Author: Oh Tae Cheol
Publication venue
Publication date: 30/01/2011
Field of study

Continued advances in circuit integration technology has ushered in the era of chip multiprocessor (CMP) architectures as further scaling of the performance of conventional wide-issue superscalar processor architectures remains hard and costly. CMP architectures take advantageof Moore¡¯s Law by integrating more cores in a given chip area rather than a single fastyet larger core. They achieve higher performance with multithreaded workloads. However,CMP architectures pose many new memory hierarchy design and management problems thatmust be addressed. For example, how many cores and how much cache capacity must weintegrate in a single chip to obtain the best throughput possible? Which is more effective,allocating more cache capacity or memory bandwidth to a program?This thesis research develops simple yet powerful analytical models to study two newmemory hierarchy design and resource management problems for CMPs. First, we considerthe chip area allocation problem to maximize the chip throughput. Our model focuses onthe trade-off between the number of cores, cache capacity, and cache management strategies.We find that different cache management schemes demand different area allocation to coresand cache to achieve their maximum performance. Second, we analyze the effect of cachecapacity partitioning on the bandwidth requirement of a given program. Furthermore, ourmodel considers how bandwidth allocation to different co-scheduled programs will affect theindividual programs¡¯ performance. Since the CMP design space is large and simulating only one design point of the designspace under various workloads would be extremely time-consuming, the conventionalsimulation-based research approach quickly becomes ineffective. We anticipate that ouranalytical models will provide practical tools to CMP designers and correctly guide theirdesign efforts at an early design stage. Furthermore, our models will allow them to betterunderstand potentially complex interactions among key design parameters

D-Scholarship@Pitt