89 research outputs found

    Extending the reach of microprocessors : column and curious caching

    Get PDF
    Thesis (Ph.D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 1999.Includes bibliographical references (p. 162-167).by Derek T. Chiou.Ph.D

    A Reconfigurable Fabric for Accelerating Large-Scale Datacenter Services

    Get PDF
    Datacenter workloads demand high computational capabilities, flexibility, power efficiency, and low cost. It is challenging to improve all of these factors simultaneously. To advance datacenter capabilities beyond what commodity server designs can provide, we designed and built a composable, reconfigurable hardware fabric based on field programmable gate arrays (FPGA). Each server in the fabric contains one FPGA, and all FPGAs within a 48-server rack are interconnected over a low-latency, high-bandwidth network. We describe a medium-scale deployment of this fabric on a bed of 1632 servers, and measure its effectiveness in accelerating the ranking component of the Bing web search engine. We describe the requirements and architecture of the system, detail the critical engineering challenges and solutions needed to make the system robust in the presence of failures, and measure the performance, power, and resilience of the system. Under high load, the large-scale reconfigurable fabric improves the ranking throughput of each server by 95% at a desirable latency distribution or reduces tail latency by 29% at a fixed throughput. In other words, the reconfigurable fabric enables the same throughput using only half the number of servers

    Phylogeographical analysis of the dominant multidrug-resistant H58 clade of Salmonella Typhi identifies inter- and intracontinental transmission events.

    Get PDF
    The emergence of multidrug-resistant (MDR) typhoid is a major global health threat affecting many countries where the disease is endemic. Here whole-genome sequence analysis of 1,832 Salmonella enterica serovar Typhi (S. Typhi) identifies a single dominant MDR lineage, H58, that has emerged and spread throughout Asia and Africa over the last 30 years. Our analysis identifies numerous transmissions of H58, including multiple transfers from Asia to Africa and an ongoing, unrecognized MDR epidemic within Africa itself. Notably, our analysis indicates that H58 lineages are displacing antibiotic-sensitive isolates, transforming the global population structure of this pathogen. H58 isolates can harbor a complex MDR element residing either on transmissible IncHI1 plasmids or within multiple chromosomal integration sites. We also identify new mutations that define the H58 lineage. This phylogeographical analysis provides a framework to facilitate global management of MDR typhoid and is applicable to similar MDR lineages emerging in other bacterial species

    Atrial fibrillation genetic risk differentiates cardioembolic stroke from other stroke subtypes

    Get PDF
    AbstractObjectiveWe sought to assess whether genetic risk factors for atrial fibrillation can explain cardioembolic stroke risk.MethodsWe evaluated genetic correlations between a prior genetic study of AF and AF in the presence of cardioembolic stroke using genome-wide genotypes from the Stroke Genetics Network (N = 3,190 AF cases, 3,000 cardioembolic stroke cases, and 28,026 referents). We tested whether a previously-validated AF polygenic risk score (PRS) associated with cardioembolic and other stroke subtypes after accounting for AF clinical risk factors.ResultsWe observed strong correlation between previously reported genetic risk for AF, AF in the presence of stroke, and cardioembolic stroke (Pearson’s r=0.77 and 0.76, respectively, across SNPs with p &lt; 4.4 × 10−4 in the prior AF meta-analysis). An AF PRS, adjusted for clinical AF risk factors, was associated with cardioembolic stroke (odds ratio (OR) per standard deviation (sd) = 1.40, p = 1.45×10−48), explaining ∌20% of the heritable component of cardioembolic stroke risk. The AF PRS was also associated with stroke of undetermined cause (OR per sd = 1.07, p = 0.004), but no other primary stroke subtypes (all p &gt; 0.1).ConclusionsGenetic risk for AF is associated with cardioembolic stroke, independent of clinical risk factors. Studies are warranted to determine whether AF genetic risk can serve as a biomarker for strokes caused by AF.</jats:sec

    Department of Energy DE-FG02-05ER25686 Early Career Principal Investigator Award Final Report

    No full text
    This project is researching FAST, a methodology to build very fast, cycleaccurate full system computer simulators and building the first set of such simulators and the first set of tools to help construct those simulators. The methodology relies on a functional model that is a fast, full-system but not cycle-accurate simulator coupled with a timing model that is models the micro-architectural structure and arbitration of a computer system, but not its functionality. The way FAST simulators differ from other simulators partitioned in the same way is that the interface between the functional and timing model is optimized to minimize the need for round-trip communication. The optimized communication enables FAST timing models to be implemented in an FPGA and the functional model in software while still achieving extremely high performance. Our general strategy is to design a methodology and then prototype an example using that methodology to ensure the methodology is sound before committing to it. This strategy ensures that the selected methodology has been tested and provides an early example of the output of that methodology

    Using GCC as an Efficient, Portable Back-End

    No full text
    In order to generate efficient object code for modern microprocessors, aggressive compilers customize their code generators (the back-end of the compiler) for each target processor. Customized back-ends, however, are major undertakings that most experimental compilers writers cannot afford. We describe a method to easily generate efficient object code for virtually any UNIX platform. The method relies on Gnu&apos;s GCC compiler and its extensions to the C language. These techniques have been applied to pHluid[1], a pH/Id compiler. The techniques, their application to pHluid and some performance results are reported in this paper. 1 Introduction Making the best use of modern microprocessors requires very sophisticated instruction scheduling due to their multiple instruction issuing capabilities and longer pipelines. To generate the best possible code, aggressive compilers customize back-ends for each supported architecture. Such back-end customization requires substantial time as well as s..

    Issues in Building a Cache-Coherent Distributed Shared Memory Machine using Commercial SMPs

    No full text
    StarT-ng is a parallel machine that supports both fine-grained user-level message passing and cache-coherent distributed shared memory (CCDSM). A site in StarT-ng is an enhanced commercial PowerPC 620-based symmetric multiprocessor (SMP) connected to an MPP-class network. With the aid of a simple address capture device, a designated processor at each site executes cache-coherence protocols. Given the variety of memory models and ever growing complexity of coherence protocols, StarT-ng should provide a relatively cheap, low risk and flexible system to explore shared memory issues. This paper describes several difficulties we have encountered in implementing CCDSM using commercial SMPs. It offers some insights into the causes of these difficulties and possible remedies. 1 Introduction This paper describes the challenges we have faced in designing a parallel machine based on commercial SMPs. We, like most other researchers[7], believe that to support a general programming model, a machin..
    • 

    corecore