31 research outputs found

    Improving low latency applications for reconfigurable devices

    Get PDF
    This thesis seeks to improve low latency application performance via architectural improvements in reconfigurable devices. This is achieved by improving resource utilisation and access, and by exploiting the different environments within which reconfigurable devices are deployed. Our first contribution leverages devices deployed at the network level to enable the low latency processing of financial market data feeds. Financial exchanges transmit messages via two identical data feeds to reduce the chance of message loss. We present an approach to arbitrate these redundant feeds at the network level using a Field-Programmable Gate Array (FPGA). With support for any messaging protocol, we evaluate our design using the NASDAQ TotalView-ITCH, OPRA, and ARCA data feed protocols, and provide two simultaneous outputs: one prioritising low latency, and one prioritising high reliability with three dynamically configurable windowing methods. Our second contribution is a new ring-based architecture for low latency, parallel access to FPGA memory. Traditional FPGA memory is formed by grouping block memories (BRAMs) together and accessing them as a single device. Our architecture accesses these BRAMs independently and in parallel. Targeting memory-based computing, which stores pre-computed function results in memory, we benefit low latency applications that rely on: highly-complex functions; iterative computation; or many parallel accesses to a shared resource. We assess square root, power, trigonometric, and hyperbolic functions within the FPGA, and provide a tool to convert Python functions to our new architecture. Our third contribution extends the ring-based architecture to support any FPGA processing element. We unify E heterogeneous processing elements within compute pools, with each element implementing the same function, and the pool serving D parallel function calls. Our implementation-agnostic approach supports processing elements with different latencies, implementations, and pipeline lengths, as well as non-deterministic latencies. Compute pools evenly balance access to processing elements across the entire application, and are evaluated by implementing eight different neural network activation functions within an FPGA.Open Acces

    A Modular Approach to Adaptive Reactive Streaming Systems

    Get PDF
    The latest generations of FPGA devices offer large resource counts that provide the headroom to implement large-scale and complex systems. However, there are increasing challenges for the designer, not just because of pure size and complexity, but also in harnessing effectively the flexibility and programmability of the FPGA. A central issue is the need to integrate modules from diverse sources to promote modular design and reuse. Further, the capability to perform dynamic partial reconfiguration (DPR) of FPGA devices means that implemented systems can be made reconfigurable, allowing components to be changed during operation. However, use of DPR typically requires low-level planning of the system implementation, adding to the design challenge. This dissertation presents ReShape: a high-level approach for designing systems by interconnecting modules, which gives a ‘plug and play’ look and feel to the designer, is supported by tools that carry out implementation and verification functions, and is carried through to support system reconfiguration during operation. The emphasis is on the inter-module connections and abstracting the communication patterns that are typical between modules – for example, the streaming of data that is common in many FPGA-based systems, or the reading and writing of data to and from memory modules. ShapeUp is also presented as the static precursor to ReShape. In both, the details of wiring and signaling are hidden from view, via metadata associated with individual modules. ReShape allows system reconfiguration at the module level, by supporting type checking of replacement modules and by managing the overall system implementation, via metadata associated with its FPGA floorplan. The methodology and tools have been implemented in a prototype for a broad domain-specific setting – networking systems – and have been validated on real telecommunications design projects

    Evolvable Smartphone-Based Point-of-Care Systems For In-Vitro Diagnostics

    Get PDF
    Recent developments in the life-science -omics disciplines, together with advances in micro and nanoscale technologies offer unprecedented opportunities to tackle some of the major healthcare challenges of our time. Lab-on-Chip technologies coupled with smart-devices in particular, constitute key enablers for the decentralization of many in-vitro medical diagnostics applications to the point-of-care, supporting the advent of a preventive and personalized medicine. Although the technical feasibility and the potential of Lab-on-Chip/smart-device systems is repeatedly demonstrated, direct-to-consumer applications remain scarce. This thesis addresses this limitation. System evolvability is a key enabler to the adoption and long-lasting success of next generation point-of-care systems by favoring the integration of new technologies, streamlining the reengineering efforts for system upgrades and limiting the risk of premature system obsolescence. Among possible implementation strategies, platform-based design stands as a particularly suitable entry point. One necessary condition, is for change-absorbing and change-enabling mechanisms to be incorporated in the platform architecture at initial design-time. Important considerations arise as to where in Lab-on-Chip/smart-device platforms can these mechanisms be integrated, and how to implement them. Our investigation revolves around the silicon-nanowire biological field effect transistor, a promising biosensing technology for the detection of biological analytes at ultra low concentrations. We discuss extensively the sensitivity and instrumentation requirements set by the technology before we present the design and implementation of an evolvable smartphone-based platform capable of interfacing lab-on-chips embedding such sensors. We elaborate on the implementation of various architectural patterns throughout the platform and present how these facilitated the evolution of the system towards one accommodating for electrochemical sensing. Model-based development was undertaken throughout the engineering process. A formal SysML system model fed our evolvability assessment process. We introduce, in particular, a model-based methodology enabling the evaluation of modular scalability: the ability of a system to scale the current value of one of its specification by successively reengineering targeted system modules. The research work presented in this thesis provides a roadmap for the development of evolvable point-of-care systems, including those targeting direct-to-consumer applications. It extends from the early identification of anticipated change, to the assessment of the ability of a system to accommodate for these changes. Our research should thus interest industrials eager not only to disrupt, but also to last in a shifting socio-technical paradigm

    Reservoir Computing in Materio

    Get PDF
    Reservoir Computing first emerged as an efficient mechanism for training recurrent neural networks and later evolved into a general theoretical model for dynamical systems. By applying only a simple training mechanism many physical systems have become exploitable unconventional computers. However, at present, many of these systems require careful selection and tuning by hand to produce usable or optimal reservoir computers. In this thesis we show the first steps to applying the reservoir model as a simple computational layer to extract exploitable information from complex material substrates. We argue that many physical substrates, even systems that in their natural state might not form usable or "good" reservoirs, can be configured into working reservoirs given some stimulation. To achieve this we apply techniques from evolution in materio whereby configuration is through evolved input-output signal mappings and targeted stimuli. In preliminary experiments the combined model and configuration method is applied to carbon nanotube/polymer composites. The results show substrates can be configured and trained as reservoir computers of varying quality. It is shown that applying the reservoir model adds greater functionality and programmability to physical substrates, without sacrificing performance. Next, the weaknesses of the technique are addressed, with the creation of new high input-output hardware system and an alternative multi-substrate framework. Lastly, a substantial effort is put into characterising the quality of a substrate for reservoir computing, i.e its ability to realise many reservoirs. From this, a methodological framework is devised. Using the framework, radically different computing substrates are compared and assessed, something previously not possible. As a result, a new understanding of the relationships between substrate, tasks and properties is possible, outlining the way for future exploration and optimisation of new computing substrates

    Mu2e Technical Design Report

    Full text link
    The Mu2e experiment at Fermilab will search for charged lepton flavor violation via the coherent conversion process mu- N --> e- N with a sensitivity approximately four orders of magnitude better than the current world's best limits for this process. The experiment's sensitivity offers discovery potential over a wide array of new physics models and probes mass scales well beyond the reach of the LHC. We describe herein the preliminary design of the proposed Mu2e experiment. This document was created in partial fulfillment of the requirements necessary to obtain DOE CD-2 approval.Comment: compressed file, 888 pages, 621 figures, 126 tables; full resolution available at http://mu2e.fnal.gov; corrected typo in background summary, Table 3.

    Measurement of charged particle multiplicity distribution in Au + Au collisions up to 200 GeV

    Get PDF
    Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Physics, 2003.Vita.Includes bibliographical references (p. 155-165).This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.Au+Au collisions in the Relativistic Heavy Ion Collider (RHIC) herald a new era of opportunities for studying hadronic matter under conditions of high energy density and nucleon density. The theory of strong interactions, Quantum Chromodynamics (QCD), tells us very little about the dynamics of a strongly interacting many-body system in this non-perturbative regime. Therefore, characterizing the collisions using global extrinsic observables becomes important. The distribution of charged particles produced in an ultra-relativistic heavy ion collision, being proportional to the initial energy density in the collision volume, is one such global observable. An anomalous change in particle multiplicity from lower energy collisions and/or peripheral to central collisions could indicate the onset of non-perturbative effects like deconfinement of quarks and gluons, or non-linear dynamics such as parton saturation. We determine the pseudo-rapidity distribution of charged particles from hits recorded in pixels of the PHOBOS silicon multiplicity detector, after applying corrections for detector acceptance, occupancy and background particles. The collision centrality is estimated by measuring the energy deposited in scintillator trigger detectors and comparing it to Monte Carlo simulations for events with different impact parameters. We find the total number of charged particles produced in central Au+Au collisions at [the square root of] sNN = 130 GeV and [the square root of] sNN = 200 GeV to be 4160+/-210 and 5050+/-250 respectively. The pseudo-rapidity distributions of charged particles show two remarkable features. In the fragmentation region, the distribution follows a limiting curve independent of the collision energy, similar to the limiting fragmentation behaviour seen earlier in p + p and p + A collisions.(cont.) In the mid-rapidity region, the yield of charged particles evolves smoothly as a function of [the square root of] sNN and collision centrality. We compare our results with a compilation of data from lower energy p + p, p + A and A + A collisions and discuss their implications for various phenomenological models of particle production.by Pradeep Sarin.Ph.D

    Optical techniques for broadband in-building networks

    Get PDF
    Optical fibres, which can easily handle any bandwidth demand, have been rolled out to more than 32 million consumer’s homes and professional buildings worldwide up to 2010. The basic technological and economical challenges of fibre-to-the-home (FTTH) has been solved. The current FTTH technology can now providing baseband Gbit Ethernet and high definition TV services to the gates of homes. Thus, the bottleneck for delivery of broadband services to the end users is shifting from the access network to the in-building network. In the meantime, the need for high-capacity transmission between devices inside the building, e.g. between desktop PC and data services, are also rapidly increase. How to bring high bandwidth to the mobile terminals such as laptops, PDAs or cell phones as well as to the fixed terminals such as desktop PCs and HDTV equipment in an all-in-one network infrastructure is a challenge we are facing. Building on the flexibility of the wireless access networks and the latent vast bandwidth of a fibre infrastructure, radio-over-fibre (RoF) techniques have been proposed as a cost-effective solution to the future integrated broadband services in in-building networks. This thesis investigates techniques to deliver high data rate wireless services via in-building networks: high capacity RoF links employing optical frequency multiplication (OFM) and sub-carrier multiplexing (SCM) techniques, with single- or multi-carrier signal formats. The orthogonal frequency division multiplexing (OFDM) format is investigated for the RoF transmission system, particularly with regard to the optical system nonlinearity. For low-cost short-range optical backbone networks, RoF transmission over large-core diameter plastic optical fibre (POF) links has been studied, including the transmission of the WiMedia-compliant multiband OFDM UWB signal over bandwidth-limited large-core POF as well as a full-duplex bi-directional UWB transmission over POF. In order to improve the functionalities for delivery of wireless services of in-building networks, techniques to introduce flexibility into the network architecture and to create dynamic capacity allocation have been investigated. By employing optical switching techniques based on optical semiconductor amplifiers (SOA), an optically routed RoF system has been studied. The dynamic capacity allocation is addressed by investigating one-dimensional and two-dimensional routing using electrical SCM and optical wavelengths. In addition, next to RoF networking, this thesis explores techniques for wired delivery of baseband high capacity services over POF links by employing a multi-level signal modulation format, in particular discrete multi-tone (DMT) modulation. Transmission of 10 Gbit/s data over 1 mm core diameter PMMA POF links is demonstrated, as a competitor to more expensive fibre solutions such as silica single and multimode fibre. A record transmission rate of more than 40 Gbit/s is presented for POF whose core diameter is comparable with silica multimode fibre. Finally, from the network perspective, the convergence of wired and wireless multi-standard services into a single fibre-based infrastructure has been studied. Techniques have been designed and demonstrated for in-building networks, which can convey both high capacity baseband services and broadband radio frequency (RF) services over a POF backbone link. The multi-standard RoF signals carry different wireless services at different radio frequencies and with different bandwidths, including WiFi, WiMax, UMTS and UWB. System setups to carry them together over the same multimode optical fibre based network have been designed and experimentally shown. All the concepts, designs and system experiments presented in this thesis underline the strong potential of multimode (silica and plastic) optical fibre techniques for the delivery of broadband services to wired and wireless devices in in-building networks, in order to extend to the end user the benefits of the broadband FTTH networks which are being installed and deployed worldwide
    corecore