435,680 research outputs found
Field-based branch prediction for packet processing engines
Network processors have exploited many aspects of architecture design, such as employing multi-core, multi-threading and hardware accelerator, to support both the ever-increasing line rates and the higher complexity of network applications. Micro-architectural techniques like superscalar, deep pipeline and speculative execution provide an excellent method of improving performance without limiting either the scalability or flexibility, provided that the branch penalty is well controlled. However, it is difficult for traditional branch predictor to keep increasing the accuracy by using larger tables, due to the fewer variations in branch patterns of packet processing. To improve the prediction efficiency, we propose a flow-based prediction mechanism which caches the branch histories of packets with similar header fields, since they normally undergo the same execution path. For packets that cannot find a matching entry in the history table, a fallback gshare predictor is used to provide branch direction. Simulation results show that the our scheme achieves an average hit rate in excess of 97.5% on a selected set of network applications and real-life packet traces, with a similar chip area to the existing branch prediction architectures used in modern microprocessors
Recommended from our members
Branch prediction apparatus, systems, and methods
An apparatus and a system, as well as a method and article, may operate to predict a branch within a first operating context, such as a user context, using a first strategy; and to predict a branch within a second operating context, such as an operating system context, using a second strategy. In some embodiments, apparatus and systems may comprise one or more first storage locations to store branch history information associated with a first operating context, and one ore more second storage locations to store branch history information associated with a second operating context.Board of Regents, University of Texas Syste
Research highlights of the global modeling and simulation branch for 1986-1987
This document provides a summary of the research conducted in the Global Modeling and Simulation Branch and highlights the most significant accomplishments in 1986 to 1987. The Branch has been the focal point for global weather and climate prediction research in the Laboratory for Atmospheres through the retrieval and use of satellite data, the development of global models and data assimilation techniques, the simulation of future observing systems, and the performance of atmospheric diagnostic studies
Influence of bond stress-slip relationship on bond strength prediction
The study of the bond stress-slip relationship of FRP (fibre reinforced polymer) adhered to concrete has been a key point to understand the bond behaviour of externally bonded reinforcement (EBR) and near surface mounted (NSM) systems. Researchers have made an effort to determine bond-slip relationships through experimental and analytical/numerical methods, although they have not obtained univocal results.
The area under the bond stress-slip relationship, representing the fracture energy, is one of the main parameters to make bond strength predictions. The fracture energy may be divided in two parts: elastic and softening contribution. These parts act both in a different way in predicting the failure load and the effective transfer length.
In this paper the influence of the shape of the bond stress-slip relationship on the prediction of the bond strength and transfer length is investigated. Hereby, a comparison is made between the bilinear bond stress-slip relationship (linear elastic ascending branch-linear softening branch) and the elastic-exponential bond stress-slip relationship (linear elastic ascending branch-exponential softening branch)
Collective resonance modes of Josephson vortices in sandwiched stack of BiSrCaCuO intrinsic Josephson junctions
We observed splitting of the low-bias vortex-flow branch in a
dense-Josephson-vortex state into multiple sub-branches in current-voltage
characteristics of intrinsic Josephson junctions (IJJs) of
BiSrCaCuO single crystals in the long-junction limit.
Each sub-branch corresponds to a plasma mode in serially coupled Josephson
junctions. Splitting into low-bias linear sub-branches with a spread in the
slopes and the inter-sub-branch mode-switching character are in good
quantitative agreement with the prediction of the weak but finite
inter-junction capacitive-coupling model incorporated with the inductive
coupling. This suggests the importance of the role of the capacitive coupling
in accurately describing the vortex dynamics in serially stacked IJJs.Comment: 4 pages, 3 figures, 1 tabl
Enlarging instruction streams
The stream fetch engine is a high-performance fetch architecture based on the concept of an instruction stream. We call a sequence of instructions from the target of a taken branch to the next taken branch, potentially containing multiple basic blocks, a stream. The long length of instruction streams makes it possible for the stream fetch engine to provide a high fetch bandwidth and to hide the branch predictor access latency, leading to performance results close to a trace cache at a lower implementation cost and complexity. Therefore, enlarging instruction streams is an excellent way to improve the stream fetch engine. In this paper, we present several hardware and software mechanisms focused on enlarging those streams that finalize at particular branch types. However, our results point out that focusing on particular branch types is not a good strategy due to Amdahl's law. Consequently, we propose the multiple-stream predictor, a novel mechanism that deals with all branch types by combining single streams into long virtual streams. This proposal tolerates the prediction table access latency without requiring the complexity caused by additional hardware mechanisms like prediction overriding. Moreover, it provides high-performance results which are comparable to state-of-the-art fetch architectures but with a simpler design that consumes less energy.Peer ReviewedPostprint (published version
- …
