2 research outputs found

    Predicting Conditional Branches With Fusion-Based Hybrid Predictors ∗

    No full text
    Researchers have studied hybrid branch predictors that leverage the strengths of multiple stand-alone predictors. The common theme among the proposed techniques is a selection mechanism that chooses a prediction from among several component predictors. We make the observation that singling out one particular component predictor ignores the information of the non-selected components. We propose Branch Prediction Fusion, originally inspired by work in the machine learning field, which combines or fuses the information from all of the components to arrive at a final prediction. Our 32KB predictor achieves the same overall prediction accuracy as the 188KB versions of the previous best performing predictors (the Multi-Hybrid and the global-local perceptron). 1

    Shader optimization and specialization

    Get PDF
    In the field of real-time graphics for computer games, performance has a significant effect on the player’s enjoyment and immersion. Graphics processing units (GPUs) are hardware accelerators that run small parallelized shader programs to speed up computationally expensive rendering calculations. This thesis examines optimizing shader programs and explores ways in which data patterns on both the CPU and GPU can be analyzed to automatically speed up rendering in games. Initially, the effect of traditional compiler optimizations on shader source-code was explored. Techniques such as loop unrolling or arithmetic reassociation provided speed-ups on several devices, but different GPU hardware responded differently to each set of optimizations. Analyzing execution traces from numerous popular PC games revealed that much of the data passed from CPU-based API calls to GPU-based shaders is either unused, or remains constant. A system was developed to capture this constant data and fold it into the shaders’ source-code. Re-running the game’s rendering code using these specialized shader variants resulted in performance improvements in several commercial games without impacting their visual quality
    corecore