5,373 research outputs found
A fault-tolerant multiprocessor architecture for aircraft, volume 1
A fault-tolerant multiprocessor architecture is reported. This architecture, together with a comprehensive information system architecture, has important potential for future aircraft applications. A preliminary definition and assessment of a suitable multiprocessor architecture for such applications is developed
Validation of multiprocessor systems
Experiments that can be used to validate fault free performance of multiprocessor systems in aerospace systems integrating flight controls and avionics are discussed. Engineering prototypes for two fault tolerant multiprocessors are tested
A design methodology for portable software on parallel computers
This final report for research that was supported by grant number NAG-1-995 documents our progress in addressing two difficulties in parallel programming. The first difficulty is developing software that will execute quickly on a parallel computer. The second difficulty is transporting software between dissimilar parallel computers. In general, we expect that more hardware-specific information will be included in software designs for parallel computers than in designs for sequential computers. This inclusion is an instance of portability being sacrificed for high performance. New parallel computers are being introduced frequently. Trying to keep one's software on the current high performance hardware, a software developer almost continually faces yet another expensive software transportation. The problem of the proposed research is to create a design methodology that helps designers to more precisely control both portability and hardware-specific programming details. The proposed research emphasizes programming for scientific applications. We completed our study of the parallelizability of a subsystem of the NASA Earth Radiation Budget Experiment (ERBE) data processing system. This work is summarized in section two. A more detailed description is provided in Appendix A ('Programming Practices to Support Eventual Parallelism'). Mr. Chrisman, a graduate student, wrote and successfully defended a Ph.D. dissertation proposal which describes our research associated with the issues of software portability and high performance. The list of research tasks are specified in the proposal. The proposal 'A Design Methodology for Portable Software on Parallel Computers' is summarized in section three and is provided in its entirety in Appendix B. We are currently studying a proposed subsystem of the NASA Clouds and the Earth's Radiant Energy System (CERES) data processing system. This software is the proof-of-concept for the Ph.D. dissertation. We have implemented and measured the performance of a portion of this subsystem on the Intel iPSC/2 parallel computer. These results are provided in section four. Our future work is summarized in section five, our acknowledgements are stated in section six, and references for published papers associated with NAG-1-995 are provided in section seven
Optimization techniques for a 2D Engine
This document contains the description of the development of a small 2D
engine written in C++. The focus of this project is to implement various
optimization techniques to have a performant application.
The result is a small 2D engine that can be executed in any Windows
machine and can handle more than 10.000 entities interacting with each
other in real time.
The source code for this project is public and under the MIT License and
can be found in the Github repository in the following link:
https://github.com/DavidTello1/2D-Rendere
Ordered reduced set successive detector for low complexity, quasi-ML MIMO detection
An Ordered Reduced Set Successive Detector (RSSD) for the V-BLAST spatial multiplexing scheme that uses a general two-dimensional non-uniform set partitioning for different symbols. The detector provides improved diversity and SNR gains at reduced complexity compared to a uniform set partitioning based detector. The detector can be used to reduce the complexity, with a small tradeoff in performance. Further, it is possible to obtain a quasi-ML performance using the disclosed detector at a reduced, yet fixed, complexity
Automatic generation of hardware/software interfaces
Enabling new applications for mobile devices often requires the use of specialized hardware to reduce power consumption. Because of time-to-market pressure, current design methodologies for embedded applications require an early partitioning of the design, allowing the hardware and software to be developed simultaneously, each adhering to a rigid interface contract. This approach is problematic for two reasons: (1) a detailed hardware-software interface is difficult to specify until one is deep into the design process, and (2) it prevents the later migration of functionality across the interface motivated by efficiency concerns or the addition of features. We address this problem using the Bluespec Codesign Language~(BCL) which permits the designer to specify the hardware-software partition in the source code, allowing the compiler to synthesize efficient software and hardware along with transactors for communication between the partitions. The movement of functionality across the hardware-software boundary is accomplished by simply specifying a new partitioning, and since the compiler automatically generates the desired interface specifications, it eliminates yet another error-prone design task. In this paper we present BCL, an extension of a commercially available hardware design language (Bluespec SystemVerilog), a new software compiling scheme, and preliminary results generated using our compiler for various hardware-software decompositions of an Ogg Vorbis audio decoder, and a ray-tracing application.National Science Foundation (U.S.) (NSF (#CCF-0541164))National Research Foundation of Korea (grant from the Korean Government (MEST) (#R33-10095)
Specifications and programs for computer software validation
Three software products developed during the study are reported and include: (1) FORTRAN Automatic Code Evaluation System, (2) the Specification Language System, and (3) the Array Index Validation System
Fast Calculation of the Lomb-Scargle Periodogram Using Graphics Processing Units
I introduce a new code for fast calculation of the Lomb-Scargle periodogram,
that leverages the computing power of graphics processing units (GPUs). After
establishing a background to the newly emergent field of GPU computing, I
discuss the code design and narrate key parts of its source. Benchmarking
calculations indicate no significant differences in accuracy compared to an
equivalent CPU-based code. However, the differences in performance are
pronounced; running on a low-end GPU, the code can match 8 CPU cores, and on a
high-end GPU it is faster by a factor approaching thirty. Applications of the
code include analysis of long photometric time series obtained by ongoing
satellite missions and upcoming ground-based monitoring facilities; and
Monte-Carlo simulation of periodogram statistical properties.Comment: Accepted by ApJ. Accompanying program source (updated since
acceptance) can be downloaded from
http://www.astro.wisc.edu/~townsend/resource/download/code/culsp.tar.g
- …