Search CORE

2,082 research outputs found

Development and evaluation of a fault-tolerant multiprocessor (FTMP) computer. Volume 1: FTMP principles of operation

Author: Lala J. H.
Smith T. B., Jr.
Publication venue
Publication date
Field of study

The basic organization of the fault tolerant multiprocessor, (FTMP) is that of a general purpose homogeneous multiprocessor. Three processors operate on a shared system (memory and I/O) bus. Replication and tight synchronization of all elements and hardware voting is employed to detect and correct any single fault. Reconfiguration is then employed to repair a fault. Multiple faults may be tolerated as a sequence of single faults with repair between fault occurrences

NASA Technical Reports Server

Intelligent editor/printer enhancements

Author: Pheanis D. C.
Woodfill M. C.
Publication venue
Publication date
Field of study

Microprocessor support hardware, software, and cross assemblers relating to the Motorola 6800 and 6809 process systems were developed. Pinter controller and intelligent CRT development are discussed. The user's manual, design specifications for the MC6809 version of the intelligent printer controller card, and a 132-character by 64-line intelligent CRT display system using a Motorola 6809 MPU, and a one-line assembler and disassembler are provided

NASA Technical Reports Server

On the safe deployment of matrix multiplication in massively parallel safety-related systems

Author: Abella Ferrer Jaume
Agirre Irune
Calderón Torres Alejandro Josué
Cazorla Almeida Francisco Javier
Fernández Muñoz Javier
Pérez Cerrolaza Jon
Publication venue: 'MDPI AG'
Publication date: 01/04/2022
Field of study

Deep learning technology has enabled the development of increasingly complex safety-related autonomous systems using high-performance computers, such as graphics processing units (GPUs), which provide the required high computing performance for the execution of parallel computing algorithms, such as matrix–matrix multiplications (a central computing element of deep learning software libraries). However, the safety certification of parallel computing software algorithms and GPU-based safety-related systems is a challenge to be addressed. For example, achieving the required fault-tolerance and diagnostic coverage for random hardware errors. This paper contributes with a safe matrix–matrix multiplication software implementation for GPUs with random hardware error-detection capabilities (permanent, transient) that can be used with different architectural patterns for fault-tolerance, and which serves as a foundation for the implementation of safe deep learning libraries for GPUs. The proposed contribution is complementary and can be combined with other techniques, such as algorithm-based fault tolerance. In particular, (i) we provide the high-performance matrix multiplication CUTLASS library with a catalog of diagnostic mechanisms to detect random hardware errors down to the arithmetic operation level; and (ii) we measure the performance impact incurred by the adoption of these mechanisms and their achievable diagnostic coverage with a set of representative matrix dimensions. To that end, we implement these algebraic operations, targeting CUDA cores with single instructions and multiple-thread math instructions in an NVIDIA Xavier NX GPU.The research of this paper has received funding from the European Union’s Horizon 2020 research and innovation programme (grant agreement No 871465 (UP2DATE)).Peer ReviewedPostprint (published version

Multidisciplinary Digital Publishing Institute

UPCommons. Portal del coneixement obert de la UPC

Directory of Open Access Journals

Recommended from our members

Machine division

Author: Earl E. Swartzlander Jr.
Inwook Kong
Publication venue: United States Patent and Trademark Office
Publication date: 21/05/2010
Field of study

Techniques are generally described that include methods, devices, systems and/or apparatus for dividing a numerator by a denominator. Some example methods may include selecting a first numerical factor stored in an electronic storage media. The first numerical factor may be multiplied by a numerator at least in part using a first logic circuit configured to perform multiplication. The first numerical factor may also be multiplied by a denominator. A second numerical factor may be calculated based, at least in part, on an approximation of a square of the difference between unity and the product of the first numerical factor and the denominator. The second numerical factor may be multiplied by the product of the numerator and the first numerical factor at least in part using the first logic circuit, to generate an approximation of a quotient of the numerator and the denominator.Board of Regents, University of Texas Syste

Texas ScholarWorks

An algorithm for fast route lookup and update

Author: Yilmaz Pinar Altin
Publication venue: Digital Commons @ NJIT
Publication date: 31/05/2000
Field of study

Increase in routing table sizes, number of updates, traffic, speed of links and migration to IPv6 have made IP address lookup, based on longest prefix matching, a major bottleneck for high performance routers. Several schemes are evaluated and compared based on complexity analysis and simulation results. A trie based scheme, called Linked List Cascade Addressable Trie (LLCAT) is presented. The strength of LLCAT comes from the fact that it is easy to be implemented in hardware, and also routing table update operations are performed incrementally requiring very few memory operations guaranteed for worst case to satisfy requirements of dynamic routing tables in high speed routers. Application of compression schemes to this algorithm is also considered to improve memory consumption and search time. The algorithm is implemented in C language and simulation results with real-life data is presented along with detailed description of the algorithm

Digital Commons @ New Jersey Institute of Technology (NJIT)

Bae: Before Anyone Else; The Answer to Mobile Dating for the African Diaspora

Author: Kunzika Jordan S
Publication venue: Dartmouth Digital Commons
Publication date: 01/06/2016
Field of study

As online dating becomes increasingly popular, millions of people across the country utilize various dating services that allow them to meet, chat, date, and develop meaningful relationships with people they would probably not have met otherwise. While mass market dating apps work well for the majority of online daters, and while many niche dating apps have been created to target a number of specific demographics such as the Jewish population, Southeast Asian population, farmers, homosexuals, etc., the African-American market has been largely underserved. African-Americans have the worst experience on mass market dating apps due to negative racial bias, and before Bae, there was no modern, ubiquitous dating app that specifically targeted the African American population. Bae attempts to fill the market gap by providing a mobile first, modern dating app curated specifically for African-Americans. I will discuss our process of building our MVP product, and the technical challenges of improving on it to provide the best user experience and achieve scale with a growing user base. Finally, I will discuss Bae’s successes, areas for improvement, and possible next steps

Dartmouth Digital Commons (Dartmouth College)

A lisp oriented architecture

Author: McClain John W. F. (John Wesley Ferguson)
Publication venue: Massachusetts Institute of Technology
Publication date: 01/01/1994
Field of study

Thesis (M.S.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 1994.Includes bibliographical references (p. 63-67).by John W.F. McClain.M.S

DSpace@MIT

Modelling lexical phrases acquisition in L2

Author: Abeillé Anne
Chanier Thierry
Colmerauer Colette
Fouqueré Christophe
Picard Francoise
Zock Michael
Publication venue: HAL CCSD
Publication date: 01/01/1992
Field of study

The study focuses on the following points. It compares the views psycholinguists and computational linguists have concerning the processes of lexical access and lexical choice. Then it shows the similarities holding between the structure of the lexicon in L1 and in L2. It tries to offer a pedagogically realistic approach in vocabulary teaching based on these results

HAL AMU

HAL Clermont Université

HAL-Paris 13

Onboard Experiment Data Support Facility

Author
Publication venue
Publication date
Field of study

An onboard array structure has been devised for end to end processing of data from multiple spaceborne sensors. The array constitutes sets of programmable pipeline processors whose elements perform each assigned function in 0.25 microseconds. This space shuttle computer system can handle data rates from a few bits to over 100 megabits per second

NASA Technical Reports Server