240 research outputs found

    Optimal cube-connected cube multiprocessors

    Get PDF
    Many CFD (computational fluid dynamics) and other scientific applications can be partitioned into subproblems. However, in general the partitioned subproblems are very large. They demand high performance computing power themselves, and the solutions of the subproblems have to be combined at each time step. The cube-connect cube (CCCube) architecture is studied. The CCCube architecture is an extended hypercube structure with each node represented as a cube. It requires fewer physical links between nodes than the hypercube, and provides the same communication support as the hypercube does on many applications. The reduced physical links can be used to enhance the bandwidth of the remaining links and, therefore, enhance the overall performance. The concept and the method to obtain optimal CCCubes, which are the CCCubes with a minimum number of links under a given total number of nodes, are proposed. The superiority of optimal CCCubes over standard hypercubes was also shown in terms of the link usage in the embedding of a binomial tree. A useful computation structure based on a semi-binomial tree for divide-and-conquer type of parallel algorithms was identified. It was shown that this structure can be implemented in optimal CCCubes without performance degradation compared with regular hypercubes. The result presented should provide a useful approach to design of scientific parallel computers

    Processor allocation strategies for modified hypercubes

    Get PDF
    Parallel processing has been widely accepted to be the future in high speed computing. Among the various parallel architectures proposed/implemented, the hypercube has shown a lot of promise because of its poweful properties, like regular topology, fault tolerance, low diameter, simple routing, and ability to efficiently emulate other architectures. The major drawback of the hypercube network is that it can not be expanded in practice because the number of communication ports for each processor grows as the logarithm of the total number of processors in the system. Therefore, once a hypercube supercomputer of a certain dimensionality has been built, any future expansions can be accomplished only by replacing the VLSI chips. This is an undesirable feature and a lot of work has been under progress to eliminate this stymie, thus providing a platform for easier expansion. Modified hypercubes (MHs) have been proposed as the building blocks of hypercube-based systems supporting incremental growth techniques without introducing extra resources for individual hypercubes. However, processor allocation on MHs proves to be a challenge due to a slight deviation in their topology from that of the standard hypercube network. This thesis addresses the issue of processor allocation on MHs and proposes various strategies which are based, partially or entirely, on table look-up approaches. A study of the various task allocation strategies for standard hypercubes is conducted and their suitability for MHs is evaluated. It is shown that the proposed strategies have a perfect subcube recognition ability and a superior performance. Existing processor allocation strategies for pure hypercube networks are demonstrated to be ineffective for MHs, in the light of their inability to recognize all available subcubes. A comparative analysis that involves the buddy strategy and the new strategies is carried out using simulation results

    Interconnection Networks Embeddings and Efficient Parallel Computations.

    Get PDF
    To obtain a greater performance, many processors are allowed to cooperate to solve a single problem. These processors communicate via an interconnection network or a bus. The most essential function of the underlying interconnection network is the efficient interchanging of messages between processes in different processors. Parallel machines based on the hypercube topology have gained a great respect in parallel computation because of its many attractive properties. Many versions of the hypercube have been introduced by many researchers mainly to enhance communications. The twisted hypercube is one of the most attractive versions of the hypercube. It preserves the important features of the hypercube and reduces its diameter by a factor of two. This dissertation investigates relations and transformations between various interconnection networks and the twisted hypercube and explore its efficiency in parallel computation. The capability of the twisted hypercube to simulate complete binary trees, complete quad trees, and rings is demonstrated and compared with the hypercube. Finally, the fault-tolerance of the twisted hypercube is investigated. We present optimal algorithms to simulate rings in a faulty twisted hypercube environment and compare that with the hypercube

    Alternately-twisted cube as an interconnection network.

    Get PDF
    by Wong Yiu Chung.Thesis (M.Phil.)--Chinese University of Hong Kong, 1991.Bibliography: leaves [100]-[101]AcknowledgementAbstractChapter 1. --- Introduction --- p.1-1Chapter 2. --- Alternately-Twisted Cube: Definition & Graph-Theoretic Properties --- p.2-1Chapter 2.1. --- Construction --- p.2-1Chapter 2.2. --- Topological Properties --- p.2-12Chapter 2.2.1. --- "Node Degree, Link Count & Diameter" --- p.2-12Chapter 2.2.2. --- Node Symmetry --- p.2-13Chapter 2.2.3. --- Sub cube Partitioning --- p.2-18Chapter 2.2.4. --- Distinct Paths --- p.2-23Chapter 2.2.5. --- Embedding other networks --- p.2-24Chapter 2.2.5.1. --- Rings --- p.2-25Chapter 2.2.5.2. --- Grids --- p.2-29Chapter 2.2.5.3. --- Binary Trees --- p.2-35Chapter 2.2.5.4. --- Hypercubes --- p.2-42Chapter 2.2.6. --- Summary of Comparison with the Hypercube --- p.2-44Chapter 3. --- Network Properties --- p.3-1Chapter 3.1. --- Routing Algorithms --- p.3-1Chapter 3.2. --- Message Transmission: Static Analysis --- p.3-5Chapter 3.3. --- Message Transmission: Dynamic Analysis --- p.3-13Chapter 3.4. --- Broadcasting --- p.3-17Chapter 4. --- Parallel Processing on the Alternately-Twisted Cube --- p.4-1Chapter 4.1. --- Ascend/Descend class algorithms --- p.4-1Chapter 4.2. --- Combining class algorithms --- p.4-7Chapter 4.3. --- Numerical algorithms --- p.4-8Chapter 5. --- "Summary, Comparison & Conclusion" --- p.5-1Chapter 5.1. --- Summary --- p.5-1Chapter 5.2. --- Comparison with other hypercube-like networks --- p.5-2Chapter 5.3. --- Conclusion --- p.5-7Chapter 5.4. --- Possible future research --- p.5-7Bibliograph

    Routing in Mobius Cubes

    Get PDF
    Möbius cube je zajímavou topologií, která vznikla z topologie hypercube. Největší výhoda oproti hypercube je v přibližně polovičním průměru Möbius cube. V této práci je popsán algoritmus nejkratšího routování a jsou popsány i jeho klady a zápory. Velkou nevýhodou je možnost pádu do stavu uzamknutí (deadlock). Proto je v práci představen nový deadlock-free algoritmus a porovnán s předchozím algoritmem. Dále je v práci popsána možnost použití hypercubického multicast 1-portového wormhole algoritmu na Möbius cube.The Möbius cube is an interesting topology created from the hypercube. Its main advantage is the which that is around one half of the diameter of the hypercube. In this thesis, the shortest path algorithm is described as well as its properties and drawbacks. One major drawback is the possibility of a deadlock. Therefore, a new deadlock-free routing algorithm is introduced and compared to the previous algorithm. Later, usage of hypercube's multicast 1-port wormhole algorithm on the Möbius cube is described

    Hypercube-Based Topologies With Incremental Link Redundancy.

    Get PDF
    Hypercube structures have received a great deal of attention due to the attractive properties inherent to their topology. Parallel algorithms targeted at this topology can be partitioned into many tasks, each of which running on one node processor. A high degree of performance is achievable by running every task individually and concurrently on each node processor available in the hypercube. Nevertheless, the performance can be greatly degraded if the node processors spend much time just communicating with one another. The goal in designing hypercubes is, therefore, to achieve a high ratio of computation time to communication time. The dissertation addresses primarily ways to enhance system performance by minimizing the communication time among processors. The need for improving the performance of hypercube networks is clearly explained. Three novel topologies related to hypercubes with improved performance are proposed and analyzed. Firstly, the Bridged Hypercube (BHC) is introduced. It is shown that this design is remarkably more efficient and cost-effective than the standard hypercube due to its low diameter. Basic routing algorithms such as one to one and broadcasting are developed for the BHC and proven optimal. Shortcomings of the BHC such as its asymmetry and limited application are clearly discussed. The Folded Hypercube (FHC), a symmetric network with low diameter and low degree of the node, is introduced. This new topology is shown to support highly efficient communications among the processors. For the FHC, optimal routing algorithms are developed and proven to be remarkably more efficient than those of the conventional hypercube. For both BHC and FHC, network parameters such as average distance, message traffic density, and communication delay are derived and comparatively analyzed. Lastly, to enhance the fault tolerance of the hypercube, a new design called Fault Tolerant Hypercube (FTH) is proposed. The FTH is shown to exhibit a graceful degradation in performance with the existence of faults. Probabilistic models based on Markov chain are employed to characterize the fault tolerance of the FTH. The results are verified by Monte Carlo simulation. The most attractive feature of all new topologies is the asymptotically zero overhead associated with them. The designs are simple and implementable. These designs can lead themselves to many parallel processing applications requiring high degree of performance

    Automorphisms generating disjoint Hamilton cycles in star graphs

    Get PDF
    In the first part of the thesis we define an automorphism φn for each star graph Stn of degree n − 1, which yields permutations of labels for the edges of Stn taken from the set of integers {1, . . . , bn/2c}. By decomposing these permutations into permutation cycles, we are able to identify edge-disjoint Hamilton cycles that are automorphic images of a known two-labelled Hamilton cycle H1 2(n) in Stn. Our main result is an improvement from the existing lower bound of bϕ(n)/10c to b2ϕ(n)/9c, where ϕ is Euler’s totient function, for the known number of edge-disjoint Hamilton cycles in Stn for all odd integers n. For prime n, the improvement is from bn/8c to bn/5c. We extend this result to the cases when n is the power of a prime other than 3 and 7. The second part of the thesis studies ‘symmetric’ collections of edge-disjoint Hamilton cycles in Stn, i.e. collections that comprise images of H1 2(n) under general label-mapping automorphisms. We show that, for all even n, there exists a symmetric collection of bϕ(n)/2c edge-disjoint Hamilton cycles, and Stn cannot have symmetric collections of greater than bϕ(n)/2c such cycles for any n. Thus, Stn is not symmetrically Hamilton decomposable if n is not prime. We also give cases of even n, in terms of Carmichael’s reduced totient function λ, for which ‘strongly’ symmetric collections of edge-disjoint Hamilton cycles, which are generated from H1 2(n) by a single automorphism, can and cannot attain the optimum bound bϕ(n)/2c for symmetric collections. In particular, we show that if n is a power of 2, then Stn has a spanning subgraph with more than half of the edges of Stn, which is strongly symmetrically Hamilton decomposable. For odd n, it remains an open problem as to whether the bϕ(n)/2c can be achieved for symmetric collections, but we are able to show that, for certain odd n, a ϕ(n)/4 bound is achievable and optimal for strongly symmetric collections. The search for edge-disjoint Hamilton cycles in star graphs is important for the design of interconnection network topologies in computer science. All our results improve on the known bounds for numbers of any kind of edge-disjoint Hamilton cycles in star graphs

    Properties and Algorithms of the KCube Interconnection Networks

    Get PDF
    The KCube interconnection network was first introduced in 2010 in order to exploit the good characteristics of two well-known interconnection networks, the hypercube and the Kautz graph. KCube links up multiple processors in a communication network with high density for a fixed degree. Since the KCube network is newly proposed, much study is required to demonstrate its potential properties and algorithms that can be designed to solve parallel computation problems. In this thesis we introduce a new methodology to construct the KCube graph. Also, with regard to this new approach, we will prove its Hamiltonicity in the general KC(m; k). Moreover, we will find its connectivity followed by an optimal broadcasting scheme in which a source node containing a message is to communicate it with all other processors. In addition to KCube networks, we have studied a version of the routing problem in the traditional hypercube, investigating this problem: whether there exists a shortest path in a Qn between two nodes 0n and 1n, when the network is experiencing failed components. We first conditionally discuss this problem when there is a constraint on the number of faulty nodes, and subsequently introduce an algorithm to tackle the problem without restrictions on the number of nodes
    corecore