# Some Theoretical Results of Hypercube for Parallel Architecture 

Nagata, M.
IIASA Working Paper
WP-92-018

February 1992

Nagata, M. (1992) Some Theoretical Results of Hypercube for Parallel Architecture. IIASA Working Paper. WP-92-018 Copyright © 1992 by the author(s). http://pure.iiasa.ac.at/3680/

Working Papers on work of the International Institute for Applied Systems Analysis receive only limited review. Views or opinions expressed herein do not necessarily represent those of the Institute, its National Member Organizations, or other organizations supporting the work. All rights reserved. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage. All copies must bear this notice and the full citation on the first page. For other purposes, to republish, to post on servers or to redistribute to lists, permission must be sought by contacting repository@iiasa.ac.at

# Working Paper 

Some Theoretical Results of Hypercube for Parallel Architecture

Motoyasu Nagata

WP-92-18
February 1992

# Some Theoretical Results of Hypercube for Parallel Architecture 

Motoyasu Nagata

WP-92-18
February 1992

Working Papers are interim reports on work of the International Institute for Applied Systems Analysis and have received only limited review. Views or opinions expressed herein do not necessarily represent those of the Institute or of its National Member Organizations.

International Institute for Applied Systems Analysis a A-2361 Laxenburg a Austria
Telephone: +432236715210 - Telex: 079137 iiasa a 口 Telefax: +43 223671313

## Foreword

This paper surveys some theoretical results of the hypercube for design of VLSI architecture. The parallel computer including the hypercube multiprocessor will become a leading technology that supports efficient computation for large uncertain systems.

# Some Theoretical Results of Hypercube for Parallel <br> Architecture 

Motoyasu Nagata

## 1 Introduction

The parallel computer is one of the remarkable developments of methodology and technology in computer science in recent years. Due to multiprocessor structure of the computer architecture, this computer has a capability to execute multiple instructions or multiple data simultaneously. The parallel computer not only provides support for efficient computation of mathematical, economical, industrial, and ecological problems but also aims new computer architecture beyond the traditional von Neumann type. Parallel processing will become a leading technology in the 1990s.

The technologies of the parallel computer mainly depends on the development of the VLSI. The parallel computer with hypercube interconnection is regarded as the most promising technology due to its topological structure and its surprising computing power [1] [4] [12] [26], and [31]. Examples of VLSI chips for the hypercube are iPSC/1, iPSC/2, and NCube/+4. On the other hand, methodologies of the hypercube have been developed based on graph theory, data structure and combinatorial optimization, taking into consideration computer architecture [5], and [12]. There are many theoretical problems in the design of the hypercube computer, for example, topological structure, embedding into hypercube, communication synchronization, and fault tolerance.

The purposes of the hypercube computer are parallel processing, parallel communication, and synchronization in all processors which are interconnected by hypercube topology. Fundamental hardware configuration of the hypercube processor, which is sometimes called the node, consists of the CPU, local memory, buffer, and interfaces [20]. Every node is connected to adjacent nodes via interfaces. Data structure for task or data is decomposed into pieces and allocated to the processor of the hypercube. Processor allocation is desirable if adjacent data of the data structure are mapped on adjacent nodes of the hypercube. Next, parallel internode communication is necessary for decomposing the unit
of the task or for transmitting data to another node. After data have been transmitted to all allocated nodes, parallel processing is carried out. Synchronization is necessary for parallel processing and communication. Processing results are also gathered by parallel internode communication. This paper surveys some theoretical results in VLSI architecture for hypercube.

## 2 Topological Structure of Hypercube

This section treats topological structure of the hypercube from viewpoints of addressing and graph.
Definition 2.1 [24] The $n$-dimensional hypercube $Q_{n}$ has $2^{n}$ nodes. Addresses of these nodes are from 0 to $2^{n}-1$. Any two nodes are adjacent if and only if two binary addresses differ by one and only one bit.

Definition 2.2 Let $a_{n} a_{n-1} \ldots . . a_{1}, b_{n} b_{n-1} \ldots . b_{1}$ be binary addresses of two nodes $a$ and $b$ of $n$-dimensional hypercube $Q_{n}$. Nodes $a$ and $b$ are adjacent if the Hamming distance between two binary addresses $H(a, b)=\sum_{i=1}^{n}\left|a_{i}-b_{i}\right|$ is one.
Definition 2.3 [24] Addresses of the $n$-dimensional hypercube $Q_{n}$ are recursively constructed as follows:
(1) Addresses of two nodes of one-dimensional hypercube $Q_{1}$ are 0 and 1.
(2) Let $a_{n-1} \ldots . a_{1}$ be binary address of any node of $(n-1)$-dimensional hypercube $Q_{n-1}$. For two $Q_{n-1} \mathrm{~s}$, concatenate 0 and 1 to the leftmost bit positions of two nodes with the same address $a_{n-1} \ldots . . a_{1}$, and connect these two nodes.
Remark 2.4 The two nodes addressed by $0 a_{n-1} \ldots . . a_{1}, 1 a_{n-1} \ldots . . a_{1}$ of $Q_{n}$ in Definition 2.3 are adjacent.

The hypercube is recursively defined using a graph.
Definition $2.5[8]$ Let $G=(V, E)$ be a graph where $V$ is a set of nodes and $E$ is a set of edges. Let $G_{p}=\left(V_{p}, E_{p}\right)$ be a product of two graphs $G_{1}=\left(V_{1}, E_{1}\right)$ and $G_{2}=\left(V_{2}, E_{2}\right)$, denoted by $G_{1} \times G_{2}$, where set of nodes $V_{p}=V_{1} \times V_{2}$. Two nodes $u=\left(u_{1}, u_{2}\right)$, $a n d v=$ $\left(v_{1}, v_{2}\right)$ are adjacent in $G_{p}$ if and only if $u_{1}=v_{1}\left(u_{2}=v_{2}\right)$ and $u_{2}\left(u_{1}\right)$ is adjacent to $v_{2}\left(v_{1}\right)$. Definition 2.6 The graph is called a complete graph if there exists only one edge that connects any two nodes.
Definition 2.7 [8] The graph of $n$-dimensional hypercube $Q_{n}$ is recursively constructed as follows:
(1) $Q_{0}$ is a trivial graph with one node.
(2) $Q_{n}=K_{2} \times Q_{n-1}$, where $K_{2}$ is a complete graph which consists of two nodes.

Some lemmas about topological properties of the hypercube have been proposed from these definitions.

Lemma 2.8 [24] There are $n$ different ways for decomposition of $Q_{n}$ into two $Q_{n-1}$ s.
Proof By Definition 2.3, there are $n$ concatenations of 0 (or 1 ) to address $Q_{n-1}$, that is, $0 a_{n-1} \ldots . . a_{1}, a_{n-1} 0 a_{n-2} \ldots . . a_{1}, a_{n-1} \ldots . . a_{1} 0$. .
Lemma 2.9 [24] There are $n!2^{n}$ different ways of addressing $Q_{n}$.
Proof The result is trivial for $n=1$. Assume that Lemma 2.9 holds for $n-1$. Since there are $(n-1)!2^{n-1}$ ways of addressing $Q_{n-1}$ and there are $n$ concatenations of 0 (or 1) to any address of the first (or second) $Q_{n-1}$, the number of ways of addressing $Q_{n}$ is $n \times(n-1)!2^{n-1}+n \times(n-1)!2^{n-1}=n!2^{n}$. ㅁ
Lemma 2.10 [24] Let $A$ and $B$ be any adjacent node of $Q_{n}$. Then the nodes adjacent to $A$ and the nodes adjacent to $B$ are conneced in one-to-one fashion.

Proof Let addresses of $A$ and $B$ be $0 a_{n-1} \ldots . . a_{1}, 1 a_{n-1} \ldots . . a_{1}$ without loss of generality. Since the address of any node that is adjacent to $A$ (or $B$ ) is obtained by reversing one and only one bit of the address of $A$ (or $B$ ), $0 a_{n-1} \ldots \bar{a}_{i} \ldots a_{1}$ and $1 a_{n-1} \ldots \bar{a}_{i} \ldots a_{1}$ are adjacent for $1 \leq i \leq n$.

Lemma 2.11 [24] There are no cycles of odd length in $Q_{n}$.
Proof Consider a cycle $A_{1}, A_{2}, \ldots ., A_{m}$ of $Q_{n}$, where $A_{1}=A_{m}$. The length of the cycle, $m-1$, is the sum of bit reversing and its reversing again for some $a_{i}(1 \leq i \leq n)$.
Definition 2.12 The graph is called a connected graph if there exists a path that connects any two nodes of the graph. Maximum distance between two nodes of the graph is called the diameter.

Lemma 2.13 [24] $Q_{n}$ is a connected graph of diameter $n$.
Proof Consider a node $A$ with address $0 \ldots . .0$ of $Q_{n}$ and a node $B$ whose address has $k$ 1 s and $(n-k) 0$ s. By Definition 2.3 there exists a path with $k$ edges that connects $A$ and $B$, then $Q_{n}$ is a connected graph and $k \leq n$.

Lemma 2.14 [29] $Q_{n}$ has $n 2^{n-1}$ edges that connect $2^{n}$ nodes.
Proof Since any node has $n$ adjacent nodes, the number of edges is $n 2^{n} \div 2=n 2^{n-1}$. $\square$
Definition 2.15 The number of edges connected to the node of the graph is called degree of the node.

Theorem 2.16 [24] A graph $G=(V, E)$ is $Q_{n}$ if and only if
(1) $V$ has $2^{n}$ vertices.
(2) Every vertex has degree $n$.
(3) $G$ is connected.
(4) For any two adjacent nodes $A$ and $B$, the nodes adjacent to $A$ and the nodes adjacent to $B$ are connected in one-to-one fashion.

Lemma 2.17 [24] The minimum distance between any two nodes $A$ and $B$ of $Q_{n}$ is the Hamming distance $H(A, B)$.

Proof Assume, without loss of generality, that two addresses of nodes $A$ and $B$ differ in $k$ leading bits. Then one path from $A$ to $B$ is the following:

$$
\begin{gathered}
a_{n} a_{n-1} a_{n-2} \ldots . . a_{n-k+1} a_{n-k} \ldots . . a_{1}=A \\
b_{n} a_{n-1} a_{n-2} \ldots . a_{n-k+1} a_{n-k} \ldots . . a_{1} \\
b_{n} b_{n-1} a_{n-2} \ldots \ldots . . a_{n-k+1} a_{n-k} \ldots \ldots . . a_{1}
\end{gathered}
$$

$$
b_{n} b_{n-1} b_{n-2} \ldots . . b_{n-k+1} a_{n-k} \ldots \ldots a_{1}=B .
$$

Then the result holds.
Lemma 2.18 [24] Let $A$ and $B$ be any two nodes of $Q_{n}$ and assume that $H(A, B)<n$. Then there are $H(A, B)$ parallel paths of length $H(A, B)$ between the nodes $A$ and $B$. Proof Assume that the addresses of nodes $A$ and $B$ are the same as those in the proof of Lemma 2.17. We construct parallel paths from $A$ to $B$ with length $H(A, B)$. The $i$-th path can be constructed as follows:

$$
\begin{aligned}
& a_{n} \ldots \ldots . a_{n-i+2} a_{n-i+1} a_{n-i} \ldots \ldots . a_{n-k+1} a_{n-k} \ldots \ldots . a_{1} \\
& a_{n} \ldots \ldots . . a_{n-i+2} b_{n-i+1} a_{n-i} \ldots \ldots . a_{n-k+1} a_{n-k} \ldots \ldots . . a_{1} \\
& a_{n} \ldots \ldots . . a_{n-i+2} b_{n-i+1} b_{n-i} \ldots \ldots . a_{n-k+1} a_{n-k} \ldots \ldots . . a_{1} \\
& \text {.................................................................. } \\
& b_{n} a_{n-1} a_{n-2} \ldots a_{n-i+2} b_{n-i+1} b_{n-i} \ldots b_{n-k+1} a_{n-k} \ldots a_{1} \\
& b_{n} b_{n-1} a_{n-2} \ldots a_{n-i+2} b_{n-i+1} b_{n-i} \ldots b_{n-k+1} a_{n-k} \ldots a_{1} \\
& b_{n} b_{n-1} \ldots \ldots \ldots \ldots \ldots . . b_{n-k+2} b_{n-k+1} a_{n-k} \ldots \ldots \ldots \ldots \ldots \ldots a_{1}
\end{aligned}
$$

Then there exists $H(A, B)$ parallel paths. $\square$
Lemma 2.19 [24] Let $A$ and $B$ be any two nodes of $Q_{n}$ and assume that $H(A, B)<n$. Then there are $n$ parallel paths between the nodes $A$ and $B$. Moreover, the length of each path is at most $H(A, B)+2$.

Proof Assume that addresses of nodes $A$ and $B$ are the same as those in the proof of Lemma 2.17. We have already proved that there exist $k$ parallel paths with length $H(A, B)$ in Lemma 2.18. We prove the existence of $(n-k)$ parallel paths with length $H(A, B)+2$ besides $k$ parallel paths with length $H(A, B)$. At first reverse $a_{i}$ of address of node $A$, next take a procedure of Lemma 2.17, and finally reverse $a_{i}$ again. We apply these procedures for $a_{i}, 2 \leq i \leq n-k$. Then the result holds.

Definition 2.20 [9] [10] The subcube with $m$-dimension $(m<n)$ is a subset of $Q_{n}$ that satisfies four properties of Theorem 2.16. The subcube is addressed using notations $0,1, *$, where $*$ is don't care symbol that stands for 0 or 1 . The dimension of the subcube is represented by number of $*$ s.

Definition 2.21 [9] The distance between two subcubes $a$ and $b$ is represented by Hamming distance:

$$
H(a, b)=\sum\left|a_{i}-b_{i}\right|,
$$

where $\left|a_{i}-b_{i}\right|=1$ if $\left(a_{i}, b_{i}\right)=(1,0)$ or $(0,1)$.
Example 2.22 Consider a subcube of $Q_{3}$ with nodes $000,001,011,010$. This subcube is with address $0 * *$ and has two dimensions. The distance between two subcubes $00 * *$ and $011 *$ is one due to $H(00 * *, 011 *)=1$.

Lemma 2.23 [9] The $m$-dimensional subcube $q$ of $Q_{n}$ is adjacent to, at most, $(n-m) 2^{m}$ subcubes in $Q_{n}$.

Proof Assume that the address of a subcube $q$ is $00 \ldots . .0 * * \ldots . . *$ that is represented by $(n-m) 0$ s and $m * s$. If the addresses of subcubes $Q_{m}$ s adjacent to a subcube $q$ are

then there exist $(n-m) 2^{m} Q_{0}$ s that are adjacent to subcube $q$. $\square$

## 3 Gray Code

The Gray code consists of binary codes such that adjacent codes differ by one bit. The Gray code plays an important role when the data structure is allocated to processors of the hypercube.

Definition 3.1 [12] The $n$-dimensional binary-reflected Gray code (BRGC) $G_{n}$ is recursively constructed as follows:

$$
\begin{gathered}
G_{1}=(0,1) \\
G_{n}=\left(0 G_{n-1}, 1 \bar{G}_{n-1}\right),
\end{gathered}
$$

where $0 G_{n-1}$ is a concatenation of 0 and $G_{n-1}$ and $\bar{G}_{n-1}$ is a backward-sorted code of $G_{n-1}$.
Example 3.2 Two and three-dimensional BRGCs are shown

$$
\begin{gathered}
G_{2}=\left(0 G_{1}, 1 \bar{G}_{1}\right)=(00,01,11,10) \\
G_{3}=\left(0 G_{2}, 1 \bar{G}_{2}\right)=(000,001,011,010,110,111,101,100)
\end{gathered}
$$

Remark 3.3 We can intuitively understand that there exists a mirror between $G_{n-1}$ and $\overline{G_{n-1}}$.
The relationship between the BRGC and its ordering is formulated by the following famous theorem discovered by Gray.

Theorem 3.4 [12] [15] Let binary representation of the $n$-dimensional BRGC $G_{n}$ and its ordering be $g_{n-1} \ldots . . g_{0}$ and $b_{n-1} \ldots . . b_{1}$, respectively. Then the relationship between the BRGC and its ordering is formulated by the following equations:

$$
\begin{gathered}
g_{i}=b_{i}+b_{i+1}, \bmod 2, i<n-1 \\
g_{n-1}=b_{n-1}
\end{gathered}
$$

The equivalent inverse relation also holds.

$$
b_{i}=g_{i}+b_{i+1}, \bmod 2, i<n-1
$$

$$
b_{n-1}=g_{n-1}
$$

Furthermore, $b_{i}$ is represented by $g_{j}$, where $i \leq j \leq n-1$

$$
b_{i}=g_{i}+\ldots . .+g_{n-1}, \bmod 2
$$

## 4 Embedding Data Structure into Hypercube

Definition 4.1 [16] Let $d(i, j)$ be the distance between two nodes $i$ and $j$. Let $d\left(G_{i}, G_{j}\right)$ be the distance between two addresses $G_{i}$ and $G_{j}$ that are encoded using the BRGC. Then $\operatorname{maxd}\left(G_{i}, G_{i+1}\right)$ is called dilation, where $G_{i}$ and $G_{i+1}$ are adjacent BRGCs.
Definition 4.2 Consider an embedding of a graph $G=(V, E)$ into another graph $G^{\prime}=$ $\left(V^{\prime}, E^{\prime}\right)$. Then $n\left(V^{\prime}\right) / n(V)$ is called expansion, where $n(V)$ is the number of elements of node set $V$.

### 4.1 Loop

Lemma 4.3 [16] The loop of length $|L|=2^{n-1}+2 k, k=\left\{1,2, \ldots \ldots \ldots ., 2^{n-2}\right\}$ can be embedded into an $n$-dimensional hypercube $Q_{n}$ with dilation 1 by BRGC.
Proof Assume that nodes $\left\{0,1, \ldots \ldots . ., 2^{n-1}+k-1\right\}$ can be embedded using the BRGC of the loop node index. And assume that loop nodes $\left\{2^{n-1}+k, 2^{n-1}+k+1, \ldots \ldots . ., 2^{n-1}+2 k-1\right\}$ can be embedded corresponding to the BRGCs of $\left\{2^{n}-k, 2^{n}-k+1, \ldots \ldots . ., 2^{n}-1\right\}$. Then the Gray code of $2^{n-1}+k-1$ is $\left(1 G_{n-1}\left(2^{n-1}-k\right)\right)$ and Gray code of $2^{n}-k$ is $\left(1 G_{n-1}(k-1)\right)$, where $G_{n-1}\left(2^{n-1}-k\right)$ is the $\left(2^{n-1}-k\right)$-th code of $(n-1)$-dimensional BRGC. If

$$
\begin{gathered}
G_{n-1}\left(2^{n-1}-k\right)=\left(1 G_{n-2}(k-1)\right) \\
G_{n-1}(k-1)=\left(0 G_{n-2}(k-1)\right)
\end{gathered}
$$

then

$$
1 G_{n-1}\left(2^{n-1}-k\right)=\left(11 G_{n-2}(k-1)\right)
$$

$$
1 G_{n-1}(k-1)=\left(10 G_{n-2}(k-1)\right) .
$$

Thus both codes are adjacent.
Lemma 4.4 [16] The loop of length $|L|=2^{n-1}+2 k+1, k=\left\{1,2, \ldots \ldots \ldots ., 2^{n-2}-1\right\}$ can be embedded into an $n$-dimensional hypercube $Q_{n}$ with at least one edge of length 2 .

Lemma 4.5 [16] The BRGCs that encode $i$ and $i+2^{k} \bmod 2^{n}$ differ in 2 bits.
Proof For two integers $i$ and $j$ whose binary representations are

$$
\begin{aligned}
& i_{n} i_{n-1} \ldots \ldots . . i_{1} \\
& j_{n} j_{n-1} \ldots \ldots . . j_{1},
\end{aligned}
$$

two BRGCs that correspond to $i$ and $j$ are assumed as follows:

$$
\begin{gathered}
g_{n} g_{n-1} \ldots \ldots . . g_{1} \\
h_{n} h_{n-1} \ldots \ldots . h_{1} .
\end{gathered}
$$

Then

$$
\begin{gathered}
i_{m}=j_{m}, m=\{1,2, \ldots \ldots ., k\} \\
j_{m}=\bar{i}_{m}, m=\{k+1, \ldots \ldots \ldots, s\},
\end{gathered}
$$

where carry stops propagation at bit position $s$. From Gray's theorem, the result is obtained

$$
\begin{gathered}
h_{m}=g_{m}, m=\{1,2, \ldots ., k-1, k+1, \ldots \ldots, s-1\} \\
h_{k}=\bar{g}_{k}, h_{s}=\bar{g}_{s},
\end{gathered}
$$

where the overline stands for the complement of the bit.

### 4.2 Mesh

Johnsson presented an example for the naive embedding of a multidimensional array. We generalize his example by the following algorithm that guarantees adjacencies with wraparound.

## Algorithm 4.6: Embedding of a $2^{n} \times 2^{n}$ mesh in $Q_{2 n}$ by BRGC

Suppose the matrix $A^{2^{i}}$ stands for $2^{i} \times 2^{i}$ mesh whose nodes are addressed using a binaryreflected Gray code (BRGC).

Step 1 For embedding of a $2 \times 2$ mesh, allocate two-dimensional BRGC $G_{2}(i)(i=0, \cdots, 3)$, $00,01,11,10$ to four nodes of this mesh in a clockwise manner, that is,

$$
A^{2^{1}}=\left(\begin{array}{ll}
00 & 01 \\
10 & 11
\end{array}\right)
$$

Step 2 Embedding of a $2^{i+1} \times 2^{i+1}$ mesh can be done using the following recursive formula

$$
A^{2^{i+1}}=\left(\begin{array}{cc}
00 A^{2^{i}} & 01\left(A_{\pi / 2}^{2^{i}}\right)^{T} \\
10\left(A_{3 \pi / 2}^{2^{i}}\right)^{T} & 11 A_{\pi}^{2^{i}}
\end{array}\right)
$$

where $A_{\pi / 2}^{2^{i}}$ indicates rotation of $A^{2^{i}}$ by $\pi / 2$ radian and $\left(A_{\pi / 2}^{2^{i}}\right)^{T}$ means transpose of $A_{\pi / 2}^{2^{i}}$. Furthermore, $00 A^{2^{i}}$ stands for concatenation of 00 to the leftmost bits of all nodes of $A^{2^{i}}$.

## Example 4.7

$$
A^{2^{2}}=\left(\begin{array}{llll}
0000 & 0001 & 0101 & 0100 \\
0010 & 0011 & 0111 & 0110 \\
1010 & 1011 & 1111 & 1110 \\
1000 & 1001 & 1101 & 1100
\end{array}\right)
$$

The adjacency with wraparound is guaranteed, for example, $(0000,0100),(0000,1000)$. Furthermore, the adjacency in an ordinary sense is also guaranteed, for example, ( 0011,0111 ), $(0011,1011)$. We also show $A^{2^{3}}$..

$$
A^{2^{3}}=\left(\begin{array}{llllllll}
000000 & 000001 & 000101 & 000100 & 010100 & 010101 & 010001 & 010000 \\
000010 & 000011 & 000111 & 000110 & 010110 & 010111 & 010011 & 010010 \\
001010 & 001011 & 001111 & 001110 & 011110 & 011111 & 011011 & 011010 \\
001000 & 001001 & 001101 & 001100 & 011100 & 011101 & 011001 & 011000 \\
101000 & 101001 & 101101 & 101100 & 111100 & 111101 & 111001 & 111000 \\
101010 & 101011 & 101111 & 101110 & 111110 & 111111 & 111011 & 111010 \\
100010 & 100011 & 100111 & 100110 & 110110 & 110111 & 110011 & 110010 \\
100000 & 100001 & 100101 & 100100 & 110100 & 110101 & 110001 & 110000
\end{array}\right) .
$$

Remark 4.8 We can intuitively interpret the algorithm about embedding of the multidimensional array from the viewpoint of the mirror. Images of $A^{2^{i}}$ reflected in two mirrors are $\left(A_{\pi / 2}^{2^{i}}\right)^{T}$ and $\left(A_{3 \pi / 2}^{2^{i}}\right)^{T}$, respectively. Furthermore, images of $\left(A_{\pi / 2}^{2^{i}}\right)^{T}$ and $\left(A_{3 \pi / 2}^{2^{i}}\right)^{T}$ reflected in mirrors result in $A_{\pi}^{2^{i}}$. Thus, the two-dimensional Gray code can be generated using Algorithm 4.6.
Lemma 4.9 [16] The naive embedding of the multidimensional array into the hypercube can be done such that an $N_{1} \times N_{2} \times \cdots \times N_{\tau}$ mesh can be embedded into an $N=\left(\left[\log _{2} N_{1}\right]+\left[\log _{2} N_{2}\right]+\cdots \cdots+\left[\log _{2} N_{r}\right]\right)$-dimensional hypercube $Q_{N}$ by assigning
[ $\log _{2} N_{i}$ ] cube dimensions to dimension $i$ of the mesh. Here [•] stands for a notation such that $[x]=\sup x \in Z, x \in R$, where $Z$ and $R$ are integer and real spaces.
Lemma 4.10 [16] The naive embedding of the multidimensional arrays is efficient for $N_{i}=2^{n_{i}}$, but for $2^{n_{i}}<N_{i}<2^{n_{i}+1}$ there exists the expansion

$$
e=\frac{2\left(\left[\log _{2} N_{1}\right]+\cdots \cdots+\left[\log _{2} N_{r}\right]\right)}{N_{1} \times \cdots \times N_{r}}
$$

Modified mesh embedding methods are found in the literature [6], [7], [14], [19], and [27].

### 4.3 Graph

Definition 4.11 [9] Relaxed-squashed (RS) embedding is a node-to-subcube distance preserving mapping from source graph to the hypercube.
Definition $\mathbf{4 . 1 2}$ [9] The dimension of the minimal cube required for the RS embedding of a source graph $G=(V, E)$ is called the weak cubical dimension $w d(G)$ of the graph.
Definition 4.13 For graph $G=(V, E)$, an induced subgraph $\operatorname{ind}_{G}\left(V_{S}\right)$ of $G$ with a node set $V_{S} \subseteq V$ is the maximal subgraph with the node set $V_{S}$.

Lemma 4.14 [9] Let $G$ be a graph with $n$ nodes. Then the following inequality holds: $\left[\log _{2} n\right] \leq w d(G) \leq n-1$.
Proof Graph $G$ is a subgraph of the complete graph $K_{n}$. If $\operatorname{wd}\left(K_{n}\right)=n-1$, then $w d(G) \leq n-1$ holds. By Definition 4.12, $\left[\log _{2} n\right] \leq w d(G)$ is obvious.
Lemma 4.15 [9] Let $G=(V, E)$ be a connected graph and let $G_{S}=\left(V_{S}, E_{S}\right)$ be a subgraph of $G$. Suppose that the induced subgraph $\operatorname{ind}_{G}\left(V_{S}\right)$ can be RS embedded into $m$-dimensional hypercube $Q_{m}$, and the removal of all edges in $E_{S}$ from $G$ results in $\left|V_{S}\right|$ disjoint graphs, $G_{i}=\left(V_{i}, E_{i}\right), 1 \leq i \leq\left|V_{S}\right|$. Then $w d(G) \leq \max _{1 \leq i \leq\left|V_{S}\right|} w d\left(G_{i}\right)+m$.
Proof Let $v_{i}, 1 \leq i \leq\left|V_{S}\right|=k$ be the nodes in $V_{S} \cap V_{i}$ of $G_{S} \cap G_{i}$. The notation address $G_{S}\left(v_{i}\right)$ represents the encoding of $v_{i}$ in $V_{S}$ so that $G_{S}$ can be RS embedded into $Q_{m}$. The notation address $_{G_{i}}(v)$ represents the encoding of $v \in V_{i}$ for the RS embedding of $G_{i}$ into $Q_{w d\left(G_{i}\right)}$, where $\operatorname{address}{ }_{G_{i}}^{j}(v)$ is the $j$-th bit of $\operatorname{address~}_{G_{i}}(v)$. The RS embedding generates $\operatorname{address}_{G}(w)$ for each $w$ in $G$ by the following procedures:

## Algorithm 4.16: RS Embedding of single graph

Step 1 For each $w \in V_{i}, 1 \leq i \leq k=\left|V_{S}\right|$, address ${ }_{G}^{j}(w) \longleftarrow a d d r e s s_{G_{S}}^{j}\left(v_{i}\right)$, where $1 \leq j \leq m$.
Step 2 For each $w \in V_{i}, 1 \leq i \leq k=\left|V_{S}\right|$,
if address $_{G_{S} \cap G_{i}}^{j}\left(v_{i}\right)=1$
then address $_{G}^{j}(w) \longleftarrow \overline{\text { address }_{G_{i}}^{j-m}(v)}$
(overline indicates complement of binary code)
else $\operatorname{address}{ }_{G}^{j}(w) \longleftarrow \operatorname{address}{ }_{G_{i}}^{j-m}(v)$
where $m+1 \leq j \leq w d\left(G_{i}\right)+m$.
Step 3 For each $w \in V_{i}, 1 \leq i \leq k=\left|V_{S}\right|$,
$\operatorname{address}{ }_{G}^{j}(w) \longleftarrow *$ (don't care symbol),
where $w d\left(G_{i}\right)+m+1 \leq j \leq \max _{1 \leq i \leq k} w d\left(G_{i}\right)+m$.
As shown in the Algorithm 4.16, the weak cubical dimension $w d(G)$ that is the dimension of the minimum cube required for the RS embedding of the graph $G=(V, E)$ is smaller than $\max _{1 \leq i \leq\left|V_{s}\right|} w d\left(G_{i}\right)+m$.

### 4.4 Tree

### 4.4.1 Embedding with expansion 1

Theorem 4.17 [16] An embedding of a complete binary tree of $n$-height in a $Q_{n}$, by labeling the tree nodes in inorder and embedding the tree by a binary encoding of the node indices, yields an embedding in which a parent node and its left descendant are at distance 1 , the parent and its right descendant are at distance 2 , and the right and left descendants are at distance 1 from each other.

Theorem 4.18 [16] An embedding of a complete binary tree of $n$-height in a $Q_{n}$, by labeling the tree nodes in inorder and embedding the tree by a BRGC encoding of the node indices, yields an embedding in which a leaf node is at distance 1 from its parent node and all other nodes are at distance 2 from their respective parent node. Left and right descendants of a node are always at distance 2 from each other.

### 4.4.2 Embedding with dilation 1

Wu [30] presents the embeddability of binary tree of $n$-height into $n+1$-dimendional hypercube using a definition of the free-free neighbor, which is outlined in Proof 1 of Theorem 4.20. Johnsson [16] presents alternative proof of the same theorem. We reformulate his proof.
Definition 4.19 [30] Let $f_{p}$ be a map from binary tree of height $p$ into hypercube $Q_{p+1}$. The free-free neighbor property is defined by if $R=f_{p}$ (binary tree of height $p$ ) has a free
neighbor $R_{1}$ and $R_{1}$ has a free neighbor $R_{2}$, then $\left\{R_{1}, R_{2}\right\}$ is not a subset of $f_{p}$ (nodes of binary tree of height $p$ ).
Theorem 4.20 [16] [30] The binary tree of height $n$ can be embedded into an ( $n+1$ )dimensional hypercube $Q_{n+1}$ with dilation 1 .

Proof 1 The following inductive algorithm proves the result.
Algorithm 4.21: Tree embedding [30]
Step 1 Left subtree of height $(n-1)$ is embedded by $f_{n-1}$ into $0 Q_{n}$ of $Q_{n+1}$. A node $0 L=f_{n-1}$ (root of left subtree) has a free neighbor $0 L_{1}$, which has a free neighbor $0 L_{2}$.
Step 2 Right subtree of height $(n-1)$ is also embedded by $g_{n-1}$ into $1 Q_{n}$ of $Q_{n+1}$. A node $1 R=g_{n-1}$ (root of right subtree) has a free neighbor $1 R_{1}$, which has a free neighbor $1 R_{2}$.

Step 3 A hypercube $Q_{n+1}$ can be constructed by combining $0 Q_{n}$ and $1 Q_{n}$ in such a way that $0 L_{1}$ is a neighbor of $1 R$ and $0 L_{2}$ is a neighbor of $1 R_{1}$. The node $0 L_{1}$ of $Q_{n+1}$ corresponds to an embedded node from the root of the tree.

Proof 2 The addressing for embedding complete binary tree of height $n$ into an $(n+1)$ dimensional hypercube $Q_{n+1}$ can be realized by the following procedure:

## Algorithm 4.22 Addressing of tree embedding

Step 1 Allocate root node of the tree of height 1 to an address 00 in $Q_{2}$. For embedding of the tree of height 2 into $Q_{3}$, allocate left-leaf and right-leaf nodes to 010 (in subcube $01 *$ in $Q_{3}$ ) and 100 (in subcube $10 *$ in $Q_{3}$ ), respectively. The root address 00 in $Q_{2}$ can be readdressed into 000 in $Q_{3}$.

Step 2 For embedding the tree of $(n+1)$ height into $(n+2)$-dimensional hypercube $Q_{n+2}$ using embedded addresses about the $n$-height tree into $Q_{n+1}$, if leaf-tree node of the $(n+1)$-height tree is left child of the leaf node of the $n$-height tree, allocate these left children to nodes in a subcube $0 Q_{n+1}$ of $Q_{n+2}$ such that nodes in a subcube $01(0 * \cdots * 1 * \cdots *$ in $Q_{n+1}$ ) are mapped on a subcube 11 (in $Q_{n+1}$ ), nodes in a subcube 11 (in $Q_{n+1}$ ) are mapped on subcube 10 (in $Q_{n+1}$ ), and nodes in a subcube 10 (in $Q_{n+1}$ ) are mapped on the same subcube 10 (in $Q_{n+1}$ ) without conflicting with previously allocated addresses.

Step 3 If the leaf-tree node of the $(n+1)$-height tree is right child of the leaf node of the $n$-height tree, allocate these right children to nodes in a subcube $1 Q_{n+1}$ of $Q_{n+2}$ such that nodes in a subcube 01 (in $Q_{n+1}$ ) are mapped on 01 (in $Q_{n+1}$ ), nodes in 11 on 11, and nodes in 10 on 10.

Step 4 Concatenate 0 to the leftmost bit of addresses of all nodes except for embedded
leaf nodes in order to embed these nodes into a subcube $0 Q_{n+1}$ of $Q_{n+2}$.
Lemma 4.23 Assume that $L_{n}$ represent leaf nodes in the $n$-height tree. By Algorithm 4.22, leaf nodes ( $L_{n} / 2, L_{n} / 4, L_{n} / 4$ ) can be respectively allocated to subcubes $(01,11,10)$ in $Q_{n+1}$ for every natural number $n$.

Proof We prove by induction. Using Step 1 of Algorithm 4.21, ( $\left.L_{n} / 2, L_{n} / 4, L_{n} / 4\right)$ are mapped from subcubes $(01,11,10)$ of $Q_{n+1}$ on subcubes $(11,10,10)$ of $Q_{n+1}$ in $0 Q_{n+1}$. Similarly using Step 2 of Algorithm $4.21,\left(L_{n} / 2, L_{n} / 4, L_{n} / 4\right)$ are mapped from subcubes $(01,11,10)$ of $Q_{n+1}$ on subcubes $(01,11,10)$ of $Q_{n+1}$ in $1 Q_{n+1}$. Since, in $0 Q_{n+1}$, allocated leaf nodes are ( $L_{n} / 2, L_{n} / 4, L_{n} / 4$ ) in subcubes $(011,010,010)$ of $0 Q_{n+1}$, and $\left(L_{n} / 2, L_{n} / 4, L_{n} / 4\right)$ in subcubes $(101,111,110)$ of $1 Q_{n+1}$, then allocated leaf nodes are $\left(L_{n+1} / 2, L_{n+1} / 4, L_{n+1} / 4\right)=\left(L_{n} / 2+L_{n} / 4+L_{n} / 4, L_{n} / 4+L_{n} / 4, L_{n} / 2\right)$ in subcubes (01*, 11*, $\left.10 *\right)$, that is, $(01,11,10)$, of $Q_{n+2}$. ㅁ
Lemma 4.24 Algorithm 4.22 enables addressing without conflict with previously addressed nodes.

Proof Suppose that $\boldsymbol{\mu}_{\text {s }}$ represent $2^{n}$ leaf nodes of $(n+1)$-height tree, $\delta$ s represent $2^{n-1}$ leaf nodes of $n$-height tree, $\nabla_{\mathrm{s}}$ represent $2^{n-2}$ leaf nodes of $(n-1)$-height tree, and $\boldsymbol{\omega}$ represent $2^{n-3}$ leaf nodes of $(n-2)$-height tree. At Step 2 of Algorithm 4.22, consider four subcubes $00,01,11,10$ in $0 Q_{n+1}$. Assume that $n$-height tree is embedded into $Q_{n+1}$ without conflict. For mapping from 01 to 11 in $0 Q_{n+1}$, since there is no conflict between $\diamond s$ and $\nabla_{s}$ in 01 , then $\boldsymbol{\varphi}_{\mathrm{s}}$ in 11 (mapped from $\nabla_{s}$ in 01 ) and $\nabla_{s}$ in 11 (from $\nabla_{\text {in }} 01$ ) do not conflict. Next consider mapping from 11 to 10 in $0 Q_{n+1}$. Locations of $\nabla$ s in 11 are equivalent to locations of $\nabla_{s}$ in 01 , and equivalent to locations of $\boldsymbol{\omega}_{\mathrm{s}}$ in 00 . Since $\omega_{\mathrm{s}}$ and $\boldsymbol{\omega}$ s do not conflict in 00 , then $\rangle s$ in $10\left(\right.$ from $\nabla_{s}$ in 00$)$ and $\boldsymbol{\phi}_{\mathrm{s}}$ in 10 (from $\diamond \mathrm{s}$ in 11 ) do not conflict. Next consider mapping from 10 to 10 in $0 Q_{n+1}$. Locations of previously allocated $\nabla_{s}$ in 10 and previously allocated $\boldsymbol{\mu}_{\mathrm{s}}$ in 10 are locations of $\nabla_{s}$ and $\boldsymbol{\phi}_{\mathrm{s}}$ in 00 , respectively. Since in 00 mapping is possible from $V_{s}$ to other nodes without conflict with $\boldsymbol{\omega}$ (see Lemma 4.25), then in 10 mapping is possible from $\diamond s$ to other nodes without conflict with previously allocated $\boldsymbol{\$}^{\mathbf{s}}$. For mapping from 01 in $0 Q_{n+1}$ to 01 in $1 Q_{n+1}$, mapping from $\diamond s$ to $\boldsymbol{Q}_{\mathrm{s}}$ is obviously without conflict. Mappings from 11 (or 10 ) in $0 Q_{n+1}$ to 11 (or 10 ) in $1 Q_{n+1}$ is similarly without conflict.

Lemma 4.25 In a subcube 00 of Lemma 4.24, mapping is possible from $V_{s}$ to other nodes without conflict with $\boldsymbol{\uparrow}$.

Proof Divide a subcube 00 into four smaller subcubes $00,01,11,10$. By mapping rules
$10 \rightarrow 00,11 \rightarrow 10$ or 11 , mapping is possible from $V_{\mathrm{s}}$ to other nodes without conflict with $\mathrm{m}_{\mathrm{s}}$.

From Lemma 4.23 and Lemma 4.24, Proof 2 of Theorem 4.20 is completed.

## 5 Concluding Remarks

This paper surveys topological structure of the hypercube and embedding of several data structures into the hypercube. Topics in complexity of tree embedding [28] [29], odd-even shift [13] [23], communication [17], synchronization [21] [25], relevant interconnection [1] [31], fault tolerance [11], and applications [2] [3] [18] of the hypercube can be found in the respective literature.

## References

[1] F. Annexstein, M. Baumslag, and A. L. Rosenberg, Group Action Graphs and Parallel Architectures, SIAM J. Computing, 19, (3), 544-569, 1990.
[2] C. K. Baru and O. Frieder, Database Operations in a Cube-Connected Multiprocessor System, IEEE Transaction on Computers, C-38, (6), 920-927, 1989.
[3] C. K. Baru and P. Goel, Squashed Embedding of E-R Schemas in Hypercubes, Journal of Parallel and Distributed Computing 8, 340-348, 1990.
[4] J. C. Bermond and C. Peyrat, De Bruijn and Kautz Networks: A Competer for the Hypercube and Distributed Computers, Edited by A. Andre and J. P. Verjus, Elsevier Science Publishers B. V. (North Holland), 1989.
[5] L. N. Bhuyan and D. P. Agrawal, Generalized Hypercube and Hyperbus Structures for a Computer Network, IEEE Transactions on Computers, C-33, 4, 323-333, 1984.
[6] M. Y. Chan and F. Y. L. Chin, On Embedding Rectangular Grids in Hypercubes, IEEE Transactions on Computers, 37, (10), 1285-1288, 1988.
[7] T. F. Chan and Y. Saad, Multigrid Algorithms on the Hypercube Multiprocessor, IEEE Transaction on Computers, C-35, (11), 969-977, 1986.
[8] M. S. Chen and K. Shin, Processor Allocation in an N-Cube Multiprocessor Using Gray Codes, IEEE Transaction on Computers, C-36, (12), 1396-1407, 1987.
[9] M. S. Chen and K. Shin, On Relaxed Squashed Embedding of Graphs into a Hypercube, SIAM J. Computing, 8, (6), 1226-1244, 1989.
[10] M. S. Chen and K. Shin, Subcube Allocation and Task Migration in Hypercube Multiprocessors, IEEE Transaction on Computers, 39, (9), 1146-1155, 1990.
[11] M. S. Chen and K. Shin, Depth-First Search Approach for Fault Tolerant Routing in Hypercube Multicomputers, IEEE Transaction on Parallel and Distributed Systems, 1, (2), 152-159, 1990.
[12] W. J. Dally, A VLSI Architecture for Concurrent Data Structures, Kluwer Academic Publishers, 1987.
[13] Z. Fang, X. Li, and L. M. Ni, Parallel Algorithms for Image Template Matching on Hypercube SIMD Computers, Proceedings IEEE Computer Society Workshop
on Computer Architecture for Pattern Analysis and Image Database Management, 1985.
[14] C. T. Ho and S. L. Johnsson, Embedding Meshes in Boolean Cubes by Graph Decomposition, Journal of Parallel and Distributed Computing, 8, 325-339, 1990.
[15] H. J. Gray and P. V. Lavonian, An Analog-to-Digital Converter for Serial Computing Machines, Proceedings of the I.R.E., 41, (10), 1462-1465, October, 1953.
[16] S. L. Johnsson, Communication Efficient Basic Linear Algebra Computations on Hypercube Architectures, Journal of Parallel and Distributed Computing, 4, 133172, 1987.
[17] S. L. Johnsson and C. T. Ho, Optimum Broadcasting and Personalized Communication in Hypercubes, IEEE Transaction on Computers, C-38, (9), 1249-1267, 1989.
[18] A. J. Laub, Hypercube Implementation of Some Parallel Algorithms in Control, NATO ASI Series, Vol. F47: Advanced Computing Concepts and Techniques in Control Engineering, Edited by M. J. Denham and A. J. Laub, Springer-Verlag Berlin, Heiderberg, 1988.
[19] S. Matic, Emulation of Hypercube Architecture on Nearest Neighbor Mesh-Connected Processing Elements, IEEE Transaction on Computers, 39, (5), 698-700, 1990.
[20] M. Nagata, S. Fukuda, and K. Kihara, Design and Implementation of a Multiprocessor with a Hypercube Interconnection Network, Proceedings of IECON'89, 1989.
[21] D. Peleg and J. Ullman, An Optimal Synchronizer for the Hypercube, SIAM J. Computing, 15, (4), 740-747, 1989.
[22] F. P. Preparata and J. Vuillemin, The Cube-Connected Cycles: A Versatile Network for Parallel Computation, Communication of the ACM, 24, (5), 300-309, 1981.
[23] S. Ranka and S. Sahni, Odd Even Shifts in SIMD Hypercubes, 1, (1), 1990.
[24] Y. Saad and M. H. Schultz, Topological Properties of Hypercubes, IEEE Transaction on Computers, 37, (7), 867-872, 1988.
[25] Y. Saad and M. H. Schultz, Data Communication in Hypercubes, Journal of Parallel and Distributed Computing, 6, 115-135, 1989.
[26] M. R. Samatham and D. K. Pradhan, The De Bruijn Multiprocessor Network: A Versatile Parallel Processing and Sorting Network for VLSI, IEEE Transaction on Computers, 38, (4), 567-581, 1989.
[27] D. S. Scott and J. Brandenburg, Minimal Mesh Embeddings in Binary Hypercubes, IEEE Transaction on Computers, 37, (10), 1284-1285, 1988.
[28] A. Wagner, Embedding Arbitrary Binary Trees in a Hypercube, Journal of Parallel and Distributed Computing, 7, 503-520, 1989.
[29] A. Wagner and D. G. Corneil, Embedding Trees in a Hypercube is NP-Complete, SIAM J. Computing, 19, (4), 570-590, 1990.
[30] A. Y. Wu, Embedding of Tree Networks into Hypercubes, Journal of Parallel and Distributed Computing, 2, 238-249, 1985.
[31] A. S. Youssef and B. Narahari, The Banyan-Hypercube Networks, IEEE Transaction on Parallel and Distributed Systems, 1, (2), 160-169, 1990.

