# A Comparatively Analysis and Performance of Logical Topologies of Embedded Hypercube (LTOEH) Interconnection Network

Mohd. Kalamuddin Ahmad Research Scholar Department of CS Mewar University, India Mohd. Husain, PhD Department of CS, APJ Tech. University, Lucknow,India (Formerly UP Tech. University, Lucknow)

# ABSTRACT

This paper is considered an important issue in the design of PE-Interconnection networks for massively parallel computing scalability. A detailed analysis shows that the double loop hypercube DLH (m,d) network is compared with mesh and torus embedded hypercube, torus embedded hypercube a better interconnection network in the properties of small diameter and highly connectivity, simple routing, scalability, constant node degree of topology, and the performance of communication. Show how to design the good interconnection network in parallel Architecture for less density as well as diameter to route the data to different path. Also it has been proved with the computational results of entire system that the embedded hypercube interconnection networks build is highly scale up in terms of communication. A complete design analysis and comparison of network with various other networks is given using different network parameters optimal of torus architecture rather than mesh architecture.

#### **Keywords**

Hypercube network, Topology, DLH, Data routing path, Embedded network, Network parameters and Scalability, Network metrics.

## **1. INTRODUCTION**

Solve a given problem more rapidly, or to enable the solution of a problem that would otherwise be impracticable by a single worker. The way nodes are connected to one another nodes in the machines, in direct network architecture, each node has a point to point, or direct, connection to some number of nodes, called the neighboring nodes. Direct network have become popular architecture for constructing massively parallel computers because they scale well, that is the number of nodes in the system increases as well as communication bandwidth and processing capability of the system also increase [1], [3], and [11].

In direct network architecture, neighboring nodes may send packets to one another directly, while nodes that are not directly connected must rely on other nodes in the network to transfer packets from source to destination. Although a router's function could be performed by the corresponding local processor, dedicated routers are used to allow overlapped.

Computation as well as communication within each node, router supports some number of input and output channels. Internal channels connect the local processor memory to the

router. External channels are used for communication between routers and nodes. By connecting the input channels of one node to the output channels of other nodes, the topology of the direct network will be defined. For topologies in which packets may have to traverse some intermediate nodes, the routing algorithm determines the path selected by a packet to reach its destination. At each node, the routing algorithm indicates the next channel to be used. Efficient routing is critical to the performance of interconnection network [1], [6], [7], [9], [10], and [12].

In this paper the proposed work, how to route the data in massive parallel computer architecture with respect to reduce the diameter.

# 2. PROPERTIES OF EMBEDDED NETWORKS

The interconnection network is a vital role in a parallel processing. A good interconnection network is expected to have least number of links, topological network cost and more reliable. The interconnection network must be able to built scale up .The data routing functions in embedded hypercube network could be analyzed [3],[4,[5],[9], [11].and [12].

#### 2.1 Mesh embedded

A Single stage recirculating represent network. In network, each PE i is allowed to send to any one of PE i+1, PE i-1, PE i+r and PE i-r, Where r = N in one circulating steps through entire network [5], [8], and [11].

The following routing function from (1) to (5) apply on simple mesh network [8] –

$$R_{+1}(i) = (i+1) \mod N$$
 (1)

$$R_{-1}(i) = (i-1) \mod N \tag{2}$$

$$R_{+r}(i) = (i+r) \mod N \tag{3}$$

$$R_{-r}(i) = (i-r) \mod N \tag{4}$$

$$R_{C}(k_{n-1},\ldots,k_{d+1},k_{d},k_{d-1},\ldots,k_{0})$$

$$= (k_{n-1} \dots k_{d+1} \ k'_{d} \ k_{d-1} \ \dots k_{0})$$
(5)

Where  $0 \le i \le N-1$ , Commonly N know as perfect Square.



Figure: 1. (4, 4, 8) Mesh Embedded Architecture

### 2.2 Torus embedded

Now suppose consider  $a \times b$  be the size of some concurrent torus networks with a number of rows and b number of columns and N being the number of nodes connected in the hypercube, the torus embedded hypercube network can be designed with the size of (a, b, N). Nodes with identical positions in the torus networks will be a group of N number of nodes connected in the hypercube configuration and can be addressed with three parameters such as row number i, column number j of torus and address of node k in hypercube where the addressed node is residing. Hence, a (a, b, N)–torus embedded hypercube network will have  $a \times b \times N$  number of nodes and a node with address as (i, j, k) where  $0 \le i < a, 0 \le j$ 

< b and  $0 \le k < N$ . The data routing functions of torus embedded hypercube as follow-

- $R_{1}(i, j, k) = (i, (j+1) \mod b, k)$  (6)
- $R_{2}(i, j, k) = (i, (b+j-1) \mod b, k)$  (7)
- $R_{3}(i, j, k) = ((i+1) \mod a, j, k)$  (8)
- $R_4(i, j, k) = ((a+i-1) \mod a, j, k)$  (9)

And also used hypercube routing function equation ( 5 ) as above mentioned in the mesh embedded.



Figure: 2. (4, 4, 8) Torus Embedded Hypercube Network

The end to end connections of row and column of each connected in torus but are not represent in Figure 2. A wraparound connection is making along each row or column if

#### 2.3 Double loop embedded hypercube

The total number of nodes in the DLH (m, d) is  $4m\times 2d$ . When m=2, the DLH(m,d) can be constructed by combining the positive features of the hypercube topology, such as small diameter, high connectivity, symmetry and simple routing, and the scalability and constant node degree of the DL(2m) topology as follows [2],[14], and [15]:

- The 2<sup>d</sup> nodes can be connected to be a d dimensional hypercube network according routing equation (5), where d is denoted the dimensional hypercube networks.
- 2) The 4m such kinds of d dimensional hypercube networks can be divided into 2m groups, in which any group can be coded with a group-id, which adopts Johnson code from 0 to 2m, and any d

they have equal label as a complete of torus embedded hypercube network (4, 4, 8).

dimensional hypercube network in a same group can also be coded with a net-id using 0 or 1.

- 3) The nodes with both the same node-id in different groups can be connected to a DL (2*m*) according to *d*.
- 4) The code of nodes in the DLH (m, d) when there is just one bit different between any two nodes, there will exist a link between them, that is to say, these two nodes are neighboring to each other.

Figure 3. Illustrate a DLH (m, d) interconnection where solid lines represent hypercube links and thin lines represent



Figure: 3. Double Loop Embedded Hypercube

DL (2m) links.. The size of the DLH(m,d) can grow without altering the number of links per node by expanding the size of the TL(2m); for example, by adding four three-cubes on the perimeter of the TL(2m) in figure . A DLH (4, 3) consists of  $4 \times 4 \times 23 = 128$  nodes. It can be viewed as eight concurrent DL

(2m) where eight nodes having identical DL (2m) addresses form one three-cube. Alternatively, it can be viewed as 16 concurrent three-cubes in which 16 nodes having identical hypercube addresses form a DL (2m).

# 3. RESULTS AND DISCUSSION

## 3.1 Node degree

Table-1 and figure 4, Gives the comparison of node degree as a function of the number of processors of various other popular networks for parallel computing system.

| Types of Network | Number of Processors |        |         |        |        |         |         |  |
|------------------|----------------------|--------|---------|--------|--------|---------|---------|--|
|                  | 128                  | 256    | 512     | 1024   | 2048   | 4096    | 8192    |  |
| Ring             | 2                    | 2      | 2       | 2      | 2      | 2       | 2       |  |
| Linear Array     | 2                    | 2      | 2       | 2      | 2      | 2       | 2       |  |
| 2D-Mesh          | 4                    | 4      | 4       | 4      | 4      | 4       | 4       |  |
| Illiac-Mesh      | 4                    | 4      | 4       | 4      | 4      | 4       | 4       |  |
| 2D-Torus         | 4                    | 4      | 4       | 4      | 4      | 4       | 4       |  |
| Hypercube        | 7                    | 8      | 9       | 10     | 11     | 12      | 13      |  |
| k-ary-n Cube     | 14, k=2              | 16,k=2 | 18, k=2 | 20,k=2 | 22,k=2 | 24, k=2 | 26, k=2 |  |

#### Table 1: Comparative Results of Node Degree, For Different Topology

The comparison indicates that the hypercube and k-ary n-cube interconnection networks are more expensive used in parallel computer, and it's not suitable for parallel architecture. For these two networks, the node degree increases drastically as the system expansion takes place. Obviously the cost per node also increases tremendously for these two networks as the system is scaled up shown in figure 4.



Figure: 4. Node degrees Analysis

# 3.2 Total number of links

Table 2: Comparative Results of Number of Links for Network Topology

| Types of Network | Number of Processors |          |          |           |           |           |           |  |  |
|------------------|----------------------|----------|----------|-----------|-----------|-----------|-----------|--|--|
|                  | 128                  | 256      | 512      | 1024      | 2048      | 4096      | 8192      |  |  |
| Ring             | 128                  | 256      | 512      | 1024      | 2048      | 4096      | 8192      |  |  |
| Linear Array     | 127                  | 255      | 511      | 1023      | 2047      | 4095      | 8191      |  |  |
| 2D-Mesh          | 234                  | 480      | 980      | 1984      | 4006      | 8064      | 16203     |  |  |
| Illiac -Mesh     | 256                  | 512      | 1024     | 2048      | 4096      | 8196      | 16384     |  |  |
| 2D-Torus         | 256                  | 512      | 1012     | 2048      | 4096      | 8192      | 16384     |  |  |
| Hypercube        | 64                   | 128      | 256      | 512       | 1024      | 2048      | 53248     |  |  |
| k-ary-n Cube     | 896,k=2              | 2048,k=2 | 4608,k=2 | 10240,k=2 | 22528,k=2 | 28672,k=2 | 1.06e+005 |  |  |

Table-2 and Figure 5 gives the comparison of Total number of wires or links for a PE-Interconnection network is expected to be in above table. In table 2 to reflects link complexity and directly related to the cost of the networks.

Table 2 shows that the number of links w.r.t. the design of simple network scaling in the parallel computing. It is

observed that the total number of connection of two dimensional static topologies like linear array, 2D-mesh, 2Dtorus, and hypercube lies in between ring and k-ary n-cube. It is to be noted that the k-ary n-cube offers larger number of links network as illustrate in figure 5.





# 3.3 Network diameter analysis

Table-3 and Figure 6 shows the comparison of network diameter at different topology, and also Table 4 and figure 7 shows the comparison of network as a function of the number of processors of various other popular networks along with the embedded hypercube network worth to mention that due to their bottleneck performance as per the last comparisons, in the table 3 indicates lower diameter of the topology are

#### mesh, torus Iliac mesh and hypercube. Therefore lower diameter topologies are used in parallel computing for PE-Interconnecting device but table 4 and figure 7 gives comparative analysis between mesh and torus embedded hypercube networks, mesh embedded hypercube are dropped because due to high diameter.

| Types of<br>Network | Number of Processors |     |     |      |      |      |      |  |  |
|---------------------|----------------------|-----|-----|------|------|------|------|--|--|
|                     | 128                  | 256 | 512 | 1024 | 2048 | 4096 | 8192 |  |  |
| Ring                | 64                   | 128 | 256 | 512  | 1024 | 2048 | 4096 |  |  |
| Linear Array        | 127                  | 255 | 511 | 1023 | 2047 | 4095 | 8191 |  |  |
| 2D-Mesh             | 21                   | 30  | 43  | 62   | 89   | 126  | 1179 |  |  |
| Illiac -Mesh        | 10                   | 15  | 22  | 31   | 44   | 63   | 90   |  |  |
| 2D-Torus            | 12                   | 16  | 24  | 32   | 11   | 64   | 92   |  |  |

Table 3: Comparative Results of Diameter at different network topology

| Hypercube    | 7      | 8     | 9      | 10     | 11     | 12      | 13      |
|--------------|--------|-------|--------|--------|--------|---------|---------|
| k-ary-n Cube | 7, k=2 | 8,k=2 | 9, k=2 | 10,k=2 | 11,k=2 | 12, k=2 | 13, k=2 |



# Analysis of Diameter at different topologies

Figure: 6. Analysis of Diameter of different Topology

| Table 4: Comparative | <b>Results of Diameter fo</b> | or Embedded Network |
|----------------------|-------------------------------|---------------------|
|----------------------|-------------------------------|---------------------|

| Types of Network                       | Number of Processors |         |         |         |          |          |
|----------------------------------------|----------------------|---------|---------|---------|----------|----------|
|                                        | 512                  | 1024    | 2048    | 4096    | 8192     | 16384    |
| (8,8, N) Mesh Embedded Hypercube       | 17,N=8               | 18,N=16 | 19,N=32 | 20,N=64 | 21,N=128 | 22,N=256 |
| (16,16, N) Mesh Embedded<br>Hypercube  | 31,N=2               | 32,N=4  | 33,N=8  | 34,N=16 | 35,N=32  | 36,N=64  |
| (32,32, N) Mesh Embedded<br>Hypercube  |                      | 64,N=1  | 65,N=2  | 66,N=4  | 67,N=8   | 68,N=16  |
| (8,8, N) Torus Embedded Hypercube      | 11,N=8               | 12,N=16 | 13,N=32 | 14,N=64 | 15,N=128 | 16,N=256 |
| (16,16, N) Torus Embedded<br>Hypercube | 17,N=2               | 18,N=4  | 19,N=8  | 20,N=16 | 21,N=32  | 22,N=64  |

| (32,32,N)Torus Embedded          |         | 32,N=1  | 33,N=2   | 34,N=4     | 35,N=8    | 36,N=16   |
|----------------------------------|---------|---------|----------|------------|-----------|-----------|
| Hypercube                        |         |         |          |            |           |           |
|                                  |         |         |          |            |           |           |
| (m,d) Double Loop Hypercube, d=3 | 20,m=16 | 36,m=32 | 68 ,m=64 | 132, m=128 | 260,m=256 | 516,m=512 |
|                                  |         |         |          |            |           |           |



#### Analysis of Diameter at different Embedded Hypercube Networks



In the results of embedded architecture given in Table 4 and Figure 7 as far as the network diameter is concerned, the torus embedded hypercube network required needs minimum network diameter to get interconnected between a source node and a destination node.

We mentioned the required the total diameter from source to destination is-

Mesh D s-d = Diameter of two dimensional mesh Diameter of hypercube

$$= 2(\sqrt{p} - 1) + \log p'$$
 (10)

Where p is no. of processor in two dimensional mesh, and p' is no. of processor in hypercube.

We are also mentioned here Torus embedded hypercube diameter from source to destination-

Torus  $D \text{ s-d} = Diameter of two dimensional torus}$ Diameter of hypercube

$$= \sqrt{p} + \log p' \tag{11}$$

The diameter of the Double loop hypercube DLH (m, d) [12] is-

)

Diameter of DLH (m, d) = m + d + 1(12)

Where p is no. of processor in two dimensional Torus, p' is no. of processor in hypercube. (Here we mentioned the total

number of processors in two dimensional mesh or torus embedded hypercube = p + p')

In comparatively analysis -diameter of (8, 8, N) mesh embedded hypercube system are same as (16, 16, N) torus embedded hypercube system, and also diameter of (16, 16, N) mesh embedded hypercube system are same as (32, 32, N) torus embedded hypercube system, shown in figure 5.

Hence the torus embedded hypercube network is much better than mesh embedded hypercube network and double loop hypercube DLH (m, d) as far as performance metrics and speed up than mesh embedded hypercube as well as Double Loop Hypercube.

#### 4. CONCLUSION

This paper has considered a PE-Interconnection network is a combination of hypercube and torus network topologies. In this paper is discussing the node degree, no. of links, and diameter of static network without and with embedding of hypercube. The analysis of results provide an idea about that torus embedded hypercube interconnection network is highly scalable, this network structure provides a immense architectural support for massive parallel computing. The growth of the network is more efficient in terms of communication and speed up. The comparison shows that the n-cube hypercube and K-ary n-cube interconnection networks are expensive and not suitable for parallel architecture. For these two networks, the node degree increases dramatically as the system spreading out takes place. The number of links with respect to the scaling design of simple network. It is observed that the total number of connection for two Dimensional static network like Mesh, Torus, Illiac-Mesh, and Ring lies in between Linear array and k-ary n-cube

# 5. REFERENCES

- K. Hwang, Advanced Computer Architecture: Parallelism, Scalability, Programmability. New York McGraw-Hill, 1993.
- [2] Kim Jong-Seok, Lee Hyeong-Ok and Heo Yeong-Nam, "Embedding among HCN(n,n), HFN(n,n) and hypercube", Proceedings of Eighth International conference on parallel and distributed systems (ICPADS)-2001 pp 533 – 540, 26-29 June 2001.
- [3] A. Louri and H. Sung, "An Optical Multi-Mesh Hypercube: A Scalable Optical Interconnection Network for Massively Parallel Computing," *IEEE J. Lightwave Technology*, vol. 12, pp. 704–716, Apr. 1994.
- [4] N. Gopalakrishna Kini, M. Sathish Kumar and Mruthyunjaya H.S., "A Torus Embedded Hypercube Scalable Interconnection Network for Parallel Architecture," IEEE explore conference publications, Mar.2009, pp.858-861.
- [5] Ahmed Louri and Hongki Sung, "An Optical Multi-Mesh Hypercube: A Scalable Optical interconnection Network for Massively Parallel Computing," Journal of Light wave Technology, Vol.12, No.4, Apr.1994, pp.704-716.
- [6] Ahmed Louri and Hongki Sung, "An OpticalMultiMeshHypercube:AScalableOpticalinterconn ection Network for Massively Parallel Computing," Journal of Light wave Technology, Vol.12, No.4, Apr.1994, pp.704-716.
- [7] L.M. Ni, and P.K. McKinley, "A Survey of Wormhole Routing Techniques in Direct Networks", IEEE Computer, vol.26, no.2, pp. 62-76, Feb. 1993.
- [8] M. K. Ahamad, M. Husain." Required Delay of Packet Transfer Model for Embedded Interconnection Network" International Journal of Engineering Research &

network. It is preferred to have a network with a network diameter of minimum value. The result of comparison shows that Torus and mesh network, mesh has the large network diameter. Further, for this network, the network diameter increases tremendously as the system is scaled up.

Technology (IJERT) Vol. 2 Issue 1, January- 2013 ISSN: 2278-0181.

- [9] M. K. Ahamad, M. Huasain, A.A.Zilli ,"A Statistical Analysis and Comparative Study of Embedded Hypercube ", International Journal of Computer Applications (0975 – 8887) Volume 103 – No 16, October 2014
- [10] G.M. Chiu, "The Odd-Even Turn Model for Adaptive Routing,"IEEE Trans. Parallel and Distributed Systems, vol. 11, no. 7, pp. 729738, July 2000., No.11, Nov.1994.
- [11] W.J. Dally, "Performance Analysis of k-ary n-cube Interconnection Networks," IEEE Trans. Computers, vol. 39, no. 6, pp. 775-785, June 1990.
- [12] Zhaoyang Li, Yi Zhang, Yu Chen and Ruichun Tang, "Design and implementation of a high-performance interconnection network," Proceedings of the Fourth International Conference on Parallel and Distributed Computing, Applications and Technologies, 2003, PDCAT'2003, 27-29 Aug.2003, pp.17-20.
- [13] Hesham El-Rewini and Mostafa Abd-El-Barr, "Advanced Computer Architecture and Parallel Processing," John Wiley & Sons, Inc., Hoboken, New Jersey, 2005.
- [14] Ahmed Louri and Hongki Sung, "A scalable optical hypercube-based interconnection network for massively parallel computing," Applied optics, Vol.33.
- [15] Y. Saad and M.H. Schultz, "Data Communication in Parallel Architectures," Parallel Computing, vol. 11, pp. 131-150, 1989.
- [16] S.L. Johnsson and C.T. Ho, "Optimum Broadcasting and Personalized Communication in Hypercubes", IEEE Trans. Computers, vol. 38, no. 9, pp. 1249-1268, 1989.