Skip to main content
Article thumbnail
Location of Repository

Efficient routing mechanisms for Dragonfly networks

By Marina García, Enrique Vallejo, Julio Ramón Beivide Palacio, Miguel Odriozola and Mateo Valero Cortés

Abstract

High-radix hierarchical networks are cost-effective topologies for large scale computers. In such networks, routers are organized in super nodes, with local and global interconnections. These networks, known as Dragonflies, outperform traditional topologies such as multi-trees or tori, in cost and scalability. However, depending on the traffic pattern, network congestion can lead to degraded performance. Misrouting (non-minimal routing) can be employed to avoid saturated global or local links. Nevertheless, with the current deadlock avoidance mechanisms used for these networks, supporting misrouting implies routers with a larger number of virtual channels. This exacerbates the buffer memory requirements that constitute one of the main constraints in high-radix switches. In this paper we introduce two novel deadlock-free routing mechanisms for Dragonfly networks that support on-the-fly adaptive routing. Using these schemes both global and local misrouting are allowed employing the same number of virtual channels as in previous proposals. Opportunistic Local Misrouting obtains the best performance by providing the highest routing freedom, and relying on a deadlock-free escape path to the destination for every packet. However, it requires Virtual Cut-Through flow-control. By contrast, Restricted Local Misrouting prevents the appearance of cycles thanks to a restriction of the possible routes within super nodes. This makes this mechanism suitable for both Virtual Cut-Through and Wormhole networks. Evaluations show that the proposed deadlock-free routing mechanisms prevent the most frequent pathological issues of Dragonfly networks. As a result, they provide higher performance than previous schemes, while requiring the same area devoted to router buffers.This work has been supported by the Spanish Ministry of Science under contracts TIN2010-21291-C02-02, TIN2012-34557, and by the European HiPEAC Network of Excellence. The research leading to these results has received funding from the European Research Council under the European Union’s Seventh Framework Programme (FP/2007-2013) / ERC Grant Agreement n. ERC-2012-Adg-321253-RoMoL. M. Garc´ıa and M. Odriozola participated\ud in this research work while they were affiliated\ud with the University of Cantabria.Peer ReviewedPostprint (author's final draft

Topics: Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors, Routing (Computer network management), Telecommunication -- Traffic -- Management, Deadlock avoidance, Dragonfly networks, Routing, System recovery, Ports (Computers), Topology, Network topology, Proposals, Adaptive systems, Encaminadors (Xarxes d'ordinadors), Telecomunicació -- Tràfic -- Gestió
Publisher: Institute of Electrical and Electronics Engineers (IEEE)
Year: 2013
DOI identifier: 10.1109/ICPP.2013.72
OAI identifier: oai:upcommons.upc.edu:2117/90617

Suggested articles


To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.