Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleJune 2023
GRAP: Group-level Resource Allocation Policy for Reconfigurable Dragonfly Network in HPC
ICS '23: Proceedings of the 37th ACM International Conference on SupercomputingPages 437–449https://doi.org/10.1145/3577193.3593732Dragonfly is a highly scalable, low-diameter, and cost-efficient network topology, which has been adopted in new exascale High Performance Computing (HPC) systems. However, Dragonfly topology suffers from the limited direct links between groups. The ...
- research-articleApril 2023
Modeling Coordinate Transformations in the Dragonfly Nervous System
NICE '23: Proceedings of the 2023 Annual Neuro-Inspired Computational Elements ConferencePages 6–10https://doi.org/10.1145/3584954.3584959Coordinate transformations are a fundamental operation that must be performed by any animal relying upon sensory information to interact with the external world. We present a neural network model that performs a coordinate transformation from the ...
- research-articleJune 2022
Optimized MPI collective algorithms for dragonfly topology
ICS '22: Proceedings of the 36th ACM International Conference on SupercomputingArticle No.: 14, Pages 1–11https://doi.org/10.1145/3524059.3532380The Message Passing Interface (MPI) is the most prominent and dominant programming model for scientific computing in super-computing systems today. Although many general and efficient algorithms have been proposed for MPI collective operations, there is ...
- research-articleJune 2022
A software-defined tensor streaming multiprocessor for large-scale machine learning
- Dennis Abts,
- Garrin Kimmell,
- Andrew Ling,
- John Kim,
- Matt Boyd,
- Andrew Bitar,
- Sahil Parmar,
- Ibrahim Ahmed,
- Roberto DiCecco,
- David Han,
- John Thompson,
- Michael Bye,
- Jennifer Hwang,
- Jeremy Fowers,
- Peter Lillian,
- Ashwin Murthy,
- Elyas Mehtabuddin,
- Chetan Tekur,
- Thomas Sohmers,
- Kris Kang,
- Stephen Maresh,
- Jonathan Ross
ISCA '22: Proceedings of the 49th Annual International Symposium on Computer ArchitecturePages 567–580https://doi.org/10.1145/3470496.3527405We describe our novel commercial software-defined approach for large-scale interconnection networks of tensor streaming processing (TSP) elements. The system architecture includes packaging, routing, and flow control of the interconnection network of ...
- research-articleJanuary 2022
An intelligent hybrid technique for optimal generator rescheduling for congestion management in a deregulated power market
Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology (JIFS), Volume 43, Issue 1Pages 1331–1345https://doi.org/10.3233/JIFS-213138Congestion not only affects the power flow, but also leads certain issues, like market power, market inefficiency and security. When the transmission line exceeds their limits congestion is occurred (voltage, thermal, stability). Congestion management is ...
- research-articleJune 2021
Q-adaptive: A Multi-Agent Reinforcement Learning Based Routing on Dragonfly Network
HPDC '21: Proceedings of the 30th International Symposium on High-Performance Parallel and Distributed ComputingPages 189–200https://doi.org/10.1145/3431379.3460650High-radix interconnects such as Dragonfly and its variants rely on adaptive routing to balance network traffic for optimum performance. Ideally, adaptive routing attempts to forward packets between minimal and non-minimal paths with the least ...
- research-articleNovember 2020
An in-depth analysis of the slingshot interconnect
SC '20: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisArticle No.: 35, Pages 1–14The interconnect is one of the most critical components in large scale computing systems, and its impact on the performance of applications is going to increase with the system size. In this paper, we will describe Slingshot, an interconnection network ...
- research-articleJanuary 2020
A privacy preservation model for big data in map-reduced framework based on k-anonymisation and swarm-based algorithms
International Journal of Intelligent Engineering Informatics (IJIEI), Volume 8, Issue 1Pages 38–53https://doi.org/10.1504/ijiei.2020.105433In recent years, two mainstream technologies have become the centre of IT world, big data and cloud computing. Both these fields are fundamentally different but used together generally. The big-data deals with huge scales of data however cloud-computing ...
- research-articleJanuary 2020
Nanoindentation analysis comparing dragonfly-inspired biomimetic micro-aerial vehicle (BMAV) wings
International Journal of Bio-Inspired Computation (IJBIC), Volume 16, Issue 2Pages 111–120https://doi.org/10.1504/ijbic.2020.109715Biomimetic micro-aerial vehicle (BMAV) are micro-scaled, unmanned aircraft based on flying biological organisms, generating thrust and lift by flapping their wings. This study investigates and compares the nano mechanical mechanical properties of four ...
- research-articleNovember 2019
Topology-custom UGAL routing on dragonfly
SC '19: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisArticle No.: 17, Pages 1–15https://doi.org/10.1145/3295500.3356208The Dragonfly network has been deployed in the current generation supercomputers and will be used in the next generation supercomputers. The Universal Globally Adaptive Load-balance routing (UGAL) is the state-of-the-art routing scheme for Dragonfly. In ...
Mitigating network noise on Dragonfly networks through application-aware routing
SC '19: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisArticle No.: 16, Pages 1–32https://doi.org/10.1145/3295500.3356196System noise can negatively impact the performance of HPC systems, and the interconnection network is one of the main factors contributing to this problem. To mitigate this effect, adaptive routing sends packets on non-minimal paths if they are less ...
- research-articleNovember 2017
A comparative study of SDN and adaptive routing on dragonfly networks
SC '17: Proceedings of the International Conference for High Performance Computing, Networking, Storage and AnalysisArticle No.: 51, Pages 1–11https://doi.org/10.1145/3126908.3126959The OpenFlow-style Software Defined Networking (SDN) technology has shown promising performance in data centers and campus networks; and the HPC community is significantly interested in adopting the SDN technology. However, while OpenFlow-style SDN ...
- ArticleNovember 2012
Modeling a Million-Node Dragonfly Network Using Massively Parallel Discrete-Event Simulation
SCC '12: Proceedings of the 2012 SC Companion: High Performance Computing, Networking Storage and AnalysisPages 366–376https://doi.org/10.1109/SC.Companion.2012.56A low-latency and low-diameter interconnection network will be an important component of future exascale architectures. The dragonfly network topology, a two-level directly connected network, is a candidate for exascale architectures because of its low ...
- ArticleSeptember 2012
Collectives on two-tier direct networks
EuroMPI'12: Proceedings of the 19th European conference on Recent Advances in the Message Passing InterfacePages 67–77https://doi.org/10.1007/978-3-642-33518-1_12Collectives are an important component of parallel programs, and have a significant impact on performance and scalability of an application. To obtain best performance, platform specific implementations of various parallel programming frameworks, such ...
- ArticleSeptember 2012
Comparison Study of Scalable and Cost-Effective Interconnection Networks for HPC
ICPPW '12: Proceedings of the 2012 41st International Conference on Parallel Processing WorkshopsPages 594–595https://doi.org/10.1109/ICPPW.2012.85This work attempts to compare size and cost of two network topologies proposed for large-radix routers: concentrated torus and dragonflies. We study and compare the scalability, cost and fault tolerance of each network. On average, we found that a ...
- ArticleSeptember 2012
On-the-Fly Adaptive Routing in High-Radix Hierarchical Networks
- Marina Garcia,
- Enrique Vallejo,
- Ramon Beivide,
- Miguel Odriozola,
- Cristobal Camarero,
- Mateo Valero,
- German Rodriguez,
- Jesus Labarta,
- Cyriel Minkenberg
ICPP '12: Proceedings of the 2012 41st International Conference on Parallel ProcessingPages 279–288https://doi.org/10.1109/ICPP.2012.46Dragonfly networks have been recently proposed for the interconnection network of forthcoming exascale supercomputers. Relying on large-radix routers, they build a topology with low diameter and high throughput, divided into multiple groups of routers. ...
- research-articleJune 2009
Indirect adaptive routing on large scale interconnection networks
ISCA '09: Proceedings of the 36th annual international symposium on Computer architecturePages 220–231https://doi.org/10.1145/1555754.1555783Recently proposed high-radix interconnection networks [10] require global adaptive routing to achieve optimum performance. Existing direct adaptive routing methods are slow to sense congestion remote from the source router and hence misroute many ...
Also Published in:
ACM SIGARCH Computer Architecture News: Volume 37 Issue 3 - ArticleJune 2008
Technology-Driven, Highly-Scalable Dragonfly Topology
ISCA '08: Proceedings of the 35th Annual International Symposium on Computer ArchitecturePages 77–88https://doi.org/10.1109/ISCA.2008.19Evolving technology and increasing pin-bandwidth motivate the use of high-radix routers to reduce the diameter, latency, and cost of interconnection networks. High-radix networks, however, require longer cables than their low-radix counterparts. Because ...
Also Published in:
ACM SIGARCH Computer Architecture News: Volume 36 Issue 3