|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 12 occurrences of 10 keywords
|
|
|
Results
Found 34 publication records. Showing 34 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
75 | Eunseuk Oh, Hongsik Choi, David Primeaux |
An All-Reduce Operation in Star Networks Using All-to-All Broadcast Communication Pattern. |
International Conference on Computational Science (1) |
2005 |
DBLP DOI BibTeX RDF |
all-reduce, distributed memory parallel computing systems, all-to-all broadcast, star network, inter-processor communication |
24 | Pitch Patarasuk, Xin Yuan 0001 |
Bandwidth Efficient All-reduce Operation on Tree Topologies. |
IPDPS |
2007 |
DBLP DOI BibTeX RDF |
|
16 | Lin Zhang, Shaohuai Shi, Xiaowen Chu 0001, Wei Wang 0030, Bo Li 0001, Chengjian Liu |
Decoupling the All-Reduce Primitive for Accelerating Distributed Deep Learning. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
16 | Lin Zhang, Shaohuai Shi, Xiaowen Chu 0001, Wei Wang 0030, Bo Li 0001, Chengjian Liu |
DeAR: Accelerating Distributed Deep Learning with Fine-Grained All-Reduce Pipelining. |
ICDCS |
2023 |
DBLP DOI BibTeX RDF |
|
16 | Fei Dai, Yawen Chen 0001, Zhiyi Huang 0001, Haibo Zhang 0001 |
Wrht: Efficient All-reduce for Distributed DNN Training in Optical Interconnect Systems. |
ICPP |
2023 |
DBLP DOI BibTeX RDF |
|
16 | Fei Dai, Yawen Chen 0001, Zhiyi Huang 0001, Haibo Zhang 0001, Fangfang Zhang 0002 |
Efficient All-Reduce for Distributed DNN Training in Optical Interconnect Systems. |
PPoPP |
2023 |
DBLP DOI BibTeX RDF |
|
16 | HyungJun Kim, Chunggeon Song, HwaMin Lee, Heonchang Yu |
Addressing Straggler Problem Through Dynamic Partial All-Reduce for Distributed Deep Learning in Heterogeneous GPU Clusters. |
ICCE |
2023 |
DBLP DOI BibTeX RDF |
|
16 | Menglu Yu, Ye Tian, Bo Ji 0001, Chuan Wu 0001, Hridesh Rajan, Jia Liu 0002 |
GADGET: Online Resource Optimization for Scheduling Ring-All-Reduce Learning Jobs. |
CoRR |
2022 |
DBLP BibTeX RDF |
|
16 | Fei Dai, Yawen Chen 0001, Zhiyi Huang 0001, Haibo Zhang 0001, Fangfang Zhang 0002 |
WRHT: Efficient All-reduce for Distributed DNN Training in Optical Interconnect System. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Menglu Yu, Bo Ji 0001, Hridesh Rajan, Jia Liu 0002 |
On Scheduling Ring-All-Reduce Learning Jobs in Multi-Tenant GPU Clusters with Communication Contention. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Feijie Wu, Shiqi He, Song Guo 0001, Zhihao Qu, Haozhao Wang, Weihua Zhuang, Jie Zhang 0076 |
Sign Bit is Enough: A Learning Synchronization Framework for Multi-hop All-reduce with Ultimate Compression. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Menglu Yu, Bo Ji 0001, Hridesh Rajan, Jia Liu 0002 |
On scheduling ring-all-reduce learning jobs in multi-tenant GPU clusters with communication contention. |
MobiHoc |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Menglu Yu, Ye Tian, Bo Ji 0001, Chuan Wu 0001, Hridesh Rajan, Jia Liu 0002 |
GADGET: Online Resource Optimization for Scheduling Ring-All-Reduce Learning Jobs. |
INFOCOM |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Feijie Wu, Shiqi He, Song Guo 0001, Zhihao Qu, Haozhao Wang, Weihua Zhuang, Jie Zhang 0076 |
Sign bit is enough: a learning synchronization framework for multi-hop all-reduce with ultimate compression. |
DAC |
2022 |
DBLP DOI BibTeX RDF |
|
16 | Junchao Ma, Dezun Dong, Cunlu Li, Ke Wu, Liquan Xiao |
Evaluation of Topology-Aware All-Reduce Algorithm for Dragonfly Networks. |
NPC |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Mohsen Gavahi, Abu Naser, Cong Wu, Mehran Sadeghi Lahijani, Zhi Wang 0004, Xin Yuan 0001 |
Encrypted All-reduce on Multi-core Clusters. |
IPCCC |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Junchao Ma, Dezun Dong, Cunlu Li, Ke Wu, Liquan Xiao |
PAARD: Proximity-Aware All-Reduce Communication for Dragonfly Networks. |
ISPA/BDCloud/SocialCom/SustainCom |
2021 |
DBLP DOI BibTeX RDF |
|
16 | Youhe Jiang, Huaxi Gu, Yunfeng Lu, Xiaoshan Yu |
2D-HRA: Two-Dimensional Hierarchical Ring-Based All-Reduce Algorithm in Large-Scale Distributed Machine Learning. |
IEEE Access |
2020 |
DBLP DOI BibTeX RDF |
|
16 | Yixin Bao, Yanghua Peng, Yangrui Chen, Chuan Wu 0001 |
Preemptive All-reduce Scheduling for Expediting Distributed DNN Training. |
INFOCOM |
2020 |
DBLP DOI BibTeX RDF |
|
16 | Jerzy Proficz, Piotr Sumionka, Jaroslaw Skomial, Marcin Semeniuk, Karol Niedzielewski, Maciej Walczak |
Investigation into MPI All-Reduce Performance in a Distributed Cluster with Consideration of Imbalanced Process Arrival Patterns. |
AINA |
2020 |
DBLP DOI BibTeX RDF |
|
16 | Jinho Lee, Inseok Hwang 0001, Soham Shah, Minsik Cho |
FlexReduce: Flexible All-reduce for Distributed Deep Learning on Asymmetric Network Topology. |
DAC |
2020 |
DBLP DOI BibTeX RDF |
|
16 | Minsik Cho, Ulrich Finkler, Mauricio J. Serrano, David S. Kung 0001, Hillery C. Hunter |
BlueConnect: Decomposing all-reduce for deep learning on heterogeneous network hierarchy. |
IBM J. Res. Dev. |
2019 |
DBLP DOI BibTeX RDF |
|
16 | Marc Casas, Wilfried N. Gansterer, Elias Wimmer |
Resilient gossip-inspired all-reduce algorithms for high-performance computing: Potential, limitations, and open questions. |
Int. J. High Perform. Comput. Appl. |
2019 |
DBLP DOI BibTeX RDF |
|
16 | Minsik Cho, Ulrich Finkler, David S. Kung 0001, Hillery C. Hunter |
BlueConnect: Decomposing All-Reduce for Deep Learning on Heterogeneous Network Hierarchy. |
MLSys |
2019 |
DBLP BibTeX RDF |
|
16 | Jerzy Proficz |
Improving all-reduce collective operations for imbalanced process arrival patterns. |
J. Supercomput. |
2018 |
DBLP DOI BibTeX RDF |
|
16 | Jerzy Proficz |
Improving all-reduce collective operations for imbalanced process arrival patterns. |
CoRR |
2018 |
DBLP BibTeX RDF |
|
16 | Zhenyu Li 0005, James Davis 0006, Stephen A. Jarvis |
An Efficient Task-based All-Reduce for Machine Learning Applications. |
MLHPC@SC |
2017 |
DBLP DOI BibTeX RDF |
|
16 | Karl E. Prikopa, Wilfried N. Gansterer, Elias Wimmer |
Parallel iterative refinement linear least squares solvers based on all-reduce operations. |
Parallel Comput. |
2016 |
DBLP DOI BibTeX RDF |
|
16 | Pitch Patarasuk, Xin Yuan 0001 |
Bandwidth optimal all-reduce algorithms for clusters of workstations. |
J. Parallel Distributed Comput. |
2009 |
DBLP DOI BibTeX RDF |
|
7 | Wi Bing Tan, Peter E. Strazdins |
The Analysis and Optimization of Collective Communications on a Beowulf Cluster. |
ICPADS |
2002 |
DBLP DOI BibTeX RDF |
|
6 | Qasim Ali 0001, Samuel P. Midkiff, Vijay S. Pai |
Modeling advanced collective communication algorithms on cell-based systems. |
PPoPP |
2010 |
DBLP DOI BibTeX RDF |
modeling, algorithms, collective communication |
5 | Qasim Ali 0001, Samuel P. Midkiff, Vijay S. Pai |
Efficient high performance collective communication for the cell blade. |
ICS |
2009 |
DBLP DOI BibTeX RDF |
algorithms, reductions, collective communication, cell processor |
3 | Vladimir G. Deineko, George Steiner, Zhihui Xue |
Robotic-Cell Scheduling: Special Polynomially Solvable Cases of the Traveling Salesman Problem on Permuted Monge Matrices. |
J. Comb. Optim. |
2005 |
DBLP DOI BibTeX RDF |
robotic-cell scheduling, permuted Monge matrix, traveling salesman problem, polynomial-time algorithm |
3 | Vijay Moorthy, Dhabaleswar K. Panda 0001, P. Sadayappan |
Fast Collective Communication Algorithms for Reflective Memory Network Clusters. |
CANPC |
2000 |
DBLP DOI BibTeX RDF |
|
Displaying result #1 - #34 of 34 (100 per page; Change: )
|
|