Accelerating model synchronization for distributed machine learning in an optical wide area network

Ling Liu; Liangjun Song; Xi Chen; Hongfang Yu; Hongfang Yu; Gang Sun

doi:10.1364/JOCN.462286

Journal of Optical Communications and Networking
Vol. 14,
Issue 10,
pp. 852-865
(2022)
•https://doi.org/10.1364/JOCN.462286

Accelerating model synchronization for distributed machine learning in an optical wide area network

Ling Liu, Liangjun Song, Xi Chen, Hongfang Yu, and Gang Sun

Not Accessible

Your library or personal account may give you access

Get PDF
Email
Share
Get Citation
Copy Citation Text
Ling Liu, Liangjun Song, Xi Chen, Hongfang Yu, and Gang Sun, "Accelerating model synchronization for distributed machine learning in an optical wide area network," J. Opt. Commun. Netw. 14, 852-865 (2022)

Export Citation
- BibTex
- Endnote (RIS)
- HTML
- Plain Text
Citation alert
Save article

Check for updates

Abstract

Geo-distributed machine learning (Geo-DML) adopts a hierarchical training architecture that includes local model synchronization within the data center and global model synchronization (GMS) across data centers. However, the scarce and heterogeneous wide area network (WAN) bandwidth can become the bottleneck of training performance. An intelligent optical device (i.e., reconfigurable optical all-drop multiplexer) makes the modern WAN topology reconfigurable, which has been ignored by most approaches to speed up Geo-DML training. Therefore, in this paper, we study scheduling algorithms to accelerate model synchronization for Geo-DML training with consideration of the reconfigurable optical WAN topology. Specifically, we use an aggregation tree for each Geo-DML training job, which helps to reduce model synchronization communication overhead across the WAN, and propose two efficient algorithms to accelerate GMS for Geo-DML: MOptree, a model-based algorithm for single job scheduling, and MMOptree for multiple job scheduling, aiming to reconfigure the WAN topology and trees by reassigning wavelengths on each fiber. Based on the current WAN topology and job information, mathematical models are built to guide the topology reconstruction, wavelength, and bandwidth allocation for each edge of the trees. The simulation results show that MOptree completes the GMS stage up to 56.16% on average faster than the traditional tree without optical-layer reconfiguration, and MMOptree achieves up to 54.6% less weighted GMS time.

Full Article | PDF Article

More Like This

Fast and scalable all-optical network architecture for distributed deep learning

Wenzhe Li, Guojun Yuan, Zhan Wang, Guangming Tan, Peiheng Zhang, and George N. Rouskas
J. Opt. Commun. Netw. 16(3) 342-357 (2024)

Flexible silicon photonic architecture for accelerating distributed deep learning

Zhenguo Wu, Liang Yuan Dai, Yuyang Wang, Songli Wang, and Keren Bergman
J. Opt. Commun. Netw. 16(2) A157-A168 (2024)

Topology configuration scheme for accelerating coflows in a hyper-FleX-LION

Hao Yang and Zuqing Zhu
J. Opt. Commun. Netw. 14(10) 805-814 (2022)

Previous Article Next Article

Cited By

You do not have subscription access to this journal. Cited by links are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Figures (13)

You do not have subscription access to this journal. Figure files are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Tables (6)

You do not have subscription access to this journal. Article tables are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Equations (29)

You do not have subscription access to this journal. Equations are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Symbol	Definition
$Q_{v}$	Wavelength capacity of node $v$
$E$	Edge set of the aggregation tree
$L$	Link set of the aggregation tree
$V$	Node set of the aggregation tree
$R_{e}$	Wavelength capacity of edge $e$
$S$	Model size
$C$	Bandwidth of a wavelength
$C_{v}$	Child set of $v$ in the aggregation tree
$w_{l}$	Allocated wavelength for the directed link $l$
$t$	Completion time of the aggregation phase
$t_{v}$	Completion time of node $v$ in the aggregation tree, which also represents the time when data on the node have been aggregated. Note that, for node $v$ with no child nodes, $t_{v} = 0$ .

Symbol	Definition
$J$	Job set
$W_{j}$	Weight of job $j$
$r_{i}^{j}$	Rate allocated to the $i$ th link of the aggregation tree $j$
$L$	Link set of all aggregation trees
$E$	Edge set of all aggregation trees
$V$	Node set of all aggregation trees
$L^{j}$	Link set of the aggregation tree $j$
$L_{i}^{j}$	$i$ th link of the aggregation tree $j$
$t_{v}^{j}$	Completion time of node $v$ of the aggregation tree $j$
$S^{j}$	Model size of job $j$
$s_{j}$	Time at which the global model synchronization process of job $j$ can start
$T_{j}$	Aggregation completion time of job $j$
$t_{v}^{j}$	Completion time of node $v$ of the aggregation tree $j$ . Note that $t_{o}^{j}$ represents the completion time of leaf node $o$ of tree $j$ .

Topology	Information
Internet2	ISP public network with 9 data centers and 18 inter-data center links
B4	Google’s WAN topology with 12 data centers and 19 inter-data center links.
Equnix	Equnix’s WAN topology with 20 data centers and 141 inter-data center links

(a) AlexNet
	$β = 0.2$	$β = 0.5$	$β = 1$
RandomOptree	507.87	656.92	905.35
MOptree	374.87	523.92	772.35
OriginalTree	596.22	745.27	993.7
(b) MobileNet
	$β = 0.2$	$β = 0.5$	$β = 1$
RandomOptree	45.75	63.75	93.75
MOptree	34.5	52.5	82.5
OriginalTree	72.0	90.0	120.0
(c) ResNet50
	$β = 0.2$	$β = 0.5$	$β = 1$
RandomOptree	213.18	295.26	432.06
MOptree	168.72	250.8	387.6
OriginalTree	328.32	410.4	547.2

Symbol	Definition
$Q_{v}$	Wavelength capacity of node $v$
$E$	Edge set of the aggregation tree
$L$	Link set of the aggregation tree
$V$	Node set of the aggregation tree
$R_{e}$	Wavelength capacity of edge $e$
$S$	Model size
$C$	Bandwidth of a wavelength
$C_{v}$	Child set of $v$ in the aggregation tree
$w_{l}$	Allocated wavelength for the directed link $l$
$t$	Completion time of the aggregation phase
$t_{v}$	Completion time of node $v$ in the aggregation tree, which also represents the time when data on the node have been aggregated. Note that, for node $v$ with no child nodes, $t_{v} = 0$ .

Accelerating model synchronization for distributed machine learning in an optical wide area network

Author Affiliations

Abstract

Cited By

Figures (13)

Tables (6)

Equations (29)

Journal of Optical Communications and Networking