Retiarii: a deep learning exploratory-training framework

AUTHORs:

Quanlu Zhang,

Zhenhua Han,

Fan Yang,

Yuge Zhang,

Zhe Liu,

Mao Yang,

Lidong ZhouAuthors Info & Claims

OSDI'20: Proceedings of the 14th USENIX Conference on Operating Systems Design and Implementation

Article No.: 52, Pages 919 - 936

Published: 04 November 2020 Publication History

PDF eReader Publisher Site

Abstract

Traditional deep learning frameworks such as TensorFlow and PyTorch support training on a single deep neural network (DNN) model, which involves computing the weights iteratively for the DNN model. Designing a DNN model for a task remains an experimental science and is typically a practice of deep learning model exploration, dovetailed with training and validation, aiming to find the best model among a set that yields the best result. Retrofitting such exploratory-training into the training process of a single DNN model, as supported by current deep learning frameworks, is unintuitive, cumbersome, and inefficient, because of the fundamental mismatch between exploring a set of models and training a single one.

Retiarii is the first framework to support deep learning exploratory-training. In particular, Retiarii (i) provides a new programming interface to specify a DNN model space for exploration, as well as an interface to describe the exploration strategy that decides which order to instantiate and train models in, how to prioritize model training, and when to terminate training of certain models; (ii) offers a Just-In-Time (JIT) engine that instantiates models, manages the training of the instantiated models, gathers the information for the exploration strategy to consume, and executes the decisions accordingly; (iii) identifies the correlations between the instantiated models and develops a set of cross-model optimizations to improve the overall exploratory-training process. Retiarii does so by introducing a key abstraction, Mutator, that connects the specifications of DNN model spaces and exploration strategies, while exposing the correlations between models for optimization. As a result, Retiarii's clean separation of DNN model space specification, exploration strategy, and cross-model optimizations, connected through the single mutator abstraction, leads to ease of programming, reuse of components, and vastly improved (up to 8.58x) overall exploratory-training efficiency.

References

[1]

ast - Abstract Syntax Trees. https://docs.python.org/3/library/ast.html, 2020. Online; accessed 30 April 2020.

Abstract

References

Index Terms

Recommendations

Transductive Multilabel Learning via Label Set Propagation

Undecimated wavelet shrinkage estimate of the 1D and 2D spectra

Lossless-constraint Denoising based Auto-encoders

Comments

Information

Published In

Sponsors

Publisher

Publication History

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

View options

PDF

eReader

Login options

Full Access

Share

Share this Publication link

Share on social media

Affiliations