Scheduling Real-time Deep Learning Services as Imprecise Computations

Yao, Shuochao; Hao, Yifan; Zhao, Yiran; Shao, Huajie; Liu, Dongxin; Liu, Shengzhong; Wang, Tianshi; Li, Jinyang; Abdelzaher, Tarek

Computer Science > Machine Learning

arXiv:2011.01112 (cs)

[Submitted on 2 Nov 2020]

Title:Scheduling Real-time Deep Learning Services as Imprecise Computations

Authors:Shuochao Yao, Yifan Hao, Yiran Zhao, Huajie Shao, Dongxin Liu, Shengzhong Liu, Tianshi Wang, Jinyang Li, Tarek Abdelzaher

View PDF

Abstract:The paper presents an efficient real-time scheduling algorithm for intelligent real-time edge services, defined as those that perform machine intelligence tasks, such as voice recognition, LIDAR processing, or machine vision, on behalf of local embedded devices that are themselves unable to support extensive computations. The work contributes to a recent direction in real-time computing that develops scheduling algorithms for machine intelligence tasks with anytime prediction. We show that deep neural network workflows can be cast as imprecise computations, each with a mandatory part and (several) optional parts whose execution utility depends on input data. The goal of the real-time scheduler is to maximize the average accuracy of deep neural network outputs while meeting task deadlines, thanks to opportunistic shedding of the least necessary optional parts. The work is motivated by the proliferation of increasingly ubiquitous but resource-constrained embedded devices (for applications ranging from autonomous cars to the Internet of Things) and the desire to develop services that endow them with intelligence. Experiments on recent GPU hardware and a state of the art deep neural network for machine vision illustrate that our scheme can increase the overall accuracy by 10%-20% while incurring (nearly) no deadline misses.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Networking and Internet Architecture (cs.NI); Systems and Control (eess.SY)
Cite as:	arXiv:2011.01112 [cs.LG]
	(or arXiv:2011.01112v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2011.01112

Submission history

From: Shuochao Yao [view email]
[v1] Mon, 2 Nov 2020 16:43:04 UTC (2,372 KB)

Computer Science > Machine Learning

Title:Scheduling Real-time Deep Learning Services as Imprecise Computations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Scheduling Real-time Deep Learning Services as Imprecise Computations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators