Deep Action- and Context-Aware Sequence Learning for Activity Recognition and Anticipation

Aliakbarian, Mohammad Sadegh; Saleh, Fatemehsadat; Fernando, Basura; Salzmann, Mathieu; Petersson, Lars; Andersson, Lars

Computer Science > Computer Vision and Pattern Recognition

arXiv:1611.05520 (cs)

[Submitted on 17 Nov 2016 (v1), last revised 18 Nov 2016 (this version, v2)]

Title:Deep Action- and Context-Aware Sequence Learning for Activity Recognition and Anticipation

Authors:Mohammad Sadegh Aliakbarian, Fatemehsadat Saleh, Basura Fernando, Mathieu Salzmann, Lars Petersson, Lars Andersson

View PDF

Abstract:Action recognition and anticipation are key to the success of many computer vision applications. Existing methods can roughly be grouped into those that extract global, context-aware representations of the entire image or sequence, and those that aim at focusing on the regions where the action occurs. While the former may suffer from the fact that context is not always reliable, the latter completely ignore this source of information, which can nonetheless be helpful in many situations. In this paper, we aim at making the best of both worlds by developing an approach that leverages both context-aware and action-aware features. At the core of our method lies a novel multi-stage recurrent architecture that allows us to effectively combine these two sources of information throughout a video. This architecture first exploits the global, context-aware features, and merges the resulting representation with the localized, action-aware ones. Our experiments on standard datasets evidence the benefits of our approach over methods that use each information type separately. We outperform the state-of-the-art methods that, as us, rely only on RGB frames as input for both action recognition and anticipation.

Comments:	10 pages, 4 figures, 7 tables
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1611.05520 [cs.CV]
	(or arXiv:1611.05520v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1611.05520

Submission history

From: Mohammad Sadegh Aliakbarian [view email]
[v1] Thu, 17 Nov 2016 01:08:56 UTC (441 KB)
[v2] Fri, 18 Nov 2016 01:41:40 UTC (441 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Deep Action- and Context-Aware Sequence Learning for Activity Recognition and Anticipation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Deep Action- and Context-Aware Sequence Learning for Activity Recognition and Anticipation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators