Deep Recurrent Convolutional Networks for Video-based Person Re-identification: An End-to-End Approach

Wu, Lin; Shen, Chunhua; Hengel, Anton van den

Computer Science > Computer Vision and Pattern Recognition

arXiv:1606.01609 (cs)

[Submitted on 6 Jun 2016 (v1), last revised 12 Jun 2016 (this version, v2)]

Title:Deep Recurrent Convolutional Networks for Video-based Person Re-identification: An End-to-End Approach

Authors:Lin Wu, Chunhua Shen, Anton van den Hengel

View PDF

Abstract:In this paper, we present an end-to-end approach to simultaneously learn spatio-temporal features and corresponding similarity metric for video-based person re-identification. Given the video sequence of a person, features from each frame that are extracted from all levels of a deep convolutional network can preserve a higher spatial resolution from which we can model finer motion patterns. These low-level visual percepts are leveraged into a variant of recurrent model to characterize the temporal variation between time-steps. Features from all time-steps are then summarized using temporal pooling to produce an overall feature representation for the complete sequence. The deep convolutional network, recurrent layer, and the temporal pooling are jointly trained to extract comparable hidden-unit representations from input pair of time series to compute their corresponding similarity value. The proposed framework combines time series modeling and metric learning to jointly learn relevant features and a good similarity measure between time sequences of person.
Experiments demonstrate that our approach achieves the state-of-the-art performance for video-based person re-identification on iLIDS-VID and PRID 2011, the two primary public datasets for this purpose.

Comments:	11 pages
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1606.01609 [cs.CV]
	(or arXiv:1606.01609v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1606.01609

Submission history

From: Chunhua Shen [view email]
[v1] Mon, 6 Jun 2016 04:29:16 UTC (265 KB)
[v2] Sun, 12 Jun 2016 10:52:09 UTC (792 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Deep Recurrent Convolutional Networks for Video-based Person Re-identification: An End-to-End Approach

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Deep Recurrent Convolutional Networks for Video-based Person Re-identification: An End-to-End Approach

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators