skip to main content
10.1145/3538806.3538811acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccbnConference Proceedingsconference-collections
research-article

An Improved Deep-learning Network for Abnormal Action Detection

Published: 14 July 2022 Publication History

Abstract

Detecting actions in untrimmed videos is a challenging task. In order to improve the accuracy of abnormal action detection, an improved network combining Boundary-Matching Network (BMN) and Structured Segment Network (SSN) is proposed in this paper to achieve the temporal detection of abnormal actions from public places. BMN is used to generate the temporal proposals of abnormal actions in the video, and then SSN acts on the proposals generated by BMN to classify them into specific categories. By modifying the feature dimensions to adapt to the length of the video, and transforming the output generated by the BMN network into a proposal, the BMN and SSN networks are well combined. The experimental results prove that the proposed method achieves good results on the abnormal action dataset collected from public places.

References

[1]
Talari S, Shafie-Khah M, and Siano P. 2017. A review of smart cities based on the internet of things concept J. Energies. 10(4): 421.
[2]
Miller M. 2015. The internet of things: How smart TVs, smart cars, smart homes, and smart cities are changing the world M. Pearson Education.
[3]
Ohnishi K, Hidaka M, and Harada T. 2016. Improved dense trajectory with cross streams. Proceedings of the 24th ACM international conference on Multimedia. pp. 257-261.
[4]
Wang H, Schmid C. 2013. Action recognition with improved trajectories. Proceedings of the IEEE international conference on computer vision. pp. 3551-58.
[5]
Dalal N, Triggs B, and Schmid. 2006. Human detection using oriented histograms of flow and appearance. European conference on computer vision, Springer. pp. 428-441.
[6]
Dalal N and Triggs B. 2005. Histograms of oriented gradients for human detection. IEEE computer society conference on computer vision and pattern recognition (CVPR'05). Ieee.1 pp. 886-893.
[7]
Simonyan K and Zisserman A. 2014. Two-stream convolutional networks for action recognition in videos. arXiv preprint arXiv. 1406 2199.
[8]
Tran D, Bourdev L D, and Fergus R. 2014. C3D: generic features for video analysis. CoRR 2(7): 8.
[9]
Zhu W, Lan C, and Xing J. 2016. Co-occurrence feature learning for skeleton based action recognition using regularized deep LSTM networks. Proceedings of the AAAI Conference on Artificial Intelligence, vol 30(1).
[10]
Gao J. 2017. Turn tap: Temporal unit regression network for temporal action proposals. Proceedings of the IEEE international conference on computer vision. pp. 3628-36.
[11]
Xiong Y. 2017. A pursuit of temporal accuracy in general activity detection. arXiv preprint arXiv.1703 02716.
[12]
Gao J. 2018. Ctap: Complementary temporal action proposal generation. Proceedings of the European conference on computer vision (ECCV). pp. 68-83.
[13]
Lin T. 2018. Bsn: Boundary sensitive network for temporal action proposal generation. Proceedings of the European Conference on Computer Vision (ECCV). pp. 3-19.
[14]
Liu Y. 2019. Multi-granularity generator for temporal action proposal. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 3604-13.
[15]
Lin T. 2019. Bmn: Boundary-matching network for temporal action proposal generation. Proceedings of the IEEE/CVF International Conference on Computer Vision. pp. 3889-98.
[16]
Zhao Y. 2017. Temporal action detection with structured segment networks. Proceedings of the IEEE International Conference on Computer Vision. pp. 2914-23.
[17]
Wang L, Xiong Y, and Wang Z. 2016. Temporal segment networks: Towards good practices for deep action recognition. European conference on computer vision, Springer. pp. 20-36.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICCBN '22: Proceedings of the 10th International Conference on Communications and Broadband Networking
February 2022
82 pages
ISBN:9781450387439
DOI:10.1145/3538806
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 July 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Abnormal action detection
  2. Deep-learning
  3. Temporal detection

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICCBN 2022

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 35
    Total Downloads
  • Downloads (Last 12 months)10
  • Downloads (Last 6 weeks)1
Reflects downloads up to 28 Dec 2024

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media