skip to main content
10.1145/2661821.2661827acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Fish Species Recognition from Video using SVM Classifier

Published: 07 November 2014 Publication History

Abstract

To build a detailed knowledge of the biodiversity, the geographical distribution and the evolution of the alive species is essential for a sustainable development and the preservation of this biodiversity. Massive databases of underwater video surveillance have been recently made available for supporting designing algorithms targeting the identification of fishes. However these video datasets are rather poor in terms of video resolution, pretty challenging regarding both the natural phenomena to be considered such as murky water, seaweed moving the water current, etc, and the huge amount of data to be processed. We have designed a processing chain based on background segmentation, selection keypoints with an adaptive scale, description with OpponentSift and learning of each species by a binary linear Support Vector Machines classifier.
Our algorithm has been evaluated in the context of our participation to the Fish task of the LifeCLEF2014 challenge. Compared to the baseline designed by the LifeCLEF challenge organizers, our approach reaches a better precision but a worse recall. Our performances in terms of species recognition (based only on the correctly detected bounding boxes) is comparable to the baseline, but our bounding boxes are often too large and our score is so penalized. Our results are thus really encouraging.

References

[1]
M. A. R. Ahad, T. Ogata, J. K. Tan, H. Kim, and S. Ishikawa. Motion recognition approach to solve overwriting in complex actions. In FG, pages 1--6. IEEE, 2008.
[2]
S. Ali and M. Shah. Human action recognition in videos using kinematic features and multiple instance learning. IEEE Trans. Pattern Anal. Mach. Intell., 32(2):288--303, Feb. 2010.
[3]
O. Barnich and M. Van Droogenbroeck. ViBe: A universal background subtraction algorithm for video sequences. IEEE Transactions on Image Processing, 20(6):1709--1724, June 2011.
[4]
S. Concetto, B. Fisher, and B. Boom. Lifeclef fish identification task 2014. In CLEF working notes 2014, 2014.
[5]
P. Dollar, V. Rabaud, G. Cottrell, and S. Belongie. Behavior recognition via sparse spatio-temporal features. In Proceedings of the 14th International Conference on Computer Communications and Networks, ICCCN '05, pages 65--72, Washington, DC, USA, 2005. IEEE Computer Society.
[6]
A. A. Efros, A. C. Berg, G. Mori, and J. Malik. Recognizing action at a distance. In Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2, ICCV '03, pages 726--, Washington, DC, USA, 2003. IEEE Computer Society.
[7]
A. Joly, H. Müller, H. Goëau, H. Glotin, C. Spampinato, A. Rauber, P. Bonnet, W.-P. Vellinga, and B. Fisher. Lifeclef 2014: multimedia life species identification challenges. In Proceedings of CLEF 2014, 2014.
[8]
A. Klaeser, M. Marszalek, and C. Schmid. A spatio-temporal descriptor based on 3d-gradients. In Proceedings of the British Machine Vision Conference, pages 99.1--99.10. BMVA Press, 2008.
[9]
I. Laptev. Local Spatio-Temporal Image Features for Motion Interpretation. PhD thesis, Department of Numerical Analysis and Computer Science (NADA), KTH, 2004.
[10]
I. Laptev and T. Lindeberg. Local descriptors for spatio-temporal recognition. In In First International Workshop on Spatial Coherence for Visual Motion Analysis, 2004.
[11]
M. Piccardi. Background subtraction techniques: a review. In IEEE International Conference on Systems, Man, and Cybernetics, pages 3099--3104. IEEE, 2004.
[12]
R. Polana and R. Nelson. Low level recognition of human motion (or how to get your man without finding his body parts). In In Proc. of IEEE Workshop on Motion of Non-Rigid and Articulated Objects, pages 77--82. Press, 1994.
[13]
M. D. Rodriguez, J. Ahmed, and M. Shah. Action mach a spatio-temporal maximum average correlation height filter for action recognition. In CVPR. IEEE Computer Society, 2008.
[14]
C. Tanase. Towards effective spatio-temporal analysis for content-based video retrieval. PhD thesis, Thesis, 04 2014.
[15]
C. Thurau and V. Hlavác. Pose primitive based human action recognition in videos or still images. In CVPR'08, pages --1--1, 2008.
[16]
K. E. A. van de Sande, T. Gevers, and C. G. M. Snoek. Evaluation of color descriptors for object and scene recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(9):1582--1596, Sept. 2010.
[17]
A. Vedaldi and B. Fulkerson. VLFeat: An open and portable library of computer vision algorithms, 2008.
[18]
G. Willems, T. Tuytelaars, and L. Gool. An efficient dense and scale-invariant spatio-temporal interest point detector. In Proceedings of the 10th European Conference on Computer Vision: Part II, ECCV '08, pages 650--663, Berlin, Heidelberg, 2008. Springer-Verlag.
[19]
L. Zelnik-Manor and M. Irani. Event-based analysis of video. In In Proc. of CVPR, pages 123--130, 2001.
[20]
Z. Zivkovic. Improved adaptive gaussian mixture model for background subtraction. In Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 2 - Volume 02, ICPR '04, pages 28--31, Washington, DC, USA, 2004. IEEE Computer Society.

Cited By

View all

Index Terms

  1. Fish Species Recognition from Video using SVM Classifier

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MAED '14: Proceedings of the 3rd ACM International Workshop on Multimedia Analysis for Ecological Data
    November 2014
    46 pages
    ISBN:9781450331234
    DOI:10.1145/2661821
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 07 November 2014

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. multimedia and multimodal retrieval
    2. specialized information retrieval
    3. video search

    Qualifiers

    • Research-article

    Conference

    MM '14
    Sponsor:
    MM '14: 2014 ACM Multimedia Conference
    November 7, 2014
    Florida, Orlando, USA

    Acceptance Rates

    MAED '14 Paper Acceptance Rate 6 of 11 submissions, 55%;
    Overall Acceptance Rate 13 of 23 submissions, 57%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)8
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 01 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media