skip to main content
10.1145/3474085.3478871acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
abstract

Trustworthy Multimedia Analysis

Published: 17 October 2021 Publication History

Abstract

This tutorial discusses the trustworthiness issue in multimedia analysis. Starting from introducing two types of spurious correlations learned from distilling human knowledge, we partition the (visual) feature space along two dimensions of task-relevance and semantic-orientation. Trustworthy multimedia analysis ideally relies on the task-relevant semantic features and consists of three modules as trainer, interpreter and tester. These three modules essentially form a closed loop, which respectively address goals of extracting task-relevant features, extracting task-relevant semantic features, and detecting spurious correlations to be corrected by the trainer and interpreter.

References

[1]
Moshe Bar. 2004. Visual objects in context. Nature Reviews Neuroscience, Vol. 5, 8 (2004), 617--629.
[2]
Benjamin Beyret, José Hernández-Orallo, Lucy Cheke, Marta Halina, Murray Shanahan, and Matthew Crosby. 2019. The animal-ai environment: Training and testing animal-like artificial cognition. arXiv preprint arXiv:1909.07483 (2019).
[3]
Chaz Firestone. 2020. Performance vs. competence in human--machine comparisons. Proceedings of the National Academy of Sciences, Vol. 117, 43 (2020), 26562--26571.
[4]
Robert Geirhos, Jörn-Henrik Jacobsen, Claudio Michaelis, Richard Zemel, Wieland Brendel, Matthias Bethge, and Felix A Wichmann. 2020. Shortcut learning in deep neural networks. Nature Machine Intelligence, Vol. 2, 11 (2020), 665--673.
[5]
Ian J Goodfellow, Jonathon Shlens, and Christian Szegedy. 2014. Explaining and harnessing adversarial examples. arXiv preprint arXiv:1412.6572 (2014).
[6]
Yue He, Zheyan Shen, and Peng Cui. 2021. Towards non-iid image classification: A dataset and baselines. Pattern Recognition, Vol. 110 (2021), 107383.
[7]
Andrew Ilyas, Shibani Santurkar, Dimitris Tsipras, Logan Engstrom, Brandon Tran, and Aleksander Madry. 2019. Adversarial examples are not bugs, they are features. arXiv preprint arXiv:1905.02175 (2019).
[8]
Chiyuan Zhang, Samy Bengio, Moritz Hardt, Benjamin Recht, and Oriol Vinyals. 2021. Understanding deep learning (still) requires rethinking generalization. Commun. ACM, Vol. 64, 3 (2021), 107--115.
[9]
Xian Zhao, Jiaming Zhang, Zhiyu Lin, and Jitao Sang. 2021. Benign Adversarial Attack: Tricking Algorithm for Goodness. arXiv preprint arXiv:2107.11986 (2021).

Cited By

View all
  • (2022)Counterfactually Measuring and Eliminating Social Bias in Vision-Language Pre-training ModelsProceedings of the 30th ACM International Conference on Multimedia10.1145/3503161.3548396(4996-5004)Online publication date: 10-Oct-2022

Index Terms

  1. Trustworthy Multimedia Analysis

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '21: Proceedings of the 29th ACM International Conference on Multimedia
    October 2021
    5796 pages
    ISBN:9781450386517
    DOI:10.1145/3474085
    Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 17 October 2021

    Check for updates

    Author Tags

    1. multimedia analysis
    2. trustworthy ai

    Qualifiers

    • Abstract

    Funding Sources

    • the Fundamental Research Funds for the Central Universities
    • the Fundamental Research Funds for the Central Universities
    • Beijing Natural Science Foundation

    Conference

    MM '21
    Sponsor:
    MM '21: ACM Multimedia Conference
    October 20 - 24, 2021
    Virtual Event, China

    Acceptance Rates

    Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)15
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 14 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2022)Counterfactually Measuring and Eliminating Social Bias in Vision-Language Pre-training ModelsProceedings of the 30th ACM International Conference on Multimedia10.1145/3503161.3548396(4996-5004)Online publication date: 10-Oct-2022

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media