skip to main content
10.1145/3503161.3548267acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Domain Generalization via Frequency-domain-based Feature Disentanglement and Interaction

Published: 10 October 2022 Publication History

Abstract

Adaptation to out-of-distribution data is a meta-challenge for all statistical learning algorithms that strongly rely on the i.i.d. assumption. It leads to unavoidable labor costs and confidence crises in realistic applications. For that, domain generalization aims at mining domain-irrelevant knowledge from multiple source domains that can generalize to unseen target domains. In this paper, by leveraging the frequency domain of an image, we uniquely work with two key observations: (i) the high-frequency information of an image depicts object edge structure, which preserves high-level semantic information of the object is naturally consistent across different domains, and (ii) the low-frequency component retains object smooth structure, while this information is susceptible to domain shifts. Motivated by the above observations, we introduce (i) an encoder-decoder structure to disentangle high- and low-frequency features of an image, (ii) an information interaction mechanism to ensure the helpful knowledge from both two parts can cooperate effectively, and (iii) a novel data augmentation technique that works on the frequency domain to encourage the robustness of frequency-wise feature disentangling. The proposed method obtains state-of-the-art performance on three widely used domain generalization benchmarks (Digit-DG, Office-Home, and PACS).

Supplementary Material

MP4 File (MM22-mmfp2217.mp4)
Presentation video

References

[1]
Yogesh Balaji, Swami Sankaranarayanan, and Rama Chellappa. 2018. Metareg: Towards domain generalization using meta-regularization. Advances in neural information processing systems (2018).
[2]
Shai Ben-David, John Blitzer, Koby Crammer, and Fernando Pereira. 2006. Analysis of representations for domain adaptation. Advances in neural information processing systems (2006).
[3]
Fabio M Carlucci, Antonio D'Innocente, Silvia Bucci, Barbara Caputo, and Tatiana Tommasi. 2019. Domain generalization by solving jigsaw puzzles. In Proceedings of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition. 2229--2238.
[4]
Dongliang Chang, Yifeng Ding, Jiyang Xie, Ayan Kumar Bhunia, Xiaoxu Li, Zhanyu Ma, Ming Wu, Jun Guo, and Yi-Zhe Song. 2020. The devil is in the channels: Mutual-channel loss for fine-grained image classification. IEEE Transactions on Image Processing (2020), 4683--4695.
[5]
Dongliang Chang, Kaiyue Pang, Yixiao Zheng, Zhanyu Ma, Yi-Zhe Song, and Jun Guo. 2021. Your" Flamingo" is My" Bird": Fine-Grained, or Not. In Proceedings of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition. 11476--11485.
[6]
Qi Dou, Daniel Coelho de Castro, Konstantinos Kamnitsas, and Ben Glocker. 2019. Domain generalization via model-agnostic learning of semantic features. Advances in Neural Information Processing Systems (2019).
[7]
Ruoyi Du, Jiyang Xie, Zhanyu Ma, Dongliang Chang, Yi-Zhe Song, and Jun Guo. 2021b. Progressive Learning of Category-Consistent Multi-Granularity Features for Fine-Grained Visual Classification. IEEE Transactions on Pattern Analysis and Machine Intelligence (2021).
[8]
Zhekai Du, Jingjing Li, Ke Lu, Lei Zhu, and Zi Huang. 2021a. Learning Transferrable and Interpretable Representations for Domain Generalization. In Proceedings of the 29th ACM Int'l Conf. on Multimedia. 3340--3349.
[9]
Yaroslav Ganin and Victor Lempitsky. 2015. Unsupervised domain adaptation by backpropagation. In Int'l Conf. on Machine Learning. PMLR, 1180--1189.
[10]
Abel Gonzalez-Garcia, Joost Van De Weijer, and Yoshua Bengio. 2018. Image-to-image translation for cross-domain disentanglement. Advances in neural information processing systems (2018).
[11]
Melvyn A Goodale and A David Milner. 1992. Separate visual pathways for perception and action. Trends in neurosciences 1 (1992), 20--25.
[12]
Jiaxing Huang, Dayan Guan, Aoran Xiao, and Shijian Lu. 2021. Fsdr: Frequency space domain randomization for domain generalization. In Proceedings of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition. 6891--6902.
[13]
Zeyi Huang, Haohan Wang, Eric P Xing, and Dong Huang. 2020. Self-challenging improves cross-domain generalization. In European Conf. on Computer Vision. Springer, 124--140.
[14]
Seogkyu Jeon, Kibeom Hong, Pilhyeon Lee, Jewook Lee, and Hyeran Byun. 2021. Feature stylization and domain-aware contrastive learning for domain generalization. In Proceedings of the 29th ACM Int'l Conf. on Multimedia. 22--31.
[15]
Mamta Juneja and Parvinder Singh Sandhu. 2009. Performance evaluation of edge detection techniques for images in spatial domain. Int'l Journal of Computer Theory and Engineering 5 (2009), 614.
[16]
Da Li, Yongxin Yang, Yi-Zhe Song, and Timothy M Hospedales. 2017. Deeper, broader and artier domain generalization. In Proceedings of the IEEE Int'l Conf. on Computer Vision. 5542--5550.
[17]
Da Li, Jianshu Zhang, Yongxin Yang, Cong Liu, Yi-Zhe Song, and Timothy M Hospedales. 2019. Episodic training for domain generalization. In Proceedings of the IEEE/CVF Int'l Conf. on Computer Vision. 1446--1455.
[18]
Haoliang Li, Sinno Jialin Pan, Shiqi Wang, and Alex C Kot. 2018. Domain generalization with adversarial feature learning. In Proceedings of the IEEE conf. on Computer Vision and Pattern Recognition. 5400--5409.
[19]
Pan Li, Da Li, Wei Li, Shaogang Gong, Yanwei Fu, and Timothy M Hospedales. 2021. A simple feature augmentation for domain generalization. In Proceedings of the IEEE/CVF Int'l Conf. on Computer Vision. 8886--8895.
[20]
Tsung-Yu Lin, Aruni RoyChowdhury, and Subhransu Maji. 2015. Bilinear cnn models for fine-grained visual recognition. In Proceedings of the IEEE Int'l Conf. on Computer Vision. 1449--1457.
[21]
Alexander H Liu, Yen-Cheng Liu, Yu-Ying Yeh, and Yu-Chiang Frank Wang. 2018. A unified feature disentangler for multi-domain image translation and manipulation. Advances in Neural Information Processing Systems (2018).
[22]
Chang Liu, Lichen Wang, Kai Li, and Yun Fu. 2021. Domain Generalization via Feature Variation Decorrelation. In Proceedings of the 29th ACM Int'l Conf. on Multimedia. 1683--1691.
[23]
Mingsheng Long, Yue Cao, Jianmin Wang, and Michael Jordan. 2015. Learning transferable features with deep adaptation networks. In Int'l Conf. on Machine Learning. PMLR, 97--105.
[24]
Divyat Mahajan, Shruti Tople, and Amit Sharma. 2021. Domain generalization using causal matching. In Int'l Conf. on Machine Learning. PMLR, 7313--7324.
[25]
Toshihiko Matsuura and Tatsuya Harada. 2020. Domain generalization using a mixture of multiple latent domains. In Proceedings of the AAAI Conf. on Artificial Intelligence. 11749--11756.
[26]
Saeid Motiian, Marco Piccirilli, Donald A Adjeroh, and Gianfranco Doretto. 2017. Unified deep supervised domain adaptation and generalization. In Proceedings of the IEEE Int'l Conf. on Computer Vision. 5715--5725.
[27]
Xingchao Peng, Qinxun Bai, Xide Xia, Zijun Huang, Kate Saenko, and Bo Wang. 2019a. Moment matching for multi-source domain adaptation. In Proceedings of the IEEE/CVF Int'l Conf. on Computer Vision. 1406--1415.
[28]
Xingchao Peng, Zijun Huang, Ximeng Sun, and Kate Saenko. 2019b. Domain agnostic learning with disentangled representations. In Int'l Conf. on Machine Learning. PMLR, 5102--5112.
[29]
Vihari Piratla, Praneeth Netrapalli, and Sunita Sarawagi. 2020. Efficient domain generalization via common-specific low-rank decomposition. In Int'l Conf. on Machine Learning. PMLR, 7728--7738.
[30]
Fengchun Qiao and Xi Peng. 2021. Uncertainty-guided model generalization to unseen domains. In Proceedings of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition. 6790--6800.
[31]
Joseph Redmon, Santosh Divvala, Ross Girshick, and Ali Farhadi. 2016. You only look once: Unified, real-time object detection. In Proceedings of the IEEE conf. on computer vision and pattern recognition. 779--788.
[32]
Shiori Sagawa, Pang Wei Koh, Tatsunori B Hashimoto, and Percy Liang. 2020. Distributionally robust neural networks. In Int'l Conf. on Learning Representations.
[33]
Shiv Shankar, Vihari Piratla, Soumen Chakrabarti, Siddhartha Chaudhuri, Preethi Jyothi, and Sunita Sarawagi. 2018a. Generalizing across domains via cross-gradient training. Int'l Conf. on Learning Representations (2018).
[34]
Shiv Shankar, Vihari Piratla, Soumen Chakrabarti, Siddhartha Chaudhuri, Preethi Jyothi, and Sunita Sarawagi. 2018b. Generalizing across domains via cross-gradient training. arXiv preprint arXiv:1804.10745 (2018).
[35]
Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).
[36]
Hemanth Venkateswara, Jose Eusebio, Shayok Chakraborty, and Sethuraman Panchanathan. 2017. Deep hashing network for unsupervised domain adaptation. In Proceedings of the IEEE Conf. on Computer Vision and Pattern Recognition. 5018--5027.
[37]
Guoqing Wang, Hu Han, Shiguang Shan, and Xilin Chen. 2020a. Cross-domain face presentation attack detection via multi-domain disentangled representation learning. In Proceedings of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition. 6678--6687.
[38]
Hao Wang, Hao He, and Dina Katabi. 2020b. Continuously indexed domain adaptation. arXiv preprint arXiv:2007.01807 (2020).
[39]
Jindong Wang, Wenjie Feng, Yiqiang Chen, Han Yu, Meiyu Huang, and Philip S Yu. 2018. Visual domain adaptation with manifold embedded distribution alignment. In Proceedings of the 26th ACM Int'l Conf. on Multimedia. 402--410.
[40]
Jindong Wang, Cuiling Lan, Chang Liu, Yidong Ouyang, Wenjun Zeng, and Tao Qin. 2021. Generalizing to unseen domains: A survey on domain generalization. arXiv preprint arXiv:2103.03097 (2021).
[41]
Shujun Wang, Lequan Yu, Caizi Li, Chi-Wing Fu, and Pheng-Ann Heng. 2020c. Learning from extrinsic and intrinsic supervisions for domain generalization. In European Conf. on Computer Vision. Springer, 159--176.
[42]
Kun Wei, Cheng Deng, Xu Yang, et al. 2020. Lifelong Zero-Shot Learning. In Int'l Joint Conf. on Artificial Intelligence. 551--557.
[43]
Kun Wei, Muli Yang, Hao Wang, Cheng Deng, and Xianglong Liu. 2019. Adversarial fine-grained composition learning for unseen attribute-object recognition. In Proceedings of the IEEE/CVF Int'l Conf. on Computer Vision. 3741--3749.
[44]
Sanghyun Woo, Jongchan Park, Joon-Young Lee, and In So Kweon. 2018. Cbam: Convolutional block attention module. In Proceedings of the European Conf. on Computer Vision. 3--19.
[45]
Jiyang Xie, Zhanyu Ma, Dongliang Chang, Guoqiang Zhang, and Jun Guo. 2021. GPCA: A probabilistic framework for gaussian process embedded channel attention. IEEE Transactions on Pattern Analysis and Machine Intelligence (2021).
[46]
Qinwei Xu, Ruipeng Zhang, Ya Zhang, Yanfeng Wang, and Qi Tian. 2021. A fourier-based framework for domain generalization. In Proceedings of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition. 14383--14392.
[47]
Yanchao Yang, Dong Lao, Ganesh Sundaramoorthi, and Stefano Soatto. 2020. Phase consistent ecological domain adaptation. In Proceedings of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition. 9011--9020.
[48]
Yanchao Yang and Stefano Soatto. 2020. Fda: Fourier domain adaptation for semantic segmentation. In Proceedings of the IEEE/CVF Conf. on Computer Vision and Pattern Recognition. 4085--4095.
[49]
Kaiyang Zhou, Yongxin Yang, Timothy Hospedales, and Tao Xiang. 2020a. Deep domain-adversarial image generation for domain generalisation. In Proceedings of the AAAI Conf. on Artificial Intelligence. 13025--13032.
[50]
Kaiyang Zhou, Yongxin Yang, Timothy Hospedales, and Tao Xiang. 2020b. Learning to generate novel domains for domain generalization. In European Conf. on Computer Vision. Springer, 561--578.
[51]
Kaiyang Zhou, Yongxin Yang, Yu Qiao, and Tao Xiang. 2021. Domain generalization with mixstyle. arXiv preprint arXiv:2104.02008 (2021).

Cited By

View all

Index Terms

  1. Domain Generalization via Frequency-domain-based Feature Disentanglement and Interaction

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '22: Proceedings of the 30th ACM International Conference on Multimedia
    October 2022
    7537 pages
    ISBN:9781450392037
    DOI:10.1145/3503161
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 10 October 2022

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. data augmentation
    2. domain generalization
    3. feature interaction
    4. representation disentanglement

    Qualifiers

    • Research-article

    Funding Sources

    • Beijing Natural Science Foundation Project
    • National Natural Science Foundation of China (NSFC)

    Conference

    MM '22
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)166
    • Downloads (Last 6 weeks)29
    Reflects downloads up to 13 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media