default search action
MMAsia 2020: Virtual Event / Singapore
- Tat-Seng Chua, Jingdong Wang, Qi Tian, Cathal Gurrin, Jia Jia, Hanwang Zhang, Qianru Sun:
MMAsia 2020: ACM Multimedia Asia, Virtual Event / Singapore, 7-9 March, 2021. ACM 2021, ISBN 978-1-4503-8308-0 - Zhaomeng Huang, Liyan Zhang, Xu Xu:
A treatment engine by multimodal EMR data. 1:1-1:7 - Boning Li, Xiangbo Shu, Rui Yan:
Storyboard relational model for group activity recognition. 2:1-2:7 - Yonghua Pan, Zechao Li, Liyan Zhang, Jinhui Tang:
Distilling knowledge in causal inference for unbiased visual question answering. 3:1-3:7 - Takashi Konno, Ayako Amma, Asako Kanezaki:
Incremental multi-view object detection from a moving camera. 4:1-4:7 - Xuan Feng, Lijuan Duan, Jie Chen:
An automated method with anchor-free detection and U-shaped segmentation for nuclei instance segmentation. 5:1-5:6 - Zhaozhen Ding, Qingfang Zheng, Chunhua Hou, Guang Shen:
Improving face recognition in surveillance video with judicious selection and fusion of representative frames. 6:1-6:7 - Jin Wang, Xi Zhang, Chen Wang, Qing Zhu, Baocai Yin:
Two-stage structure aware image inpainting based on generative adversarial networks. 7:1-7:6 - Zheng He, Xueli Wei, Kangli Zeng, Zhen Han, Qin Zou, Zhongyuan Wang:
Low-quality watermarked face inpainting with discriminative residual learning. 8:1-8:6 - Carmen Chai Wang Er, Bee Theng Lau, Abdullah Al Mahmud, Mark Tee Kit Tsun:
A multimedia solution to motivate childhood cancer patients to keep up with cancer treatment. 9:1-9:5 - Haihui Ye, Qiang Qi, Ying Wang, Yang Lu, Hanzi Wang:
Global and local feature alignment for video object detection. 10:1-10:7 - Xiang Guan, Yang Yang, Zheng Wang, Jingjing Li:
Semantic feature augmentation for fine-grained visual categorization with few-sample training. 11:1-11:7 - Thomas Petit, Pierre Letessier, Stefan Duffner, Christophe Garcia:
Unsupervised learning of co-occurrences for face images retrieval. 12:1-12:7 - Lianli Gao, Jingqiu Zhang, Jingkuan Song, Heng Tao Shen:
EvoGAN: an evolutionary GAN for face aging and rejuvenation. 13:1-13:7 - Yuting Ma, Fan Tang, Weiming Dong, Changsheng Xu:
Destylization of text with decorative elements. 14:1-14:7 - Xu Xu, Liyan Zhang, Zhaomeng Huang, Guodong Du:
Hierarchical clustering via mutual learning for unsupervised person re-identification. 15:1-15:7 - Yangchao Wang, Shiyuan He, Xing Xu, Yang Yang, Jingjing Li, Heng Tao Shen:
Self-supervised adversarial learning for cross-modal retrieval. 16:1-16:7 - Liang Peng, Yang Yang, Xing Xu, Jingjing Li, Xiaofeng Zhu:
Multi-level expression guided attention network for referring expression comprehension. 17:1-17:7 - Ruizhe Geng, Zhongyi Huang, Jie Chen:
Adaptive feature aggregation network for nuclei segmentation. 18:1-18:7 - Naoto Kashiwagi, Tokinori Suzuki, Jounghun Lee, Daisuke Ikeda:
Classification of multimedia SNS posts about tourist sites based on their focus toward predicting eco-friendly users. 19:1-19:7 - Jun Liang, Haosheng Chen, Kaiwen Du, Yan Yan, Hanzi Wang:
Learning intra-inter semantic aggregation for video object detection. 20:1-20:7 - Ying Wang, Luo Xiong, Kaiwen Du, Yan Yan, Hanzi Wang:
Robust visual tracking via scale-aware localization and peak response strength. 21:1-21:7 - Shu Naritomi, Keiji Yanai:
Hungry networks: 3D mesh reconstruction of a dish and a plate from a single dish image for estimating food volume. 22:1-22:7 - Xiaoyi Zhang, Zheng Wang, Xing Xu, Jiwei Wei, Yang Yang:
Scene graph generation via multi-relation classification and cross-modal attention coordinator. 23:1-23:7 - Yasuhiro Mochida, Daisuke Shirai, Takahiro Yamaguchi, Seiki Kuwabara, Hideki Nishizawa:
A novel system architecture and an automatic monitoring method for remote production. 24:1-24:7 - Ying Liu, Yanbo Lei, Sheikh Faisal Rashid:
Graph convolution network with node feature optimization using cross attention for few-shot learning. 25:1-25:7 - Taijin Zhao, Hongliang Li, Heqian Qiu, Qingbo Wu, King Ngi Ngan:
A multi-scale language embedding network for proposal-free referring expression comprehension. 26:1-26:7 - Tomoki Haruyama, Sho Takahashi, Takahiro Ogawa, Miki Haseyama:
Similar scene retrieval in soccer videos with weak annotations by multimodal use of bidirectional LSTM. 27:1-27:8 - Yutao Xu, Hanli Wang, Jian Zhu:
Patch assembly for real-time instance segmentation. 28:1-28:7 - Jie Ou, Mingjian Chen, Hong Wu:
Full-resolution encoder-decoder networks with multi-scale feature fusion for human pose estimation. 29:1-29:6 - Jiwei Wei, Yang Yang, Xing Xu, Yanli Ji, Xiaofeng Zhu, Heng Tao Shen:
Graph-based variational auto-encoder for generalized zero-shot learning. 30:1-30:7 - Chang Li, Qian Huang, Xing Li, Qianhan Wu:
A multi-scale human action recognition method based on Laplacian pyramid depth motion images. 31:1-31:6 - Ganfeng Lu, Jiping Zheng:
Fixed-size video summarization over streaming data via non-monotone submodular maximization. 32:1-32:7 - Pengyi Hao, Xuhang Xie, Tianxing Han, Cong Bai:
Overlap classification mechanism for skeletal bone age assessment. 33:1-33:7 - Xuanjing Shen, Yunqi Zhang, Haipeng Chen, Di Gai:
Multi-focus noisy image fusion based on gradient regularized convolutional sparse representatione. 34:1-34:7 - Zhe Cui, Li Su, Weigang Zhang, Qingming Huang:
Fixation guided network for salient object detection. 35:1-35:7 - Yi-Bin Cheng, Xipeng Chen, Dongyu Zhang, Liang Lin:
Motion-transformer: self-supervised pre-training for skeleton-based action recognition. 36:1-36:6 - Rintaro Yanagi, Ren Togo, Takahiro Ogawa, Miki Haseyama:
Interactive re-ranking for cross-modal retrieval based on object-wise question answering. 37:1-37:7 - Ping Wang, Li Liu, Huaxiang Zhang, Tianshi Wang:
A background-induced generative network with multi-level discriminator for text-to-image generation. 38:1-38:6 - Lexuan Sun, Xueliang Liu, Zhenzhen Hu, Richang Hong:
WFN-PSC: weighted-fusion network with poly-scale convolution for image dehazing. 39:1-39:7 - Yingjiao Pei, Zhongyuan Wang, Heling Chen, Baojin Huang, Weiping Tu:
Video scene detection based on link prediction using graph convolution network. 40:1-40:7 - Ichi Kanaya, Meina Tawaki, Keiko Yamamoto:
Cross-cultural design of facial expressions for humanoids: is there cultural difference between Japan and Denmark? 41:1-41:5 - Ying Liu, Heng Zhang, Xiao-Long Yun, Jun-Yu Ye, Cheng-Lin Liu:
Table detection and cell segmentation in online handwritten documents with graph attention networks. 42:1-42:6 - Abdullah Aman Khan, Saifullah Tumrani, Chunlin Jiang, Jie Shao:
RICAPS: residual inception and cascaded capsule network for broadcast sports video classification. 43:1-43:7 - Cheng Peng, Na Qi, Qing Zhu:
Transfer non-stationary texture with complex appearance. 44:1-44:7 - Heling Chen, Zhongyuan Wang, Yingjiao Pei, Baojin Huang, Weiping Tu:
Story segmentation for news broadcast based on primary caption. 45:1-45:6 - Yujia Cao, Zhichao Cui, Yuehu Liu, Xiaojun Lv, Kaibei Peng:
Intermediate coordinate based pose non-perspective estimation from line correspondences. 46:1-46:7 - Huan-Hua Chang, Wen-Cheng Chen, Wan-Lun Tsai, Min-Chun Hu, Wei-Ta Chu:
An autoregressive generation model for producing instant basketball defensive trajectory. 47:1-47:7 - Xingyu Liu, Zongxing Ji, Piao Huang, Tongwei Ren:
Real-time arbitrary video style transfer. 48:1-48:7 - Shagun Uppal, Anish Madan, Sarthak Bhagat, Yi Yu, Rajiv Ratn Shah:
C3VQG: category consistent cyclic visual question generation. 49:1-49:7 - Shota Ashida, Adam Jatowt, Antoine Doucet, Masatoshi Yoshikawa:
Determining image age with rank-consistent ordinal classification and object-centered ensemble. 50:1-50:8 - Dakai Ren, Xiangming Wen, Xiaoya Liu, Shuai Huang, Jiazhong Chen:
Cross-modal learning for saliency prediction in mobile environment. 51:1-51:6 - Ran Shi, Jian Xiong, Tong Qiao:
Objective object segmentation visual quality evaluation based on pixel-level and region-level characteristics. 52:1-52:7 - Fang Zhou, Bei Yin, Zanxia Jin, Heran Wu, Dongyan Zhang:
Text-based visual question answering with knowledge base. 53:1-53:6 - Qisheng Jiang:
Attention-constraint facial expression recognition. 54:1-54:7 - Yupeng Cheng, Xingxing Wei, Huazhu Fu, Shang-Wei Lin, Weisi Lin:
Defense for adversarial videos by self-adaptive JPEG compression and optical texture. 55:1-55:7 - Yaoqing Li, Sheng-hua Zhong, Tongwei Ren, Yan Liu:
Fusing CAMs-weighted features and temporal information for robust loop closure detection. 56:1-56:7 - Ran Shi, Gongyang Li, Weijie Wei, Zhi Liu:
Fixations based personal target objects segmentation. 57:1-57:7 - Miao Tian, Dongyan Guo, Ying Cui, Xiang Pan, Shengyong Chen:
Improving auto-encoder novelty detection using channel attention and entropy minimization. 58:1-58:6 - Yanan Li, Jun Yu, Yibing Zhan, Zhi Chen:
Relationship graph learning network for visual relationship detection. 59:1-59:7 - Yuying Cai, Jinfeng Li, Bao-Di Liu, Weifeng Liu, Kai Zhang, Changsheng Xu:
Local structure alignment guided domain adaptation with few source samples. 60:1-60:7 - Peng Zhang, Deqiang Ouyang, Feiyu Chen, Jie Shao:
Multiplicative angular margin loss for text-based person search. 61:1-61:7 - Xiaoye Wang, Xiaowen Zhou, Zan Gao, Peng Yang, Xianbin Wen, Hongyun Ning:
Integrating aspect-aware interactive attention and emotional position-aware for multi-aspect sentiment analysis. 62:1-62:7 - Yao Tang, Lin Zhao, Zhaoliang Yao, Chen Gong, Jian Yang:
Graph-based motion prediction for abnormal action detection. 63:1-63:7 - Haoyu Tang, Jihua Zhu, Zan Gao, Tao Zhuo, Zhiyong Cheng:
Attention feature matching for weakly-supervised video relocalization. 64:1-64:7 - Bohong Yang, Kai Meng, Hong Lu, Xinyao Nie, Guanhao Huang, Jingjing Luo, Xing Zhu:
Pulse localization networks with infrared camera. 65:1-65:5 - Yijun Liu, Zhengning Wang, Ruixu Geng, Hao Zeng, Yi Zeng:
Structure-preserving extremely low light image enhancement with fractional order differential mask guidance. 66:1-66:7 - Junjie Wang, Feng Gao, Junyu Dong:
Change detection from SAR images based on deformable residual convolutional neural networks. 67:1-67:7 - Hui Cui, Lei Zhu, Wentao Tan:
Efficient inter-image relation graph neural network hashing for scalable image retrieval. 68:1-68:8 - Aozhu Chen, Xinyi Huang, Hailan Lin, Xirong Li:
Towards annotation-free evaluation of cross-lingual image captioning. 69:1-69:7 - Anish Bhardwaj, Nikhil Chauhan, Rajiv Ratn Shah:
Synthesized 3D models with smartphone based MR to modify the PreBuilt environment: interior design. 70:1-70:3 - Aayush Jain, Meet Shah, Suraj Pandey, Mansi Agarwal, Rajiv Ratn Shah, Yifang Yin:
SeekSuspect: retrieving suspects from criminal datasets using visual memory. 71:1-71:3 - Arun Zachariah, Mohamed Gharibi, Praveen Rao:
A large-scale image retrieval system for everyday scenes. 72:1-72:3 - Klaus Schoeffmann, Jakub Lokoc, Werner Bailer:
10 years of video browser showdown. 73:1-73:3
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.