default search action
CVPR 2016: Las Vegas, NV, USA
- 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA, June 27-30, 2016. IEEE Computer Society 2016, ISBN 978-1-4673-8851-1
Oral & Spotlight Session 1-1A
O1-1A: Image Captioning and Question Answering
- Lisa Anne Hendricks, Subhashini Venugopalan, Marcus Rohrbach, Raymond J. Mooney, Kate Saenko, Trevor Darrell:
Deep Compositional Captioning: Describing Novel Object Categories without Paired Training Data. 1-10 - Junhua Mao, Jonathan Huang, Alexander Toshev, Oana Camburu, Alan L. Yuille, Kevin Murphy:
Generation and Comprehension of Unambiguous Object Descriptions. 11-20 - Zichao Yang, Xiaodong He, Jianfeng Gao, Li Deng, Alexander J. Smola:
Stacked Attention Networks for Image Question Answering. 21-29 - Hyeonwoo Noh, Paul Hongsuck Seo, Bohyung Han:
Image Question Answering Using Convolutional Neural Network with Dynamic Parameter Prediction. 30-38 - Jacob Andreas, Marcus Rohrbach, Trevor Darrell, Dan Klein:
Neural Module Networks. 39-48
S1-1A: Language and Vision
- Scott E. Reed, Zeynep Akata, Honglak Lee, Bernt Schiele:
Learning Deep Representations of Fine-Grained Visual Descriptions. 49-58 - Zeynep Akata, Mateusz Malinowski, Mario Fritz, Bernt Schiele:
Multi-cue Zero-Shot Learning with Strong Supervision. 59-68 - Yongqin Xian, Zeynep Akata, Gaurav Sharma, Quynh Nguyen, Matthias Hein, Bernt Schiele:
Latent Embeddings for Zero-Shot Classification. 69-77 - Roland Kwitt, Sebastian Hegenbart, Marc Niethammer:
One-Shot Learning of Scene Locations via Feature Trajectory Transfer. 78-86 - Chuang Gan, Tianbao Yang, Boqing Gong:
Learning Attributes Equals Multi-Source Domain Generalization. 87-97 - Carl Vondrick, Hamed Pirsiavash, Antonio Torralba:
Anticipating Visual Representations from Unlabeled Video. 98-106
Oral & Spotlight Session 1-1B
O1-1B: Matching and Alignment
- Kwang Moo Yi, Yannick Verdie, Pascal Fua, Vincent Lepetit:
Learning to Assign Orientations to Feature Points. 107-116 - Tinghui Zhou, Philipp Krähenbühl, Mathieu Aubry, Qi-Xing Huang, Alexei A. Efros:
Learning Dense Correspondence via 3D-Guided Cycle Consistency. 117-126 - Shenlong Wang, Sean Ryan Fanello, Christoph Rhemann, Shahram Izadi, Pushmeet Kohli:
The Global Patch Collider. 127-135 - Seyed Hamid Rezatofighi, Anton Milan, Zhen Zhang, Qinfeng Shi, Anthony R. Dick, Ian D. Reid:
Joint Probabilistic Matching Using m-Best Solutions. 136-145 - Xiangyu Zhu, Zhen Lei, Xiaoming Liu, Hailin Shi, Stan Z. Li:
Face Alignment Across Large Poses: A 3D Solution. 146-155
S1-1B: Segmentation and Contour Detection
- Jie Feng, Brian L. Price, Scott Cohen, Shih-Fu Chang:
Interactive Segmentation on RGBD Images via Cue Selection. 156-164 - Chen Liu, Pushmeet Kohli, Yasutaka Furukawa:
Layered Scene Decomposition via the Occlusion-CRF. 165-173 - Michael Maire, Takuya Narihira, Stella X. Yu:
Affinity CNN: Learning Pixel-Centric Pairwise Relations for Figure/Ground Embedding. 174-182 - Anna Khoreva, Rodrigo Benenson, Mohamed Omran, Matthias Hein, Bernt Schiele:
Weakly Supervised Object Boundaries. 183-192 - Jimei Yang, Brian L. Price, Scott Cohen, Honglak Lee, Ming-Hsuan Yang:
Object Contour Detection with a Fully Convolutional Encoder-Decoder Network. 193-202
Poster Session P1-1
- Qi Wu, Chunhua Shen, Lingqiao Liu, Anthony R. Dick, Anton van den Hengel:
What Value Do Explicit High Level Concepts Have in Vision to Language Problems? 203-212 - Nati Ofir, Meirav Galun, Boaz Nadler, Ronen Basri:
Fast Detection of Curved Edges at Low SNR. 213-221 - Wei Shen, Kai Zhao, Yuan Jiang, Yan Wang, Zhijiang Zhang, Xiang Bai:
Object Skeleton Extraction in Natural Images by Fusing Scale-Associated Deep Side Outputs. 222-230 - Yu Liu, Michael S. Lew:
Learning Relaxed Deep Supervision for Better Edge Detection. 231-240 - Huan Fu, Chaohui Wang, Dacheng Tao, Michael J. Black:
Occlusion Boundary Detection via Deep Exploration of Context. 241-250 - Zizhao Zhang, Fuyong Xing, Xiaoshuang Shi, Lin Yang:
SemiContour: A Semi-Supervised Learning Approach for Contour Detection. 251-259 - Saurabh Singh, Derek Hoiem, David A. Forsyth:
Learning to Localize Little Landmarks. 260-269 - Lingxi Xie, Liang Zheng, Jingdong Wang, Alan L. Yuille, Qi Tian:
InterActive: Inter-Layer Activeness Propagation. 270-279 - Hao Yang, Joey Tianyi Zhou, Yu Zhang, Bin-Bin Gao, Jianxin Wu, Jianfei Cai:
Exploit Bounding Box Annotations for Multi-Label Object Recognition. 280-288 - Dmitry Laptev, Nikolay Savinov, Joachim M. Buhmann, Marc Pollefeys:
TI-POOLING: Transformation-Invariant Pooling for Feature Learning in Convolutional Neural Networks. 289-297 - Edgar Simo-Serra, Hiroshi Ishikawa:
Fashion Style in 128 Floats: Joint Ranking and Classification Using Weak Data for Feature Extraction. 298-307 - Yuhui Quan, Chenglong Bao, Hui Ji:
Equiangular Kernel Dictionary Learning with Applications to Dynamic Texture Analysis. 308-316 - Yang Gao, Oscar Beijbom, Ning Zhang, Trevor Darrell:
Compact Bilinear Pooling. 317-326 - Tsun-Yi Yang, Yen-Yu Lin, Yung-Yu Chuang:
Accumulated Stability Voting: A Robust Descriptor from Descriptors of Multiple Scales. 327-335 - Swarna Kamlam Ravindran, Anurag Mittal:
CoMaL: Good Features to Match on Object Boundaries. 336-345 - Yuan-Ting Hu, Yen-Yu Lin:
Progressive Feature Matching with Alternate Descriptor Selection and Correspondence Enrichment. 346-354 - Da Chen, Jean-Marie Mirebeau, Laurent D. Cohen:
A New Finsler Minimal Path Model with Curvature Penalization for Image Segmentation and Closed Contour Detection. 355-363 - Yuhua Chen, Dengxin Dai, Jordi Pont-Tuset, Luc Van Gool:
Scale-Aware Alignment of Hierarchical Image Segmentation. 364-372 - Ning Xu, Brian L. Price, Scott Cohen, Jimei Yang, Thomas S. Huang:
Deep Interactive Object Selection. 373-381 - Danna Gurari, Suyog Dutt Jain, Margrit Betke, Kristen Grauman:
Pull the Plug? Predicting If Computers or Humans Should Segment Images. 382-391 - Yuka Kihara, Matvey Soloviev, Tsuhan Chen:
In the Shadows, Shape Priors Shine: Using Occlusion to Improve Multi-region Segmentation. 392-401 - Loïc Alain Royer, David L. Richmond, Carsten Rother, Bjoern Andres, Dagmar Kainmueller:
Convexity Shape Constraints for Image Segmentation. 402-410 - Ertunc Erdil, Sinan Yildirim, Müjdat Çetin, Tolga Tasdizen:
MCMC Shape Sampling for Image Segmentation with Nonparametric Shape Priors. 411-419 - Fengyuan Zhu, Guangyong Chen, Pheng-Ann Heng:
From Noise Modeling to Blind Image Denoising. 420-429 - Jaesik Park, Yu-Wing Tai, Sudipta N. Sinha, In-So Kweon:
Efficient and Robust Color Consistency for Community Photo Collections. 430-438 - Or Lotan, Michal Irani:
Needle-Match: Reliable Patch Matching under High Uncertainty. 439-448 - Kuldeep Kulkarni, Suhas Lohit, Pavan K. Turaga, Ronan Kerviche, Amit Ashok:
ReconNet: Non-Iterative Reconstruction of Images from Compressively Sensed Measurements. 449-458 - Jin-shan Pan, Zhe Hu, Zhixun Su, Hsin-Ying Lee, Ming-Hsuan Yang:
Soft-Segmentation Guided Object Motion Deblurring. 459-468 - Dongliang Cheng, Abdelrahman Kamel, Brian L. Price, Scott Cohen, Michael S. Brown:
Two Illuminant Estimation and User Correction Preference. 469-477 - Guanbin Li, Yizhou Yu:
Deep Contrast Learning for Salient Object Detection. 478-487 - Seung-Hwan Baek, Inchang Choi, Min H. Kim:
Multiview Image Completion with Space Structure Propagation. 488-496 - Long Mai, Hailin Jin, Feng Liu:
Composition-Preserving Deep Photo Aesthetics Assessment. 497-506 - Jiansheng Chen, Gaocheng Bai, Shaoheng Liang, Zhengqin Li:
Automatic Image Cropping: A Computational Complexity Study. 507-515 - Neil D. B. Bruce, Christopher Catton, Sasa Janjic:
A Deeper Look at Saliency: Feature Contrast, Semantics, and Beyond. 516-524 - Calden Wloka, John K. Tsotsos:
Spatially Binned ROC: A Comprehensive Saliency Metric. 525-534 - Qiaosong Wang, Wen Zheng, Robinson Piramuthu:
GraB: Visual Saliency via Novel Graph Model and Background Priors. 535-543 - Anna Volokitin, Michael Gygli, Xavier Boix:
Predicting When Saliency Maps are Accurate and Eye Fixations Consistent. 544-552 - Oriel Frigo, Neus Sabater, Julie Delon, Pierre Hellier:
Split and Match: Example-Based Adaptive Patch Sampling for Unsupervised Style Transfer. 553-561 - Lilian Calvet, Pierre Gurdjos, Carsten Griwodz, Simone Gasparini:
Detection and Accurate Localization of Circular Fiducials under Highly Challenging Conditions. 562-570 - Luis Herranz, Shuqiang Jiang, Xiangyang Li:
Scene Recognition with CNNs: Objects, Scales and Dataset Bias. 571-579 - Nicholas Rhinehart, Kris Makoto Kitani:
Learning Action Maps of Large Environments via First-Person Vision. 580-588 - Yingying Zhang, Desen Zhou, Siqin Chen, Shenghua Gao, Yi Ma:
Single-Image Crowd Counting via Multi-Column Convolutional Neural Network. 589-597 - Junting Pan, Elisa Sayrol, Xavier Giró-i-Nieto, Kevin McGuinness, Noel E. O'Connor:
Shallow and Deep Convolutional Networks for Saliency Prediction. 598-606 - Mohammad Najafi, Sarah Taghavi Namin, Mathieu Salzmann, Lars Petersson:
Sample and Filter: Nonparametric Scene Parsing via Efficient Filtering. 607-615 - Saumitro Dasgupta, Kuan Fang, Kevin Chen, Silvio Savarese:
DeLay: Robust Spatial Layout Estimation for Cluttered Indoor Scenes. 616-624 - Siyu Zhu, Richard Zanibbi:
A Text Detection System for Natural Scenes with Convolutional Feature Learning and Cascaded Classification. 625-632 - Xiaodan Liang, Yunchao Wei, Xiaohui Shen, Zequn Jie, Jiashi Feng, Liang Lin, Shuicheng Yan:
Reversible Recursive Instance-Level Object Segmentation. 633-641 - Yao Lu, Xue Bai, Linda G. Shapiro, Jue Wang:
Coherent Parametric Contours for Interactive Video Object Segmentation. 642-650 - Yong-Jin Liu, Cheng-Chi Yu, Minjing Yu, Ying He:
Manifold SLIC: A Fast Method to Compute Content-Sensitive Superpixels. 651-659 - Gayoung Lee, Yu-Wing Tai, Junmo Kim:
Deep Saliency with Encoded Low Level Distance Map and High Level Features. 660-668 - Ziyu Zhang, Sanja Fidler, Raquel Urtasun:
Instance-Level Segmentation for Autonomous Driving with Deep Densely Connected MRFs. 669-677 - Nian Liu, Junwei Han:
DHSNet: Deep Hierarchical Saliency Network for Salient Object Detection. 678-686 - Rong Quan, Junwei Han, Dingwen Zhang, Feiping Nie:
Object Co-segmentation via Graph Optimized-Flexible Manifold Ranking. 687-695 - Won-Dong Jang, Chulwoo Lee, Chang-Su Kim:
Primary Object Segmentation in Videos via Alternate Convex Optimization of Foreground and Background Distributions. 696-704 - Renjiao Yi, Jue Wang, Ping Tan:
Automatic Fence Segmentation in Videos of Dynamic Scenes. 705-713 - Luca Del Pero, Susanna Ricco, Rahul Sukthankar, Vittorio Ferrari:
Discovering the Physical Parts of an Articulated Object Class from Multiple Videos. 714-723 - Federico Perazzi, Jordi Pont-Tuset, Brian McWilliams, Luc Van Gool, Markus H. Gross, Alexander Sorkine-Hornung:
A Benchmark Dataset and Evaluation Methodology for Video Object Segmentation. 724-732 - Mahmudul Hasan, Jonghyun Choi, Jan Neumann, Amit K. Roy-Chowdhury, Larry S. Davis:
Learning Temporal Regularity in Video Sequences. 733-742 - Nicolas Marki, Federico Perazzi, Oliver Wang, Alexander Sorkine-Hornung:
Bilateral Space Video Segmentation. 743-751 - Zhang Zhang, Kaiqi Huang, Tieniu Tan, Peipei Yang, Jun Li:
ReD-SFA: Relation Discovery Based Slow Feature Analysis for Trajectory Clustering. 752-760
Oral & Spotlight Session 1-2A
O1-2A: Object Recognition and Detection
- Abhinav Shrivastava, Abhinav Gupta, Ross B. Girshick:
Training Region-Based Object Detectors with Online Hard Example Mining. 761-769 - Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun:
Deep Residual Learning for Image Recognition. 770-778 - Joseph Redmon, Santosh Kumar Divvala, Ross B. Girshick, Ali Farhadi:
You Only Look Once: Unified, Real-Time Object Detection. 779-788 - Spyros Gidaris, Nikos Komodakis:
LocNet: Improving Localization Accuracy for Object Detection. 789-798 - Qian Yu, Feng Liu, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales, Chen Change Loy:
Sketch Me That Shoe. 799-807
S1-2A: Object Detection 1
- Shuran Song, Jianxiong Xiao:
Deep Sliding Shapes for Amodal 3D Object Detection in RGB-D Images. 808-816 - Kai Kang, Wanli Ouyang, Hongsheng Li, Xiaogang Wang:
Object Detection from Video Tubelets with Convolutional Neural Networks. 817-825 - Judy Hoffman, Saurabh Gupta, Trevor Darrell:
Learning with Side Information through Modality Hallucination. 826-834 - Neelima Chavali, Harsh Agrawal, Aroma Mahendru, Dhruv Batra:
Object-Proposal Evaluation Protocol is 'Gameable'. 835-844 - Tao Kong, Anbang Yao, Yurong Chen, Fuchun Sun:
HyperNet: Towards Accurate Region Proposal Generation and Joint Object Detection. 845-853 - Dim P. Papadopoulos, Jasper R. R. Uijlings, Frank Keller, Vittorio Ferrari:
We Don't Need No Bounding-Boxes: Training Object Class Detectors Using Only Human Verification. 854-863 - Wanli Ouyang, Xiaogang Wang, Cong Zhang, Xiaokang Yang:
Factors in Finetuning Deep Model for Object Detection with Long-Tail Distribution. 864-873
Oral & Spotlight Session 1-2B
O1-2B: Vision with Alternative Sensors
- Guy Rosman, Daniela Rus, John W. Fisher III:
Information-Driven Adaptive Structured-Light Scanners. 874-883 - Patrick Bardow, Andrew J. Davison, Stefan Leutenegger:
Simultaneous Optical Flow and Intensity Estimation from an Event Camera. 884-892 - Achuta Kadambi, Jamie Schiel, Ramesh Raskar:
Macroscopic Interferometry: Rethinking Depth Estimation with Frequency-Domain Time-of-Flight. 893-902 - Huaijin G. Chen, Suren Jayasuriya, Jiyue Yang, Judy Stephen, Sriram Sivaramakrishnan, Ashok Veeraraghavan, Alyosha C. Molnar:
ASP Vision: Optically Computing the First Layer of Convolutional Neural Networks Using Angle Sensitive Pixels. 903-912 - Katherine L. Bouman, Michael D. Johnson, Daniel Zoran, Vincent L. Fish, Sheperd S. Doeleman, William T. Freeman:
Computational Imaging for VLBI Image Reconstruction. 913-922
S1-2B: Video Analysis 1
- Chuang Gan, Ting Yao, Kuiyuan Yang, Yi Yang, Tao Mei:
You Lead, We Exceed: Labor-Free Video Concept Learning by Jointly Exploiting Web Videos and Images. 923-932 - Fanyi Xiao, Yong Jae Lee:
Track and Segment: An Iterative Unsupervised Approach for Video Object Proposals. 933-942 - Gao Zhu, Fatih Porikli, Hongdong Li:
Beyond Local Search: Tracking Objects Everywhere with Instance-Specific Proposals. 943-951 - Hongkai Yu, Youjie Zhou, Jeff P. Simmons, Craig P. Przybyla, Yuewei Lin, Xiaochuan Fan, Yang Mi, Song Wang:
Groupwise Tracking of Crowded Similar-Appearance Targets from Low-Continuity Image Sequences. 952-960 - Alexandre Alahi, Kratarth Goel, Vignesh Ramanathan, Alexandre Robicquet, Li Fei-Fei, Silvio Savarese:
Social LSTM: Human Trajectory Prediction in Crowded Spaces. 961-971 - Andrii Maksai, Xinchao Wang, Pascal Fua:
What Players do with the Ball: A Physically Constrained Interaction Modeling. 972-981 - Ting Yao, Tao Mei, Yong Rui:
Highlight Detection with Pairwise Deep Ranking for First-Person Video Summarization. 982-990
Poster Session P1-2
- Bugra Tekin, Artem Rozantsev, Vincent Lepetit, Pascal Fua:
Direct Prediction of 3D Body Poses from Motion Compensated Sequences. 991-1000 - Michael Gygli, Yale Song, Liangliang Cao:
Video2GIF: Automatic Generation of Animated GIFs from Video. 1001-1009 - Amir Shahroudy, Jun Liu, Tian-Tsong Ng, Gang Wang:
NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis. 1010-1019 - Bingbing Ni, Xiaokang Yang, Shenghua Gao:
Progressively Parsing Interactional Objects for Fine Grained Action Detection. 1020-1028 - Pingbo Pan, Zhongwen Xu, Yi Yang, Fei Wu, Yueting Zhuang:
Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning. 1029-1038 - Jingjing Meng, Hongxing Wang, Junsong Yuan, Yap-Peng Tan:
From Keyframes to Key Objects: Video Summarization by Representative Object Proposal Selection. 1039-1048 - Zheng Shou, Dongang Wang, Shih-Fu Chang:
Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs. 1049-1058 - Ke Zhang, Wei-Lun Chao, Fei Sha, Kristen Grauman:
Summary Transfer: Exemplar-Based Subset Selection for Video Summarization. 1059-1067 - Yeong Jun Koh, Won-Dong Jang, Chang-Su Kim:
POD: Discovering Primary Objects in Videos Based on Evolutionary Refinement of Object Recurrence, Background, and Primary Object Models. 1068-1076 - Waqas Sultani, Mubarak Shah:
What If We Do Not have Multiple Videos of the Same Action? - Video Action Localization Using Web Images. 1077-1085 - Lu Zhang, Hayley Hung:
Beyond F-Formations: Determining Social Involvement in Free Standing Conversing Groups from Static Images. 1086-1095 - Ziwei Liu, Ping Luo, Shi Qiu, Xiaogang Wang, Xiaoou Tang:
DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations. 1096-1104 - Hua Zhang, Si Liu, Changqing Zhang, Wenqi Ren, Rui Wang, Xiaochun Cao:
SketchNet: Sketch Classification with Web Images. 1105-1113 - Xiaofan Zhang, Feng Zhou, Yuanqing Lin, Shaoting Zhang:
Embedding Label Structures for Fine-Grained Feature Representation. 1114-1123 - Feng Zhou, Yuanqing Lin:
Fine-Grained Image Classification by Exploring Bipartite-Graph Labels. 1124-1133 - Xiaopeng Zhang, Hongkai Xiong, Wengang Zhou, Weiyao Lin, Qi Tian:
Picking Deep Filter Responses for Fine-Grained Image Recognition. 1134-1142 - Han Zhang, Tao Xu, Mohamed Elhoseiny, Xiaolei Huang, Shaoting Zhang, Ahmed M. Elgammal, Dimitris N. Metaxas:
SPDA-CNN: Unifying Semantic Part Detection and Abstraction for Fine-Grained Recognition. 1143-1152 - Yin Cui, Feng Zhou, Yuanqing Lin, Serge J. Belongie:
Fine-Grained Categorization and Dataset Bootstrapping Using Deep Metric Learning with Humans in the Loop. 1153-1162 - Yaming Wang, Jonghyun Choi, Vlad I. Morariu, Larry S. Davis:
Mining Discriminative Triplets of Patches for Fine-Grained Classification. 1163-1172 - Shaoli Huang, Zhe Xu, Dacheng Tao, Ya Zhang:
Part-Stacked CNN for Fine-Grained Visual Categorization. 1173-1182 - Kevin Lin, Jiwen Lu, Chu-Song Chen, Jie Zhou:
Learning Compact Binary Descriptors with Unsupervised Deep Neural Networks. 1183-1192 - Kilho Son, Daniel Moreno, James Hays, David B. Cooper:
Solving Small-Piece Jigsaw Puzzles by Growing Consensus. 1193-1201 - Zhen Zhang, Qinfeng Shi, Julian J. McAuley, Wei Wei, Yanning Zhang, Anton van den Hengel:
Pairwise Matching through Max-Weight Bipartite Belief Propagation. 1202-1210 - Takumi Kobayashi:
Structured Feature Similarity with Explicit Feature Map. 1211-1219 - Mor Dar, Yael Moses:
Temporal Epipolar Regions. 1220-1228 - Albert Haque, Alexandre Alahi, Li Fei-Fei:
Recurrent Attention Models for Depth-Based Person Identification. 1229-1238 - Li Zhang, Tao Xiang, Shaogang Gong:
Learning a Discriminative Null Space for Person Re-identification. 1239-1248 - Tong Xiao, Hongsheng Li, Wanli Ouyang, Xiaogang Wang:
Learning Deep Feature Representations with Domain Guided Dropout for Person Re-identification. 1249-1258 - Shanshan Zhang, Rodrigo Benenson, Mohamed Omran, Jan Hendrik Hosang, Bernt Schiele:
How Far are We from Solving Pedestrian Detection? 1259-1267 - Dapeng Chen, Zejian Yuan, Badong Chen, Nanning Zheng:
Similarity Learning with Spatial Constraints for Person Re-identification. 1268-1277 - Ying Zhang, Baohua Li, Huchuan Lu, Atshushi Irie, Xiang Ruan:
Sample-Specific SVM Learning for Person Re-identification. 1278-1287 - Faqiang Wang, Wangmeng Zuo, Liang Lin, David Zhang, Lei Zhang:
Joint Learning of Single-Image and Cross-Image Representations for Person Re-identification. 1288-1296 - Haoxiang Li, Jonathan Brandt, Zhe Lin, Xiaohui Shen, Gang Hua:
A Multi-level Contextual Model for Person Recognition in Photo Albums. 1297-1305 - Peixi Peng, Tao Xiang, Yaowei Wang, Massimiliano Pontil, Shaogang Gong, Tiejun Huang, Yonghong Tian:
Unsupervised Cross-Dataset Transfer Learning for Person Re-identification. 1306-1315 - Jiale Cao, Yanwei Pang, Xuelong Li:
Pedestrian Detection Inspired by Appearance Constancy and Shape Symmetry. 1316-1324 - Niall McLaughlin, Jesús Martínez del Rincón, Paul Miller:
Recurrent Convolutional Network for Video-Based Person Re-identification. 1325-1334 - De Cheng, Yihong Gong, Sanping Zhou, Jinjun Wang, Nanning Zheng:
Person Re-identification by Multi-Channel Parts-Based CNN with Improved Triplet Loss Function. 1335-1344 - Jinjie You, Ancong Wu, Xiang Li, Wei-Shi Zheng:
Top-Push Video-Based Person Re-identification. 1345-1353 - Yeong-Jun Cho, Kuk-Jin Yoon:
Improving Person Re-identification via Pose-Aware Multi-shot Matching. 1354-1362 - Tetsu Matsukawa, Takahiro Okabe, Einoshin Suzuki, Yoichi Sato:
Hierarchical Gaussian Descriptor for Person Re-identification. 1363-1372 - Lijun Wang, Wanli Ouyang, Xiaogang Wang, Huchuan Lu:
STCT: Sequentially Training Convolutional Networks for Visual Tracking. 1373-1381 - Juan-Manuel Pérez-Rúa, Tomás Crivelli, Patrick Bouthemy, Patrick Pérez:
Determining Occlusions from Space and Time Image Reconstructions. 1382-1391 - Ju Hong Yoon, Chang-Ryeol Lee, Ming-Hsuan Yang, Kuk-Jin Yoon:
Online Multi-object Tracking via Structural Constraint Event Aggregation. 1392-1400 - Luca Bertinetto, Jack Valmadre, Stuart Golodetz, Ondrej Miksik, Philip H. S. Torr:
Staple: Complementary Learners for Real-Time Tracking. 1401-1409 - Jiaolong Yang, Hongdong Li, Yuchao Dai, Robby T. Tan:
Robust Optical Flow Estimation of Double-Layer Images under Transparency or Reflection. 1410-1419 - Ran Tao, Efstratios Gavves, Arnold W. M. Smeulders:
Siamese Instance Search for Tracking. 1420-1429 - Martin Danelljan, Gustav Häger, Fahad Shahbaz Khan, Michael Felsberg:
Adaptive Decontamination of the Training Set: A Unified Formulation for Discriminative Visual Tracking. 1430-1438 - Adel Bibi, Tianzhu Zhang, Bernard Ghanem:
3D Part-Based Sparse Tracker with Automatic Synchronization and Registration. 1439-1448 - Zhen Cui, Shengtao Xiao, Jiashi Feng, Shuicheng Yan:
Recurrently Target-Attending Tracking. 1449-1458 - Ferran Diego, Fred A. Hamprecht:
Structured Regression Gradient Boosting. 1459-1467 - Maksim Lapin, Matthias Hein, Bernt Schiele:
Loss Functions for Top-k Error: Analysis and Insights. 1468-1477 - Valentina Zantedeschi, Rémi Emonet, Marc Sebban:
Metric Learning as Convex Combinations of Local Models with Generalization Guarantees. 1478-1486 - Ziming Zhang, Yuting Chen, Venkatesh Saligrama:
Efficient Training of Very Deep Neural Networks for Supervised Hashing. 1487-1495 - Saeid Motiian, Marco Piccirilli, Donald A. Adjeroh, Gianfranco Doretto:
Information Bottleneck Learning Using Privileged Information for Visual Recognition. 1496-1505
Oral & Spotlight Session 2-1A
O2-1A: Recognition and Parsing in 3D
- Hossein Rahmani, Ajmal S. Mian:
3D Action Recognition from Novel Viewpoints. 1506-1515 - David F. Fouhey, Abhinav Gupta, Andrew Zisserman:
3D Shape Attributes. 1516-1524 - Zhile Ren, Erik B. Sudderth:
Three-Dimensional Object Detection and Layout Prediction Using Clouds of Oriented Gradients. 1525-1533 - Iro Armeni, Ozan Sener, Amir R. Zamir, Helen Jiang, Ioannis K. Brilakis, Martin Fischer, Silvio Savarese:
3D Semantic Parsing of Large-Scale Indoor Spaces. 1534-1543 - Lingyu Wei, Qixing Huang, Duygu Ceylan, Etienne Vouga, Hao Li:
Dense Human Body Correspondences Using Convolutional Networks. 1544-1553
S2-1A: Recognition Beyond Objects
- Joseph DeGol, Mani Golparvar Fard, Derek Hoiem:
Geometry-Informed Material Recognition. 1554-1562 - Abhijit Bendale, Terrance E. Boult:
Towards Open Set Deep Networks. 1563-1572 - Peng Wang, Lingqiao Liu, Chunhua Shen, Zi Huang, Anton van den Hengel, Heng Tao Shen:
What's Wrong with That Object? Identifying Images of Unusual Objects by Modelling the Detection Score Distribution. 1573-1581 - Torsten Sattler, Michal Havlena, Konrad Schindler, Marc Pollefeys:
Large-Scale Location Recognition and the Geometric Burstiness Problem. 1582-1590 - Mark Wolff, Robert T. Collins, Yanxi Liu:
Regularity-Driven Building Facade Matching between Aerial and Street Views. 1591-1600 - R. T. Pramod, S. P. Arun:
Do Computational Models Differ Systematically from Human Object Perception? 1601-1609
Oral & Spotlight Session 2-1B
O2-1B: Image Processing and Restoration
- Timo Hackel, Jan Dirk Wegner, Konrad Schindler:
Contour Detection in Unstructured 3D Point Clouds. 1610-1618 - Yin Li, Manohar Paluri, James M. Rehg, Piotr Dollár:
Unsupervised Learning of Edges. 1619-1627 - Jin-shan Pan, Deqing Sun, Hanspeter Pfister, Ming-Hsuan Yang:
Blind Image Deblurring Using Dark Channel Prior. 1628-1636 - Jiwon Kim, Jung Kwon Lee, Kyoung Mu Lee:
Deeply-Recursive Convolutional Network for Image Super-Resolution. 1637-1645 - Jiwon Kim, Jung Kwon Lee, Kyoung Mu Lee:
Accurate Image Super-Resolution Using Very Deep Convolutional Networks. 1646-1654
S2-1B: Image Processing and Restoration
- Nguyen Ho Man Rang, Michael S. Brown:
RAW Image Reconstruction Using a Self-Contained sRGB-JPEG Image with Only 64 KB Overhead. 1655-1663 - Kede Ma, Qingbo Wu, Zhou Wang, Zhengfang Duanmu, Hongwei Yong, Hongliang Li, Lei Zhang:
Group MAD Competition? A New Methodology to Compare Objective Image Quality Models. 1664-1673 - Dana Berman, Tali Treibitz, Shai Avidan:
Non-local Image Dehazing. 1674-1682 - Seonghyeon Nam, Youngbae Hwang, Yasuyuki Matsushita, Seon Joo Kim:
A Holistic Approach to Cross-Channel Image Noise Modeling and Its Application to Image Denoising. 1683-1691 - Qi Xie, Qian Zhao, Deyu Meng, Zongben Xu, Shuhang Gu, Wangmeng Zuo, Lei Zhang:
Multispectral Images Denoising by Intrinsic Tensor Sparsity Regularization. 1692-1700 - Wei-Sheng Lai, Jia-Bin Huang, Zhe Hu, Narendra Ahuja, Ming-Hsuan Yang:
A Comparative Study for Single Image Blind Deblurring. 1701-1709
Poster Session P2-1
- Minh Vo, Srinivasa G. Narasimhan, Yaser Sheikh:
Spatiotemporal Bundle Adjustment for Dynamic 3D Reconstruction. 1710-1718 - Ajad Chhatkuli, Daniel Pizarro, Toby Collins, Adrien Bartoli:
Inextensible Non-Rigid Shape-from-Motion by Second-Order Cone Programming. 1719-1727 - Johan Fredriksson, Viktor Larsson, Carl Olsson, Fredrik Kahl:
Optimal Relative Pose with Unknown Correspondences. 1728-1736 - Haifei Huang, Hui Zhang, Yiu-Ming Cheung:
Homography Estimation from the Common Self-Polar Triangle of Separate Ellipses. 1737-1744 - Maximilian Diebold, Bernd Jähne, Alexander Gatto:
Heterogeneous Light Fields. 1745-1753 - Anders P. Eriksson, John Bastian, Tat-Jun Chin, Mats Isaksson:
A Consensus-Based Framework for Distributed Bundle Adjustment. 1754-1762 - Kyungdon Joo, Tae-Hyun Oh, Junsik Kim, In-So Kweon:
Globally Optimal Manhattan Frame Estimation in Real-Time. 1763-1771 - Kai Han, Kwan-Yee K. Wong, Dirk Schnieders, Miaomiao Liu:
Mirror Surface Reconstruction under an Uncalibrated Camera. 1772-1780 - Guibo Luo, Yuesheng Zhu, Zhaotian Li, Liming Zhang:
A Hole Filling Approach Based on Background Reconstruction for View Synthesis in 3D Video. 1781-1789 - Yinqiang Zheng, Laurent Kneip:
A Direct Least-Squares Solution to the PnP Problem with Unknown Focal Length. 1790-1798 - Zuzana Kukelova, Jan Heller, Andrew W. Fitzgibbon:
Efficient Intersection of Three Quadrics and Applications in Computer Vision. 1799-1808 - Lior Talker, Yael Moses, Ilan Shimshoni:
Using Spatial Order to Boost the Elimination of Incorrect Feature Matches. 1809-1817 - Martin Danelljan, Giulia Meneghetti, Fahad Shahbaz Khan, Michael Felsberg:
A Probabilistic Framework for Color-Based Point Set Registration. 1818-1826 - Dong Gong, Mingkui Tan, Yanning Zhang, Anton van den Hengel, Qinfeng Shi:
Blind Image Deconvolution by Automatic Gradient Activation. 1827-1836 - Eduardo Pérez-Pellitero, Jordi Salvador, Javier Ruiz Hidalgo, Bodo Rosenhahn:
PSyCo: Manifold Span Reduction for Super Resolution. 1837-1845 - Jochen Gast, Anita Sellent, Stefan Roth:
Parametric Object Motion from Blur. 1846-1854 - Zhe Hu, Lu Yuan, Stephen Lin, Ming-Hsuan Yang:
Image Deblurring Using Smartphone Inertial Sensors. 1855-1864 - Radu Timofte, Rasmus Rothe, Luc Van Gool:
Seven Ways to Improve Example-Based Single Image Super Resolution. 1865-1873 - Wenzhe Shi, Jose Caballero, Ferenc Huszar, Johannes Totz, Andrew P. Aitken, Rob Bishop, Daniel Rueckert, Zehan Wang:
Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network. 1874-1883 - Xiaojun Chang, Yaoliang Yu, Yi Yang, Eric P. Xing:
They are Not Equally Reliable: Semantic Event Search Using Differentiated Concept Classifiers. 1884-1893 - Minghuang Ma, Haoqi Fan, Kris M. Kitani:
Going Deeper into First-Person Activity Recognition. 1894-1903 - Yang Zhou, Bingbing Ni, Richang Hong, Xiaokang Yang, Qi Tian:
Cascaded Interactional Targeting Network for Egocentric Video Analysis. 1904-1913 - Fabian Caba Heilbron, Juan Carlos Niebles, Bernard Ghanem:
Fast Temporal Activity Proposals for Efficient Detection of Human Actions in Untrimmed Videos. 1914-1923 - Basura Fernando, Peter Anderson, Marcus Hutter, Stephen Gould:
Discriminative Hierarchical Rank Pooling for Activity Recognition. 1924-1932 - Christoph Feichtenhofer, Axel Pinz, Andrew Zisserman:
Convolutional Two-Stream Network Fusion for Video Action Recognition. 1933-1941 - Shugao Ma, Leonid Sigal, Stan Sclaroff:
Learning Activity Progression in LSTMs for Activity Detection and Early Detection. 1942-1950 - Yingwei Li, Weixin Li, Vijay Mahadevan, Nuno Vasconcelos:
VLAD3: Encoding Dynamics of Deep Features for Action Recognition. 1951-1960 - Bharat Singh, Tim K. Marks, Michael J. Jones, Oncel Tuzel, Ming Shao:
A Multi-stream Bi-directional Recurrent Neural Network for Fine-Grained Action Detection. 1961-1970 - Mostafa S. Ibrahim, Srikanth Muralidharan, Zhiwei Deng, Arash Vahdat, Greg Mori:
A Hierarchical Deep Temporal Model for Group Activity Recognition. 1971-1980 - Ivan Lillo, Juan Carlos Niebles, Alvaro Soto:
A Hierarchical Pose-Based Approach to Complex Action Understanding Using Dictionaries of Actionlets and Motion Poselets. 1981-1990 - Wangjiang Zhu, Jie Hu, Gang Sun, Xudong Cao, Yu Qiao:
A Key Volume Mining Deep Framework for Action Recognition. 1991-1999 - Eng-Jon Ong, Miroslaw Bober:
Improved Hamming Distance Search Using Variable Length Hashing. 2000-2008 - Jae-Pil Heo, Zhe Lin, Xiaohui Shen, Jonathan Brandt, Sung-Eui Yoon:
Shortlist Selection with Residual-Aware Distance Estimator for K-Nearest Neighbor Search. 2009-2017 - Xiaojuan Wang, Ting Zhang, Guo-Jun Qi, Jinhui Tang, Jingdong Wang:
Supervised Quantization for Similarity Search. 2018-2026 - Patrick Wieschollek, Oliver Wang, Alexander Sorkine-Hornung, Hendrik P. A. Lensch:
Efficient Large-Scale Approximate Nearest Neighbor Search on the GPU. 2027-2035 - Ting Zhang, Jingdong Wang:
Collaborative Quantization for Cross-Modal Similarity Search. 2036-2045 - Thi Quynh Nhi Tran, Hervé Le Borgne, Michel Crucianu:
Aggregating Image and Text Quantized Correlated Components. 2046-2054 - Artem Babenko, Victor S. Lempitsky:
Efficient Indexing of Billion-Scale Datasets of Deep Descriptors. 2055-2063 - Haomiao Liu, Ruiping Wang, Shiguang Shan, Xilin Chen:
Deep Supervised Hashing for Fast Image Retrieval. 2064-2072 - Ahmet Iscen, Michael G. Rabbat, Teddy Furon:
Efficient Large-Scale Similarity Search Using Matrix Factorization. 2073-2081 - Theodora Kontogianni, Markus Mathias, Bastian Leibe:
Incremental Object Discovery in Time-Varying Image Collections. 2082-2090 - Jia-Bin Huang, Rich Caruana, Andrew Farnsworth, Steve Kelling, Narendra Ahuja:
Detecting Migrating Birds at Night. 2091-2099 - Ilja Kuzborskij, Fabio Maria Carlucci, Barbara Caputo:
When Naïve Bayes Nearest Neighbors Meet Convolutional Neural Networks. 2100-2109 - Zhe Zhu, Dun Liang, Song-Hai Zhang, Xiaolei Huang, Baoli Li, Shi-Min Hu:
Traffic-Sign Detection and Classification in the Wild. 2110-2118 - Yuxing Tang, Josiah Wang, Boyang Gao, Emmanuel Dellandréa, Robert J. Gaizauskas, Liming Chen:
Large Scale Semi-Supervised Object Detection Using Visual and Semantic Knowledge Transfer. 2119-2128 - Fan Yang, Wongun Choi, Yuanqing Lin:
Exploit All the Layers: Fast and Accurate CNN Object Detector with Scale Dependent Pooling and Cascaded Rejection Classifiers. 2129-2137 - Keze Wang, Liang Lin, Wangmeng Zuo, Shuhang Gu, Lei Zhang:
Dictionary Pair Classifier Driven Convolutional Neural Networks for Object Detection. 2138-2146 - Xiaozhi Chen, Kaustav Kundu, Ziyu Zhang, Huimin Ma, Sanja Fidler, Raquel Urtasun:
Monocular 3D Object Detection for Autonomous Driving. 2147-2156 - Radu Tudor Ionescu, Bogdan Alexe, Marius Leordeanu, Marius Popescu, Dim P. Papadopoulos, Vittorio Ferrari:
How Hard Can It Be? Estimating the Difficulty of Visual Search in an Image. 2157-2166 - Hongye Liu, Yonghong Tian, Yaowei Wang, Lu Pang, Tiejun Huang:
Deep Relative Distance Learning: Tell the Difference between Similar Vehicles. 2167-2175 - Kyle Krafka, Aditya Khosla, Petr Kellnhofer, Harini Kannan, Suchendra M. Bhandarkar, Wojciech Matusik, Antonio Torralba:
Eye Tracking for Everyone. 2176-2184 - Zorah Lähner, Emanuele Rodolà, Frank R. Schmidt, Michael M. Bronstein, Daniel Cremers:
Efficient Globally Optimal 2D-to-3D Deformable Shape Matching. 2185-2193 - Viktoriia Sharmanska, Daniel Hernández-Lobato, José Miguel Hernández-Lobato, Novi Quadrianto:
Ambiguity Helps: Classification with Disagreements in Crowdsourced Annotations. 2194-2202 - Roozbeh Mottaghi, Hannaneh Hajishirzi, Ali Farhadi:
A Task-Oriented Approach for Cost-Sensitive Recognition. 2203-2211 - Sukrit Shankar, Duncan P. Robertson, Yani Ioannou, Antonio Criminisi, Roberto Cipolla:
Refining Architectures of Deep Convolutional Neural Networks. 2212-2220 - Ali Borji, Saeed Izadi, Laurent Itti:
iLab-20M: A Large-Scale Controlled Object Dataset to Investigate Deep Learning. 2221-2230 - Chen-Yu Lee, Simon Osindero:
Recursive Recurrent Nets with Attention Modeling for OCR in the Wild. 2231-2239 - Venkatesh N. Murthy, Vivek K. Singh, Terrence Chen, R. Manmatha, Dorin Comaniciu:
Deep Decision Network for Multi-class Image Classification. 2240-2248 - Ruizhi Qiao, Lingqiao Liu, Chunhua Shen, Anton van den Hengel:
Less is More: Zero-Shot Learning from Online Textual Documents with Noise Suppression. 2249-2257 - Wen Li, Dengxin Dai, Mingkui Tan, Dong Xu, Luc Van Gool:
Fast Algorithms for Linear and Kernel SVM+. 2258-2266
Oral & Spotlight Session 2-2A
O2-2A: Recognition and Labeling
- Guo-Jun Qi:
Hierarchically Gated Deep Networks for Semantic Segmentation. 2267-2275 - Liang Lin, Guangrun Wang, Rui Zhang, Ruimao Zhang, Xiaodan Liang, Wangmeng Zuo:
Deep Structured Scene Parsing by Learning with Image Descriptions. 2276-2284 - Jiang Wang, Yi Yang, Junhua Mao, Zhiheng Huang, Chang Huang, Wei Xu:
CNN-RNN: A Unified Framework for Multi-label Image Classification. 2285-2294 - Jing Wang, Yu Cheng, Rogério Schmidt Feris:
Walk and Learn: Facial Attribute Representation Learning from Egocentric Video and Contextual Data. 2295-2304 - Arik Poznanski, Lior Wolf:
CNN-N-Gram for HandwritingWord Recognition. 2305-2314
2A: Object Detection 2
- Ankush Gupta, Andrea Vedaldi, Andrew Zisserman:
Synthetic Data for Text Localisation in Natural Images. 2315-2324 - Russell Stewart, Mykhaylo Andriluka, Andrew Y. Ng:
End-to-End People Detection in Crowded Scenes. 2325-2333 - Wei-Chih Tu, Shengfeng He, Qingxiong Yang, Shao-Yi Chien:
Real-Time Salient Object Detection with a Minimum Spanning Tree. 2334-2342 - David Feng, Nick Barnes, Shaodi You, Chris McCarthy:
Local Background Enclosure for RGB-D Salient Object Detection. 2343-2350 - Yongxi Lu, Tara Javidi, Svetlana Lazebnik:
Adaptive Object Detection Using Adjacency and Zoom Prediction. 2351-2359 - Arthur Daniel Costea, Sergiu Nedevschi:
Semantic Channels for Fast Pedestrian Detection. 2360-2368 - Mahyar Najibi, Mohammad Rastegari, Larry S. Davis:
G-CNN: An Iterative Grid Based Object Detector. 2369-2377
Oral & Spotlight Session 2-2B
O2-2B: Computational Photography and Faces
- Wei Wang, Zhen Cui, Yan Yan, Jiashi Feng, Shuicheng Yan, Xiangbo Shu, Nicu Sebe:
Recurrent Face Aging. 2378-2386 - Justus Thies, Michael Zollhöfer, Marc Stamminger, Christian Theobalt, Matthias Nießner:
Face2Face: Real-Time Face Capture and Reenactment of RGB Videos. 2387-2395 - Sergey Tulyakov, Xavier Alameda-Pineda, Elisa Ricci, Lijun Yin, Jeffrey F. Cohn, Nicu Sebe:
Self-Adaptive Matrix Completion for Heart Rate Estimation from Face Videos under Realistic Conditions. 2396-2404 - Andrew Owens, Phillip Isola, Josh H. McDermott, Antonio Torralba, Edward H. Adelson, William T. Freeman:
Visually Indicated Sounds. 2405-2413 - Leon A. Gatys, Alexander S. Ecker, Matthias Bethge:
Image Style Transfer Using Convolutional Neural Networks. 2414-2423
S2-2B: Computational Photography and Biomedical Applications
- Le Hou, Dimitris Samaras, Tahsin M. Kurç, Yi Gao, James E. Davis, Joel H. Saltz:
Patch-Based Convolutional Neural Network for Whole Slide Tissue Image Classification. 2424-2433 - Hossam N. Isack, Olga Veksler, Milan Sonka, Yuri Boykov:
Hedgehog Shape Priors for Multi-Object Segmentation. 2434-2442 - Won Hwa Kim, Hyunwoo J. Kim, Nagesh Adluru, Vikas Singh:
Latent Variable Graphical Model Selection Using Harmonic Analysis: Applications to the Human Connectome Project (HCP). 2443-2451 - Gyeongmin Choe, Srinivasa G. Narasimhan, In-So Kweon:
Simultaneous Estimation of Near IR BRDF and Fine-Scale Surface Geometry. 2452-2460 - Seoung Wug Oh, Michael S. Brown, Marc Pollefeys, Seon Joo Kim:
Do It Yourself Hyperspectral Imaging with Everyday Digital Cameras. 2461-2469 - Joon-Young Lee, Kalyan Sunkavalli, Zhe Lin, Xiaohui Shen, In-So Kweon:
Automatic Content-Aware Color and Tone Stylization. 2470-2478 - Chuan Li, Michael Wand:
Combining Markov Random Fields and Convolutional Neural Networks for Image Synthesis. 2479-2486
Poster Session P2-2
- Hao Chen, Xiaojuan Qi, Lequan Yu, Pheng-Ann Heng:
DCAN: Deep Contour-Aware Networks for Accurate Gland Segmentation. 2487-2496 - Hoo-Chang Shin, Kirk Roberts, Le Lu, Dina Demner-Fushman, Jianhua Yao, Ronald M. Summers:
Learning to Read Chest X-Rays: Recurrent Neural Cascade Model for Automated Image Annotation. 2497-2506 - Huu Le, Tat-Jun Chin, David Suter:
Conformal Surface Alignment with Optimal Möbius Search. 2507-2516 - Seong Jae Hwang, Nagesh Adluru, Maxwell D. Collins, Sathya N. Ravi, Barbara B. Bendlin, Sterling C. Johnson, Vikas Singh:
Coupled Harmonic Bases for Longitudinal Characterization of Brain Networks. 2517-2525 - Jae Y. Shin, Nima Tajbakhsh, R. Todd Hurst, Christopher B. Kendall, Jianming Liang:
Automating Carotid Intima-Media Thickness Video Interpretation with Convolutional Neural Networks. 2526-2535 - Deepak Pathak, Philipp Krähenbühl, Jeff Donahue, Trevor Darrell, Alexei A. Efros:
Context Encoders: Feature Learning by Inpainting. 2536-2544 - Chenyi Lei, Dong Liu, Weiping Li, Zheng-Jun Zha, Houqiang Li:
Comparative Deep Learning of Hybrid Representations for Image Recommendations. 2545-2553 - Vadim Lebedev, Victor S. Lempitsky:
Fast ConvNets Using Group-Wise Brain Damage. 2554-2564 - Zeeshan Hayder, Xuming He, Mathieu Salzmann:
Learning to Co-Generate Object Proposals with a Deep Structured Network. 2565-2573 - Seyed-Mohsen Moosavi-Dezfooli, Alhussein Fawzi, Pascal Frossard:
DeepFool: A Simple and Accurate Method to Fool Deep Neural Networks. 2574-2582 - Calvin Murdock, Zhen Li, Howard Zhou, Tom Duerig:
Blockout: Dynamic Model Selection for Hierarchical Deep Networks. 2583-2591 - Forrest N. Iandola, Matthew W. Moskewicz, Khalid Ashraf, Kurt Keutzer:
FireCaffe: Near-Linear Acceleration of Deep Neural Network Training on Compute Clusters. 2592-2600 - Sarah Rastegar, Mahdieh Soleymani Baghshah, Hamid R. Rabiee, Seyed Mohsen Shojaee:
MDL-CW: A Multimodal Deep Learning Framework with CrossWeights. 2601-2609 - Jörn-Henrik Jacobsen, Jan C. van Gemert, Zhongyu Lou, Arnold W. M. Smeulders:
Structured Receptive Fields in CNNs. 2610-2619 - Suriya Singh, Chetan Arora, C. V. Jawahar:
First Person Action Recognition Using Deep Learned Descriptors. 2620-2628 - Ryo Yonetani, Kris M. Kitani, Yoichi Sato:
Recognizing Micro-Actions and Reactions from Paired Egocentric Videos. 2629-2638 - Chun-yu Wang, Yizhou Wang, Alan L. Yuille:
Mining 3D Key-Pose-Motifs for Action Recognition. 2639-2647 - Khurram Soomro, Haroon Idrees, Mubarak Shah:
Predicting the Where and What of Actors and Actions through Online Action Localization. 2648-2657 - Xiaolong Wang, Ali Farhadi, Abhinav Gupta:
Actions ~ Transformations. 2658-2667 - Young Joon Yoo, Kimin Yun, Sangdoo Yun, Jonghee Hong, Hawook Jeong, Jin Young Choi:
Visual Path Prediction in Complex Scenes with Crowded Moving Objects. 2668-2677 - Serena Yeung, Olga Russakovsky, Greg Mori, Li Fei-Fei:
End-to-End Learning of Action Detection from Frame Glimpses in Videos. 2678-2687 - Analí Alfaro, Domingo Mery, Alvaro Soto:
Action Recognition in Video Using Sparse Coding and Relative Features. 2688-2697 - Yang Wang, Minh Hoai:
Improving Human Action Recognition by Non-action Classification. 2698-2707 - Limin Wang, Yu Qiao, Xiaoou Tang, Luc Van Gool:
Actionness Estimation Using Hybrid Fully Convolutional Networks. 2708-2717 - Bowen Zhang, Limin Wang, Zhe Wang, Yu Qiao, Hanli Wang:
Real-Time Action Recognition with Enhanced Motion Vector CNNs. 2718-2726 - Joo Ho Lee, Inchang Choi, Min H. Kim:
Laplacian Patch-Based Image Synthesis. 2727-2735 - Yu Li, Robby T. Tan, Xiaojie Guo, Jiangbo Lu, Michael S. Brown:
Rain Streak Removal Using Layer Priors. 2736-2744 - Takashi Shibata, Masayuki Tanaka, Masatoshi Okutomi:
Gradient-Domain Image Reconstruction Framework with Intensity-Range and Base-Structure Constraints. 2745-2753 - Jialei Wang, Peder A. Olsen, Andrew R. Conn, Aurélie C. Lozano:
Removing Clouds and Recovering Ground Observations in Satellite Image Sequences via Temporally Contiguous Robust Matrix Completion. 2754-2763 - Zhangyang Wang, Ding Liu, Shiyu Chang, Qing Ling, Yingzhen Yang, Thomas S. Huang:
D3: Deep Dual-Domain Based Fast Restoration of JPEG-Compressed Images. 2764-2772 - Vijay Rengarajan, A. N. Rajagopalan, Rangarajan Aravind:
From Bows to Arrows: Rolling Shutter Rectification of Urban Scenes. 2773-2781 - Xueyang Fu, Delu Zeng, Yue Huang, Xiao-Ping (Steven) Zhang, Xinghao Ding:
A Weighted Variational Model for Simultaneous Reflectance and Illumination Estimation. 2782-2790 - Tsung-Yu Lin, Subhransu Maji:
Visualizing and Understanding Deep Texture Representations. 2791-2799 - Jin-shan Pan, Zhouchen Lin, Zhixun Su, Ming-Hsuan Yang:
Robust Kernel Estimation with Outliers Handling for Image Deblurring. 2800-2808 - Hanwang Zhang, Xindi Shang, Wenzhuo Yang, Huan Xu, Huan-Bo Luan, Tat-Seng Chua:
Online Collaborative Learning for Open-Vocabulary Visual Classifiers. 2809-2817 - Christian Szegedy, Vincent Vanhoucke, Sergey Ioffe, Jonathon Shlens, Zbigniew Wojna:
Rethinking the Inception Architecture for Computer Vision. 2818-2826 - Saurabh Gupta, Judy Hoffman, Jitendra Malik:
Cross Modal Distillation for Supervision Transfer. 2827-2836 - Trung T. Pham, Seyed Hamid Rezatofighi, Ian D. Reid, Tat-Jun Chin:
Efficient Point Process Inference for Large-Scale Object Detection. 2837-2845 - Hakan Bilen, Andrea Vedaldi:
Weakly Supervised Deep Detection Networks. 2846-2854 - Jacob Chan, Jimmy Addison Lee, Kemao Qian:
BORDER: An Oriented Rectangles Approach to Texture-Less Object Recognition. 2855-2863 - Suyog Dutt Jain, Kristen Grauman:
Active Image Segmentation Propagation. 2864-2873 - Sean Bell, C. Lawrence Zitnick, Kavita Bala, Ross B. Girshick:
Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks. 2874-2883 - Gong Cheng, Peicheng Zhou, Junwei Han:
RIFD-CNN: Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection. 2884-2893 - Stefan Mathe, Aleksis Pirinen, Cristian Sminchisescu:
Reinforcement Learning for Visual Object Detection. 2894-2902 - Inbar Huberman, Raanan Fattal:
Detecting Repeating Objects Using Patch Correlation Analysis. 2903-2911 - Sebastian Lapuschkin, Alexander Binder, Grégoire Montavon, Klaus-Robert Müller, Wojciech Samek:
Analyzing Classifiers: Fisher Vectors and Deep Neural Networks. 2912-2920 - Bolei Zhou, Aditya Khosla, Àgata Lapedriza, Aude Oliva, Antonio Torralba:
Learning Deep Features for Discriminative Localization. 2921-2929 - Ishan Misra, C. Lawrence Zitnick, Margaret Mitchell, Ross B. Girshick:
Seeing through the Human Reporting Bias: Visual Classifiers from Noisy Human-Centric Labels. 2930-2939 - Lluís Castrejón, Yusuf Aytar, Carl Vondrick, Hamed Pirsiavash, Antonio Torralba:
Learning Aligned Cross-Modal Representations from Weakly Aligned Data. 2940-2949 - Sijia Cai, Lei Zhang, Wangmeng Zuo, Xiangchu Feng:
A Probabilistic Collaborative Representation Based Approach for Pattern Classification. 2950-2959 - Hexiang Hu, Guang-Tong Zhou, Zhiwei Deng, Zicheng Liao, Greg Mori:
Learning Structured Inference Neural Networks with Label Relations. 2960-2968 - Hongyuan Zhu, Jean-Baptiste Weibel, Shijian Lu:
Discriminative Multi-modal Feature Fusion for RGBD Indoor Scene Recognition. 2969-2976 - Qiang Li, Maoying Qiao, Wei Bian, Dacheng Tao:
Conditional Graphical Lasso for Multi-label Image Classification. 2977-2986 - Zijun Wei, Minh Hoai:
Region Ranking SVM for Image Classification. 2987-2996 - Carl Vondrick, Deniz Oktay, Hamed Pirsiavash, Antonio Torralba:
Predicting Motivations of Actions by Leveraging Text. 2997-3005 - Jakub Sochor, Adam Herout, Jirí Havel:
BoxCars: 3D Boxes as CNN Input for Improved Fine-Grained Vehicle Recognition. 3006-3015 - Xu Liu, Zilei Wang, Jiashi Feng, Hongsheng Xi:
Highway Vehicle Counting in Compressed Domain. 3016-3024 - Shiyao Huang, Xianghua Ying, Jiangpeng Rong, Zeyu Shang, Hongbin Zha:
Camera Calibration from Periodic Motion of a Pedestrian. 3025-3033
Oral & Spotlight Session 3-1A
O3-1A: Actions and Human Pose
- Hakan Bilen, Basura Fernando, Efstratios Gavves, Andrea Vedaldi, Stephen Gould:
Dynamic Image Networks for Action Recognition. 3034-3042 - Vignesh Ramanathan, Jonathan Huang, Sami Abu-El-Haija, Alexander N. Gorban, Kevin Murphy, Li Fei-Fei:
Detecting Events and Key Actors in Multi-person Videos. 3043-3053 - Behrooz Mahasseni, Sinisa Todorovic:
Regularizing Long Short Term Memory with 3D Human-Skeleton Sequences for Action Recognition. 3054-3062 - James Charles, Tomas Pfister, Derek R. Magee, David C. Hogg, Andrew Zisserman:
Personalizing Human Video Pose Estimation. 3063-3072 - Wei Yang, Wanli Ouyang, Hongsheng Li, Xiaogang Wang:
End-to-End Learning of Deformable Mixture of Parts and Deep Convolutional Neural Networks for Human Pose Estimation. 3073-3082
S3-1A: Activity Recognition
- Chenliang Xu, Jason J. Corso:
Actor-Action Semantic Segmentation with Grouping Process Models. 3083-3092 - Jun Yuan, Bingbing Ni, Xiaokang Yang, Ashraf A. Kassim:
Temporal Action Localization with Pyramid of Score Distribution Features. 3093-3102 - Katsunori Ohnishi, Atsushi Kanehira, Asako Kanezaki, Tatsuya Harada:
Recognizing Activities of Daily Living with a Wrist-Mounted Camera. 3103-3111 - Zuxuan Wu, Yanwei Fu, Yu-Gang Jiang, Leonid Sigal:
Harnessing Object and Scene Semantics for Large-Scale Video Understanding. 3112-3121 - Jinsoo Choi, Tae Hyun Oh, In-So Kweon:
Video-Story Composition via Plot Analysis. 3122-3130 - Alexander Richard, Juergen Gall:
Temporal Action Detection Using a Statistical Language Model. 3131-3140
Oral & Spotlight Session 3-1B
O3-1B: Semantic Segmentation
- Shu Liu, Xiaojuan Qi, Jianping Shi, Hong Zhang, Jiaya Jia:
Multi-scale Patch Aggregation (MPA) for Simultaneous Detection and Segmentation. 3141-3149 - Jifeng Dai, Kaiming He, Jian Sun:
Instance-Aware Semantic Segmentation via Multi-task Network Cascades. 3150-3158 - Di Lin, Jifeng Dai, Jiaya Jia, Kaiming He, Jian Sun:
ScribbleSup: Scribble-Supervised Convolutional Networks for Semantic Segmentation. 3159-3167 - Abhijit Kundu, Vibhav Vineet, Vladlen Koltun:
Feature Space Optimization for Semantic Video Segmentation. 3168-3175 - Maros Blaha, Christoph Vogel, Audrey Richard, Jan Dirk Wegner, Thomas Pock, Konrad Schindler:
Large-Scale Semantic 3D Reconstruction: An Adaptive Multi-resolution Model for Multi-class Volumetric Labeling. 3176-3184
S3-1B: Semantic Parsing and Segmentation
- Xiaodan Liang, Xiaohui Shen, Donglai Xiang, Jiashi Feng, Liang Lin, Shuicheng Yan:
Semantic Object Parsing with Local-Global Long Short-Term Memory. 3185-3193 - Guosheng Lin, Chunhua Shen, Anton van den Hengel, Ian D. Reid:
Efficient Piecewise Training of Deep Structured Models for Semantic Segmentation. 3194-3203 - Seunghoon Hong, Junhyuk Oh, Honglak Lee, Bohyung Han:
Learning Transferrable Knowledge for Semantic Segmentation with Deep Convolutional Neural Network. 3204-3212 - Marius Cordts, Mohamed Omran, Sebastian Ramos, Timo Rehfeld, Markus Enzweiler, Rodrigo Benenson, Uwe Franke, Stefan Roth, Bernt Schiele:
The Cityscapes Dataset for Semantic Urban Scene Understanding. 3213-3223 - Raviteja Vemulapalli, Oncel Tuzel, Ming-Yu Liu, Rama Chellappa:
Gaussian Conditional Random Field Network for Semantic Segmentation. 3224-3233 - Germán Ros, Laura Sellart, Joanna Materzynska, David Vázquez, Antonio M. López:
The SYNTHIA Dataset: A Large Collection of Synthetic Images for Semantic Segmentation of Urban Scenes. 3234-3243
Poster Session P3-1
- Alex Locher, Michal Perdoch, Luc Van Gool:
Progressive Prioritized Multi-view Stereo. 3244-3252 - Angjoo Kanazawa, David W. Jacobs, Manmohan Chandraker:
WarpNet: Weakly Supervised Matching for Single-View Reconstruction. 3253-3261 - Ole Johannsen, Antonin Sulc, Bastian Goldluecke:
What Sparse Light Field Coding Reveals about Scene Structure. 3262-3270 - Hao Wang, Jun Wang, Liang Wang:
Online Reconstruction of Indoor Scenes from RGB-D Streams. 3271-3279 - Ali Osman Ulusoy, Michael J. Black, Andreas Geiger:
Patches, Planes and Probabilities: A Non-Local Prior for Volumetric 3D Reconstruction. 3280-3289 - Ian Schillebeeckx, Robert Pless:
Single Image Camera Calibration with Lenticular Arrays for Augmented Reality. 3290-3298 - Diego Thomas, Rin-Ichiro Taniguchi:
Augmented Blendshapes for Real-Time Simultaneous 3D Head Modeling and Facial Motion Capture. 3299-3308 - Jin Xie, Meng Wang, Yi Fang:
Learned Binary Spectral Shape Descriptor for 3D Shape Correspondence. 3309-3317 - Luca Magri, Andrea Fusiello:
Multiple Models Fitting as a Set Coverage Problem. 3318-3326 - Cédric Verleysen, Christophe De Vleeschouwer:
Piecewise-Planar 3D Approximation from Wide-Baseline Stereo. 3327-3336 - Olivier Saurer, Marc Pollefeys, Gim Hee Lee:
Sparse to Dense 3D Reconstruction from Rolling Shutter Images. 3337-3345 - Matthew Trager, Martial Hebert, Jean Ponce:
Consistency of Silhouettes and Their Duals. 3346-3354 - Cenek Albl, Zuzana Kukelova, Tomás Pajdla:
Rolling Shutter Absolute Pose Problem with Known Vertical Direction. 3355-3363 - Eric Brachmann, Frank Michel, Alexander Krull, Michael Ying Yang, Stefan Gumhold, Carsten Rother:
Uncertainty-Driven 6D Pose Estimation of Objects and Scenes from a Single RGB Image. 3364-3372 - Andrey Bushnevskiy, Lorenzo Sorgi, Bodo Rosenhahn:
Multicamera Calibration from Visible and Mirrored Epipoles. 3373-3381 - Lazaros Zafeiriou, Epameinondas Antonakos, Stefanos Zafeiriou, Maja Pantic:
Joint Unsupervised Deformable Spatio-Temporal Alignment of Sequences. 3382-3390 - Kaili Zhao, Wen-Sheng Chu, Honggang Zhang:
Deep Region and Multi-label Learning for Facial Action Unit Detection. 3391-3399 - Yue Wu, Qiang Ji:
Constrained Joint Cascade Regression Framework for Simultaneous Facial Action Unit Recognition and Facial Landmark Detection. 3400-3408 - Shizhan Zhu, Cheng Li, Chen Change Loy, Xiaoou Tang:
Unconstrained Face Alignment via Cascaded Compositional Learning. 3409-3417 - Marcel Piotraschke, Volker Blanz:
Automated 3D Face Reconstruction from Multiple Images Using Quality Measures. 3418-3427 - Jie Zhang, Meina Kan, Shiguang Shan, Xilin Chen:
Occlusion-Free Face Alignment: Deep Regression Networks Coupled with De-Corrupt AutoEncoders. 3428-3437 - Zheng Zhang, Jeffrey M. Girard, Yue Wu, Xing Zhang, Peng Liu, Umur A. Ciftci, Shaun J. Canavan, Michael Reale, Andrew Horowitz, Huiyuan Yang, Jeffrey F. Cohn, Qiang Ji, Lijun Yin:
Multimodal Spontaneous Emotion Corpus for Human Behavior Analysis. 3438-3446 - Pei Yu, Jiahuan Zhou, Ying Wu:
Learning Reconstruction-Based Remote Gaze Estimation. 3447-3455 - Hongwei Qin, Junjie Yan, Xiu Li, Xiaolin Hu:
Joint Training of Cascaded CNN for Face Detection. 3456-3465 - Rui Zhao, Quan Gan, Shangfei Wang, Qiang Ji:
Facial Expression Intensity Estimation Using Ordinal Information. 3466-3474 - Bumsub Ham, Minsu Cho, Cordelia Schmid, Jean Ponce:
Proposal Flow. 3475-3484 - Chen Sun, Manohar Paluri, Ronan Collobert, Ram Nevatia, Lubomir D. Bourdev:
ProNet: Learning to Propose Object-Specific Boxes for Cascaded Neural Networks. 3485-3493 - Christopher Thomas, Adriana Kovashka:
Seeing Behind the Camera: Identifying the Authorship of a Photograph. 3494-3502 - Shuochen Su, Felix Heide, Robin Swanson, Jonathan Klein, Clara Callenberg, Matthias B. Hullin, Wolfgang Heidrich:
Material Classification Using Raw Time-of-Flight Measurements. 3503-3511 - Dong Li, Jia-Bin Huang, Yali Li, Shengjin Wang, Ming-Hsuan Yang:
Weakly Supervised Object Localization with Progressive Domain Adaptation. 3512-3520 - Roozbeh Mottaghi, Hessam Bagherinezhad, Mohammad Rastegari, Ali Farhadi:
Newtonian Image Understanding: Unfolding the Dynamics of Objects in Static Images. 3521-3529 - Ali Harakeh, Daniel C. Asmar, Elie A. Shammas:
Identifying Good Training Data for Self-Supervised Free Space Estimation. 3530-3538 - Hani Altwaijry, Eduard Trulls, James Hays, Pascal Fua, Serge J. Belongie:
Learning to Match Aerial Images with Deep Attentive Architectures. 3539-3547 - Krishna Kumar Singh, Fanyi Xiao, Yong Jae Lee:
Track and Transfer: Watching Videos to Simulate Strong Human Supervision for Weakly-Supervised Object Detection. 3548-3556 - Ali Diba, Ali Mohammad Pazandeh, Hamed Pirsiavash, Luc Van Gool:
DeepCAMP: Deep Convolutional Action & Attribute Mid-Level Patterns. 3557-3565 - Hojin Cho, Myung-Chul Sung, Bongjin Jun:
Canny Text Detector: Fast and Robust Scene Text Localization Algorithm. 3566-3573 - Di Hu, Xuelong Li, Xiaoqiang Lu:
Temporal Multimodal Learning in Audiovisual Speech Recognition. 3574-3582 - Andreas Doumanoglou, Rigas Kouskouridas, Sotiris Malassiotis, Tae-Kyun Kim:
Recovering 6D Object Pose and Predicting Next-Best-View in the Crowd. 3583-3592 - Liuhao Ge, Hui Liang, Junsong Yuan, Daniel Thalmann:
Robust 3D Hand Pose Estimation in Single Depth Images: From Single-View CNN to Multi-View CNNs. 3593-3601 - Gedas Bertasius, Jianbo Shi, Lorenzo Torresani:
Semantic Segmentation with Boundary Neural Fields. 3602-3610 - Gellért Máttyus, Shenlong Wang, Sanja Fidler, Raquel Urtasun:
HD Maps: Fine-Grained Road Segmentation by Parsing Ground and Aerial Images. 3611-3619 - Bing Shuai, Zhen Zuo, Bing Wang, Gang Wang:
DAG-Recurrent Neural Networks for Scene Labeling. 3620-3629 - Baisheng Lai, Xiaojin Gong:
Saliency Guided Dictionary Learning for Weakly-Supervised Image Parsing. 3630-3639 - Liang-Chieh Chen, Yi Yang, Jiang Wang, Wei Xu, Alan L. Yuille:
Attention to Scale: Scale-Aware Semantic Image Segmentation. 3640-3649 - Nasim Souly, Mubarak Shah:
Scene Labeling Using Sparse Precision Matrix. 3650-3658 - Ke Li, Bharath Hariharan, Jitendra Malik:
Iterative Instance Segmentation. 3659-3667 - Jason Kuen, Zhenhua Wang, Gang Wang:
Recurrent Attentional Networks for Saliency Detection. 3668-3677 - Guillaume Seguin, Piotr Bojanowski, Rémi Lajugie, Ivan Laptev:
Instance-Level Video Segmentation from Object Tracks. 3678-3687 - Jun Xie, Martin Kiefel, Ming-Ting Sun, Andreas Geiger:
Semantic Instance Annotation of Street Scenes by 3D to 2D Label Transfer. 3688-3697 - Amir Kolaman, Maxim Lvov, Rami R. Hagege, Hugo Guterman:
Amplitude Modulated Video Camera - Light Separation in Dynamic Scenes. 3698-3706 - Boxin Shi, Zhe Wu, Zhipeng Mo, Dinglong Duan, Sai-Kit Yeung, Ping Tan:
A Benchmark Dataset and Evaluation for Non-Lambertian and Uncalibrated Photometric Stereo. 3707-3716 - Ting-Chun Wang, Manohar Srikanth, Ravi Ramamoorthi:
Depth from Semi-Calibrated Stereo and Defocus. 3717-3726 - Ying Fu, Yinqiang Zheng, Imari Sato, Yoichi Sato:
Exploiting Spectral-Spatial Correlation for Coded Hyperspectral Image Restoration. 3727-3736 - Julie Chang, Isaac Kauvar, Xuemei Hu, Gordon Wetzstein:
Variable Aperture Light Field Photography: Overcoming the Diffraction-Limited Spatio-Angular Resolution Tradeoff. 3737-3745 - Stefan Heber, Thomas Pock:
Convolutional Networks for Shape from Light Field. 3746-3754 - Rajat Aggarwal, Amrisha Vohra, Anoop M. Namboodiri:
Panoramic Stereo Videos with a Single Camera. 3755-3763 - Mark Sheinin, Yoav Y. Schechner:
The Next Best Underwater View. 3764-3773 - Yoshie Kobayashi, Tetsuro Morimoto, Imari Sato, Yasuhiro Mukaigawa, Takao Tomono, Katsushi Ikeuchi:
Reconstructing Shapes and Appearances of Thin Film Objects Using RGB Images. 3774-3782 - Tomas F. Yago Vicente, Minh Hoai, Dimitris Samaras:
Noisy Label Recovery for Shadow Detection in Unfamiliar Domains. 3783-3792
Oral & Spotlight Session 3-2A
O3-2A: Video Understanding
- Oscar Koller, Hermann Ney, Richard Bowden:
Deep Hand: How to Train a CNN on 1 Million Hand Images When Your Data is Continuous and Weakly Labelled. 3793-3802 - Bo Li, Tianfu Wu, Caiming Xiong, Song-Chun Zhu:
Recognizing Car Fluents from Video. 3803-3812 - Edward Johns, Stefan Leutenegger, Andrew J. Davison:
Pairwise Decomposition of Image Sequences for Active Multi-view Recognition. 3813-3822 - Yixin Zhu, Chenfanfu Jiang, Yibiao Zhao, Demetri Terzopoulos, Song-Chun Zhu:
Inferring Forces and Learning Human Utilities from Videos. 3823-3833 - Hyun Soo Park, Jyh-Jing Hwang, Jianbo Shi:
Force from Motion: Decoding Physical Sensation in a First Person Video. 3834-3842
S3-2A: Video Analysis 2
- Pan Ji, Hongdong Li, Mathieu Salzmann, Yiran Zhong:
Robust Multi-Body Feature Tracker: A Segmentation-Free Approach. 3843-3851 - Dinesh Jayaraman, Kristen Grauman:
Slow and Steady Feature Analysis: Higher Order Temporal Coherence in Video. 3852-3861 - Chun-Hao Huang, Benjamin Allain, Jean-Sébastien Franco, Nassir Navab, Slobodan Ilic, Edmond Boyer:
Volumetric 3D Tracking by Detection. 3862-3870 - Shoou-I Yu, Deyu Meng, Wangmeng Zuo, Alexander G. Hauptmann:
The Solution Path Algorithm for Identity-Aware Multi-object Tracking. 3871-3879 - Tianzhu Zhang, Adel Bibi, Bernard Ghanem:
In Defense of Sparse Tracking: Circulant Sparse Tracker. 3880-3888 - Laura Sevilla-Lara, Deqing Sun, Varun Jampani, Michael J. Black:
Optical Flow with Semantic Segmentation and Localized Layers. 3889-3898 - Yi-Hsuan Tsai, Ming-Hsuan Yang, Michael J. Black:
Video Segmentation via Object Flow. 3899-3908
Oral & Spotlight Session 3-2B
O3-2B: Grouping and Optimization Methods
- Marc T. Law, Yaoliang Yu, Matthieu Cord, Eric P. Xing:
Closed-Form Training of Mahalanobis Distance for Supervised Clustering. 3909-3917 - Chong You, Daniel P. Robinson, René Vidal:
Scalable Sparse Subspace Clustering by Orthogonal Matching Pursuit. 3918-3927 - Chong You, Chun-Guang Li, Daniel P. Robinson, René Vidal:
Oracle Based Active Set Algorithm for Scalable Elastic Net Subspace Clustering. 3928-3937 - Wen-bing Huang, Fuchun Sun, Le-le Cao, Deli Zhao, Huaping Liu, Mehrtash Harandi:
Sparse Coding and Dictionary Learning with Linear Dynamical Systems. 3938-3947 - Thomas Möllenhoff, Emanuel Laude, Michael Möller, Jan Lellmann, Daniel Cremers:
Sublabel-Accurate Relaxation of Nonconvex Energies. 3948-3956
S3-2B: Statistical Methods and Transfer Learning
- Etai Littwin, Lior Wolf:
The Multiverse Loss for Robust Transfer Learning. 3957-3966 - Viktoriia Sharmanska, Novi Quadrianto:
Learning from the Mistakes of Others: Matching Errors in Cross-Dataset Learning. 3967-3975 - Rudrasis Chakraborty, Dohyung Seo, Baba C. Vemuri:
An Efficient Exact-PGA Algorithm for Constant Curvature Manifolds. 3976-3984 - Samuel Rota Bulò, Peter Kontschieder:
Online Learning with Bayesian Classification Trees. 3985-3993 - Ishan Misra, Abhinav Shrivastava, Abhinav Gupta, Martial Hebert:
Cross-Stitch Networks for Multi-task Learning. 3994-4003 - Hyun Oh Song, Yu Xiang, Stefanie Jegelka, Silvio Savarese:
Deep Metric Learning via Lifted Structured Feature Embedding. 4004-4012 - Andrew Lavin, Scott Gray:
Fast Algorithms for Convolutional Neural Networks. 4013-4021
Poster Session P3-2
- Ang Li, Dapeng Chen, Yuanliu Liu, Zejian Yuan:
Coordinating Multiple Disparity Proposals for Stereo Computation. 4022-4030 - Chi Zhang, Zhiwei Li, Rui Cai, Hongyang Chao, Yong Rui:
Joint Multiview Segmentation and Localization of RGB-D Images Using Depth-Induced Silhouette Consistency. 4031-4039 - Nikolaus Mayer, Eddy Ilg, Philip Häusser, Philipp Fischer, Daniel Cremers, Alexey Dosovitskiy, Thomas Brox:
A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation. 4040-4048 - Wei Feng, Fei-Peng Tian, Qian Zhang, Jizhou Sun:
6D Dynamic Camera Relocalization from Single Reference Image. 4049-4057 - René Ranftl, Vibhav Vineet, Qifeng Chen, Vladlen Koltun:
Dense Monocular Depth Estimation in Complex Dynamic Scenes. 4058-4066 - Christian Mostegel, Markus Rumpler, Friedrich Fraundorfer, Horst Bischof:
Using Self-Contradiction to Learn Confidence Measures in Stereo Vision. 4067-4076 - Ankur Handa, Viorica Patraucean, Vijay Badrinarayanan, Simon Stent, Roberto Cipolla:
Understanding RealWorld Indoor Scenes with Synthetic Data. 4077-4085 - Hae-Gon Jeon, Joon-Young Lee, Sunghoon Im, Hyowon Ha, In-So Kweon:
Stereo Matching with Color and Monochrome Cameras in Low-Light Conditions. 4086-4094 - Gil Ben-Artzi, Yoni Kasten, Shmuel Peleg, Michael Werman:
Camera Calibration from Dynamic Silhouettes Using Motion Barcodes. 4095-4103 - Johannes L. Schönberger, Jan-Michael Frahm:
Structure-from-Motion Revisited. 4104-4113 - Wencheng Wang, Tianhao Gao:
Constructing Canonical Regions for Fast and Effective View Selection. 4114-4122 - Chen Kong, Simon Lucey:
Prior-Less Compressible Structure from Motion. 4123-4131 - Yuchao Dai, Hongdong Li, Laurent Kneip:
Rolling Shutter Camera Relative Pose: Generalized Epipolar Geometry. 4132-4140 - Marco Crocco, Cosimo Rubino, Alessio Del Bue:
Structure from Motion with Objects. 4141-4149 - Ayan Sinha, Chiho Choi, Karthik Ramani:
DeepHand: Robust Hand Pose Estimation by Completing a Matrix Imputed with Deep Features. 4150-4158 - Zheng Zhang, Chengquan Zhang, Wei Shen, Cong Yao, Wenyu Liu, Xiang Bai:
Multi-oriented Text Detection with Fully Convolutional Networks. 4159-4167 - Baoguang Shi, Xinggang Wang, Pengyuan Lyu, Cong Yao, Xiang Bai:
Robust Scene Text Recognition with Automatic Rectification. 4168-4176 - George Trigeorgis, Patrick Snape, Mihalis A. Nicolaou, Epameinondas Antonakos, Stefanos Zafeiriou:
Mnemonic Descent Method: A Recurrent Process Applied for End-to-End Face Alignment. 4177-4187 - Amin Jourabloo, Xiaoming Liu:
Large-Pose Face Alignment via CNN-Based Dense 3D Model Fitting. 4188-4196 - Joseph Roth, Yiying Tong, Xiaoming Liu:
Adaptive 3D Face Reconstruction from Unconstrained Photo Collections. 4197-4206 - Pavlo Molchanov, Xiaodong Yang, Shalini Gupta, Kihwan Kim, Stephen Tyree, Jan Kautz:
Online Detection and Classification of Dynamic Hand Gestures with Recurrent 3D Convolutional Neural Networks. 4207-4215 - Hyung Jin Chang, Tobias Fischer, Maxime Petit, Martina Zambelli, Yiannis Demiris:
Kinematic Structure Correspondences via Hypergraph Matching. 4216-4225 - Binod Bhattarai, Gaurav Sharma, Frédéric Jurie:
CP-mtML: Coupled Projection Multi-Task Metric Learning for Large Scale Face Retrieval. 4226-4235 - David Gadot, Lior Wolf:
PatchBatch: A Batch Augmented Loss for Optical Flow. 4236-4245 - Tatsunori Taniai, Sudipta N. Sinha, Yoichi Sato:
Joint Recovery of Dense Correspondence and Cosegmentation in Two Images. 4246-4255 - Yuanlu Xu, Xiaobai Liu, Yang Liu, Song-Chun Zhu:
Multi-view People Tracking via Hierarchical Trajectory Composition. 4256-4265 - Jifeng Ning, Jimei Yang, Shaojie Jiang, Lei Zhang, Ming-Hsuan Yang:
Object Tracking via Dual Linear Structured SVM and Explicit Feature Map. 4266-4274 - Taiki Sekii:
Robust, Real-Time 3D Tracking of Multiple Objects with Similar Appearances. 4275-4283 - Yedid Hoshen, Shmuel Peleg:
An Egocentric Look at Video Photographer Identity. 4284-4292 - Hyeonseob Nam, Bohyung Han:
Learning Multi-domain Convolutional Neural Networks for Visual Tracking. 4293-4302 - Yuankai Qi, Shengping Zhang, Lei Qin, Hongxun Yao, Qingming Huang, Jongwoo Lim, Ming-Hsuan Yang:
Hedged Deep Tracking. 4303-4311 - Si Liu, Tianzhu Zhang, Xiaochun Cao, Changsheng Xu:
Structural Correlation Filter for Robust Visual Tracking. 4312-4320 - Jongwon Choi, Hyung Jin Chang, Jiyeoup Jeong, Yiannis Demiris, Jin Young Choi:
Visual Tracking Using Attention-Modulated Disintegration and Integration. 4321-4330 - Vikas Dhiman, Quoc-Huy Tran, Jason J. Corso, Manmohan Chandraker:
A Continuous Occlusion Model for Road Scene Understanding. 4331-4339 - Adrien Gaidon, Qiao Wang, Yohann Cabon, Eleonora Vig:
VirtualWorlds as Proxy for Multi-object Tracking Analysis. 4340-4349 - Keisuke Midorikawa, Toshihiko Yamasaki, Kiyoharu Aizawa:
Uncalibrated Photometric Stereo by Stepwise Optimization Using Principal Components of Isotropic BRDFs. 4350-4358 - Yvain Quéau, Roberto Mecca, Jean-Denis Durou:
Unbiased Photometric Stereo for Colored Surfaces: A Variational Approach. 4359-4368 - Yiming Qian, Minglun Gong, Yee-Hong Yang:
3D Reconstruction of Transparent Objects with Position-Normal Consistency. 4369-4377 - Roy Or-El, Rom Hershkovitz, Aaron Wetzler, Guy Rosman, Alfred M. Bruckstein, Ron Kimmel:
Real-Time Depth Refinement for Specular Objects. 4378-4386 - Kenichiro Tanaka, Yasuhiro Mukaigawa, Hiroyuki Kubo, Yasuyuki Matsushita, Yasushi Yagi:
Recovering Transparent Shape from Time-of-Flight Distortion. 4387-4395 - Williem, In Kyu Park:
Robust Light Field Depth Estimation for Noisy Scene with Occlusion. 4396-4404 - Nianyi Li, Haiting Lin, Bilin Sun, Mingyuan Zhou, Jingyi Yu:
Rotational Crossed-Slit Light Fields. 4405-4413 - Fabrizio Natola, Valsamis Ntouskos, Fiora Pirri, Marta Sanzari:
Single Image Object Modeling Based on BRDF and r-Surfaces Learning. 4414-4423 - Monami Banerjee, Rudrasis Chakraborty, Edward Ofori, Michael S. Okun, David E. Vaillancourt, Baba C. Vemuri:
A Nonlinear Regression Technique for Manifold Valued Data with Applications to Medical Image Analysis. 4424-4432 - Qilong Wang, Peihua Li, Wangmeng Zuo, Lei Zhang:
RAID-G: Robust Estimation of Approximate Infinite Dimensional Gaussian with Application to Material Recognition. 4433-4441 - Nikolaos Karianakis, Jingming Dong, Stefano Soatto:
An Empirical Evaluation of Current Convolutional Architectures' Ability to Manage Nuisance Location and Scale Variability. 4442-4451 - Varun Jampani, Martin Kiefel, Peter V. Gehler:
Learning Sparse High Dimensional Filters: Image Filtering, Dense CRFs and Bilateral Neural Networks. 4452-4461 - Fujiao Ju, Yanfeng Sun, Junbin Gao, Simeng Liu, Yongli Hu, Baocai Yin:
Mixture of Bilateral-Projection Two-Dimensional Probabilistic Principal Component Analysis. 4462-4470 - Raviteja Vemulapalli, Rama Chellappa:
Rolling Rotations for Recognizing Human Actions from 3D Skeletal Data. 4471-4479 - Stephan Zheng, Yang Song, Thomas Leung, Ian J. Goodfellow:
Improving the Robustness of Deep Neural Networks via Stability Training. 4480-4488 - Chao Xing, Xin Geng, Hui Xue:
Logistic Boosting Regression for Label Distribution Learning. 4489-4497 - Xikang Zhang, Yin Wang, Mengran Gou, Mario Sznaier, Octavia I. Camps:
Efficient Temporal Sequence Comparison and Classification Using Gram Matrix Embeddings on a Riemannian Manifold. 4498-4507 - Konstantinos Rematas, Tobias Ritschel, Mario Fritz, Efstratios Gavves, Tinne Tuytelaars:
Deep Reflectance Maps. 4508-4516 - Qingxiong Yang:
Semantic Filtering. 4517-4526 - Amir M. Rahimi, Raphael Ruschel, B. S. Manjunath:
UAVSensor Fusion with Latent-Dynamic Conditional Random Fields in Coronal Plane Estimation. 4527-4534 - Elena Stumm, Christopher Mei, Simon Lacroix, Juan I. Nieto, Marco Hutter, Roland Siegwart:
Robust Visual Place Recognition with Graph Kernels. 4535-4544 - Liang-Chieh Chen, Jonathan T. Barron, George Papandreou, Kevin Murphy, Alan L. Yuille:
Semantic Image Segmentation with Task-Specific Edge Detection Using CNNs and a Discriminatively Trained Domain Transform. 4545-4554
Oral & Spotlight Session 4-1A
O4-1A: Image & Video Captioning and Descriptions
- Ronghang Hu, Huazhe Xu, Marcus Rohrbach, Jiashi Feng, Kate Saenko, Trevor Darrell:
Natural Language Object Retrieval. 4555-4564 - Justin Johnson, Andrej Karpathy, Li Fei-Fei:
DenseCap: Fully Convolutional Localization Networks for Dense Captioning. 4565-4574 - Jean-Baptiste Alayrac, Piotr Bojanowski, Nishant Agrawal, Josef Sivic, Ivan Laptev, Simon Lacoste-Julien:
Unsupervised Learning from Narrated Instruction Videos. 4575-4583 - Haonan Yu, Jiang Wang, Zhiheng Huang, Yi Yang, Wei Xu:
Video Paragraph Captioning Using Hierarchical Recurrent Neural Networks. 4584-4593 - Yingwei Pan, Tao Mei, Ting Yao, Houqiang Li, Yong Rui:
Jointly Modeling Embedding and Translation to Bridge Video and Language. 4594-4602
S4-1A: High Level Semantics
- Arjun Chandrasekaran, Ashwin K. Vijayakumar, Stanislaw Antol, Mohit Bansal, Dhruv Batra, C. Lawrence Zitnick, Devi Parikh:
We are Humor Beings: Understanding and Predicting Visual Humor. 4603-4612 - Kevin J. Shih, Saurabh Singh, Derek Hoiem:
Where to Look: Focus Regions for Visual Question Answering. 4613-4621 - Qi Wu, Peng Wang, Chunhua Shen, Anthony R. Dick, Anton van den Hengel:
Ask Me Anything: Free-Form Visual Question Answering Based on Knowledge from External Sources. 4622-4630 - Makarand Tapaswi, Yukun Zhu, Rainer Stiefelhagen, Antonio Torralba, Raquel Urtasun, Sanja Fidler:
MovieQA: Understanding Stories in Movies through Question-Answering. 4631-4640 - Yuncheng Li, Yale Song, Liangliang Cao, Joel R. Tetreault, Larry Goldberg, Alejandro Jaimes, Jiebo Luo:
TGIF: A New Dataset and Benchmark on Animated GIF Description. 4641-4650 - Quanzeng You, Hailin Jin, Zhaowen Wang, Chen Fang, Jiebo Luo:
Image Captioning with Semantic Attention. 4651-4659
Oral & Spotlight Session 4-1B
O4-1B: Non-rigid Reconstruction and Motion Analysis
- Armin Mustafa, Hansung Kim, Jean-Yves Guillemaut, Adrian Hilton:
Temporally Coherent 4D Reconstruction of Complex Dynamic Scenes. 4660-4669 - Minsik Lee, Jungchan Cho, Songhwai Oh:
Consensus of Non-rigid Reconstructions. 4670-4678 - Shaifali Parashar, Daniel Pizarro, Adrien Bartoli:
Isometric Non-rigid Shape-from-Motion in Linear Time. 4679-4687 - Jianhui Chen, Hoang Minh Le, Peter Carr, Yisong Yue, James J. Little:
Learning Online Smooth Predictors for Realtime Camera Planning Using Recurrent Decision Trees. 4688-4696 - Hyun Soo Park, Jyh-Jing Hwang, Yedong Niu, Jianbo Shi:
Egocentric Future Localization. 4697-4705 - Qifeng Chen, Vladlen Koltun:
Full Flow: Optical Flow Estimation By Global Optimization over Regular Grids. 4706-4714
S4-1B: Human Pose Estimation
- Xiao Chu, Wanli Ouyang, Hongsheng Li, Xiaogang Wang:
Structured Feature Learning for Pose Estimation. 4715-4723 - Shih-En Wei, Varun Ramakrishna, Takeo Kanade, Yaser Sheikh:
Convolutional Pose Machines. 4724-4732 - João Carreira, Pulkit Agrawal, Katerina Fragkiadaki, Jitendra Malik:
Human Pose Estimation with Iterative Error Feedback. 4733-4742
Poster Session P4-1
- Thibaut Durand, Nicolas Thome, Matthieu Cord:
WELDON: Weakly Supervised Learning of Deep Convolutional Neural Networks. 4743-4752 - Lingxi Xie, Jingdong Wang, Zhen Wei, Meng Wang, Qi Tian:
DisturbLabel: Regularizing CNN on the Loss Layer. 4753-4762 - Leslie N. Smith, Emily M. Hand, Timothy Doster:
Gradual DropIn of Layers to Train Very Deep Neural Networks. 4763-4771 - Zhiwei Deng, Arash Vahdat, Hexiang Hu, Greg Mori:
Structure Inference Machines: Recurrent Neural Networks for Analyzing Relations in Group Activity Recognition. 4772-4781 - Nadav Cohen, Or Sharir, Amnon Shashua:
Deep SimNets. 4782-4791 - Zhangyang Wang, Shiyu Chang, Yingzhen Yang, Ding Liu, Thomas S. Huang:
Studying Very Low Resolution Recognition Using Deep Networks. 4792-4800 - Raviteja Vemulapalli, Oncel Tuzel, Ming-Yu Liu:
Deep Gaussian Conditional Random Field Network: A Model-Based Deep Network for Discriminative Denoising. 4801-4809 - Yufei Wang, Zhe Lin, Xiaohui Shen, Radomír Mech, Gavin S. P. Miller, Garrison W. Cottrell:
Event-Specific Image Importance. 4810-4819 - Jiaxiang Wu, Cong Leng, Yuhang Wang, Qinghao Hu, Jian Cheng:
Quantized Convolutional Neural Networks for Mobile Devices. 4820-4828 - Alexey Dosovitskiy, Thomas Brox:
Inverting Visual Representations with Convolutional Networks. 4829-4837 - Iacopo Masi, Stephen Rawls, Gérard G. Medioni, Prem Natarajan:
Pose-Aware Face Recognition in the Wild. 4838-4846 - Meina Kan, Shiguang Shan, Xilin Chen:
Multi-view Deep Network for Cross-View Classification. 4847-4855 - Yi Sun, Xiaogang Wang, Xiaoou Tang:
Sparsifying Neural Network Connections for Face Recognition. 4856-4864 - Qingxiang Feng, Yicong Zhou, Rushi Lan:
Pairwise Linear Regression Classification for Image Set Retrieval. 4865-4872 - Ira Kemelmacher-Shlizerman, Steven M. Seitz, Daniel Miller, Evan Brossard:
The MegaFace Benchmark: 1 Million Faces for Recognition at Scale. 4873-4882 - Ognjen Arandjelovic:
Learnt Quasi-Transitive Similarity for Retrieval from Large Collections of Faces. 4883-4892 - Yandong Wen, Zhifeng Li, Yu Qiao:
Latent Factor Guided Convolutional Neural Networks for Age-Invariant Face Recognition. 4893-4901 - Robert Walecki, Ognjen Rudovic, Vladimir Pavlovic, Maja Pantic:
Copula Ordinal Regression for Joint Estimation of Facial Action Unit Intensity. 4902-4910 - Timo Bolkart, Stefanie Wuhrer:
A Robust Multilinear Model Learning Framework for 3D Faces. 4911-4919 - Zhenxing Niu, Mo Zhou, Le Wang, Xinbo Gao, Gang Hua:
Ordinal Regression with Multiple Output CNN for Age Estimation. 4920-4928 - Leonid Pishchulin, Eldar Insafutdinov, Siyu Tang, Bjoern Andres, Mykhaylo Andriluka, Peter V. Gehler, Bernt Schiele:
DeepCut: Joint Subset Partition and Labeling for Multi Person Pose Estimation. 4929-4937 - Suha Kwak, Minsu Cho, Ivan Laptev:
Thin-Slicing for Pose: Learning to Understand Pose without Explicit Pose Estimation. 4938-4947 - Hashim Yasin, Umar Iqbal, Björn Krüger, Andreas Weber, Juergen Gall:
A Dual-Source Approach for 3D Pose Estimation from a Single Image. 4948-4956 - Markus Oberweger, Gernot Riegler, Paul Wohlhart, Vincent Lepetit:
Efficiently Creating 3D Training Data for Fine Hand Pose Estimation. 4957-4965 - Xiaowei Zhou, Menglong Zhu, Spyridon Leonardos, Konstantinos G. Derpanis, Kostas Daniilidis:
Sparseness Meets Deepness: 3D Human Pose Estimation from Monocular Video. 4966-4975 - Kushal Kafle, Christopher Kanan:
Answer-Type Prediction for Visual Question Answering. 4976-4984 - Satwik Kottur, Ramakrishna Vedantam, José M. F. Moura, Devi Parikh:
VisualWord2Vec (Vis-W2V): Learning Visually Grounded Word Embeddings Using Abstract Scenes. 4985-4994 - Yuke Zhu, Oliver Groth, Michael S. Bernstein, Li Fei-Fei:
Visual7W: Grounded Question Answering in Images. 4995-5004 - Liwei Wang, Yin Li, Svetlana Lazebnik:
Learning Deep Structure-Preserving Image-Text Embeddings. 5005-5013 - Peng Zhang, Yash Goyal, Douglas Summers-Stay, Dhruv Batra, Devi Parikh:
Yin and Yang: Balancing and Answering Binary Visual Questions. 5014-5022 - Song Bai, Xiang Bai, Zhichao Zhou, Zhaoxiang Zhang, Longin Jan Latecki:
GIFT: A Real-Time and Scalable 3D Shape Search Engine. 5023-5032 - Chao Zhang, William A. P. Smith, Arnaud Dessein, Nick E. Pears, Hang Dai:
Functional Faces: Groupwise Dense Correspondence Using Functional Maps. 5033-5041 - Girum G. Demisse, Djamila Aouada, Björn E. Ottersten:
Similarity Metric for Curved Shapes in Euclidean Space. 5042-5050 - Jie Shi, Wen Zhang, Yalin Wang:
Shape Analysis with Hyperbolic Wasserstein Distance. 5051-5061 - Xinchu Shi, Haibin Ling, Weiming Hu, Junliang Xing, Yanning Zhang:
Tensor Power Iteration for Multi-graph Matching. 5062-5070 - Yongxin Yang, Timothy M. Hospedales:
Multivariate Regression on the Grassmannian for Predicting Novel Domains. 5071-5080 - Yao-Hung Hubert Tsai, Yi-Ren Yeh, Yu-Chiang Frank Wang:
Learning Cross-Domain Landmarks for Heterogeneous Domain Adaptation. 5081-5090 - Diego Marcos, Raffay Hamid, Devis Tuia:
Geospatial Correspondences for Multimodal Registration. 5091-5100 - Yue Wu, Qiang Ji:
Constrained Deep Transfer Feature Learning and Its Applications. 5101-5109 - George Trigeorgis, Mihalis A. Nicolaou, Stefanos Zafeiriou, Björn W. Schuller:
Deep Canonical Time Warping. 5110-5118 - Xianglong Liu, Xinjie Fan, Cheng Deng, Zhujin Li, Hao Su, Dacheng Tao:
Multilinear Hyperplane Hashing. 5119-5127 - Olivier Canévet, François Fleuret:
Large Scale Hard Sample Mining with Monte Carlo Tree Search. 5128-5137 - Atsushi Kanehira, Tatsuya Harada:
Multi-label Ranking from Positive and Unlabeled Data. 5138-5146 - Jianwei Yang, Devi Parikh, Dhruv Batra:
Joint Unsupervised Learning of Deep Representations and Image Clusters. 5147-5156 - Ming Yin, Yi Guo, Junbin Gao, Zhaoshui He, Shengli Xie:
Kernel Sparse Subspace Clustering on Symmetric Positive Definite Manifolds. 5157-5164 - Christopher Funk, Yanxi Liu:
Symmetry reCAPTCHA. 5165-5174 - Chen Huang, Chen Change Loy, Xiaoou Tang:
Unsupervised Learning of Discriminative Attributes and Visual Representations. 5175-5184 - Mehrtash Tafazzoli Harandi, Mathieu Salzmann, Fatih Porikli:
When VLAD Met Hilbert. 5185-5194 - Ha Quang Minh, Marco San-Biagio, Loris Bazzani, Vittorio Murino:
Approximate Log-Hilbert-Schmidt Distances between Covariance Operators for Image Classification. 5195-5203 - Yongfang Cheng, Yin Wang, Mario Sznaier, Octavia I. Camps:
Subspace Clustering with Priors via Sparse Quadratically Constrained Quadratic Programming. 5204-5212 - Xiai Chen, Zhi Han, Yao Wang, Qian Zhao, Deyu Meng, Yandong Tang:
Robust Tensor Factorization with Unknown Noise. 5213-5221 - Yusuke Mukuta, Tatsuya Harada:
Kernel Approximation via Empirical Orthogonal Decomposition for Unsupervised Feature Learning. 5222-5230 - Agata Mosinska-Domanska, Raphael Sznitman, Przemyslaw Glowacki, Pascal Fua:
Active Learning for Delineation of Curvilinear Structures. 5231-5239 - Xavier Alameda-Pineda, Elisa Ricci, Yan Yan, Nicu Sebe:
Recognizing Emotions from Abstract Paintings Using Non-Linear Matrix Completion. 5240-5248 - Canyi Lu, Jiashi Feng, Yudong Chen, Wei Liu, Zhouchen Lin, Shuicheng Yan:
Tensor Robust Principal Component Analysis: Exact Recovery of Corrupted Low-Rank Tensors via Convex Optimization. 5249-5257 - Soheil Kolouri, Yang Zou, Gustavo K. Rohde:
Sliced Wasserstein Kernels for Probability Distributions. 5258-5267 - Xian Wei, Hao Shen, Martin Kleinsteuber:
Trace Quotient Meets Sparsity: A Method for Learning Low Dimensional Image Representations. 5268-5277 - Hisham Cholakkal, Jubin Johnson, Deepu Rajan:
Backtracking ScSPM Image Classifier for Weakly Supervised Top-Down Saliency. 5278-5287 - Jun Xu, Tao Mei, Ting Yao, Yong Rui:
MSR-VTT: A Large Video Description Dataset for Bridging Video and Language. 5288-5296
Oral & Spotlight Session 4-2A
O4-2A: Learning and CNN Architectures
- Relja Arandjelovic, Petr Gronát, Akihiko Torii, Tomás Pajdla, Josef Sivic:
NetVLAD: CNN Architecture for Weakly Supervised Place Recognition. 5297-5307 - Ashesh Jain, Amir R. Zamir, Silvio Savarese, Ashutosh Saxena:
Structural-RNN: Deep Learning on Spatio-Temporal Graphs. 5308-5317 - Yong-Deok Kim, Taewoong Jang, Bohyung Han, Seungjin Choi:
Learning to Select Pre-Trained Deep Representations with Bayesian Evidence Framework. 5318-5326 - Soravit Changpinyo, Wei-Lun Chao, Boqing Gong, Fei Sha:
Synthesized Classifiers for Zero-Shot Learning. 5327-5336 - Yanwei Fu, Leonid Sigal:
Semi-supervised Vocabulary-Informed Learning. 5337-5346
S4-2A: Learning and Optimization
- Zhuwen Li, Shuoguang Yang, Loong-Fah Cheong, Kim-Chuan Toh:
Simultaneous Clustering and Model Selection for Tensor Affinities. 5347-5355 - Jinglin Xu, Junwei Han, Feiping Nie:
Discriminatively Embedded K-Means for Multi-view Clustering. 5356-5364 - Ishant Shanu, Chetan Arora, Parag Singla:
Min Norm Point Algorithm for Higher Order MRF-MAP Inference. 5365-5374 - Chen Huang, Yining Li, Chen Change Loy, Xiaoou Tang:
Learning Deep Representation for Imbalanced Classification. 5375-5384 - Vijay Kumar B. G, Gustavo Carneiro, Ian D. Reid:
Learning Local Image Descriptors with Deep Siamese and Triplet Convolutional Networks by Minimizing Global Loss Functions. 5385-5394 - Piotr Koniusz, Anoop Cherian:
Sparse Coding for Third-Order Super-Symmetric Tensor Descriptors with Application to Texture Recognition. 5395-5403 - Jen-Hao Rick Chang, Aswin C. Sankaranarayanan, B. V. K. Vijaya Kumar:
Random Features for Sparse Signal Classification. 5404-5412
Oral & Spotlight Session 4-2B
O4-2B: 3D Shape Reconstruction
- Hyowon Ha, Sunghoon Im, Jaesik Park, Hae-Gon Jeon, In-So Kweon:
High-Quality Depth from Uncalibrated Small Motion Clip. 5413-5421 - Hao Yang, Hui Zhang:
Efficient 3D Room Shape Recovery from a Single Panorama. 5422-5430 - Michael Firman, Oisin Mac Aodha, Simon J. Julier, Gabriel J. Brostow:
Structured Prediction of Unobserved Voxels from a Single Depth Image. 5431-5440 - Sean Ryan Fanello, Christoph Rhemann, Vladimir Tankovich, Adarsh Kowdle, Sergio Orts-Escolano, David Kim, Shahram Izadi:
HyperDepth: Learning Depth from Structured Light without Matching. 5441-5450 - Ting-Chun Wang, Manmohan Chandraker, Alexei A. Efros, Ravi Ramamoorthi:
SVBRDF-Invariant Shape and Reflectance Estimation from Light-Field Cameras. 5451-5459
S4-2B: 3D Reconstruction
- Nikolay Savinov, Christian Häne, Lubor Ladicky, Marc Pollefeys:
Semantic 3D Reconstruction with Continuous Regularization and Ray Potentials Using a Visibility Consistency Constraint. 5460-5469 - Carolina Raposo, João P. Barreto:
Theory and Practice of Structure-From-Motion Using Affine Correspondences. 5470-5478 - Silvano Galliani, Konrad Schindler:
Just Look at the Image: Viewpoint-Specific Surface Normal Prediction for Improved Multi-View Reconstruction. 5479-5487 - Filip Radenovic, Johannes L. Schönberger, Dinghuang Ji, Jan-Michael Frahm, Ondrej Chum, Jiri Matas:
From Dusk Till Dawn: Modeling in the Dark. 5488-5496 - Benjamin Eckart, Kihwan Kim, Alejandro J. Troccoli, Alonzo Kelly, Jan Kautz:
Accelerated Generative Models for 3D Point Cloud Data. 5497-5505 - Anirban Roy, Sinisa Todorovic:
Monocular Depth Estimation Using Neural Regression Forest. 5506-5514 - John Flynn, Ivan Neulander, James Philbin, Noah Snavely:
Deep Stereo: Learning to Predict New Views from the World's Imagery. 5515-5524
Oral & Spotlight Session 4-3A
O4-3A: Face, Gesture, & Situation Recognition: Algorithms and Datasets
- Shuo Yang, Ping Luo, Chen Change Loy, Xiaoou Tang:
WIDER FACE: A Face Detection Benchmark. 5525-5533 - Mark Yatskar, Luke Zettlemoyer, Ali Farhadi:
Situation Recognition: Visual Semantic Role Labeling for Image Understanding. 5534-5542
S4-3A: People and Faces
- James Booth, Anastasios Roussos, Stefanos Zafeiriou, Allan Ponniah, David J. Dunaway:
A 3D Morphable Model Learnt from 10, 000 Faces. 5543-5552 - Rasmus Rothe, Radu Timofte, Luc Van Gool:
Some Like It Hot - Visual Guidance for Preference Prediction. 5553-5561 - Carlos Fabian Benitez-Quiroz, Ramprakash Srinivasan, Aleix M. Martínez:
EmotioNet: An Accurate, Real-Time Algorithm for the Automatic Annotation of a Million Facial Expressions in the Wild. 5562-5570 - Shuxin Ouyang, Timothy M. Hospedales, Yi-Zhe Song, Xueming Li:
ForgetMeNot: Memory-Aware Forensic Facial Sketch Matching. 5571-5579 - Karan Sikka, Gaurav Sharma, Marian Stewart Bartlett:
LOMo: Latent Ordinal Model for Facial Analysis in Videos. 5580-5589 - Dipan K. Pal, Felix Juefei-Xu, Marios Savvides:
Discriminative Invariant Kernel Features: A Bells-and-Whistles-Free Approach to Unsupervised Face Recognition and Pose Estimation. 5590-5599 - Peiyun Hu, Deva Ramanan:
Bottom-Up and Top-Down Reasoning with Hierarchical Rectified Gaussians. 5600-5609 - David Joseph Tan, Thomas J. Cashman, Jonathan Taylor, Andrew W. Fitzgibbon, Daniel Tarlow, Sameh Khamis, Shahram Izadi, Jamie Shotton:
Fits Like a Glove: Rapid and Reliable Hand Shape Personalization. 5610-5619 - Jing Shao, Chen Change Loy, Kai Kang, Xiaogang Wang:
Slicing Convolutional Neural Network for Crowd Video Understanding. 5620-5628
Spotlight Session 4-3B
S4-3B: 3D, Stereo, Matching, and Saliency Estimation
- Florian Bernard, Peter Gemmar, Frank Hertel, Jorge M. Gonçalves, Johan Thunberg:
Linear Shape Deformation Models with Local Support Using Graph-Based Structured Matrix Factorisation. 5629-5638 - Jayakorn Vongkulbhisal, Ricardo Silveira Cabral, Fernando De la Torre, João Paulo Costeira:
Motion from Structure (MfS): Searching for 3D Objects in Cluttered Point Trajectories. 5639-5647 - Charles Ruizhongtai Qi, Hao Su, Matthias Nießner, Angela Dai, Mengyuan Yan, Leonidas J. Guibas:
Volumetric and Multi-view CNNs for Object Classification on 3D Data. 5648-5656 - Menghua Zhai, Scott Workman, Nathan Jacobs:
Detecting Vanishing Points Using Global Image Context in a Non-ManhattanWorld. 5657-5665 - Chunyuan Li, Andrew Stevens, Changyou Chen, Yunchen Pu, Zhe Gan, Lawrence Carin:
Learning Weight Uncertainty with Stochastic Gradient MCMC for Shape Classification. 5666-5675 - Duc Thanh Nguyen, Binh-Son Hua, Minh-Khoi Tran, Quang-Hieu Pham, Sai-Kit Yeung:
A Field Model for Repairing 3D Shapes. 5676-5684 - Dylan Campbell, Lars Petersson:
GOGMA: Globally-Optimal Gaussian Mixture Alignment. 5685-5694 - Wenjie Luo, Alexander G. Schwing, Raquel Urtasun:
Efficient Deep Learning for Stereo Matching. 5695-5703 - Yinlin Hu, Rui Song, Yunsong Li:
Efficient Coarse-to-Fine Patch Match for Large Displacement Optical Flow. 5704-5712 - Ben Harwood, Tom Drummond:
FANNG: Fast Approximate Nearest Neighbour Graphs. 5713-5722 - Shengfeng He, Rynson W. H. Lau:
Exemplar-Driven Top-Down Saliency Detection via Deep Association. 5723-5732 - Jianming Zhang, Stan Sclaroff, Zhe Lin, Xiaohui Shen, Brian L. Price, Radomír Mech:
Unconstrained Salient Object Detection via Proposal Subset Optimization. 5733-5742 - Sina Honari, Jason Yosinski, Pascal Vincent, Christopher J. Pal:
Recombinator Networks: Learning Coarse-to-Fine Feature Aggregation. 5743-5752 - Saumya Jetley, Naila Murray, Eleonora Vig:
End-to-End Saliency Mapping via Probability Distribution Prediction. 5753-5761
Poster Session P4-2
- Shaojing Fan, Tian-Tsong Ng, Bryan L. Koenig, Ming Jiang, Qi Zhao:
A Paradigm for Building Generalized Models of Human Image Perception through Data Fusion. 5762-5771 - Chi Nhan Duong, Khoa Luu, Kha Gia Quach, Tien D. Bui:
Longitudinal Face Modeling via Temporal Deep Restricted Boltzmann Machines. 5772-5780 - Srinivas S. S. Kruthiventi, Vennela Gudisa, Jaley H. Dholakiya, R. Venkatesh Babu:
Saliency Unified: A Deep Architecture for simultaneous Eye Fixation Prediction and Salient Object Segmentation. 5781-5790 - Yuxiang Zhou, Epameinondas Antonakos, Joan Alabort-i-Medina, Anastasios Roussos, Stefanos Zafeiriou:
Estimating Correspondences of Deformable Objects "In-the-Wild". 5791-5801 - Vladislav Golyanik, Sk Aziz Ali, Didier Stricker:
Gravitational Approach for Point Set Registration. 5802-5810 - Gang Wang, Zhicheng Wang, Yufei Chen, Qiangqiang Zhou, Weidong Zhao:
Context-Aware Gaussian Fields for Non-rigid Point Set Registration. 5811-5819 - Magnus Oskarsson, Kenneth Batstone, Kalle Åström:
Trust No One: Low Rank Matrix Factorization Using Hierarchical RANSAC. 5820-5829 - Chen Wang, Ramin Zabih:
Relaxation-Based Preprocessing Techniques for Markov Random Field Inference. 5830-5838 - Yuhui Quan, Yong Xu, Yuping Sun, Yan Huang, Hui Ji:
Sparse Coding for Classification via Discrimination Ensemble. 5839-5847 - Pierre Baqué, Timur M. Bagautdinov, François Fleuret, Pascal Fua:
Principled Parallel Mean-Field Inference for Discrete Random Fields. 5848-5857 - Tat-Jun Chin, Yang Heng Kee, Anders P. Eriksson, Frank Neumann:
Guaranteed Outlier Removal with Mixed Integer Linear Programs. 5858-5866 - Thalaiyasingam Ajanthan, Richard I. Hartley, Mathieu Salzmann:
Memory Efficient Max Flow for Multi-label Submodular MRFs. 5867-5876 - Mingkui Tan, Shijie Xiao, Junbin Gao, Dong Xu, Anton van den Hengel, Qinfeng Shi:
Proximal Riemannian Pursuit for Large-Scale Trace-Norm Minimization. 5877-5886 - Erik Bylow, Carl Olsson, Fredrik Kahl, Mikael G. Nilsson:
Minimizing the Maximal Rank. 5887-5895 - Caglayan Dicle, Burak Yilmaz, Octavia I. Camps, Mario Sznaier:
Solving Temporal Puzzles. 5896-5905 - Sohil Shah, Tom Goldstein, Christoph Studer:
Estimating Sparse Signals with Smooth Support via Convex Programming and Block Sparsity. 5906-5915 - Na Qi, Yunhui Shi, Xiaoyan Sun, Baocai Yin:
TenSR: Multi-dimensional Tensor Sparse Representation. 5916-5925 - Florian Jug, Evgeny Levinkov, Corinna Blasse, Eugene W. Myers, Bjoern Andres:
Moral Lineage Tracing. 5926-5935 - Behrooz Nasihatkon, Frida Fejne, Fredrik Kahl:
Globally Optimal Rigid Intensity Based Registration: A Fast Fourier Domain Approach. 5936-5944 - Haichuan Yang, Yijun Huang, Lam Tran, Ji Liu, Shuai Huang:
On Benefits of Selection Diversity via Bilevel Exclusive Sparsity. 5945-5954 - Bohan Zhuang, Guosheng Lin, Chunhua Shen, Ian D. Reid:
Fast Training of Triplet-Based Deep Binary Embedding Networks. 5955-5964 - Aayush Bansal, Bryan C. Russell, Abhinav Gupta:
Marr Revisited: 2D-3D Alignment via Surface Normal Prediction. 5965-5974 - Ziad Al-Halah, Makarand Tapaswi, Rainer Stiefelhagen:
Recovering the Missing Link: Predicting Class-Attribute Associations for Unsupervised Zero-Shot Learning. 5975-5984 - Yang Zhang, Boqing Gong, Mubarak Shah:
Fast Zero-Shot Image Tagging. 5985-5994 - Anran Wang, Jianfei Cai, Jiwen Lu, Tat-Jen Cham:
Modality and Component Aware Feature Fusion for RGB-D Scene Classification. 5995-6004 - Yilin Wang, Suhang Wang, Jiliang Tang, Huan Liu, Baoxin Li:
PPP: Joint Pointwise and Pairwise Image Label Prediction. 6005-6013 - Jan Dirk Wegner, Steve Branson, David Hall, Konrad Schindler, Pietro Perona:
Cataloging Public Objects Using Aerial and Street-Level Images - Urban Trees. 6014-6023 - Francisco Massa, Bryan C. Russell, Mathieu Aubry:
Deep Exemplar 2D-3D Detection by Adapting from Real to Rendered Views. 6024-6033 - Ziming Zhang, Venkatesh Saligrama:
Zero-Shot Learning via Joint Latent Similarity Embedding. 6034-6042 - Bin Yang, Junjie Yan, Zhen Lei, Stan Z. Li:
CRAFT Objects from Images. 6043-6051
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.