Oral 1
3D Vision
- Globally-Optimal Inlier Set Maximisation for Simultaneous Camera Pose and Feature Correspondence (PDF)
- Dylan Campbell, Lars Petersson, Laurent Kneip, Hongdong Li
- Robust Pseudo Random Fields for Light-Field Stereo Matching (PDF)
- Chao-Tsung Huang
- A Lightweight Approach for On-The-Fly Reflectance Estimation (PDF)
- Kihwan Kim, Jinwei Gu, Stephen Tyree, Pavlo Molchanov, Matthias Nießner, Jan Kautz
- Distributed Very Large Scale Bundle Adjustment by Global Camera Consensus (PDF)
- Runze Zhang, Siyu Zhu, Tian Fang, Long Quan
- Practical Projective Structure From Motion (P2SfM) (PDF, videos)
- Ludovic Magerand, Alessio Del Bue
Spotlight 1
3D Vision & Video Analysis
- Anticipating Daily Intention Using On-Wrist Motion Triggered Sensing
- Tz-Ying Wu, Ting-An Chien, Cheng-Sheng Chan, Chan-Wei Hu, Min Sun
- Rethinking Reprojection: Closing the Loop for Pose-Aware Shape Reconstruction From a Single Image
- Rui Zhu, Hamed Kiani Galoogahi, Chaoyang Wang, Simon Lucey
- End-To-End Learning of Geometry and Context for Deep Stereo Regression
- Alex Kendall, Hayk Martirosyan, Saumitro Dasgupta, Peter Henry, Ryan Kennedy, Abraham Bachrach, Adam Bry
- Using Sparse Elimination for Solving Minimal Problems in Computer Vision
- Janne Heikkilä
- High-Resolution Shape Completion Using Deep Neural Networks for Global Structure and Local Geometry Inference
- Xiaoguang Han, Zhen Li, Haibin Huang, Evangelos Kalogerakis, Yizhou Yu
- Temporal Tessellation: A Unified Approach for Video Analysis
- Dotan Kaufman, Gil Levi, Tal Hassner, Lior Wolf
- Learning Policies for Adaptive Tracking With Deep Feature Cascades
- Chen Huang, Simon Lucey, Deva Ramanan
- Temporal Shape Super-Resolution by Intra-Frame Motion Encoding Using High-Fps Structured Light
- Yuki Shiba, Satoshi Ono, Ryo Furukawa, Shinsaku Hiura, Hiroshi Kawasaki
Poster 1
Oral O1 Posters
- Globally-Optimal Inlier Set Maximisation for Simultaneous Camera Pose and Feature Correspondence
- Dylan Campbell, Lars Petersson, Laurent Kneip, Hongdong Li
- Robust Pseudo Random Fields for Light-Field Stereo Matching
- Chao-Tsung Huang
- A Lightweight Approach for On-The-Fly Reflectance Estimation
- Kihwan Kim, Jinwei Gu, Stephen Tyree, Pavlo Molchanov, Matthias Nießner, Jan Kautz
- Distributed Very Large Scale Bundle Adjustment by Global Camera Consensus
- Runze Zhang, Siyu Zhu, Tian Fang, Long Quan
- Practical Projective Structure From Motion (P2SfM)
- Ludovic Magerand, Alessio Del Bue
Spotlight S1 Posters
- Anticipating Daily Intention Using On-Wrist Motion Triggered Sensing
- Tz-Ying Wu, Ting-An Chien, Cheng-Sheng Chan, Chan-Wei Hu, Min Sun
- Rethinking Reprojection: Closing the Loop for Pose-Aware Shape Reconstruction From a Single Image
- Rui Zhu, Hamed Kiani Galoogahi, Chaoyang Wang, Simon Lucey
- End-To-End Learning of Geometry and Context for Deep Stereo Regression
- Alex Kendall, Hayk Martirosyan, Saumitro Dasgupta, Peter Henry, Ryan Kennedy, Abraham Bachrach, Adam Bry
- Using Sparse Elimination for Solving Minimal Problems in Computer Vision
- Janne Heikkilä
- High-Resolution Shape Completion Using Deep Neural Networks for Global Structure and Local Geometry Inference
- Xiaoguang Han, Zhen Li, Haibin Huang, Evangelos Kalogerakis, Yizhou Yu
- Temporal Tessellation: A Unified Approach for Video Analysis
- Dotan Kaufman, Gil Levi, Tal Hassner, Lior Wolf
- Learning Policies for Adaptive Tracking With Deep Feature Cascades
- Chen Huang, Simon Lucey, Deva Ramanan
- Temporal Shape Super-Resolution by Intra-Frame Motion Encoding Using High-Fps Structured Light
- Yuki Shiba, Satoshi Ono, Ryo Furukawa, Shinsaku Hiura, Hiroshi Kawasaki
3D Computer Vision
- Real-Time Monocular Pose Estimation of 3D Objects Using Temporally Consistent Local Color Histograms
- Henning Tjaden, Ulrich Schwanecke, Elmar Schömer
- CAD Priors for Accurate and Flexible Instance Reconstruction
- Tolga Birdal, Slobodan Ilic
- Colored Point Cloud Registration Revisited
- Jaesik Park, Qian-Yi Zhou, Vladlen Koltun
- Learning Compact Geometric Features
- Marc Khoury, Qian-Yi Zhou, Vladlen Koltun
- Joint Layout Estimation and Global Multi-View Registration for Indoor Reconstruction
- Jeong-Kyun Lee, Jaewon Yea, Min-Gyu Park, Kuk-Jin Yoon
Biomedical Image Analysis
- A Geometric Framework for Statistical Analysis of Trajectories With Distinct Temporal Spans
- Rudrasis Chakraborty, Vikas Singh, Nagesh Adluru, Baba C. Vemuri
- An Optimal Transportation Based Univariate Neuroimaging Index
- Liang Mi, Wen Zhang, Junwei Zhang, Yonghui Fan, Dhruman Goradia, Kewei Chen, Eric M. Reiman, Xianfeng Gu, Yalin Wang
Face & Gesture
- S3FD: Single Shot Scale-Invariant Face Detector
- Shifeng Zhang, Xiangyu Zhu, Zhen Lei, Hailin Shi, Xiaobo Wang, Stan Z. Li
Low-Level Vision & Image Processing
- Amulet: Aggregating Multi-Level Convolutional Features for Salient Object Detection
- Pingping Zhang, Dong Wang, Huchuan Lu, Hongyu Wang, Xiang Ruan
- Learning Uncertain Convolutional Features for Accurate Saliency Detection
- Pingping Zhang, Dong Wang, Huchuan Lu, Hongyu Wang, Baocai Yin
- Zero-Order Reverse Filtering
- Xin Tao, Chao Zhou, Xiaoyong Shen, Jue Wang, Jiaya Jia
- Learning Blind Motion Deblurring
- Patrick Wieschollek, Michael Hirsch, Bernhard Schölkopf, Hendrik P. A. Lensch
- Joint Adaptive Sparsity and Low-Rankness on the Fly: An Online Tensor Reconstruction Scheme for Video Denoising
- Bihan Wen, Yanjun Li, Luke Pfister, Yoram Bresler
- Learning to Super-Resolve Blurry Face and Text Images
- Xiangyu Xu, Deqing Sun, Jinshan Pan, Yujin Zhang, Hanspeter Pfister, Ming-Hsuan Yang
- Video Frame Interpolation via Adaptive Separable Convolution
- Simon Niklaus, Long Mai, Feng Liu
Motion & Tracking
- Deep Occlusion Reasoning for Multi-Camera Multi-Target Detection
- Pierre Baqué, François Fleuret, Pascal Fua
- Encouraging LSTMs to Anticipate Actions Very Early
- Mohammad Sadegh Aliakbarian, Fatemeh Sadat Saleh, Mathieu Salzmann, Basura Fernando, Lars Petersson, Lars Andersson
- PathTrack: Fast Trajectory Annotation With Path Supervision
- Santiago Manen, Michael Gygli, Dengxin Dai, Luc Van Gool
- Tracking the Untrackable: Learning to Track Multiple Cues With Long-Term Dependencies
- Amir Sadeghian, Alexandre Alahi, Silvio Savarese
- MirrorFlow: Exploiting Symmetries in Joint Optical Flow and Occlusion Estimation
- Junhwa Hur, Stefan Roth
- Tracking as Online Decision-Making: Learning a Policy From Streaming Videos With Reinforcement Learning
- James Supančič, III, Deva Ramanan
Optimization Methods
- Non-Convex Rank/Sparsity Regularization and Local Minima
- Carl Olsson, Marcus Carlsson, Fredrik Andersson, Viktor Larsson
- A Revisit of Sparse Coding Based Anomaly Detection in Stacked RNN Framework
- Weixin Luo, Wen Liu, Shenghua Gao
Recognition
- HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis
- Xihui Liu, Haiyu Zhao, Maoqing Tian, Lu Sheng, Jing Shao, Shuai Yi, Junjie Yan, Xiaogang Wang
- No Fuss Distance Metric Learning Using Proxies
- Yair Movshovitz-Attias, Alexander Toshev, Thomas K. Leung, Sergey Ioffe, Saurabh Singh
- Benchmarking and Error Diagnosis in Multi-Instance Pose Estimation
- Matteo Ruggero Ronchi, Pietro Perona
- Orientation Invariant Feature Embedding and Spatial Temporal Regularization for Vehicle Re-Identification
- Zhongdao Wang, Luming Tang, Xihui Liu, Zhuliang Yao, Shuai Yi, Jing Shao, Junjie Yan, Shengjin Wang, Hongsheng Li, Xiaogang Wang
- Fashion Forward: Forecasting Visual Style in Fashion
- Ziad Al-Halah, Rainer Stiefelhagen, Kristen Grauman
- Towards 3D Human Pose Estimation in the Wild: A Weakly-Supervised Approach
- Xingyi Zhou, Qixing Huang, Xiao Sun, Xiangyang Xue, Yichen Wei
- Flow-Guided Feature Aggregation for Video Object Detection
- Xizhou Zhu, Yujie Wang, Jifeng Dai, Lu Yuan, Yichen Wei
- Reasoning About Fine-Grained Attribute Phrases Using Reference Games
- Jong-Chyi Su, Chenyun Wu, Huaizu Jiang, Subhransu Maji
- DeNet: Scalable Real-Time Object Detection With Directed Sparse Sampling
- Lachlan Tychsen-Smith, Lars Petersson
- MIHash: Online Hashing With Mutual Information
- Fatih Cakir, Kun He, Sarah Adel Bargal, Stan Sclaroff
- SafetyNet: Detecting and Rejecting Adversarial Examples Robustly
- Jiajun Lu, Theerasit Issaranon, David Forsyth
- Recurrent Models for Situation Recognition
- Svetlana Lazebnik, Arun Mallya
- Multi-Label Image Recognition by Recurrently Discovering Attentional Regions
- Zhouxia Wang, Tianshui Chen, Guanbin Li, Ruijia Xu, Liang Lin
- Deep Determinantal Point Process for Large-Scale Multi-Label Classification
- Pengtao Xie, Ruslan Salakhutdinov, Luntian Mou, Eric P. Xing
- Visual Semantic Planning Using Deep Successor Representations
- Yuke Zhu, Daniel Gordon, Eric Kolve, Dieter Fox, Li Fei-Fei, Abhinav Gupta, Roozbeh Mottaghi, Ali Farhadi
- Neural Person Search Machines
- Hao Liu, Jiashi Feng, Zequn Jie, Karlekar Jayashree, Bo Zhao, Meibin Qi, Jianguo Jiang, Shuicheng Yan
- DualNet: Learn Complementary Features for Image Recognition
- Saihui Hou, Xu Liu, Zilei Wang
- Higher-Order Integration of Hierarchical Convolutional Activations for Fine-Grained Visual Categorization
- Sijia Cai, Wangmeng Zuo, Lei Zhang
- Show, Adapt and Tell: Adversarial Training of Cross-Domain Image Captioner
- Tseng-Hung Chen, Yuan-Hong Liao, Ching-Yao Chuang, Wan-Ting Hsu, Jianlong Fu, Min Sun
- Attribute Recognition by Joint Recurrent Learning of Context and Correlation
- Jingya Wang, Xiatian Zhu, Shaogang Gong, Wei Li
- VegFru: A Domain-Specific Dataset for Fine-Grained Visual Categorization
- Saihui Hou, Yushan Feng, Zilei Wang
- Increasing CNN Robustness to Occlusions by Reducing Filter Support
- Elad Osherov, Michael Lindenbaum
- Exploiting Multi-Grain Ranking Constraints for Precisely Searching Visually-Similar Vehicles
- Ke Yan, Yonghong Tian, Yaowei Wang, Wei Zeng, Tiejun Huang
- Recurrent Scale Approximation for Object Detection in CNN
- Yu Liu, Hongyang Li, Junjie Yan, Fangyin Wei, Xiaogang Wang, Xiaoou Tang
Segmentation, Grouping & Shape
- Embedding 3D Geometric Features for Rigid Object Part Segmentation
- Yafei Song, Xiaowu Chen, Jia Li, Qinping Zhao
Statistical Methods & Learning
- Towards Context-Aware Interaction Recognition for Visual Relationship Detection
- Bohan Zhuang, Lingqiao Liu, Chunhua Shen, Ian Reid
- When Unsupervised Domain Adaptation Meets Tensor Representations
- Hao Lu, Lei Zhang, Zhiguo Cao, Wei Wei, Ke Xian, Chunhua Shen, Anton van den Hengel
- Look, Listen and Learn
- Relja Arandjelović, Andrew Zisserman
- Grad-CAM: Visual Explanations From Deep Networks via Gradient-Based Localization
- Ramprasaath R. Selvaraju, Michael Cogswell, Abhishek Das, Ramakrishna Vedantam, Devi Parikh, Dhruv Batra
- Image-Based Localization Using LSTMs for Structured Feature Correlation
- Florian Walch, Caner Hazirbas, Laura Leal-Taixé, Torsten Sattler, Sebastian Hilsenbeck, Daniel Cremers
- Personalized Image Aesthetics
- Jian Ren, Xiaohui Shen, Zhe Lin, Radomír Měch, David J. Foran
- Predicting Deeper Into the Future of Semantic Segmentation
- Pauline Luc, Natalia Neverova, Camille Couprie, Jakob Verbeek, Yann LeCun
- Coordinating Filters for Faster Deep Neural Networks
- Wei Wen, Cong Xu, Chunpeng Wu, Yandan Wang, Yiran Chen, Hai Li
- Unsupervised Representation Learning by Sorting Sequences
- Hsin-Ying Lee, Jia-Bin Huang, Maneesh Singh, Ming-Hsuan Yang
Video
- A Read-Write Memory Network for Movie Story Understanding
- Seil Na, Sangho Lee, Jisung Kim, Gunhee Kim
- SegFlow: Joint Learning for Video Object Segmentation and Optical Flow
- Jingchun Cheng, Yi-Hsuan Tsai, Shengjin Wang, Ming-Hsuan Yang
- Unsupervised Action Discovery and Localization in Videos
- Khurram Soomro, Mubarak Shah
- Dense-Captioning Events in Videos
- Ranjay Krishna, Kenji Hata, Frederic Ren, Li Fei-Fei, Juan Carlos Niebles
- Learning Long-Term Dependencies for Action Recognition With a Biologically-Inspired Deep Network
- Yemin Shi, Yonghong Tian, Yaowei Wang, Wei Zeng, Tiejun Huang
- Compressive Quantization for Fast Object Instance Search in Videos
- Tan Yu, Zhenzhen Wang, Junsong Yuan
- Complex Event Detection by Identifying Reliable Shots From Untrimmed Videos
- Hehe Fan, Xiaojun Chang, De Cheng, Yi Yang, Dong Xu, Alexander G. Hauptmann
Vision for X
- Deep Direct Regression for Multi-Oriented Scene Text Detection
- Wenhao He, Xu-Yao Zhang, Fei Yin, Cheng-Lin Liu
Oral 2
Recognition I
- Open Set Domain Adaptation
- Pau Panareda Busto, Juergen Gall
- Deformable Convolutional Networks
- Jifeng Dai, Haozhi Qi, Yuwen Xiong, Yi Li, Guodong Zhang, Han Hu, Yichen Wei
- Ensemble Diffusion for Retrieval
- Song Bai, Zhichao Zhou, Jingdong Wang, Xiang Bai, Longin Jan Latecki, Qi Tian
- FoveaNet: Perspective-Aware Urban Scene Parsing
- Xin Li, Zequn Jie, Wei Wang, Changsong Liu, Jimei Yang, Xiaohui Shen, Zhe Lin, Qiang Chen, Shuicheng Yan, Jiashi Feng
- Beyond Planar Symmetry: Modeling Human Perception of Reflection and Rotation Symmetries in the Wild
- Christopher Funk, Yanxi Liu
Spotlight 2
Recognition I
- Learning to Reason: End-To-End Module Networks for Visual Question Answering
- Ronghang Hu, Jacob Andreas, Marcus Rohrbach, Trevor Darrell, Kate Saenko
- Hard-Aware Deeply Cascaded Embedding
- Yuhui Yuan, Kuiyuan Yang, Chao Zhang
- Query-Guided Regression Network With Context Policy for Phrase Grounding
- Kan Chen, Rama Kovvuri, Ram Nevatia
- SUBIC: A Supervised, Structured Binary Code for Image Search
- Himalaya Jain, Joaquin Zepeda, Patrick Pérez, Rémi Gribonval
- Revisiting Unreasonable Effectiveness of Data in Deep Learning Era
- Chen Sun, Abhinav Shrivastava, Saurabh Singh, Abhinav Gupta
- A Generative Model of People in Clothing
- Christoph Lassner, Gerard Pons-Moll, Peter V. Gehler
- Escape From Cells: Deep Kd-Networks for the Recognition of 3D Point Cloud Models
- Roman Klokov, Victor Lempitsky
- Improved Image Captioning via Policy Gradient Optimization of SPIDEr
- Siqi Liu, Zhenhai Zhu, Ning Ye, Sergio Guadarrama, Kevin Murphy
Poster 2
Oral O2 Posters
- Open Set Domain Adaptation
- Pau Panareda Busto, Juergen Gall
- Deformable Convolutional Networks
- Jifeng Dai, Haozhi Qi, Yuwen Xiong, Yi Li, Guodong Zhang, Han Hu, Yichen Wei
- Ensemble Diffusion for Retrieval
- Song Bai, Zhichao Zhou, Jingdong Wang, Xiang Bai, Longin Jan Latecki, Qi Tian
- FoveaNet: Perspective-Aware Urban Scene Parsing
- Xin Li, Zequn Jie, Wei Wang, Changsong Liu, Jimei Yang, Xiaohui Shen, Zhe Lin, Qiang Chen, Shuicheng Yan, Jiashi Feng
- Beyond Planar Symmetry: Modeling Human Perception of Reflection and Rotation Symmetries in the Wild
- Christopher Funk, Yanxi Liu
Spotlight S2 Posters
- Learning to Reason: End-To-End Module Networks for Visual Question Answering
- Ronghang Hu, Jacob Andreas, Marcus Rohrbach, Trevor Darrell, Kate Saenko
- Hard-Aware Deeply Cascaded Embedding
- Yuhui Yuan, Kuiyuan Yang, Chao Zhang
- Query-Guided Regression Network With Context Policy for Phrase Grounding
- Kan Chen, Rama Kovvuri, Ram Nevatia
- SUBIC: A Supervised, Structured Binary Code for Image Search
- Himalaya Jain, Joaquin Zepeda, Patrick Pérez, Rémi Gribonval
- Revisiting Unreasonable Effectiveness of Data in Deep Learning Era
- Chen Sun, Abhinav Shrivastava, Saurabh Singh, Abhinav Gupta
- A Generative Model of People in Clothing
- Christoph Lassner, Gerard Pons-Moll, Peter V. Gehler
- Escape From Cells: Deep Kd-Networks for the Recognition of 3D Point Cloud Models
- Roman Klokov, Victor Lempitsky
- Improved Image Captioning via Policy Gradient Optimization of SPIDEr
- Siqi Liu, Zhenhai Zhu, Ning Ye, Sergio Guadarrama, Kevin Murphy
3D Computer Vision
- Rolling Shutter Correction in Manhattan World
- Pulak Purkait, Christopher Zach, Ale&scaron, Leonardis
- Local-To-Global Point Cloud Registration Using a Dictionary of Viewpoint Descriptors (PDF)
- David Avidar, David Malah, Meir Barzohar
- 3D-PRNN: Generating Shape Primitives With Recurrent Neural Networks
- Chuhang Zou, Ersin Yumer, Jimei Yang, Duygu Ceylan, Derek Hoiem
- BodyFusion: Real-Time Capture of Human Motion and Surface Geometry Using a Single Depth Camera
- Tao Yu, Kaiwen Guo, Feng Xu, Yuan Dong, Zhaoqi Su, Jianhui Zhao, Jianguo Li, Qionghai Dai, Yebin Liu
- Quasiconvex Plane Sweep for Triangulation With Outliers
- Qianggong Zhang, Tat-Jun Chin, David Suter
- "Maximizing Rigidity" Revisited: A Convex Programming Approach for Generic 3D Shape Reconstruction From Multiple Perspective Views
- Pan Ji, Hongdong Li, Yuchao Dai, Ian Reid
- Surface Registration via Foliation
- Xiaopeng Zheng, Chengfeng Wen, Na Lei, Ming Ma, Xianfeng Gu
- Rolling-Shutter-Aware Differential SfM and Image Rectification
- Bingbing Zhuang, Loong-Fah Cheong, Gim Hee Lee
- Corner-Based Geometric Calibration of Multi-Focus Plenoptic Cameras
- Sotiris Nousias, François Chadebecq, Jonas Pichat, Pearse Keane, Sébastien Ourselin, Christos Bergeles
Computational Photography
- Focal Track: Depth and Accommodation With Oscillating Lens Deformation
- Qi Guo, Emma Alexander, Todd Zickler
- Reconfiguring the Imaging Pipeline for Computer Vision
- Mark Buckler, Suren Jayasuriya, Adrian Sampson
- Catadioptric HyperSpectral Light Field Imaging
- Yujia Xue, Kang Zhu, Qiang Fu, Xilin Chen, Jingyi Yu
Face & Gesture
- Cross-View Asymmetric Metric Learning for Unsupervised Person Re-Identification
- Hong-Xing Yu, Ancong Wu, Wei-Shi Zheng
- Real Time Eye Gaze Tracking With 3D Deformable Eye-Face Model
- Kang Wang, Qiang Ji
- Ensemble Deep Learning for Skeleton-Based Action Recognition Using Temporal Sliding LSTM Networks
- Inwoong Lee, Doyoung Kim, Seoungyoon Kang, Sanghoon Lee
- How Far Are We From Solving the 2D & 3D Face Alignment Problem? (And a Dataset of 230,000 3D Facial Landmarks)
- Adrian Bulat, Georgios Tzimiropoulos
- Large Pose 3D Face Reconstruction From a Single Image via Direct Volumetric CNN Regression
- Aaron S. Jackson, Adrian Bulat, Vasileios Argyriou, Georgios Tzimiropoulos
Low-Level Vision & Image Processing
- RankIQA: Learning From Rankings for No-Reference Image Quality Assessment
- Xialei Liu, Joost van de Weijer, Andrew D. Bagdanov
- Look, Perceive and Segment: Finding the Salient Objects in Images via Two-Stream Fixation-Semantic CNNs
- Xiaowu Chen, Anlin Zheng, Jia Li, Feng Lu
- Delving Into Salient Object Subitizing and Detection
- Shengfeng He, Jianbo Jiao, Xiaodan Zhang, Guoqiang Han, Rynson W.H. Lau
- Learning Discriminative Data Fitting Functions for Blind Image Deblurring
- Jinshan Pan, Jiangxin Dong, Yu-Wing Tai, Zhixun Su, Ming-Hsuan Yang
- Video Deblurring via Semantic Segmentation and Pixel-Wise Non-Linear Kernel
- Wenqi Ren, Jinshan Pan, Xiaochun Cao, Ming-Hsuan Yang
- On-Demand Learning for Deep Image Restoration
- Ruohan Gao, Kristen Grauman
- Multi-Channel Weighted Nuclear Norm Minimization for Real Color Image Denoising
- Jun Xu, Lei Zhang, David Zhang, Xiangchu Feng
- Coherent Online Video Style Transfer
- Dongdong Chen, Jing Liao, Lu Yuan, Nenghai Yu, Gang Hua
Motion & Tracking
- SHaPE: A Novel Graph Theoretic Algorithm for Making Consensus-Based Decisions in Person Re-Identification Systems
- Arko Barman, Shishir K. Shah
- Need for Speed: A Benchmark for Higher Frame Rate Object Tracking
- Hamed Kiani Galoogahi, Ashton Fagg, Chen Huang, Deva Ramanan, Simon Lucey
- Learning Background-Aware Correlation Filters for Visual Tracking
- Hamed Kiani Galoogahi, Ashton Fagg, Simon Lucey
- Robust Object Tracking Based on Temporal and Spatial Deep Networks
- Zhu Teng, Junliang Xing, Qiang Wang, Congyan Lang, Songhe Feng, Yi Jin
- Real-Time Hand Tracking Under Occlusion From an Egocentric RGB-D Sensor
- Franziska Mueller, Dushyant Mehta, Oleksandr Sotnychenko, Srinath Sridhar, Dan Casas, Christian Theobalt
- Predicting Human Activities Using Stochastic Grammar
- Siyuan Qi, Siyuan Huang, Ping Wei, Song-Chun Zhu
- ProbFlow: Joint Optical Flow and Uncertainty Estimation
- Anne S. Wannenwetsch, Margret Keuper, Stefan Roth
Optimization Methods
- Sublabel-Accurate Discretization of Nonconvex Free-Discontinuity Problems
- Thomas Möllenhoff, Daniel Cremers
Recognition
- DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding
- Yinda Zhang, Mingru Bai, Pushmeet Kohli, Shahram Izadi, Jianxiong Xiao
- BAM! The Behance Artistic Media Dataset for Recognition Beyond Photography
- Michael J. Wilber, Chen Fang, Hailin Jin, Aaron Hertzmann, John Collomosse, Serge Belongie
- Adversarial PoseNet: A Structure-Aware Convolutional Network for Human Pose Estimation
- Yu Chen, Chunhua Shen, Xiu-Shen Wei, Lingqiao Liu, Jian Yang
- An Empirical Study of Language CNN for Image Captioning
- Jiuxiang Gu, Gang Wang, Jianfei Cai, Tsuhan Chen
- Attributes2Classname: A Discriminative Model for Attribute-Based Unsupervised Zero-Shot Learning
- Berkan Demirel, Ramazan Gokberk Cinbis, Nazli Ikizler-Cinbis
- Areas of Attention for Image Captioning
- Marco Pedersoli, Thomas Lucas, Cordelia Schmid, Jakob Verbeek
- Generative Modeling of Audible Shapes for Object Perception
- Zhoutong Zhang, Jiajun Wu, Qiujia Li, Zhengjia Huang, James Traer, Josh H. McDermott, Joshua B. Tenenbaum, William T. Freeman
- Scene Graph Generation From Objects, Phrases and Region Captions
- Yikang Li, Wanli Ouyang, Bolei Zhou, Kun Wang, Xiaogang Wang
- Recurrent Multimodal Interaction for Referring Image Segmentation
- Chenxi Liu, Zhe Lin, Xiaohui Shen, Jimei Yang, Xin Lu, Alan Yuille
- Learning Feature Pyramids for Human Pose Estimation
- Wei Yang, Shuang Li, Wanli Ouyang, Hongsheng Li, Xiaogang Wang
- Structured Attentions for Visual Question Answering
- Chen Zhu, Yanpeng Zhao, Shuaiyi Huang, Kewei Tu, Yi Ma
- Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection
- Debidatta Dwibedi, Ishan Misra, Martial Hebert
Segmentation, Grouping & Shape
- Cascaded Feature Network for Semantic Segmentation of RGB-D Images
- Di Lin, Guangyong Chen, Daniel Cohen-Or, Pheng-Ann Heng, Hui Huang
Statistical Methods & Learning
- Encoder Based Lifelong Learning
- Amal Rannen, Rahaf Aljundi, Matthew B. Blaschko, Tinne Tuytelaars
- Transitive Invariance for Self-Supervised Visual Representation Learning
- Xiaolong Wang, Kaiming He, Abhinav Gupta
- Weakly Supervised Learning of Deep Metrics for Stereo Reconstruction
- Stepan Tulyakov, Anton Ivanov, François Fleuret
- Fine-Grained Recognition in the Wild: A Multi-Task Domain Adaptation Approach
- Timnit Gebru, Judy Hoffman, Li Fei-Fei
- SORT: Second-Order Response Transform for Visual Recognition
- Yan Wang, Lingxi Xie, Chenxi Liu, Siyuan Qiao, Ya Zhang, Wenjun Zhang, Qi Tian, Alan Yuille
- Adversarial Examples for Semantic Segmentation and Object Detection
- Cihang Xie, Jianyu Wang, Zhishuai Zhang, Yuyin Zhou, Lingxi Xie, Alan Yuille
- Genetic CNN
- Lingxi Xie, Alan Yuille
- Channel Pruning for Accelerating Very Deep Neural Networks
- Yihui He, Xiangyu Zhang, Jian Sun
- Infinite Latent Feature Selection: A Probabilistic Latent Graph-Based Ranking Approach
- Giorgio Roffo, Simone Melzi, Umberto Castellani, Alessandro Vinciarelli
Video
- Video Fill in the Blank Using LR/RL LSTMs With Spatial-Temporal Attentions
- Amir Mazaheri, Dong Zhang, Mubarak Shah
- Primary Video Object Segmentation via Complementary CNNs and Neighborhood Reversible Flow
- Jia Li, Anlin Zheng, Xiaowu Chen, Bin Zhou
- Attentive Semantic Video Generation Using Captions
- Tanya Marwah, Gaurav Mittal, Vineeth N. Balasubramanian
- Following Gaze in Video
- Adrià, Recasens, Carl Vondrick, Aditya Khosla, Antonio Torralba
- Adaptive RNN Tree for Large-Scale Human Action Recognition
- Wenbo Li, Longyin Wen, Ming-Ching Chang, Ser Nam Lim, Siwei Lyu
- Spatio-Temporal Person Retrieval via Natural Language Queries
- Masataka Yamaguchi, Kuniaki Saito, Yoshitaka Ushiku, Tatsuya Harada
Vision for X
- Automatic Spatially-Aware Fashion Concept Discovery
- Xintong Han, Zuxuan Wu, Phoenix X. Huang, Xiao Zhang, Menglong Zhu, Yuan Li, Yang Zhao, Larry S. Davis
- ChromaTag: A Colored Marker and Fast Detection Algorithm
- Joseph DeGol, Timothy Bretl, Derek Hoiem
- Adversarial Image Perturbation for Privacy Protection — A Game Theory Perspective
- Seong Joon Oh, Mario Fritz, Bernt Schiele
- WeText: Scene Text Detection Under Weak Supervision
- Shangxuan Tian, Shijian Lu, Chongshou Li
Oral 3
Vision for X
- Arbitrary Style Transfer in Real-Time With Adaptive Instance Normalization
- Xun Huang, Serge Belongie
- Photographic Image Synthesis With Cascaded Refinement Networks
- Qifeng Chen, Vladlen Koltun
- SSD-6D: Making RGB-Based 3D Detection and 6D Pose Estimation Great Again
- Wadim Kehl, Fabian Manhardt, Federico Tombari, Slobodan Ilic, Nassir Navab
- Unsupervised Creation of Parameterized Avatars
- Lior Wolf, Yaniv Taigman, Adam Polyak
- Learning for Active 3D Mapping
- Karel Zimmermann, Tomá&scaron, Petříček, Vojtěch Šalanský, Tomá&scaron, Svoboda
Poster 3
Oral O3 Posters
- Arbitrary Style Transfer in Real-Time With Adaptive Instance Normalization
- Xun Huang, Serge Belongie
- Photographic Image Synthesis With Cascaded Refinement Networks
- Qifeng Chen, Vladlen Koltun
- SSD-6D: Making RGB-Based 3D Detection and 6D Pose Estimation Great Again
- Wadim Kehl, Fabian Manhardt, Federico Tombari, Slobodan Ilic, Nassir Navab
- Unsupervised Creation of Parameterized Avatars
- Lior Wolf, Yaniv Taigman, Adam Polyak
- Learning for Active 3D Mapping
- Karel Zimmermann, Tomá&scaron, Petříček, Vojtěch Šalanský, Tomá&scaron, Svoboda
3D Computer Vision
- Toward Perceptually-Consistent Stereo: A Scanline Study
- Jialiang Wang, Daniel Glasner, Todd Zickler
- Surface Normals in the Wild
- Weifeng Chen, Donglai Xiang, Jia Deng
- Unsupervised Learning of Stereo Matching
- Chao Zhou, Hong Zhang, Xiaoyong Shen, Jiaya Jia
- Unrestricted Facial Geometry Reconstruction Using Image-To-Image Translation
- Matan Sela, Elad Richardson, Ron Kimmel
- Learned Multi-Patch Similarity
- Wilfried Hartmann, Silvano Galliani, Michal Havlena, Luc Van Gool, Konrad Schindler
- Click Here: Human-Localized Keypoints as Guidance for Viewpoint Estimation
- Ryan Szeto, Jason J. Corso
- Unsupervised Adaptation for Deep Stereo
- Alessio Tonioni, Matteo Poggi, Stefano Mattoccia, Luigi Di Stefano
Computational Photography
- Composite Focus Measure for High Quality Depth Maps
- Parikshit Sakurikar, P. J. Narayanan
Face & Gesture
- Reconstruction-Based Disentanglement for Pose-Invariant Face Recognition
- Xi Peng (Group: Work group, Company,... - optional), Xiang Yu (Group: Work group, Company,... - optional), Kihyuk Sohn (Group: Work group, Company,... - optional), Dimitris N. Metaxas (Group: Work group, Company,... - optional), Manmohan Chandraker (Group: Work group, Company,... - optional)
- Recurrent 3D-2D Dual Learning for Large-Pose Facial Landmark Detection
- Shengtao Xiao, Jiashi Feng, Luoqi Liu, Xuecheng Nie, Wei Wang, Shuicheng Yan, Ashraf Kassim
- Anchored Regression Networks Applied to Age Estimation and Super Resolution
- Eirikur Agustsson, Radu Timofte, Luc Van Gool
- Infant Footprint Recognition
- Eryun Liu
Low-Level Vision & Image Processing
- Self-Paced Kernel Estimation for Robust Blind Image Deblurring
- Dong Gong, Mingkui Tan, Yanning Zhang, Anton van den Hengel, Qinfeng Shi
- Super-Trajectory for Video Segmentation
- Wenguan Wang, Jianbing Shen, Jianwen Xie, Fatih Porikli
- Be Your Own Prada: Fashion Synthesis With Structural Coherence
- Shizhan Zhu, Raquel Urtasun, Sanja Fidler, Dahua Lin, Chen Change Loy
- Wavelet-SRNet: A Wavelet-Based CNN for Multi-Scale Face Super Resolution
- Huaibo Huang, Ran He, Zhenan Sun, Tieniu Tan
- Learning Gaze Transitions From Depth to Improve Video Saliency Estimation
- George Leifman, Dmitry Rudoy, Tristan Swedish, Eduardo Bayro-Corrochano, Ramesh Raskar
- Joint Convolutional Analysis and Synthesis Sparse Representation for Single Image Layer Separation
- Shuhang Gu, Deyu Meng, Wangmeng Zuo, Lei Zhang
- Modelling the Scene Dependent Imaging in Cameras With a Deep Neural Network
- Seonghyeon Nam, Seon Joo Kim
- Transformed Low-Rank Model for Line Pattern Noise Removal
- Yi Chang, Luxin Yan, Sheng Zhong
- Weakly Supervised Manifold Learning for Dense Semantic Object Correspondence
- Utkarsh Gaur, B. S. Manjunath
- PanNet: A Deep Network Architecture for Pan-Sharpening
- Junfeng Yang, Xueyang Fu, Yuwen Hu, Yue Huang, Xinghao Ding, John Paisley
Motion & Tracking
- Dual Motion GAN for Future-Flow Embedded Video Prediction
- Xiaodan Liang, Lisa Lee, Wei Dai, Eric P. Xing
- Online Robust Image Alignment via Subspace Learning From Gradient Orientations
- Qingqing Zheng, Yi Wang, Pheng-Ann Heng
- Learning Dynamic Siamese Network for Visual Object Tracking
- Qing Guo, Wei Feng, Ce Zhou, Rui Huang, Liang Wan, Song Wang
Optimization Methods
- High Order Tensor Formulation for Convolutional Sparse Coding
- Adel Bibi, Bernard Ghanem
- Learning Proximal Operators: Using Denoising Networks for Regularizing Inverse Imaging Problems
- Tim Meinhardt, Michael Möller, Caner Hazirbas, Daniel Cremers
Recognition
- ScaleNet: Guiding Object Proposal Generation in Supermarkets and Beyond
- Siyuan Qiao, Wei Shen, Weichao Qiu, Chenxi Liu, Alan Yuille
- Temporal Dynamic Graph LSTM for Action-Driven Video Object Detection
- Yuan Yuan, Xiaodan Liang, Xiaolong Wang, Dit-Yan Yeung, Abhinav Gupta
- VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation
- Chuang Gan, Yandong Li, Haoxiang Li, Chen Sun, Boqing Gong
- Multi-Modal Factorized Bilinear Pooling With Co-Attention Learning for Visual Question Answering
- Zhou Yu, Jun Yu, Jianping Fan, Dacheng Tao
- SCNet: Learning Semantic Correspondence
- Kai Han, Rafael S. Rezende, Bumsub Ham, Kwan-Yee K. Wong, Minsu Cho, Cordelia Schmid, Jean Ponce
- Soft Proposal Networks for Weakly Supervised Object Localization
- Yi Zhu, Yanzhao Zhou, Qixiang Ye, Qiang Qiu, Jianbin Jiao
- Class Rectification Hard Mining for Imbalanced Deep Learning
- Qi Dong, Shaogang Gong, Xiatian Zhu
- Generating High-Quality Crowd Density Maps Using Contextual Pyramid CNNs
- Vishwanath A. Sindagi, Vishal M. Patel
- See the Glass Half Full: Reasoning About Liquid Containers, Their Volume and Content
- Roozbeh Mottaghi, Connor Schenck, Dieter Fox, Ali Farhadi
- Hierarchical Multimodal LSTM for Dense Visual-Semantic Embedding
- Zhenxing Niu, Mo Zhou, Le Wang, Xinbo Gao, Gang Hua
- Identity-Aware Textual-Visual Matching With Latent Co-Attention
- Shuang Li, Tong Xiao, Hongsheng Li, Wei Yang, Xiaogang Wang
- Learning Deep Neural Networks for Vehicle Re-ID With Visual-Spatio-Temporal Path Proposals
- Yantao Shen, Tong Xiao, Hongsheng Li, Shuai Yi, Xiaogang Wang
- Learning From Noisy Labels With Distillation
- Yuncheng Li, Jianchao Yang, Yale Song, Liangliang Cao, Jiebo Luo, Li-Jia Li
- DSOD: Learning Deeply Supervised Object Detectors From Scratch
- Zhiqiang Shen, Zhuang Liu, Jianguo Li, Yu-Gang Jiang, Yurong Chen, Xiangyang Xue
- Phrase Localization and Visual Relationship Detection With Comprehensive Image-Language Cues
- Bryan A. Plummer, Arun Mallya, Christopher M. Cervantes, Julia Hockenmaier, Svetlana Lazebnik
- Chained Cascade Network for Object Detection
- Wanli Ouyang, Kun Wang, Xin Zhu, Xiaogang Wang
- VPGNet: Vanishing Point Guided Network for Lane and Road Marking Detection and Recognition
- Seokju Lee, Junsik Kim, Jae Shin Yoon, Seunghak Shin, Oleksandr Bailo, Namil Kim, Tae-Hee Lee, Hyun Seok Hong, Seung-Hoon Han, In So Kweon
- Unsupervised Learning of Important Objects From First-Person Videos
- Gedas Bertasius, Hyun Soo Park, Stella X. Yu, Jianbo Shi
- An Analysis of Visual Question Answering Algorithms
- Kushal Kafle, Christopher Kanan
- Visual Relationship Detection With Internal and External Linguistic Knowledge Distillation
- Ruichi Yu, Ang Li, Vlad I. Morariu, Larry S. Davis
- A Two Stream Siamese Convolutional Neural Network for Person Re-Identification
- Dahjung Chung, Khalid Tahboub, Edward J. Delp
- Joint Learning of Object and Action Detectors
- Vicky Kalogeiton, Philippe Weinzaepfel, Vittorio Ferrari, Cordelia Schmid
Segmentation, Grouping & Shape
- No More Discrimination: Cross City Adaptation of Road Scene Segmenters
- Yi-Hsin Chen, Wei-Yu Chen, Yu-Ting Chen, Bo-Cheng Tsai, Yu-Chiang Frank Wang, Min Sun
- Open Vocabulary Scene Parsing
- Hang Zhao, Xavier Puig, Bolei Zhou, Sanja Fidler, Antonio Torralba
- Learned Watershed: End-To-End Learning of Seeded Segmentation
- Steffen Wolf, Lukas Schott, Ullrich Köthe, Fred Hamprecht
- Curriculum Domain Adaptation for Semantic Segmentation of Urban Scenes (PDF, code)
- Yang Zhang, Philip David, Boqing Gong
- Scale-Adaptive Convolutions for Scene Parsing
- Rui Zhang, Sheng Tang, Yongdong Zhang, Jintao Li, Shuicheng Yan
Statistical Methods & Learning
- Privacy-Preserving Visual Learning Using Doubly Permuted Homomorphic Encryption
- Ryo Yonetani, Vishnu Naresh Boddeti, Kris M. Kitani, Yoichi Sato
- Multi-Task Self-Supervised Visual Learning
- Carl Doersch, Andrew Zisserman
- A Self-Balanced Min-Cut Algorithm for Image Clustering
- Xiaojun Chen, Joshua Zhexue Haung, Feiping Nie, Renjie Chen, Qingyao Wu
- Is Second-Order Information Helpful for Large-Scale Visual Recognition?
- Peihua Li, Jiangtao Xie, Qilong Wang, Wangmeng Zuo
- Factorized Bilinear Models for Image Recognition
- Yanghao Li, Naiyan Wang, Jiaying Liu, Xiaodi Hou
- Octree Generating Networks: Efficient Convolutional Architectures for High-Resolution 3D Outputs
- Maxim Tatarchenko, Alexey Dosovitskiy, Thomas Brox
- Truncating Wide Networks Using Binary Tree Architectures
- Yan Zhang, Mete Ozay, Shuohao Li, Takayuki Okatani
Video
- Bringing Background Into the Foreground: Making All Classes Equal in Weakly-Supervised Video Semantic Segmentation
- Fatemeh Sadat Saleh, Mohammad Sadegh Aliakbarian, Mathieu Salzmann, Lars Petersson, Jose M. Álvarez
- View Adaptive Recurrent Neural Networks for High Performance Human Action Recognition From Skeleton Data
- Pengfei Zhang, Cuiling Lan, Junliang Xing, Wenjun Zeng, Jianru Xue, Nanning Zheng
- Joint Discovery of Object States and Manipulation Actions
- Jean-Baptiste Alayrac, Ivan Laptev, Josef Sivic, Simon Lacoste-Julien
- What Actions Are Needed for Understanding Human Actions in Videos?
- Gunnar A. Sigurdsson, Olga Russakovsky, Abhinav Gupta
- Lattice Long Short-Term Memory for Human Action Recognition
- Lin Sun, Kui Jia, Kevin Chen, Dit-Yan Yeung, Bertram E. Shi, Silvio Savarese
- Common Action Discovery and Localization in Unconstrained Videos
- Jiong Yang, Junsong Yuan
- Pixel-Level Matching for Video Object Segmentation Using Convolutional Neural Networks
- Jae Shin Yoon, Francois Rameau, Junsik Kim, Seokju Lee, Seunghak Shin, In So Kweon
- Am I a Baller? Basketball Performance Assessment From First-Person Videos
- Gedas Bertasius, Hyun Soo Park, Stella X. Yu, Jianbo Shi
Vision for X
- Deep Cropping via Attention Box Prediction and Aesthetics Assessment
- Wenguan Wang, Jianbing Shen
- Raster-To-Vector: Revisiting Floorplan Transformation
- Chen Liu, Jiajun Wu, Pushmeet Kohli, Yasutaka Furukawa
- Deep TextSpotter: An End-To-End Trainable Scene Text Localization and Recognition Framework
- Michal Bušta, Luká&scaron, Neumann, Jiří, Matas
Spotlight 3
Vision for X & Computational Phtography
- Playing for Benchmarks
- Stephan R. Richter, Zeeshan Hayder, Vladlen Koltun
- Unpaired Image-To-Image Translation Using Cycle-Consistent Adversarial Networks
- Jun-Yan Zhu, Taesung Park, Phillip Isola, Alexei A. Efros
- GANs for Biological Image Synthesis
- Anton Osokin, Anatole Chessel, Rafael E. Carazo Salas, Federico Vaggi
- Learning to Synthesize a 4D RGBD Light Field From a Single Image
- Pratul P. Srinivasan, Tongzhou Wang, Ashwin Sreelal, Ravi Ramamoorthi, Ren Ng
- Neural EPI-Volume Networks for Shape From Light Field
- Stefan Heber, Wei Yu, Thomas Pock
- Material Editing Using a Physically Based Rendering Network
- Guilin Liu, Duygu Ceylan, Ersin Yumer, Jimei Yang, Jyh-Ming Lien
- Turning Corners Into Cameras: Principles and Methods
- Katherine L. Bouman, Vickie Ye, Adam B. Yedidia, Frédo Durand, Gregory W. Wornell, Antonio Torralba, William T. Freeman
- Linear Differential Constraints for Photo-Polarimetric Height Estimation
- Silvia Tozza, William A. P. Smith, Dizhong Zhu, Ravi Ramamoorthi, Edwin R. Hancock
Poster 4
Spotlight S3 Posters
- Playing for Benchmarks
- Stephan R. Richter, Zeeshan Hayder, Vladlen Koltun
- Unpaired Image-To-Image Translation Using Cycle-Consistent Adversarial Networks
- Jun-Yan Zhu, Taesung Park, Phillip Isola, Alexei A. Efros
- GANs for Biological Image Synthesis
- Anton Osokin, Anatole Chessel, Rafael E. Carazo Salas, Federico Vaggi
- Learning to Synthesize a 4D RGBD Light Field From a Single Image
- Pratul P. Srinivasan, Tongzhou Wang, Ashwin Sreelal, Ravi Ramamoorthi, Ren Ng
- Neural EPI-Volume Networks for Shape From Light Field
- Stefan Heber, Wei Yu, Thomas Pock
- Material Editing Using a Physically Based Rendering Network
- Guilin Liu, Duygu Ceylan, Ersin Yumer, Jimei Yang, Jyh-Ming Lien
- Turning Corners Into Cameras: Principles and Methods
- Katherine L. Bouman, Vickie Ye, Adam B. Yedidia, Frédo Durand, Gregory W. Wornell, Antonio Torralba, William T. Freeman
- Linear Differential Constraints for Photo-Polarimetric Height Estimation
- Silvia Tozza, William A. P. Smith, Dizhong Zhu, Ravi Ramamoorthi, Edwin R. Hancock
3D Computer Vision
- Polynomial Solvers for Saturated Ideals
- Viktor Larsson, Kalle Åström, Magnus Oskarsson
- Shape Inpainting Using 3D Generative Adversarial Network and Recurrent Convolutional Networks
- Weiyue Wang, Qiangui Huang, Suya You, Chao Yang, Ulrich Neumann
- SurfaceNet: An End-To-End 3D Neural Network for Multiview Stereopsis
- Mengqi Ji, Juergen Gall, Haitian Zheng, Yebin Liu, Lu Fang
- Making Minimal Solvers for Absolute Pose Estimation Compact and Robust
- Viktor Larsson, Zuzana Kukelova, Yinqiang Zheng
- 3D Surface Detail Enhancement From a Single Normal Map
- Wuyuan Xie, Miaohui Wang, Xianbiao Qi, Lei Zhang
- RMPE: Regional Multi-Person Pose Estimation
- Hao-Shu Fang, Shuqin Xie, Yu-Wing Tai, Cewu Lu
- Online Video Object Detection Using Association LSTM
- Yongyi Lu, Cewu Lu, Chi-Keung Tang
- PolyFit: Polygonal Surface Reconstruction From Point Clouds
- Liangliang Nan, Peter Wonka
- Progressive Large Scale-Invariant Image Matching in Scale Space (PDF)
- Lei Zhou, Siyu Zhu, Tianwei Shen, Jinglu Wang, Tian Fang, Long Quan
- Efficient Global 2D-3D Matching for Camera Localization in a Large-Scale 3D Map (PDF)
- Liu Liu, Hongdong Li, Yuchao Dai
- Multi-View Non-Rigid Refinement and Normal Selection for High Quality 3D Reconstruction (PDF)
- Sk. Mohammadul Haque, Venu Madhav Govindu
Biomedical Image Analysis
- Multi-Stage Multi-Recursive-Input Fully Convolutional Networks for Neuronal Boundary Detection
- Wei Shen, Bin Wang, Yuan Jiang, Yan Wang, Alan Yuille
Computational Photography
- Depth and Image Restoration From Light Field in a Scattering Medium
- Jiandong Tian, Zachary Murez, Tong Cui, Zhen Zhang, David Kriegman, Ravi Ramamoorthi
- Video Reflection Removal Through Spatio-Temporal Optimization
- Ajay Nandoriya, Mohamed Elgharib, Changil Kim, Mohamed Hefeeda, Wojciech Matusik
Face & Gesture
- Efficient Online Local Metric Adaptation via Negative Samples for Person Re-Identification
- Jiahuan Zhou, Pei Yu, Wei Tang, Ying Wu
- Stepwise Metric Promotion for Unsupervised Video Person Re-Identification
- Zimo Liu, Dong Wang, Huchuan Lu
- Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis
- Rui Huang, Shu Zhang, Tianyu Li, Ran He
- Group Re-Identification via Unsupervised Transfer of Sparse Features Encoding
- Giuseppe Lisanti, Niki Martinel, Alberto Del Bimbo, Gian Luca Foresti
- Visual Transformation Aided Contrastive Learning for Video-Based Kinship Verification
- Hamdi Dibeklioğlu
Low-Level Vision & Image Processing
- Decoder Network Over Lightweight Reconstructed Feature for Fast Semantic Style Transfer
- Ming Lu, Hao Zhao, Anbang Yao, Feng Xu, Yurong Chen, Li Zhang
- Blind Image Deblurring With Outlier Handling
- Jiangxin Dong, Jinshan Pan, Zhixun Su, Ming-Hsuan Yang
- Paying Attention to Descriptions Generated by Image Captioning Models
- Hamed R. Tavakoli, Rakshith Shetty, Ali Borji, Jorma Laaksonen
- Fast Image Processing With Fully-Convolutional Networks
- Qifeng Chen, Jia Xu, Vladlen Koltun
- Robust Video Super-Resolution With Learned Temporal Dynamics
- Ding Liu, Zhaowen Wang, Yuchen Fan, Xianming Liu, Zhangyang Wang, Shiyu Chang, Thomas Huang
- Should We Encode Rain Streaks in Video as Deterministic or Stochastic?
- Wei Wei, Lixuan Yi, Qi Xie, Qian Zhao, Deyu Meng, Zongben Xu
- Joint Bi-Layer Optimization for Single-Image Rain Streak Removal
- Lei Zhu, Chi-Wing Fu, Dani Lischinski, Pheng-Ann Heng
Motion & Tracking
- Low-Dimensionality Calibration Through Local Anisotropic Scaling for Robust Hand Model Personalization
- Edoardo Remelli, Anastasia Tkach, Andrea Tagliasacchi, Mark Pauly
- Non-Markovian Globally Consistent Multi-Object Tracking
- Andrii Maksai, Xinchao Wang, François Fleuret, Pascal Fua
- CREST: Convolutional Residual Learning for Visual Tracking
- Yibing Song, Chao Ma, Lijun Gong, Jiawei Zhang, Rynson W. H. Lau, Ming-Hsuan Yang
- Volumetric Flow Estimation for Incompressible Fluids Using the Stationary Stokes Equations
- Katrin Lasinger, Christoph Vogel, Konrad Schindler
- Bounding Boxes, Segmentations and Object Coordinates: How Important Is Recognition for 3D Scene Flow Estimation in Autonomous Driving Scenarios?
- Aseem Behl, Omid Hosseini Jafari, Siva Karthik Mustikovela, Hassan Abu Alhaija, Carsten Rother, Andreas Geiger
Optimization Methods
- Performance Guaranteed Network Acceleration via High-Order Residual Quantization
- Zefan Li, Bingbing Ni, Wenjun Zhang, Xiaokang Yang, Wen Gao
Recognition
- Deep Metric Learning With Angular Loss
- Jian Wang, Feng Zhou, Shilei Wen, Xiao Liu, Yuanqing Lin
- Compositional Human Pose Regression
- Xiao Sun, Jiaxiang Shang, Shuang Liang, Yichen Wei
- MUTAN: Multimodal Tucker Fusion for Visual Question Answering
- Hedi Ben-younes, Remi Cadene, Matthieu Cord, Nicolas Thome
- Revisiting IM2GPS in the Deep Learning Era
- Nam Vo, Nathan Jacobs, James Hays
- Scene Parsing With Global Context Embedding
- Wei-Chih Hung, Yi-Hsuan Tsai, Xiaohui Shen, Zhe Lin, Kalyan Sunkavalli, Xin Lu, Ming-Hsuan Yang
- A Simple yet Effective Baseline for 3D Human Pose Estimation
- Julieta Martinez, Rayat Hossain, Javier Romero, James J. Little
- Dual-Glance Model for Deciphering Social Relationships
- Junnan Li, Yongkang Wong, Qi Zhao, Mohan S. Kankanhalli
- Sketching With Style: Visual Search With Sketches and Aesthetic Context
- John Collomosse, Tu Bui, Michael J. Wilber, Chen Fang, Hailin Jin
- Point Set Registration With Global-Local Correspondence and Transformation Estimation
- Su Zhang, Yang Yang, Kun Yang, Yi Luo, Sim-Heng Ong
Segmentation, Grouping & Shape
- SceneNet RGB-D: Can 5M Synthetic Images Beat Generic ImageNet Pre-Training on Indoor Segmentation?
- John McCormac, Ankur Handa, Stefan Leutenegger, Andrew J. Davison
- A Unified Model for Near and Remote Sensing
- Scott Workman, Menghua Zhai, David J. Crandall, Nathan Jacobs
- Directionally Convolutional Networks for 3D Shape Segmentation
- Haotian Xu, Ming Dong, Zichun Zhong
- AMAT: Medial Axis Transform for Natural Images
- Stavros Tsogkas, Sven Dickinson
- Deep Dual Learning for Semantic Image Segmentation
- Ping Luo, Guangrun Wang, Liang Lin, Xiaogang Wang
- Regional Interactive Image Segmentation Networks
- Jun Hao Liew, Yunchao Wei, Wei Xiong, Sim-Heng Ong, Jiashi Feng
Statistical Methods & Learning
- Learning Efficient Convolutional Networks Through Network Slimming
- Zhuang Liu, Jianguo Li, Zhiqiang Shen, Gao Huang, Shoumeng Yan, Changshui Zhang
- CVAE-GAN: Fine-Grained Image Generation Through Asymmetric Training
- Jianmin Bao, Dong Chen, Fang Wen, Houqiang Li, Gang Hua
- Universal Adversarial Perturbations Against Semantic Image Segmentation
- Jan Hendrik Metzen, Mummadi Chaithanya Kumar, Thomas Brox, Volker Fischer
- Associative Domain Adaptation
- Philip Haeusser, Thomas Frerix, Alexander Mordvintsev, Daniel Cremers
- Introspective Neural Networks for Generative Modeling
- Justin Lazarow, Long Jin, Zhuowen Tu
- Towards a Unified Compositional Model for Visual Pattern Modeling
- Wei Tang, Pei Yu, Jiahuan Zhou, Ying Wu
- Least Squares Generative Adversarial Networks
- Xudong Mao, Qing Li, Haoran Xie, Raymond Y.K. Lau, Zhen Wang, Stephen Paul Smolley
- Centered Weight Normalization in Accelerating Training of Deep Neural Networks
- Lei Huang, Xianglong Liu, Yang Liu, Bo Lang, Dacheng Tao
- Deep Growing Learning
- Guangcong Wang, Xiaohua Xie, Jianhuang Lai, Jiaxuan Zhuo
- Smart Mining for Deep Metric Learning
- Ben Harwood, Vijay Kumar B G, Gustavo Carneiro, Ian Reid, Tom Drummond
- Temporal Generative Adversarial Nets With Singular Value Clipping
- Masaki Saito, Eiichi Matsumoto, Shunta Saito
- Sampling Matters in Deep Embedding Learning
- Chao-Yuan Wu, R. Manmatha, Alexander J. Smola, Philipp Krähenbühl
- DualGAN: Unsupervised Dual Learning for Image-To-Image Translation
- Zili Yi, Hao Zhang, Ping Tan, Minglun Gong
Video
- Learning View-Invariant Features for Person Identification in Temporally Synchronized Videos Taken by Wearable Cameras
- Kang Zheng, Xiaochuan Fan, Yuewei Lin, Hao Guo, Hongkai Yu, Dazhou Guo, Song Wang
- MarioQA: Answering Questions by Watching Gameplay Videos
- Jonghwan Mun, Paul Hongsuck Seo, Ilchae Jung, Bohyung Han
- SBGAR: Semantics Based Group Activity Recognition
- Xin Li, Mooi Choo Chuah
- Trespassing the Boundaries: Labeling Temporal Bounds for Object Interactions in Egocentric Video
- Davide Moltisanti, Michael Wray, Walterio Mayol-Cuevas, Dima Damen
- Unmasking the Abnormal Events in Video
- Radu Tudor Ionescu, Sorina Smeureanu, Bogdan Alexe, Marius Popescu
- Chained Multi-Stream Networks Exploiting Pose, Motion, and Appearance for Action Classification and Detection
- Mohammadreza Zolfaghari, Gabriel L. Oliveira, Nima Sedaghat, Thomas Brox
- Temporal Action Detection With Structured Segment Networks
- Yue Zhao, Yuanjun Xiong, Limin Wang, Zhirong Wu, Xiaoou Tang, Dahua Lin
- Jointly Recognizing Object Fluents and Tasks in Egocentric Videos
- Yang Liu, Ping Wei, Song-Chun Zhu
- Transferring Objects: Joint Inference of Container and Human Pose
- Hanqing Wang, Wei Liang, Lap-Fai Yu
Vision for X
- Interpretable Learning for Self-Driving Cars by Visualizing Causal Attention
- Jinkyu Kim, John Canny
Oral 4
Recognition 2
- Learning Cooperative Visual Dialog Agents With Deep Reinforcement Learning
- Abhishek Das, Satwik Kottur, José, M. F. Moura, Stefan Lee, Dhruv Batra
- Mask R-CNN
- Kaiming He, Georgia Gkioxari, Piotr Dollár, Ross Girshick
- Towards Diverse and Natural Image Descriptions via a Conditional GAN
- Bo Dai, Sanja Fidler, Raquel Urtasun, Dahua Lin
- Focal Loss for Dense Object Detection
- Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, Piotr Dollár
- Inferring and Executing Programs for Visual Reasoning
- Justin Johnson, Bharath Hariharan, Laurens van der Maaten, Judy Hoffman, Li Fei-Fei, C. Lawrence Zitnick, Ross Girshick
Spotlight 4
Recognition 2
- Visual Forecasting by Imitating Dynamics in Natural Sequences
- Kuo-Hao Zeng, William B. Shen, De-An Huang, Min Sun, Juan Carlos Niebles
- TorontoCity: Seeing the World With a Million Eyes
- Shenlong Wang, Min Bai, Gellért Máttyus, Hang Chu, Wenjie Luo, Bin Yang, Justin Liang, Joel Cheverie, Sanja Fidler, Raquel Urtasun
- Low-Shot Visual Recognition by Shrinking and Hallucinating Features
- Bharath Hariharan, Ross Girshick
- A Coarse-Fine Network for Keypoint Localization
- Shaoli Huang, Mingming Gong, Dacheng Tao
- Detect to Track and Track to Detect
- Christoph Feichtenhofer, Axel Pinz, Andrew Zisserman
- Single Shot Text Detector With Regional Attention
- Pan He, Weilin Huang, Tong He, Qile Zhu, Yu Qiao, Xiaolin Li
- SubUNets: End-To-End Hand Shape and Continuous Sign Language Recognition
- Necati Cihan Camgoz, Simon Hadfield, Oscar Koller, Richard Bowden
- A Spatiotemporal Oriented Energy Network for Dynamic Texture Recognition
- Isma Hadji, Richard P. Wildes
Poster 5
Oral O4 Posters
- Learning Cooperative Visual Dialog Agents With Deep Reinforcement Learning
- Abhishek Das, Satwik Kottur, José, M. F. Moura, Stefan Lee, Dhruv Batra
- Mask R-CNN
- Kaiming He, Georgia Gkioxari, Piotr Dollár, Ross Girshick
- Towards Diverse and Natural Image Descriptions via a Conditional GAN
- Bo Dai, Sanja Fidler, Raquel Urtasun, Dahua Lin
- Focal Loss for Dense Object Detection
- Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, Piotr Dollár
- Inferring and Executing Programs for Visual Reasoning
- Justin Johnson, Bharath Hariharan, Laurens van der Maaten, Judy Hoffman, Li Fei-Fei, C. Lawrence Zitnick, Ross Girshick
Spotlight S4 Posters
- Visual Forecasting by Imitating Dynamics in Natural Sequences
- Kuo-Hao Zeng, William B. Shen, De-An Huang, Min Sun, Juan Carlos Niebles
- TorontoCity: Seeing the World With a Million Eyes
- Shenlong Wang, Min Bai, Gellért Máttyus, Hang Chu, Wenjie Luo, Bin Yang, Justin Liang, Joel Cheverie, Sanja Fidler, Raquel Urtasun
- Low-Shot Visual Recognition by Shrinking and Hallucinating Features
- Bharath Hariharan, Ross Girshick
- A Coarse-Fine Network for Keypoint Localization
- Shaoli Huang, Mingming Gong, Dacheng Tao
- Detect to Track and Track to Detect
- Christoph Feichtenhofer, Axel Pinz, Andrew Zisserman
- Single Shot Text Detector With Regional Attention
- Pan He, Weilin Huang, Tong He, Qile Zhu, Yu Qiao, Xiaolin Li
- SubUNets: End-To-End Hand Shape and Continuous Sign Language Recognition
- Necati Cihan Camgoz, Simon Hadfield, Oscar Koller, Richard Bowden
- A Spatiotemporal Oriented Energy Network for Dynamic Texture Recognition
- Isma Hadji, Richard P. Wildes
3D Computer Vision
- Probabilistic Structure From Motion With Objects (PSfMO)
- Paul Gay, Cosimo Rubino, Vaibhav Bansal, Alessio Del Bue
- A 3D Morphable Model of Craniofacial Shape and Texture Variation
- Hang Dai, Nick Pears, William A. P. Smith, Christian Duncan
- Multi-View Dynamic Shape Refinement Using Local Temporal Integration
- Vincent Leroy, Jean-Sebastien Franco, Edmond Boyer
- Learning Hand Articulations by Hallucinating Heat Distribution
- Chiho Choi, Sangpil Kim, Karthik Ramani
- Intrinsic3D: High-Quality 3D Reconstruction by Joint Appearance and Geometry Optimization With Spatially-Varying Lighting
- Robert Maier, Kihwan Kim, Daniel Cremers, Jan Kautz, Matthias Nießner
- Robust Hand Pose Estimation During the Interaction With an Unknown Object
- Chiho Choi, Sang Ho Yoon, Chin-Ning Chen, Karthik Ramani
- Detailed Surface Geometry and Albedo Recovery From RGB-D Video Under Natural Illumination
- Xinxin Zuo, Sen Wang, Jiangbin Zheng, Ruigang Yang
- Monocular Free-Head 3D Gaze Tracking With Deep Learning and Geometry Constraints
- Wangjiang Zhu, Haoping Deng
Computational Photography
- Filter Selection for Hyperspectral Estimation
- Boaz Arad, Ohad Ben-Shahar
- A Microfacet-Based Reflectance Model for Photometric Stereo With Highly Specular Surfaces
- Lixiong Chen, Yinqiang Zheng, Boxin Shi, Art Subpa-Asa, Imari Sato
Face & Gesture
- Detecting Faces Using Inside Cascaded Contextual CNN
- Kaipeng Zhang, Zhanpeng Zhang, Hao Wang, Zhifeng Li, Yu Qiao, Wei Liu
- A Novel Space-Time Representation on the Positive Semidefinite Cone for Facial Expression Recognition
- Anis Kacem, Mohamed Daoudi, Boulbaba Ben Amor, Juan Carlos Alvarez-Paiva
- DeepCoder: Semi-Parametric Variational Autoencoders for Automatic Facial Action Coding
- Dieu Linh Tran, Robert Walecki, Ognjen (Oggi) Rudovic, Stefanos Eleftheriadis, Björn Schuller, Maja Pantic
- Pose-Invariant Face Alignment With a Single CNN
- Amin Jourabloo, Mao Ye, Xiaoming Liu, Liu Ren
- Unsupervised Domain Adaptation for Face Recognition in Unlabeled Videos
- Kihyuk Sohn, Sifei Liu, Guangyu Zhong, Xiang Yu, Ming-Hsuan Yang, Manmohan Chandraker
- Deeply-Learned Part-Aligned Representations for Person Re-Identification
- Liming Zhao, Xi Li, Yueting Zhuang, Jingdong Wang
Low-Level Vision & Image Processing
- Semantic Line Detection and Its Applications
- Jun-Tae Lee, Han-Ul Kim, Chul Lee, Chang-Su Kim
- A Generic Deep Architecture for Single Image Reflection Removal and Image Smoothing
- Qingnan Fan, Jiaolong Yang, Gang Hua, Baoquan Chen, David Wipf
- Revisiting Cross-Channel Information Transfer for Chromatic Aberration Correction
- Tiancheng Sun, Yifan Peng, Wolfgang Heidrich
- High-Quality Correspondence and Segmentation Estimation for Dual-Lens Smart-Phone Portraits
- Xiaoyong Shen, Hongyun Gao, Xin Tao, Chao Zhou, Jiaya Jia
- Learning Visual Attention to Identify People With Autism Spectrum Disorder
- Ming Jiang, Qi Zhao
- DSLR-Quality Photos on Mobile Devices With Deep Convolutional Networks
- Andrey Ignatov, Nikolay Kobyshev, Radu Timofte, Kenneth Vanhoey, Luc Van Gool
- Non-Uniform Blind Deblurring by Reblurring
- Yuval Bahat, Netalee Efrat, Michal Irani
- Misalignment-Robust Joint Filter for Cross-Modal Image Pairs
- Takashi Shibata, Masayuki Tanaka, Masatoshi Okutomi
- Low-Rank Tensor Completion: A Pseudo-Bayesian Learning Approach
- Wei Chen, Nan Song
- DeepCD: Learning Deep Complementary Descriptors for Patch Representations
- Tsun-Yi Yang, Jo-Han Hsu, Yen-Yu Lin, Yung-Yu Chuang
Motion & Tracking
- Beyond Standard Benchmarks: Parameterizing Performance Evaluation in Visual Object Tracking
- Luka Čehovin Zajc, Alan Lukeič, Ale&scaron, Leonardis, Matej Kristan
- The Pose Knows: Video Forecasting by Generating Pose Futures
- Jacob Walker, Kenneth Marino, Abhinav Gupta, Martial Hebert
- What Will Happen Next? Forecasting Player Moves in Sports Videos
- Panna Felsen, Pulkit Agrawal, Jitendra Malik
Optimization Methods
- Robust Kronecker-Decomposable Component Analysis for Low-Rank Modeling
- Mehdi Bahri, Yannis Panagakis, Stefanos Zafeiriou
Recognition
- Recurrent Topic-Transition GAN for Visual Paragraph Generation
- Xiaodan Liang, Zhiting Hu, Hao Zhang, Chuang Gan, Eric P. Xing
- A Two-Streamed Network for Estimating Fine-Scaled Depth Maps From Single RGB Images
- Jun Li, Reinhard Klein, Angela Yao
- Weakly Supervised Object Localization Using Things and Stuff Transfer
- Miaojing Shi, Holger Caesar, Vittorio Ferrari
- Single Image Action Recognition Using Semantic Body Part Actions
- Zhichen Zhao, Huimin Ma, Shaodi You
- Incremental Learning of Object Detectors Without Catastrophic Forgetting
- Konstantin Shmelkov, Cordelia Schmid, Karteek Alahari
- Generative Adversarial Networks Conditioned by Brain Signals
- Simone Palazzo, Concetto Spampinato, Isaak Kavasidis, Daniela Giordano, Mubarak Shah
- Learning to Disambiguate by Asking Discriminative Questions
- Yining Li, Chen Huang, Xiaoou Tang, Chen Change Loy
- Interpretable Explanations of Black Boxes by Meaningful Perturbation
- Ruth C. Fong, Andrea Vedaldi
- DeepRoadMapper: Extracting Road Topology From Aerial Images
- Gellért Máttyus, Wenjie Luo, Raquel Urtasun
- Monocular 3D Human Pose Estimation by Predicting Depth on Joints
- Bruce Xiaohan Nie, Ping Wei, Song-Chun Zhu
- Large-Scale Image Retrieval With Attentive Deep Local Features
- Hyeonwoo Noh, Andre Araujo, Jack Sim, Tobias Weyand, Bohyung Han
- Deep Globally Constrained MRFs for Human Pose Estimation
- Ioannis Marras, Petar Palasek, Ioannis Patras
- Predicting Visual Exemplars of Unseen Classes for Zero-Shot Learning
- Soravit Changpinyo, Wei-Lun Chao, Fei Sha
- Multi-Label Learning of Part Detectors for Heavily Occluded Pedestrian Detection
- Chunluan Zhou, Junsong Yuan
- SGN: Sequential Grouping Networks for Instance Segmentation
- Shu Liu, Jiaya Jia, Sanja Fidler, Raquel Urtasun
- Adaptive Feeding: Achieving Fast and Accurate Detections by Adaptively Combining Object Detectors
- Hong-Yu Zhou, Bin-Bin Gao, Jianxin Wu
- Aesthetic Critiques Generation for Photos
- Kuang-Yu Chang, Kung-Hung Lu, Chu-Song Chen
- Hide-And-Seek: Forcing a Network to Be Meticulous for Weakly-Supervised Object and Action Localization
- Krishna Kumar Singh, Yong Jae Lee
Segmentation, Grouping & Shape
- Two-Phase Learning for Weakly Supervised Object Localization
- Dahun Kim, Donghyeon Cho, Donggeun Yoo, In So Kweon
Statistical Methods & Learning
- Curriculum Dropout
- Pietro Morerio, Jacopo Cavazza, Riccardo Volpi, René, Vidal, Vittorio Murino
- Predictor Combination at Test Time
- Kwang In Kim, James Tompkin, Christian Richardt
- Guided Perturbations: Self-Corrective Behavior in Convolutional Neural Networks
- Swami Sankaranarayanan, Arpit Jain, Ser Nam Lim
- Learning Robust Visual-Semantic Embeddings
- Yao-Hung Hubert Tsai, Liang-Kang Huang, Ruslan Salakhutdinov
- PUnDA: Probabilistic Unsupervised Domain Adaptation for Knowledge Transfer Across Visual Categories
- Behnam Gholami, Ognjen (Oggi) Rudovic, Vladimir Pavlovic
- Learning in an Uncertain World: Representing Ambiguity Through Multiple Hypotheses
- Christian Rupprecht, Iro Laina, Robert DiPietro, Maximilian Baust, Federico Tombari, Nassir Navab, Gregory D. Hager
Video
- CDTS: Collaborative Detection, Tracking, and Segmentation for Online Multiple Object Segmentation in Videos
- Yeong Jun Koh, Chang-Su Kim
- Temporal Superpixels Based on Proximity-Weighted Patch Matching
- Se-Ho Lee, Won-Dong Jang, Chang-Su Kim
- Joint Detection and Recounting of Abnormal Events by Learning Deep Generic Knowledge
- Ryota Hinami, Tao Mei, Shin'ichi Satoh
- TURN TAP: Temporal Unit Regression Network for Temporal Action Proposals
- Jiyang Gao, Zhenheng Yang, Kan Chen, Chen Sun, Ram Nevatia
- Online Real-Time Multiple Spatiotemporal Action Localisation and Prediction
- Gurkirt Singh, Suman Saha, Michael Sapienza, Philip H. S. Torr, Fabio Cuzzolin
- Leveraging Weak Semantic Relevance for Complex Video Event Classification
- Chao Li, Jiewei Cao, Zi Huang, Lei Zhu, Heng Tao Shen
- Weakly Supervised Summarization of Web Videos
- Rameswar Panda, Abir Das, Ziyan Wu, Jan Ernst, Amit K. Roy-Chowdhury
- FCN-rLSTM: Deep Spatio-Temporal Neural Networks for Vehicle Counting in City Cameras
- Shanghang Zhang, Guanhang Wu, João P. Costeira, José, M. F. Moura
Vision for X
- Fast Face-Swap Using Convolutional Neural Networks
- Iryna Korshunova, Wenzhe Shi, Joni Dambre, Lucas Theis
- Towards a Visual Privacy Advisor: Understanding and Predicting Privacy Risks in Images
- Tribhuvanesh Orekondy, Bernt Schiele, Mario Fritz
Oral 5
Face and Human Behaviour Analysis
- First-Person Activity Forecasting With Online Inverse Reinforcement Learning
- Nicholas Rhinehart, Kris M. Kitani
- Binarized Convolutional Landmark Localizers for Human Pose Estimation and Face Alignment With Limited Resources
- Adrian Bulat, Georgios Tzimiropoulos
- MoFA: Model-Based Deep Convolutional Face Autoencoder for Unsupervised Monocular Reconstruction
- Ayush Tewari, Michael Zollhöfer, Hyeongwoo Kim, Pablo Garrido, Florian Bernard, Patrick Pérez, Christian Theobalt
- RPAN: An End-To-End Recurrent Pose-Attention Network for Action Recognition in Videos
- Wenbin Du, Yali Wang, Yu Qiao
- Temporal Non-Volume Preserving Approach to Facial Age-Progression and Age-Invariant Face Recognition
- Chi Nhan Duong, Kha Gia Quach, Khoa Luu, Ngan Le, Marios Savvides
Spotlight 5
Face and Human Behaviour Analysis
- Attribute-Enhanced Face Recognition With Neural Tensor Fusion Networks
- Guosheng Hu, Yang Hua, Yang Yuan, Zhihong Zhang, Zheng Lu, Sankha S. Mukherjee, Timothy M. Hospedales, Neil M. Robertson, Yongxin Yang
- Unlabeled Samples Generated by GAN Improve the Person Re-Identification Baseline in Vitro
- Zhedong Zheng, Liang Zheng, Yi Yang
- Egocentric Gesture Recognition Using Recurrent 3D Convolutional Neural Networks With Spatiotemporal Transformer Modules
- Congqi Cao, Yifan Zhang, Yi Wu, Hanqing Lu, Jian Cheng
- Recursive Spatial Transformer (ReST) for Alignment-Free Face Recognition
- Wanglong Wu, Meina Kan, Xin Liu, Yi Yang, Shiguang Shan, Xilin Chen
- Learning Discriminative Aggregation Network for Video-Based Face Recognition
- Yongming Rao, Ji Lin, Jiwen Lu, Jie Zhou
- Synergy Between Face Alignment and Tracking via Discriminative Global Consensus Optimization
- Muhammad Haris Khan, John McDonagh, Georgios Tzimiropoulos
- SVDNet for Pedestrian Retrieval
- Yifan Sun, Liang Zheng, Weijian Deng, Shengjin Wang
- Towards More Accurate Iris Recognition Using Deeply Learned Spatially Corresponding Features
- Zijing Zhao, Ajay Kumar
Poster 6
Oral O5 Posters
- First-Person Activity Forecasting With Online Inverse Reinforcement Learning
- Nicholas Rhinehart, Kris M. Kitani
- Binarized Convolutional Landmark Localizers for Human Pose Estimation and Face Alignment With Limited Resources
- Adrian Bulat, Georgios Tzimiropoulos
- MoFA: Model-Based Deep Convolutional Face Autoencoder for Unsupervised Monocular Reconstruction
- Ayush Tewari, Michael Zollhöfer, Hyeongwoo Kim, Pablo Garrido, Florian Bernard, Patrick Pérez, Christian Theobalt
- RPAN: An End-To-End Recurrent Pose-Attention Network for Action Recognition in Videos
- Wenbin Du, Yali Wang, Yu Qiao
- Temporal Non-Volume Preserving Approach to Facial Age-Progression and Age-Invariant Face Recognition
- Chi Nhan Duong, Kha Gia Quach, Khoa Luu, Ngan Le, Marios Savvides
Spotlight S5 Posters
- Attribute-Enhanced Face Recognition With Neural Tensor Fusion Networks
- Guosheng Hu, Yang Hua, Yang Yuan, Zhihong Zhang, Zheng Lu, Sankha S. Mukherjee, Timothy M. Hospedales, Neil M. Robertson, Yongxin Yang
- Unlabeled Samples Generated by GAN Improve the Person Re-Identification Baseline in Vitro
- Zhedong Zheng, Liang Zheng, Yi Yang
- Egocentric Gesture Recognition Using Recurrent 3D Convolutional Neural Networks With Spatiotemporal Transformer Modules
- Congqi Cao, Yifan Zhang, Yi Wu, Hanqing Lu, Jian Cheng
- Recursive Spatial Transformer (ReST) for Alignment-Free Face Recognition
- Wanglong Wu, Meina Kan, Xin Liu, Yi Yang, Shiguang Shan, Xilin Chen
- Learning Discriminative Aggregation Network for Video-Based Face Recognition
- Yongming Rao, Ji Lin, Jiwen Lu, Jie Zhou
- Synergy Between Face Alignment and Tracking via Discriminative Global Consensus Optimization
- Muhammad Haris Khan, John McDonagh, Georgios Tzimiropoulos
- SVDNet for Pedestrian Retrieval
- Yifan Sun, Liang Zheng, Weijian Deng, Shengjin Wang
- Towards More Accurate Iris Recognition Using Deeply Learned Spatially Corresponding Features
- Zijing Zhao, Ajay Kumar
3D Computer Vision
- Semantically Informed Multiview Surface Refinement
- Maro&scaron, Bláha, Mathias Rothermel, Martin R. Oswald, Torsten Sattler, Audrey Richard, Jan D. Wegner, Marc Pollefeys, Konrad Schindler
- BB8: A Scalable, Accurate, Robust to Partial Occlusion Method for Predicting the 3D Poses of Challenging Objects Without Using Depth
- Mahdi Rad, Vincent Lepetit
- Modeling Urban Scenes From Pointclouds
- William Nguatem, Helmut Mayer
- Parameter-Free Lens Distortion Calibration of Central Cameras
- Filippo Bergamasco, Luca Cosmo, Andrea Gasparetto, Andrea Albarelli, Andrea Torsello
- Pose Guided RGBD Feature Learning for 3D Object Pose Estimation
- Vassileios Balntas, Andreas Doumanoglou, Caner Sahin, Juil Sock, Rigas Kouskouridas, Tae-Kyun Kim
- Efficient Global Illumination for Morphable Models
- Andreas Schneider, Sandro Schönborn, Lavrenti Frobeen, Bernhard Egger, Thomas Vetter
- Low Compute and Fully Parallel Computer Vision With HashMatch
- Sean Ryan Fanello, Julien Valentin, Adarsh Kowdle, Christoph Rhemann, Vladimir Tankovich, Carlo Ciliberto, Philip Davidson, Shahram Izadi
- Dense Non-Rigid Structure-From-Motion and Shading With Unknown Albedos
- Mathias Gallardo, Toby Collins, Adrien Bartoli
- From Point Clouds to Mesh Using Regression
- Ľubor Ladický, Olivier Saurer, SoHyeon Jeong, Fabio Maninchedda, Marc Pollefeys
- Stereo DSO: Large-Scale Direct Sparse Visual Odometry With Stereo Cameras
- Rui Wang, Martin Schwörer, Daniel Cremers
- Space-Time Localization and Mapping
- Minhaeng Lee, Charless C. Fowlkes
Computational Photography
- Benchmarking Single-Image Reflection Removal Algorithms
- Renjie Wan, Boxin Shi, Ling-Yu Duan, Ah-Hwee Tan, Alex C. Kot
Face & Gesture
- Attention-Aware Deep Reinforcement Learning for Video Face Recognition
- Yongming Rao, Jiwen Lu, Jie Zhou
- Learning to Fuse 2D and 3D Image Cues for Monocular Body Pose Estimation
- Bugra Tekin, Pablo Márquez-Neila, Mathieu Salzmann, Pascal Fua
- Deep Facial Action Unit Recognition From Partially Labeled Data
- Shan Wu, Shangfei Wang, Bowen Pan, Qiang Ji
- Pose-Driven Deep Convolutional Model for Person Re-Identification
- Chi Su, Jianing Li, Shiliang Zhang, Junliang Xing, Wen Gao, Qi Tian
- Recognition of Action Units in the Wild With Deep Nets and a New Global-Local Loss
- C. Fabian Benitez-Quiroz, Yan Wang, Aleix M. Martinez
- Faster Than Real-Time Facial Alignment: A 3D Spatial Transformer Network Approach in Unconstrained Poses
- Chandrasekhar Bhagavatula, Chenchen Zhu, Khoa Luu, Marios Savvides
- Towards Large-Pose Face Frontalization in the Wild
- Xi Yin, Xiang Yu, Kihyuk Sohn, Xiaoming Liu, Manmohan Chandraker
Low-Level Vision & Image Processing
- A Joint Intrinsic-Extrinsic Prior Model for Retinex
- Bolun Cai, Xianming Xu, Kailing Guo, Kui Jia, Bin Hu, Dacheng Tao
- Going Unconstrained With Rolling Shutter Deblurring
- Mahesh Mohan M. R., A. N. Rajagopalan, Gunasekaran Seetharaman
- A Stagewise Refinement Model for Detecting Salient Objects in Images
- Tiantian Wang, Ali Borji, Lihe Zhang, Pingping Zhang, Huchuan Lu
- From Square Pieces to Brick Walls: The Next Challenge in Solving Jigsaw Puzzles
- Shir Gur, Ohad Ben-Shahar
- Online Video Deblurring via Dynamic Temporal Blending Network
- Tae Hyun Kim, Kyoung Mu Lee, Bernhard Schölkopf, Michael Hirsch
- Supervision by Fusion: Towards Unsupervised Learning of Deep Salient Object Detector
- Dingwen Zhang, Junwei Han, Yu Zhang
- Fast Multi-Image Matching via Density-Based Clustering
- Roberto Tron, Xiaowei Zhou, Carlos Esteves, Kostas Daniilidis
- Characterizing and Improving Stability in Neural Style Transfer
- Agrim Gupta, Justin Johnson, Alexandre Alahi, Li Fei-Fei
Recognition
- Cross-Modal Deep Variational Hashing
- Venice Erin Liong, Jiwen Lu, Yap-Peng Tan, Jie Zhou
- Spatial Memory for Context Reasoning in Object Detection
- Xinlei Chen, Abhinav Gupta
- Deep Binaries: Encoding Semantic-Rich Cues for Efficient Textual-Visual Cross Retrieval
- Yuming Shen, Li Liu, Ling Shao, Jingkuan Song
- Learning a Recurrent Residual Fusion Network for Multimodal Matching
- Yu Liu, Yanming Guo, Erwin M. Bakker, Michael S. Lew
- Rotational Subgroup Voting and Pose Clustering for Robust 3D Object Recognition
- Anders Glent Buch, Lilita Kiforenko, Dirk Kraft
- CoupleNet: Coupling Global Structure With Local Parts for Object Detection
- Yousong Zhu, Chaoyang Zhao, Jinqiao Wang, Xu Zhao, Yi Wu, Hanqing Lu
- Speaking the Same Language: Matching Machine to Human Captions by Adversarial Training
- Rakshith Shetty, Marcus Rohrbach, Lisa Anne Hendricks, Mario Fritz, Bernt Schiele
- Drone-Based Object Counting by Spatially Regularized Regional Proposal Network
- Meng-Ru Hsieh, Yen-Liang Lin, Winston H. Hsu
- BlitzNet: A Real-Time Deep Network for Scene Understanding
- Nikita Dvornik, Konstantin Shmelkov, Julien Mairal, Cordelia Schmid
- Situation Recognition With Graph Neural Networks
- Ruiyu Li, Makarand Tapaswi, Renjie Liao, Jiaya Jia, Raquel Urtasun, Sanja Fidler
- Learning Visual N-Grams From Web Data
- Ang Li, Allan Jabri, Armand Joulin, Laurens van der Maaten
- Attention-Based Multimodal Fusion for Video Description
- Chiori Hori, Takaaki Hori, Teng-Yok Lee, Ziming Zhang, Bret Harsham, John R. Hershey, Tim K. Marks, Kazuhiko Sumi
- Learning the Latent "Look": Unsupervised Discovery of a Style-Coherent Embedding From Fashion Images
- Wei-Lin Hsiao, Kristen Grauman
- Aligned Image-Word Representations Improve Inductive Transfer Across Vision-Language Tasks
- Tanmay Gupta, Kevin Shih, Saurabh Singh, Derek Hoiem
- Learning Discriminative Latent Attributes for Zero-Shot Classification
- Huajie Jiang, Ruiping Wang, Shiguang Shan, Yi Yang, Xilin Chen
- PPR-FCN: Weakly Supervised Visual Relation Detection via Parallel Pairwise R-FCN
- Hanwang Zhang, Zawlin Kyaw, Jinyang Yu, Shih-Fu Chang
Segmentation, Grouping & Shape
- Higher-Order Minimum Cost Lifted Multicuts for Motion Segmentation
- Margret Keuper
- Deep Free-Form Deformation Network for Object-Mask Registration
- Haoyang Zhang, Xuming He
- Region-Based Correspondence Between 3D Shapes via Spatially Smooth Biclustering
- Matteo Denitto, Simone Melzi, Manuele Bicego, Umberto Castellani, Alessandro Farinelli, Mário A. T. Figueiredo, Yanir Kleiman, Maks Ovsjanikov
Statistical Methods & Learning
- Learning Discriminative ab-Divergences for Positive Definite Matrices
- Anoop Cherian, Panagiotis Stanitsas, Mehrtash Harandi, Vassilios Morellas, Nikolaos Papanikolopoulos
- Consensus Convolutional Sparse Coding
- Biswarup Choudhury, Robin Swanson, Felix Heide, Gordon Wetzstein, Wolfgang Heidrich
- Domain-Adaptive Deep Network Compression
- Marc Masana, Joost van de Weijer, Luis Herranz, Andrew D. Bagdanov, Jose M. Álvarez
- Self-Supervised Learning of Pose Embeddings From Spatiotemporal Relations in Videos
- Ömer Sümer, Tobias Dencker, Björn Ommer
- Approximate Grassmannian Intersections: Subspace-Valued Subspace Learning
- Calvin Murdock, Fernando De la Torre
- Side Information in Robust Principal Component Analysis: Algorithms and Applications
- Niannan Xue, Yannis Panagakis, Stefanos Zafeiriou
- Summarization and Classification of Wearable Camera Streams by Learning the Distributions Over Deep Features of Out-Of-Sample Image Sequences
- Alessandro Perina, Sadegh Mohammadi, Nebojsa Jojic, Vittorio Murino
- Unsupervised Learning From Video to Detect Foreground Objects in Single Images
- Ioana Croitoru, Simion-Vlad Bogolin, Marius Leordeanu
- Supplementary Meta-Learning: Towards a Dynamic Model for Deep Neural Networks
- Feihu Zhang, Benjamin W. Wah
- Adversarial Inverse Graphics Networks: Learning 2D-To-3D Lifting and Image-To-Image Translation From Unpaired Supervision
- Hsiao-Yu Fish Tung, Adam W. Harley, William Seto, Katerina Fragkiadaki
- Active Learning for Human Pose Estimation
- Buyu Liu, Vittorio Ferrari
- Interleaved Group Convolutions
- Ting Zhang, Guo-Jun Qi, Bin Xiao, Jingdong Wang
Video
- Learning-Based Cloth Material Recovery From Video
- Shan Yang, Junbang Liang, Ming C. Lin
- Unsupervised Video Understanding by Reconciliation of Posture Similarities
- Timo Milbich, Miguel Bautista, Ekaterina Sutter, Björn Ommer
- Action Tubelet Detector for Spatio-Temporal Action Localization
- Vicky Kalogeiton, Philippe Weinzaepfel, Vittorio Ferrari, Cordelia Schmid
- AMTnet: Action-Micro-Tube Regression by End-To-End Trainable Deep Architecture
- Suman Saha, Gurkirt Singh, Fabio Cuzzolin
Vision for X
- Constrained Convolutional Sparse Coding for Parametric Based Reconstruction of Line Drawings
- Sara Shaheen, Lama Affara, Bernard Ghanem
- Neural Ctrl-F: Segmentation-Free Query-By-String Word Spotting in Handwritten Manuscript Collections
- Tomas Wilkinson, Jonas Lindström, Anders Brun
Oral 6
Video Analysis
- Spatial-Aware Object Embeddings for Zero-Shot Localization and Classification of Actions (PDF)
- Pascal Mettes, Cees G. M. Snoek
- Semantic Video CNNs Through Representation Warping
- Raghudeep Gadde, Varun Jampani, Peter V. Gehler
- Video Frame Synthesis Using Deep Voxel Flow
- Ziwei Liu, Raymond A. Yeh, Xiaoou Tang, Yiming Liu, Aseem Agarwala
- Detail-Revealing Deep Video Super-Resolution
- Xin Tao, Hongyun Gao, Renjie Liao, Jue Wang, Jiaya Jia
- Learning Video Object Segmentation With Visual Memory
- Pavel Tokmakov, Karteek Alahari, Cordelia Schmid
Oral 7
Low-Level vision
- EnhanceNet: Single Image Super-Resolution Through Automated Texture Synthesis
- Mehdi S. M. Sajjadi, Bernhard Schölkopf, Michael Hirsch
- Makeup-Go: Blind Reversion of Portrait Edit
- Ying-Cong Chen, Xiaoyong Shen, Jiaya Jia
- Shadow Detection With Conditional Generative Adversarial Networks
- Vu Nguyen, Tomas F. Yago Vicente, Maozheng Zhao, Minh Hoai, Dimitris Samaras
- Learning High Dynamic Range From Outdoor Panoramas
- Jinsong Zhang, Jean-François Lalonde
- DCTM: Discrete-Continuous Transformation Matching for Semantic Flow
- Seungryong Kim, Dongbo Min, Stephen Lin, Kwanghoon Sohn
Spotlight 6
Low-Level vision
- MemNet: A Persistent Memory Network for Image Restoration
- Ying Tai, Jian Yang, Xiaoming Liu, Chunyan Xu
- Structure-Measure: A New Way to Evaluate Foreground Maps
- Deng-Ping Fan, Ming-Ming Cheng, Yun Liu, Tao Li, Ali Borji
- Weakly- and Self-Supervised Learning for Content-Aware Deep Image Retargeting
- Donghyeon Cho, Jinsun Park, Tae-Hyun Oh, Yu-Wing Tai, In So Kweon
- Practical and Efficient Multi-View Matching
- Eleonora Maset, Federica Arrigoni, Andrea Fusiello
- Unrolled Memory Inner-Products: An Abstract GPU Operator for Efficient Vision-Related Computations
- Yu-Sheng Lin, Wei-Chao Chen, Shao-Yi Chien
- Learning to Push the Limits of Efficient FFT-Based Image Deconvolution
- Jakob Kruse, Carsten Rother, Uwe Schmidt
- Learning Spread-Out Local Feature Descriptors
- Xu Zhang, Felix X. Yu, Sanjiv Kumar, Shih-Fu Chang
- Visual Odometry for Pixel Processor Arrays
- Laurie Bose, Jianing Chen, Stephen J. Carey, Piotr Dudek, Walterio Mayol-Cuevas
Poster 7
Oral O6 Posters
- Spatial-Aware Object Embeddings for Zero-Shot Localization and Classification of Actions
- Pascal Mettes, Cees G. M. Snoek
- Semantic Video CNNs Through Representation Warping
- Raghudeep Gadde, Varun Jampani, Peter V. Gehler
- Video Frame Synthesis Using Deep Voxel Flow
- Ziwei Liu, Raymond A. Yeh, Xiaoou Tang, Yiming Liu, Aseem Agarwala
- Detail-Revealing Deep Video Super-Resolution
- Xin Tao, Hongyun Gao, Renjie Liao, Jue Wang, Jiaya Jia
- Learning Video Object Segmentation With Visual Memory
- Pavel Tokmakov, Karteek Alahari, Cordelia Schmid
Oral O7 Posters
- EnhanceNet: Single Image Super-Resolution Through Automated Texture Synthesis
- Mehdi S. M. Sajjadi, Bernhard Schölkopf, Michael Hirsch
- Makeup-Go: Blind Reversion of Portrait Edit
- Ying-Cong Chen, Xiaoyong Shen, Jiaya Jia
- Shadow Detection With Conditional Generative Adversarial Networks
- Vu Nguyen, Tomas F. Yago Vicente, Maozheng Zhao, Minh Hoai, Dimitris Samaras
- Learning High Dynamic Range From Outdoor Panoramas
- Jinsong Zhang, Jean-François Lalonde
- DCTM: Discrete-Continuous Transformation Matching for Semantic Flow
- Seungryong Kim, Dongbo Min, Stephen Lin, Kwanghoon Sohn
Spotlight S6 Posters
- MemNet: A Persistent Memory Network for Image Restoration
- Ying Tai, Jian Yang, Xiaoming Liu, Chunyan Xu
- Structure-Measure: A New Way to Evaluate Foreground Maps
- Deng-Ping Fan, Ming-Ming Cheng, Yun Liu, Tao Li, Ali Borji
- Weakly- and Self-Supervised Learning for Content-Aware Deep Image Retargeting
- Donghyeon Cho, Jinsun Park, Tae-Hyun Oh, Yu-Wing Tai, In So Kweon
- Practical and Efficient Multi-View Matching
- Eleonora Maset, Federica Arrigoni, Andrea Fusiello
- Unrolled Memory Inner-Products: An Abstract GPU Operator for Efficient Vision-Related Computations
- Yu-Sheng Lin, Wei-Chao Chen, Shao-Yi Chien
- Learning to Push the Limits of Efficient FFT-Based Image Deconvolution
- Jakob Kruse, Carsten Rother, Uwe Schmidt
- Learning Spread-Out Local Feature Descriptors
- Xu Zhang, Felix X. Yu, Sanjiv Kumar, Shih-Fu Chang
- Visual Odometry for Pixel Processor Arrays
- Laurie Bose, Jianing Chen, Stephen J. Carey, Piotr Dudek, Walterio Mayol-Cuevas
3D Computer Vision
- Joint Estimation of Camera Pose, Depth, Deblurring, and Super-Resolution From a Blurred Image Sequence
- Haesol Park, Kyoung Mu Lee
- 2D-Driven 3D Object Detection in RGB-D Images
- Jean Lahoud, Bernard Ghanem
- Ray Space Features for Plenoptic Structure-From-Motion
- Yingliang Zhang, Peihong Yu, Wei Yang, Yuanxi Ma, Jingyi Yu
- Depth Estimation Using Structured Light Flow — Analysis of Projected Pattern Flow on an Object's Surface
- Ryo Furukawa, Ryusuke Sagawa, Hiroshi Kawasaki
- Monocular Dense 3D Reconstruction of a Complex Dynamic Scene From Two Perspective Frames
- Suryansh Kumar, Yuchao Dai, Hongdong Li
- Optimal Transformation Estimation With Semantic Cues
- Danda Pani Paudel, Adlane Habed, Luc Van Gool
- Dynamics Enhanced Multi-Camera Motion Segmentation From Unsynchronized Videos
- Xikang Zhang, Bengisu Ozbay, Mario Sznaier, Octavia Camps
- Taking the Scenic Route to 3D: Optimising Reconstruction From Moving Cameras
- Oscar Mendez, Simon Hadfield, Nicolas Pugeault, Richard Bowden
- FLaME: Fast Lightweight Mesh Estimation Using Variational Smoothing on Delaunay Graphs (poster, PDF)
- W. Nicholas Greene, Nicholas Roy
Biomedical Image Analysis
- Efficient Algorithms for Moral Lineage Tracing
- Markus Rempfler, Jan-Hendrik Lange, Florian Jug, Corinna Blasse, Eugene W. Myers, Bjoern H. Menze, Bjoern Andres
Computational Photography
- From RGB to Spectrum for Natural Scenes via Manifold-Based Mapping (PDF)
- Yan Jia, Yinqiang Zheng, Lin Gu, Art Subpa-Asa, Antony Lam, Yoichi Sato, Imari Sato
- DeepFuse: A Deep Unsupervised Approach for Exposure Fusion With Extreme Exposure Image Pairs
- K. Ram Prabhakar, V Sai Srikar, R. Venkatesh Babu
Face & Gesture
- Learning Dense Facial Correspondences in Unconstrained Images
- Ronald Yu, Shunsuke Saito, Haoxiang Li, Duygu Ceylan, Hao Li
- Jointly Attentive Spatial-Temporal Pooling Networks for Video-Based Person Re-Identification
- Shuangjie Xu, Yu Cheng, Kang Gu, Yang Yang, Shiyu Chang, Pan Zhou
Low-Level Vision & Image Processing
- Automatic Content-Aware Projection for 360° Videos
- Yeong Won Kim, Chang-Ryeol Lee, Dae-Yong Cho, Yong Hoon Kwon, Hyeok-Jae Choi, Kuk-Jin Yoon
- Blur-Invariant Deep Learning for Blind-Deblurring
- T. M. Nimisha, Akash Kumar Singh, A. N. Rajagopalan
- Non-Linear Convolution Filters for CNN-Based Learning
- Georgios Zoumpourlis, Alexandros Doumanoglou, Nicholas Vretos, Petros Daras
- AOD-Net: All-In-One Dehazing Network
- Boyi Li, Xiulian Peng, Zhangyang Wang, Jizheng Xu, Dan Feng
- Simultaneous Detection and Removal of High Altitude Clouds From an Image
- Tushar Sandhan, Jin Young Choi
- Understanding Low- and High-Level Contributions to Fixation Prediction
- Matthias Kümmerer, Thomas S. A. Wallis, Leon A. Gatys, Matthias Bethge
- Image Super-Resolution Using Dense Skip Connections
- Tong Tong, Gen Li, Xiejie Liu, Qinquan Gao
- Convergence Analysis of MAP Based Blur Kernel Estimation
- Sunghyun Cho, Seungyong Lee
- Blob Reconstruction Using Unilateral Second Order Gaussian Kernels With Application to High-ISO Long-Exposure Image Denoising
- Gang Wang, Carlos Lopez-Molina, Bernard De Baets
- Deep Generative Adversarial Compression Artifact Removal
- Leonardo Galteri, Lorenzo Seidenari, Marco Bertini, Alberto Del Bimbo
Motion & Tracking
- Online Multi-Object Tracking Using CNN-Based Single Object Tracker With Spatial-Temporal Attention Mechanism
- Qi Chu, Wanli Ouyang, Hongsheng Li, Xiaogang Wang, Bin Liu, Nenghai Yu
Recognition
- Mutual Enhancement for Detection of Multiple Logos in Sports Videos
- Yuan Liao, Xiaoqing Lu, Chengcui Zhang, Yongtao Wang, Zhi Tang
- Referring Expression Generation and Comprehension via Attributes
- Jingyu Liu, Liang Wang, Ming-Hsuan Yang
- RoomNet: End-To-End Room Layout Estimation
- Chen-Yu Lee, Vijay Badrinarayanan, Tomasz Malisiewicz, Andrew Rabinovich
- SSH: Single Stage Headless Face Detector
- Mahyar Najibi, Pouya Samangouei, Rama Chellappa, Larry S. Davis
- AnnArbor: Approximate Nearest Neighbors Using Arborescence Coding
- Artem Babenko, Victor Lempitsky
- Boosting Image Captioning With Attributes
- Ting Yao, Yingwei Pan, Yehao Li, Zhaofan Qiu, Tao Mei
- Learning to Estimate 3D Hand Pose From Single RGB Images
- Christian Zimmermann, Thomas Brox
- Locally-Transferred Fisher Vectors for Texture Classification
- Yang Song, Fan Zhang, Qing Li, Heng Huang, Lauren J. O'Donnell, Weidong Cai
- Object-Level Proposals
- Jianxiang Ma, Anlong Ming, Zilong Huang, Xinggang Wang, Yu Zhou
- Extreme Clicking for Efficient Object Annotation
- Dim P. Papadopoulos, Jasper R. R. Uijlings, Frank Keller, Vittorio Ferrari
- WordSup: Exploiting Word Annotations for Character Based Text Detection
- Han Hu, Chengquan Zhang, Yuxuan Luo, Yuzhuo Wang, Junyu Han, Errui Ding
- Illuminating Pedestrians via Simultaneous Detection & Segmentation
- Garrick Brazil, Xi Yin, Xiaoming Liu
- Generalized Orderless Pooling Performs Implicit Salient Matching
- Marcel Simon, Yang Gao, Trevor Darrell, Joachim Denzler, Erik Rodner
Segmentation, Grouping & Shape
- Exploiting Spatial Structure for Localizing Manipulated Image Regions
- Jawadul H. Bappy, Amit K. Roy-Chowdhury, Jason Bunk, Lakshmanan Nataraj, B. S. Manjunath
- RDFNet: RGB-D Multi-Level Residual Feature Fusion for Indoor Semantic Segmentation
- Seong-Jin Park, Ki-Sang Hong, Seungyong Lee
- The Mapillary Vistas Dataset for Semantic Understanding of Street Scenes
- Gerhard Neuhold, Tobias Ollmann, Samuel Rota Bulò, Peter Kontschieder
- Self-Organized Text Detection With Minimal Post-Processing via Border Learning
- Yue Wu, Prem Natarajan
Statistical Methods & Learning
- Sparse Exact PGA on Riemannian Manifolds
- Monami Banerjee, Rudrasis Chakraborty, Baba C. Vemuri
- Tensor RPCA by Bayesian CP Factorization With Complex Noise
- Qiong Luo, Zhi Han, Xi'ai Chen, Yao Wang, Deyu Meng, Dong Liang, Yandong Tang
- Multimodal Gaussian Process Latent Variable Models With Harmonization
- Guoli Song, Shuhui Wang, Qingming Huang, Qi Tian
- Segmentation-Aware Convolutional Networks Using Local Attention Masks
- Adam W. Harley, Konstantinos G. Derpanis, Iasonas Kokkinos
- Rotation Equivariant Vector Field Networks
- Diego Marcos, Michele Volpi, Nikos Komodakis, Devis Tuia
- ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression
- Jian-Hao Luo, Jianxin Wu, Weiyao Lin
- AutoDIAL: Automatic DomaIn Alignment Layers
- Fabio Maria Carlucci, Lorenzo Porzi, Barbara Caputo, Elisa Ricci, Samuel Rota Bulò
- Focusing Attention: Towards Accurate Text Recognition in Natural Images
- Zhanzhan Cheng, Fan Bai, Yunlu Xu, Gang Zheng, Shiliang Pu, Shuigeng Zhou
- Unsupervised Object Segmentation in Video by Efficient Selection of Highly Probable Positive Features
- Emanuela Haller, Marius Leordeanu
- Nonparametric Variational Auto-Encoders for Hierarchical Representation Learning
- Prasoon Goyal, Zhiting Hu, Xiaodan Liang, Chenyu Wang, Eric P. Xing
- Dense and Low-Rank Gaussian CRFs Using Deep Embeddings
- Siddhartha Chandra, Nicolas Usunier, Iasonas Kokkinos
Video
- A Multimodal Deep Regression Bayesian Network for Affective Video Content Analyses
- Quan Gan, Shangfei Wang, Longfei Hao, Qiang Ji
- Moving Object Detection in Time-Lapse or Motion Trigger Image Sequences Using Low-Rank and Invariant Sparse Decomposition
- Moein Shakeri, Hong Zhang
- A Multilayer-Based Framework for Online Background Subtraction With Freely Moving Cameras
- Yizhe Zhu, Ahmed Elgammal
- Dynamic Label Graph Matching for Unsupervised Video Re-Identification
- Mang Ye, Andy J. Ma, Liang Zheng, Jiawei Li, Pong C. Yuen
- Spatiotemporal Modeling for Crowd Counting in Videos
- Feng Xiong, Xingjian Shi, Dit-Yan Yeung
Vision for X
- Personalized Cinemagraphs Using Semantic Understanding and Collaborative Learning
- Tae-Hyun Oh, Kyungdon Joo, Neel Joshi, Baoyuan Wang, In So Kweon, Sing Bing Kang
- What Is Around the Camera? (PDF)
- Stamatios Georgoulis, Konstantinos Rematas, Tobias Ritschel, Mario Fritz, Tinne Tuytelaars, Luc Van Gool
Oral 8
Recognition 3
- Weakly-Supervised Learning of Visual Relations
- Julia Peyre, Josef Sivic, Ivan Laptev, Cordelia Schmid
- BIER - Boosting Independent Embeddings Robustly
- Michael Opitz, Georg Waltner, Horst Possegger, Horst Bischof
- 3D Graph Neural Networks for RGBD Semantic Segmentation
- Xiaojuan Qi, Renjie Liao, Jiaya Jia, Sanja Fidler, Raquel Urtasun
- Learning Multi-Attention Convolutional Neural Network for Fine-Grained Image Recognition
- Heliang Zheng, Jianlong Fu, Tao Mei, Jiebo Luo
- Learning 3D Object Categories by Looking Around Them
- David Novotny, Diane Larlus, Andrea Vedaldi
Spotlight 7
Recognition 3
- Quantitative Evaluation of Confidence Measures in a Machine Learning World
- Matteo Poggi, Fabio Tosi, Stefano Mattoccia
- Towards End-To-End Text Spotting With Convolutional Recurrent Neural Networks
- Hui Li, Peng Wang, Chunhua Shen
- DeepSetNet: Predicting Sets With Deep Neural Networks
- S. Hamid Rezatofighi, Vijay Kumar B G, Anton Milan, Ehsan Abbasnejad, Anthony Dick, Ian Reid
- Learning From Video and Text via Large-Scale Discriminative Clustering
- Antoine Miech, Jean-Baptiste Alayrac, Piotr Bojanowski, Ivan Laptev, Josef Sivic
- TALL: Temporal Activity Localization via Language Query
- Jiyang Gao, Chen Sun, Zhenheng Yang, Ram Nevatia
- End-To-End Face Detection and Cast Grouping in Movies Using Erdős-Rényi Clustering
- SouYoung Jin, Hang Su, Chris Stauffer, Erik Learned-Miller
- Active Decision Boundary Annotation With Deep Generative Models
- Miriam Huijser, Jan C. van Gemert
- Convolutional Dictionary Learning via Local Processing
- Vardan Papyan, Yaniv Romano, Jeremias Sulam, Michael Elad
Poster 8
Oral O8 Posters
- Weakly-Supervised Learning of Visual Relations
- Julia Peyre, Josef Sivic, Ivan Laptev, Cordelia Schmid
- BIER - Boosting Independent Embeddings Robustly
- Michael Opitz, Georg Waltner, Horst Possegger, Horst Bischof
- 3D Graph Neural Networks for RGBD Semantic Segmentation
- Xiaojuan Qi, Renjie Liao, Jiaya Jia, Sanja Fidler, Raquel Urtasun
- Learning Multi-Attention Convolutional Neural Network for Fine-Grained Image Recognition
- Heliang Zheng, Jianlong Fu, Tao Mei, Jiebo Luo
- Learning 3D Object Categories by Looking Around Them
- David Novotny, Diane Larlus, Andrea Vedaldi
Spotlight S7 Posters
- Quantitative Evaluation of Confidence Measures in a Machine Learning World
- Matteo Poggi, Fabio Tosi, Stefano Mattoccia
- Towards End-To-End Text Spotting With Convolutional Recurrent Neural Networks
- Hui Li, Peng Wang, Chunhua Shen
- DeepSetNet: Predicting Sets With Deep Neural Networks
- S. Hamid Rezatofighi, Vijay Kumar B G, Anton Milan, Ehsan Abbasnejad, Anthony Dick, Ian Reid
- Learning From Video and Text via Large-Scale Discriminative Clustering
- Antoine Miech, Jean-Baptiste Alayrac, Piotr Bojanowski, Ivan Laptev, Josef Sivic
- TALL: Temporal Activity Localization via Language Query
- Jiyang Gao, Chen Sun, Zhenheng Yang, Ram Nevatia
- End-To-End Face Detection and Cast Grouping in Movies Using Erdős-Rényi Clustering
- SouYoung Jin, Hang Su, Chris Stauffer, Erik Learned-Miller
- Active Decision Boundary Annotation With Deep Generative Models
- Miriam Huijser, Jan C. van Gemert
- Convolutional Dictionary Learning via Local Processing
- Vardan Papyan, Yaniv Romano, Jeremias Sulam, Michael Elad
Oral O9 Posters
- Deep Adaptive Image Clustering
- Jianlong Chang, Lingfeng Wang, Gaofeng Meng, Shiming Xiang, Chunhong Pan
- One Network to Solve Them All — Solving Linear Inverse Problems Using Deep Projection Models
- J. H. Rick Chang, Chun-Liang Li, Barnabás Póczos, B. V. K. Vijaya Kumar, Aswin C. Sankaranarayanan
- Representation Learning by Learning to Count
- Mehdi Noroozi, Hamed Pirsiavash, Paolo Favaro
- StackGAN: Text to Photo-Realistic Image Synthesis With Stacked Generative Adversarial Networks
- Han Zhang, Tao Xu, Hongsheng Li, Shaoting Zhang, Xiaogang Wang, Xiaolei Huang, Dimitris N. Metaxas
- Unsupervised Learning of Object Landmarks by Factorized Spatial Embeddings
- James Thewlis, Hakan Bilen, Andrea Vedaldi
3D Computer Vision
- Editable Parametric Dense Foliage From 3D Capture
- Gaurav Chaurasia, Paul Beardsley
- Refractive Structure-From-Motion Through a Flat Refractive Interface
- François Chadebecq, Francisco Vasconcelos, George Dwyer, René, Lacher, Sébastien Ourselin, Tom Vercauteren, Danail Stoyanov
- Submodular Trajectory Optimization for Aerial 3D Scanning (PDF)
- Mike Roberts, Debadeepta Dey, Anh Truong, Sudipta Sinha, Shital Shah, Ashish Kapoor, Pat Hanrahan, Neel Joshi
- Camera Calibration by Global Constraints on the Motion of Silhouettes
- Gil Ben-Artzi
- Deltille Grids for Geometric Camera Calibration
- Hyowon Ha, Michal Perdoch, Hatem Alismail, In So Kweon, Yaser Sheikh
Computational Photography
- A Lightweight Single-Camera Polarization Compass With Covariance Estimation
- Wolfgang Stürzl
- Reflectance Capture Using Univariate Sampling of BRDFs
- Zhuo Hui, Kalyan Sunkavalli, Joon-Young Lee, Sunil Hadap, Jian Wang, Aswin C. Sankaranarayanan
- Estimating Defocus Blur via Rank of Local Patches
- Guodong Xu, Yuhui Quan, Hui Ji
Face & Gesture
- RGB-Infrared Cross-Modality Person Re-Identification
- Wei-Shi Zheng, Ancong Wu, Hong-Xing Yu, Shaogang Gong, Jianhuang Lai
- Intrinsic 3D Dynamic Surface Tracking Based on Dynamic Ricci Flow and Teichmüller Map
- Xiaokang Yu, Na Lei, Yalin Wang, Xianfeng Gu
- Multi-Scale Deep Learning Architectures for Person Re-Identification
- Xuelin Qian, Yanwei Fu, Yu-Gang Jiang, Tao Xiang, Xiangyang Xue
- Range Loss for Deep Face Recognition With Long-Tailed Training Data
- Xiao Zhang, Zhiyuan Fang, Yandong Wen, Zhifeng Li, Yu Qiao
- Face Sketch Matching via Coupled Deep Transform Learning
- Shruti Nagpal, Maneet Singh, Richa Singh, Mayank Vatsa, Afzel Noore, Angshul Majumdar
Low-Level Vision & Image Processing
- Realistic Dynamic Facial Textures From a Single Image Using GANs
- Kyle Olszewski, Zimo Li, Chao Yang, Yi Zhou, Ronald Yu, Zeng Huang, Sitao Xiang, Shunsuke Saito, Pushmeet Kohli, Hao Li
- Pixel Recursive Super Resolution
- Ryan Dahl, Mohammad Norouzi, Jonathon Shlens
- Recurrent Color Constancy
- Yanlin Qian, Ke Chen, Jarno Nikkanen, Joni-Kristian Kämäräinen, Jiří, Matas
- Saliency Pattern Detection by Ranking Structured Trees
- Lei Zhu, Haibin Ling, Jin Wu, Huiping Deng, Jin Liu
Motion & Tracking
- Monocular Video-Based Trailer Coupler Detection Using Multiplexer Convolutional Neural Network
- Yousef Atoum, Joseph Roth, Michael Bliss, Wende Zhang, Xiaoming Liu
- Parallel Tracking and Verifying: A Framework for Real-Time and High Accuracy Visual Tracking
- Heng Fan, Haibin Ling
- Non-Rigid Object Tracking via Deformable Patches Using Shape-Preserved KCF and Level Sets
- Xin Sun, Ngai-Man Cheung, Hongxun Yao, Yiluan Guo
Optimization Methods
- A Discriminative View of MRF Pre-Processing Algorithms
- Chen Wang, Charles Herrmann, Ramin Zabih
Recognition
- Offline Handwritten Signature Modeling and Verification Based on Archetypal Analysis
- Elias N. Zois, Ilias Theodorakopoulos, George Economou
- Long Short-Term Memory Kalman Filters: Recurrent Neural Estimators for Pose Regularization
- Huseyin Coskun, Felix Achilles, Robert DiPietro, Nassir Navab, Federico Tombari
- Learning Spatio-Temporal Representation With Pseudo-3D Residual Networks
- Zhaofan Qiu, Ting Yao, Tao Mei
- Deeper, Broader and Artier Domain Generalization
- Da Li, Yongxin Yang, Yi-Zhe Song, Timothy M. Hospedales
- Deep Spatial-Semantic Attention for Fine-Grained Sketch-Based Image Retrieval
- Jifei Song, Qian Yu, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales
- Soft-NMS — Improving Object Detection With One Line of Code
- Navaneeth Bodla, Bharat Singh, Rama Chellappa, Larry S. Davis
- Semantic Jitter: Dense Supervision for Visual Comparisons via Synthetic Images
- Aron Yu, Kristen Grauman
- Video Scene Parsing With Predictive Feature Learning
- Xiaojie Jin, Xin Li, Huaxin Xiao, Xiaohui Shen, Zhe Lin, Jimei Yang, Yunpeng Chen, Jian Dong, Luoqi Liu, Zequn Jie, Jiashi Feng, Shuicheng Yan
- Understanding and Mapping Natural Beauty
- Scott Workman, Richard Souvenir, Nathan Jacobs
- Human Pose Estimation Using Global and Local Normalization
- Ke Sun, Cuiling Lan, Junliang Xing, Wenjun Zeng, Dong Liu, Jingdong Wang
- HashNet: Deep Learning to Hash by Continuation
- Zhangjie Cao, Mingsheng Long, Jianmin Wang, Philip S. Yu
- Scaling the Scattering Transform: Deep Hybrid Networks
- Edouard Oyallon, Eugene Belilovsky, Sergey Zagoruyko
- Flip-Invariant Motion Representation
- Takumi Kobayashi
- Scene Categorization With Spectral Features
- Salman H. Khan, Munawar Hayat, Fatih Porikli
- Image2song: Song Retrieval via Bridging Image Content and Lyric Words
- Xuelong Li, Di Hu, Xiaoqiang Lu
Segmentation, Grouping & Shape
- Deep Functional Maps: Structured Prediction for Dense Shape Correspondence
- Or Litany, Tal Remez, Emanuele Rodolà, Alex Bronstein, Michael Bronstein
- Training Deep Networks to Be Spatially Sensitive
- Nicholas Kolkin, Eli Shechtman, Gregory Shakhnarovich
- 3DCNN-DQN-RNN: A Deep Reinforcement Learning Framework for Semantic Parsing of Large-Scale 3D Point Clouds
- Fangyu Liu, Shuaipeng Li, Liqiang Zhang, Chenghu Zhou, Rongtian Ye, Yuebin Wang, Jiwen Lu
- Semi Supervised Semantic Segmentation Using Generative Adversarial Network
- Nasim Souly, Concetto Spampinato, Mubarak Shah
Statistical Methods & Learning
- Efficient Low Rank Tensor Ring Completion
- Wenqi Wang, Vaneet Aggarwal, Shuchin Aeron
- Semantic Image Synthesis via Adversarial Learning
- Hao Dong, Simiao Yu, Chao Wu, Yike Guo
- Unified Deep Supervised Domain Adaptation and Generalization
- Saeid Motiian, Marco Piccirilli, Donald A. Adjeroh, Gianfranco Doretto
- Interpretable Transformations With Encoder-Decoder Networks
- Daniel E. Worrall, Stephan J. Garbin, Daniyar Turmukhambetov, Gabriel J. Brostow
- Deep Clustering via Joint Convolutional Autoencoder Embedding and Relative Entropy Minimization
- Kamran Ghasedi Dizaji, Amirhossein Herandi, Cheng Deng, Weidong Cai, Heng Huang
- Scene Classification - formely: Deep Scene Image Classification With the MFAFVNet
- Yunsheng Li, Mandar Dixit, Nuno Vasconcelos
- Learning Bag-Of-Features Pooling for Deep Convolutional Neural Networks
- Nikolaos Passalis, Anastasios Tefas
- Adversarial Examples Detection in Deep Networks With Convolutional Filter Statistics
- Xin Li, Fuxin Li
Video
- Joint Prediction of Activity Labels and Starting Times in Untrimmed Videos
- Tahmida Mahmud, Mahmudul Hasan, Amit K. Roy-Chowdhury
- R-C3D: Region Convolutional 3D Network for Temporal Activity Detection
- Huijuan Xu, Abir Das, Kate Saenko
- Temporal Context Network for Activity Localization in Videos
- Xiyang Dai, Bharat Singh, Guyue Zhang, Larry S. Davis, Yan Qiu Chen
- Localizing Moments in Video With Natural Language
- Lisa Anne Hendricks, Oliver Wang, Eli Shechtman, Josef Sivic, Trevor Darrell, Bryan Russell
- TORNADO: A Spatio-Temporal Convolutional Regression Network for Video Action Proposal
- Hongyuan Zhu, Romain Vial, Shijian Lu
- Tube Convolutional Neural Network (T-CNN) for Action Detection in Videos
- Rui Hou, Chen Chen, Mubarak Shah
- Learning Action Recognition Model From Depth and Skeleton Videos
- Hossein Rahmani, Mohammed Bennamoun
- The "Something Something" Video Database for Learning and Evaluating Visual Common Sense
- Raghav Goyal; Samira Ebrahimi Kahou, Vincent Michalski, Joanna Materzyńska, Susanne Westphal, Heuna Kim, Valentin Haenel, Ingo Fruend, Peter Yianilos, Moritz Mueller-Freitag, Florian Hoppe, Christian Thurau, Ingo Bax, Roland Memisevic
Vision for X
- GPLAC: Generalizing Vision-Based Robotic Skills Using Weakly Labeled Images
- Avi Singh, Larry Yang, Sergey Levine
- Semi-Global Weighted Least Squares in Image Filtering
- Wei Liu, Xiaogang Chen, Chuanhua Shen, Zhi Liu, Jie Yang
- Scale Recovery for Monocular Visual Odometry Using Depth Estimated With Deep Convolutional Neural Fields
- Xiaochuan Yin, Xiangwei Wang, Xiaoguo Du, Qijun Chen
Oral 9
Machine Learning
- Deep Adaptive Image Clustering
- Jianlong Chang, Lingfeng Wang, Gaofeng Meng, Shiming Xiang, Chunhong Pan
- One Network to Solve Them All — Solving Linear Inverse Problems Using Deep Projection Models
- J. H. Rick Chang, Chun-Liang Li, Barnabás Póczos, B. V. K. Vijaya Kumar, Aswin C. Sankaranarayanan
- Representation Learning by Learning to Count
- Mehdi Noroozi, Hamed Pirsiavash, Paolo Favaro
- StackGAN: Text to Photo-Realistic Image Synthesis With Stacked Generative Adversarial Networks
- Han Zhang, Tao Xu, Hongsheng Li, Shaoting Zhang, Xiaogang Wang, Xiaolei Huang, Dimitris N. Metaxas
- Unsupervised Learning of Object Landmarks by Factorized Spatial Embeddings
- James Thewlis, Hakan Bilen, Andrea Vedaldi

Home
Changelog
Forum
RSS
Twitter