| 401 |
An Empirical Study of Language CNN for Image Captioning
Jiuxiang Gu, Gang Wang, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
150 |
9 years ago |
| 402 |
Learning Visual N-Grams from Web Data
Ang Li, Allan Jabri, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
150 |
9 years ago |
| 403 |
BAM! The Behance Artistic Media Dataset for Recognition Beyond Photography
Michael J. Wilber, Chen Fang, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
148 |
9 years ago |
| 404 |
Learning Spread-out Local Feature Descriptors
Xu Zhang, Felix X. Yu, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
148 |
8 years ago |
| 405 |
Improved Techniques for Training Adaptive Deep Networks
Hao Li, Hong Zhang, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
148 |
6 years ago |
| 406 |
Improving Pedestrian Attribute Recognition With Weakly-Supervised Multi-Scale Attribute-Specific Localization
Chufeng Tang, Lu Sheng, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
148 |
6 years ago |
| 407 |
Adaptive Context Network for Scene Parsing
Jun Fu, Jing Liu, ... (+5 more)
|
👻
Ghosted
|
cs.CV
|
147 |
6 years ago |
| 408 |
On the Adversarial Robustness of Multi-Modal Foundation Models
Christian Schlarmann, Matthias Hein
|
👻
Ghosted
|
cs.LG
|
147 |
2 years ago |
| 409 |
Learning a Discriminative Model for the Perception of Realism in Composite Images
Jun-Yan Zhu, Philipp Krähenbühl, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
146 |
10 years ago |
| 410 |
Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner
Tseng-Hung Chen, Yuan-Hong Liao, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
146 |
9 years ago |
| 411 |
Convolutional Dictionary Learning via Local Processing
Vardan Papyan, Yaniv Romano, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
146 |
9 years ago |
| 412 |
Dense Face Alignment
Yaojie Liu, Amin Jourabloo, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
146 |
8 years ago |
| 413 |
Query-guided Regression Network with Context Policy for Phrase Grounding
Kan Chen, Rama Kovvuri, Ram Nevatia
|
👻
Ghosted
|
cs.CV
|
145 |
8 years ago |
| 414 |
SROBB: Targeted Perceptual Loss for Single Image Super-Resolution
Mohammad Saeed Rad, Behzad Bozorgtabar, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
145 |
6 years ago |
| 415 |
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks
Humam Alwassel, Silvio Giancola, Bernard Ghanem
|
👻
Ghosted
|
cs.CV
|
145 |
5 years ago |
| 416 |
First-Person Activity Forecasting with Online Inverse Reinforcement Learning
Nicholas Rhinehart, Kris M. Kitani
|
👻
Ghosted
|
cs.CV
|
144 |
9 years ago |
| 417 |
Towards Context-aware Interaction Recognition
Bohan Zhuang, Lingqiao Liu, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
143 |
9 years ago |
| 418 |
Visual Semantic Planning using Deep Successor Representations
Yuke Zhu, Daniel Gordon, ... (+6 more)
|
👻
Ghosted
|
cs.CV
|
143 |
9 years ago |
| 419 |
Two-Phase Learning for Weakly Supervised Object Localization
Dahun Kim, Donghyeon Cho, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
143 |
8 years ago |
| 420 |
Total Denoising: Unsupervised Learning of 3D Point Cloud Cleaning
Pedro Hermosilla, Tobias Ritschel, Timo Ropinski
|
👻
Ghosted
|
cs.CV
|
143 |
7 years ago |
| 421 |
Efficient Learning on Point Clouds with Basis Point Sets
Sergey Prokudin, Christoph Lassner, Javier Romero
|
👻
Ghosted
|
cs.CV
|
143 |
6 years ago |
| 422 |
Large-scale Robust Deep AUC Maximization: A New Surrogate Loss and Empirical Studies on Medical Image Classification
Zhuoning Yuan, Yan Yan, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
143 |
5 years ago |
| 423 |
Context-aware CNNs for person head detection
Tuan-Hung Vu, Anton Osokin, Ivan Laptev
|
👻
Ghosted
|
cs.CV
|
142 |
10 years ago |
| 424 |
PPR-FCN: Weakly Supervised Visual Relation Detection via Parallel Pairwise R-FCN
Hanwang Zhang, Zawlin Kyaw, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
142 |
8 years ago |
| 425 |
Submodular Trajectory Optimization for Aerial 3D Scanning
Mike Roberts, Debadeepta Dey, ... (+6 more)
|
👻
Ghosted
|
cs.CV
|
141 |
9 years ago |
| 426 |
Understanding and Comparing Deep Neural Networks for Age and Gender Classification
Sebastian Lapuschkin, Alexander Binder, ... (+2 more)
|
👻
Ghosted
|
stat.ML
|
141 |
8 years ago |
| 427 |
Weakly-Supervised Alignment of Video With Text
Piotr Bojanowski, Rémi Lajugie, ... (+5 more)
|
👻
Ghosted
|
cs.CV
|
140 |
11 years ago |
| 428 |
Situation Recognition with Graph Neural Networks
Ruiyu Li, Makarand Tapaswi, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
140 |
8 years ago |
| 429 |
Spectral Feature Transformation for Person Re-identification
Chuanchen Luo, Yuntao Chen, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
140 |
7 years ago |
| 430 |
AutoFocus: Efficient Multi-Scale Inference
Mahyar Najibi, Bharat Singh, Larry S. Davis
|
👻
Ghosted
|
cs.CV
|
140 |
7 years ago |
| 431 |
Towards Precise End-to-end Weakly Supervised Object Detection Network
Ke Yang, Dongsheng Li, Yong Dou
|
👻
Ghosted
|
cs.CV
|
139 |
6 years ago |
| 432 |
Personalized Age Progression with Aging Dictionary
Xiangbo Shu, Jinhui Tang, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
138 |
10 years ago |
| 433 |
Towards Unconstrained End-to-End Text Spotting
Siyang Qin, Alessandro Bissacco, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
138 |
6 years ago |
| 434 |
Learning Spatial Awareness to Improve Crowd Counting
Zhi-Qi Cheng, Jun-Xiu Li, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
138 |
6 years ago |
| 435 |
Re-ID Driven Localization Refinement for Person Search
Chuchu Han, Jiacheng Ye, ... (+5 more)
|
👻
Ghosted
|
cs.CV
|
138 |
6 years ago |
| 436 |
What Actions are Needed for Understanding Human Actions in Videos?
Gunnar A. Sigurdsson, Olga Russakovsky, Abhinav Gupta
|
👻
Ghosted
|
cs.CV
|
137 |
8 years ago |
| 437 |
Explaining the Ambiguity of Object Detection and 6D Pose From Visual Data
Fabian Manhardt, Diego Martin Arroyo, ... (+5 more)
|
👻
Ghosted
|
cs.CV
|
137 |
7 years ago |
| 438 |
Harvesting Discriminative Meta Objects with Deep CNN Features for Scene Classification
Ruobing Wu, Baoyuan Wang, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
136 |
10 years ago |
| 439 |
VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation
Chuang Gan, Yandong Li, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
136 |
8 years ago |
| 440 |
Copy-and-Paste Networks for Deep Video Inpainting
Sungho Lee, Seoung Wug Oh, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
134 |
6 years ago |
| 441 |
Learning Compositional Representations for Few-Shot Recognition
Pavel Tokmakov, Yu-Xiong Wang, Martial Hebert
|
👻
Ghosted
|
cs.CV
|
133 |
7 years ago |
| 442 |
Efficient Decomposition of Image and Mesh Graphs by Lifted Multicuts
Margret Keuper, Evgeny Levinkov, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
131 |
11 years ago |
| 443 |
Exploiting temporal consistency for real-time video depth estimation
Haokui Zhang, Chunhua Shen, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
131 |
6 years ago |
| 444 |
Improving Image Classification with Location Context
Kevin Tang, Manohar Paluri, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
130 |
11 years ago |
| 445 |
Coherent Motion Segmentation in Moving Camera Videos using Optical Flow Orientations
Manjunath Narayana, Allen Hanson, Erik Learned-Miller
|
👻
Ghosted
|
cs.CV
|
130 |
10 years ago |
| 446 |
Open Vocabulary Scene Parsing
Hang Zhao, Xavier Puig, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
130 |
9 years ago |
| 447 |
Online Model Distillation for Efficient Video Inference
Ravi Teja Mullapudi, Steven Chen, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
129 |
7 years ago |
| 448 |
Unsupervised Object Discovery and Tracking in Video Collections
Suha Kwak, Minsu Cho, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
127 |
11 years ago |
| 449 |
Boundless: Generative Adversarial Networks for Image Extension
Piotr Teterwak, Aaron Sarna, ... (+5 more)
|
👻
Ghosted
|
cs.CV
|
127 |
6 years ago |
| 450 |
Holistic++ Scene Understanding: Single-view 3D Holistic Scene Parsing and Human Pose Estimation with Human-Object Interaction and Physical Commonsense
Yixin Chen, Siyuan Huang, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
127 |
6 years ago |