| 1 |
UnitBox: An Advanced Object Detection Network
Jiahui Yu, Yuning Jiang, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
1.7K |
9 years ago |
| 2 |
MISA: Modality-Invariant and -Specific Representations for Multimodal Sentiment Analysis
Devamanyu Hazarika, Roger Zimmermann, Soujanya Poria
|
👻
Ghosted
|
cs.CL
|
1.0K |
5 years ago |
| 3 |
AVEC 2016 - Depression, Mood, and Emotion Recognition Workshop and Challenge
Michel Valstar, Jonathan Gratch, ... (+8 more)
|
👻
Ghosted
|
cs.CV
|
646 |
9 years ago |
| 4 |
CrowdNet: A Deep Convolutional Network for Dense Crowd Counting
Lokesh Boominathan, Srinivas S S Kruthiventi, R. Venkatesh Babu
|
👻
Ghosted
|
cs.CV
|
566 |
9 years ago |
| 5 |
Single Shot Temporal Action Detection
Tianwei Lin, Xu Zhao, Zheng Shou
|
👻
Ghosted
|
cs.CV
|
473 |
8 years ago |
| 6 |
Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification
Zuxuan Wu, Xi Wang, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
463 |
11 years ago |
| 7 |
GLAD: Global-Local-Alignment Descriptor for Pedestrian Retrieval
Longhui Wei, Shiliang Zhang, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
409 |
8 years ago |
| 8 |
Learning Fashion Compatibility with Bidirectional LSTMs
Xintong Han, Zuxuan Wu, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
401 |
8 years ago |
| 9 |
Action Recognition Based on Joint Trajectory Maps Using Convolutional Neural Networks
Pichao Wang, Zhaoyang Li, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
378 |
9 years ago |
| 10 |
Stronger, Faster and More Explainable: A Graph Convolutional Baseline for Skeleton-based Action Recognition
Yi-Fan Song, Zhang Zhang, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
334 |
5 years ago |
| 11 |
Image Captioning with Deep Bidirectional LSTMs
Cheng Wang, Haojin Yang, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
295 |
10 years ago |
| 12 |
OpenVSLAM: A Versatile Visual SLAM Framework
Shinya Sumikura, Mikiya Shibuya, Ken Sakurada
|
👻
Ghosted
|
cs.CV
|
292 |
6 years ago |
| 13 |
Emotion Recognition in Speech using Cross-Modal Transfer in the Wild
Samuel Albanie, Arsha Nagrani, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
285 |
7 years ago |
| 14 |
Non-locally Enhanced Encoder-Decoder Network for Single Image De-raining
Guanbin Li, Xiang He, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
253 |
7 years ago |
| 15 |
MS$^2$L: Multi-Task Self-Supervised Learning for Skeleton Based Action Recognition
Lilang Lin, Sijie Song, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
236 |
5 years ago |
| 16 |
Deep Cross-Modal Audio-Visual Generation
Lele Chen, Sudhanshu Srivastava, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
232 |
8 years ago |
| 17 |
DFEW: A Large-Scale Database for Recognizing Dynamic Facial Expressions in the Wild
Xingxun Jiang, Yuan Zong, ... (+5 more)
|
👻
Ghosted
|
cs.CV
|
231 |
5 years ago |
| 18 |
Not made for each other- Audio-Visual Dissonance-based Deepfake Detection and Localization
Komal Chugh, Parul Gupta, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
220 |
5 years ago |
| 19 |
DAWN: Dynamic Adversarial Watermarking of Neural Networks
Sebastian Szyller, Buse Gul Atli, ... (+2 more)
|
👻
Ghosted
|
cs.CR
|
215 |
6 years ago |
| 20 |
Detecting Sarcasm in Multimodal Social Platforms
Rossano Schifanella, Paloma de Juan, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
213 |
9 years ago |
| 21 |
Efficient Dense Modules of Asymmetric Convolution for Real-Time Semantic Segmentation
Shao-Yuan Lo, Hsueh-Ming Hang, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
206 |
7 years ago |
| 22 |
Focus Your Attention: A Bidirectional Focal Attention Network for Image-Text Matching
Chunxiao Liu, Zhendong Mao, ... (+4 more)
|
👻
Ghosted
|
cs.MM
|
204 |
6 years ago |
| 23 |
Exploit the Connectivity: Multi-Object Tracking with TrackletNet
Gaoang Wang, Yizhou Wang, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
192 |
7 years ago |
| 24 |
Sharp Multiple Instance Learning for DeepFake Video Detection
Xiaodan Li, Yining Lang, ... (+6 more)
|
👻
Ghosted
|
cs.CV
|
191 |
5 years ago |
| 25 |
Multi-View Image Generation from a Single-View
Bo Zhao, Xiao Wu, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
190 |
9 years ago |
| 26 |
PVNet: A Joint Convolutional Network of Point Cloud and Multi-View for 3D Shape Recognition
Haoxuan You, Yifan Feng, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
185 |
7 years ago |
| 27 |
Audio Event Detection using Weakly Labeled Data
Anurag Kumar, Bhiksha Raj
|
👻
Ghosted
|
cs.SD
|
178 |
9 years ago |
| 28 |
Geometry Guided Adversarial Facial Expression Synthesis
Lingxiao Song, Zhihe Lu, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
163 |
8 years ago |
| 29 |
Black-box Adversarial Attacks on Video Recognition Models
Linxi Jiang, Xingjun Ma, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
163 |
7 years ago |
| 30 |
CubeMLP: An MLP-based Model for Multimodal Sentiment Analysis and Depression Estimation
Hao Sun, Hongyi Wang, ... (+3 more)
|
👻
Ghosted
|
cs.MM
|
160 |
3 years ago |
| 31 |
Multi-View Graph Convolutional Network for Multimedia Recommendation
Penghang Yu, Zhiyi Tan, ... (+2 more)
|
👻
Ghosted
|
cs.IR
|
157 |
2 years ago |
| 32 |
Topic Modeling Based Multi-modal Depression Detection
Yuan Gong, Christian Poellabauer
|
👻
Ghosted
|
cs.CL
|
156 |
8 years ago |
| 33 |
Semantic Human Matting
Quan Chen, Tiezheng Ge, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
155 |
7 years ago |
| 34 |
Zero-Shot Hashing via Transferring Supervised Knowledge
Yang Yang, Weilun Chen, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
154 |
9 years ago |
| 35 |
FedGH: Heterogeneous Federated Learning with Generalized Global Header
Liping Yi, Gang Wang, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
152 |
3 years ago |
| 36 |
ISIA Food-500: A Dataset for Large-Scale Food Recognition via Stacked Global-Local Attention Network
Weiqing Min, Linhu Liu, ... (+5 more)
|
👻
Ghosted
|
cs.CV
|
146 |
5 years ago |
| 37 |
Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark
Shuyu Yang, Yinan Zhou, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
146 |
2 years ago |
| 38 |
Jointly Cross- and Self-Modal Graph Attention Network for Query-Based Moment Localization
Daizong Liu, Xiaoye Qu, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
145 |
5 years ago |
| 39 |
Deep CTR Prediction in Display Advertising
Junxuan Chen, Baigui Sun, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
143 |
9 years ago |
| 40 |
User-Guided Deep Anime Line Art Colorization with Conditional Adversarial Networks
Yuanzheng Ci, Xinzhu Ma, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
141 |
7 years ago |
| 41 |
Twitter Sentiment Analysis via Bi-sense Emoji Embedding and Attention-based LSTM
Yuxiao Chen, Jianbo Yuan, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
140 |
7 years ago |
| 42 |
Temporal Localization of Fine-Grained Actions in Videos by Domain Transfer from Web Images
Chen Sun, Sanketh Shetty, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
137 |
11 years ago |
| 43 |
Aesthetic-Driven Image Enhancement by Adversarial Learning
Yubin Deng, Chen Change Loy, Xiaoou Tang
|
👻
Ghosted
|
cs.CV
|
133 |
8 years ago |
| 44 |
PopMAG: Pop Music Accompaniment Generation
Yi Ren, Jinzheng He, ... (+4 more)
|
👻
Ghosted
|
cs.SD
|
133 |
5 years ago |
| 45 |
DeepFont: Identify Your Font from An Image
Zhangyang Wang, Jianchao Yang, ... (+5 more)
|
👻
Ghosted
|
cs.CV
|
131 |
10 years ago |
| 46 |
Visual Affect Around the World: A Large-scale Multilingual Visual Sentiment Ontology
Brendan Jou, Tao Chen, ... (+4 more)
|
👻
Ghosted
|
cs.MM
|
129 |
10 years ago |
| 47 |
Boosting Continuous Sign Language Recognition via Cross Modality Augmentation
Junfu Pu, Wengang Zhou, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
127 |
5 years ago |
| 48 |
LIME: A Method for Low-light IMage Enhancement
Xiaojie Guo
|
👻
Ghosted
|
cs.CV
|
125 |
9 years ago |
| 49 |
Bridge the Gap Between VQA and Human Behavior on Omnidirectional Video: A Large-Scale Dataset and a Deep Learning Model
Chen Li, Mai Xu, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
125 |
7 years ago |
| 50 |
Multi-Label Image Classification via Knowledge Distillation from Weakly-Supervised Detection
Yongcheng Liu, Lu Sheng, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
125 |
7 years ago |