| 151 |
Auto-captions on GIF: A Large-scale Video-sentence Dataset for Vision-language Pre-training
Yingwei Pan, Yehao Li, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
61 |
5 years ago |
| 152 |
DDGHM: Dual Dynamic Graph with Hybrid Metric Training for Cross-Domain Sequential Recommendation
Xiaolin Zheng, Jiajie Su, ... (+2 more)
|
👻
Ghosted
|
cs.IR
|
61 |
3 years ago |
| 153 |
ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors
Jingwen Chen, Yingwei Pan, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
60 |
2 years ago |
| 154 |
GNAS: A Greedy Neural Architecture Search Method for Multi-Attribute Learning
Siyu Huang, Xi Li, ... (+3 more)
|
👻
Ghosted
|
cs.NE
|
59 |
8 years ago |
| 155 |
Unconstrained Fashion Landmark Detection via Hierarchical Recurrent Transformer Networks
Sijie Yan, Ziwei Liu, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
59 |
8 years ago |
| 156 |
Target-Guided Composed Image Retrieval
Haokun Wen, Xian Zhang, ... (+3 more)
|
👻
Ghosted
|
cs.MM
|
57 |
2 years ago |
| 157 |
Weakly Supervised Video Summarization by Hierarchical Reinforcement Learning
Yiyan Chen, Li Tao, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
57 |
6 years ago |
| 158 |
Deep Multimodal Speaker Naming
Yongtao Hu, Jimmy Ren, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
57 |
10 years ago |
| 159 |
CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model
Zhen Ye, Wei Xue, ... (+4 more)
|
👻
Ghosted
|
cs.SD
|
56 |
2 years ago |
| 160 |
Adversarial Bipartite Graph Learning for Video Domain Adaptation
Yadan Luo, Zi Huang, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
56 |
5 years ago |
| 161 |
ChainerCV: a Library for Deep Learning in Computer Vision
Yusuke Niitani, Toru Ogawa, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
54 |
8 years ago |
| 162 |
Counterfactual Reasoning for Out-of-distribution Multimodal Sentiment Analysis
Teng Sun, Wenjie Wang, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
53 |
3 years ago |
| 163 |
Who, Where, and What to Wear? Extracting Fashion Knowledge from Social Media
Yunshan Ma, Xun Yang, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
52 |
6 years ago |
| 164 |
Context-Dependent Diffusion Network for Visual Relationship Detection
Zhen Cui, Chunyan Xu, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
52 |
7 years ago |
| 165 |
Multimedia Semantic Integrity Assessment Using Joint Embedding Of Images And Text
Ayush Jaiswal, Ekraam Sabir, ... (+2 more)
|
👻
Ghosted
|
cs.MM
|
51 |
8 years ago |
| 166 |
Video Imagination from a Single Image with Transformation Generation
Baoyang Chen, Wenmin Wang, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
51 |
8 years ago |
| 167 |
FakingRecipe: Detecting Fake News on Short Video Platforms from the Perspective of Creative Process
Yuyan Bu, Qiang Sheng, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
51 |
1 year ago |
| 168 |
Outfit Compatibility Prediction and Diagnosis with Multi-Layered Comparison Network
Xin Wang, Bo Wu, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
50 |
6 years ago |
| 169 |
Deeply-Supervised Recurrent Convolutional Neural Network for Saliency Detection
Youbao Tang, Xiangqian Wu, Wei Bu
|
👻
Ghosted
|
cs.CV
|
50 |
9 years ago |
| 170 |
Fine-grained Discriminative Localization via Saliency-guided Faster R-CNN
Xiangteng He, Yuxin Peng, Junjie Zhao
|
👻
Ghosted
|
cs.CV
|
49 |
8 years ago |
| 171 |
Learning a Target Sample Re-Generator for Cross-Database Micro-Expression Recognition
Yuan Zong, Xiaohua Huang, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
49 |
8 years ago |
| 172 |
Temporally Guided Music-to-Body-Movement Generation
Hsuan-Kai Kao, Li Su
|
👻
Ghosted
|
cs.MM
|
48 |
5 years ago |
| 173 |
Salvage Reusable Samples from Noisy Data for Robust Learning
Zeren Sun, Xian-Sheng Hua, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
48 |
5 years ago |
| 174 |
Amora: Black-box Adversarial Morphing Attack
Run Wang, Felix Juefei-Xu, ... (+5 more)
|
👻
Ghosted
|
cs.CV
|
48 |
6 years ago |
| 175 |
Enabling My Robot To Play Pictionary : Recurrent Neural Networks For Sketch Recognition
Ravi Kiran Sarvadevabhatla, Jogendra Kundu, Babu R. Venkatesh
|
🌅
Old Age
|
cs.CV
|
48 |
9 years ago |
| 176 |
Adversarial Privacy-preserving Filter
Jiaming Zhang, Jitao Sang, ... (+4 more)
|
👻
Ghosted
|
cs.CR
|
47 |
5 years ago |
| 177 |
Attention Transfer from Web Images for Video Recognition
Junnan Li, Yongkang Wong, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
47 |
8 years ago |
| 178 |
Leveraging Contextual Cues for Generating Basketball Highlights
Vinay Bettadapura, Caroline Pantofaru, Irfan Essa
|
👻
Ghosted
|
cs.MM
|
47 |
9 years ago |
| 179 |
PiRhDy: Learning Pitch-, Rhythm-, and Dynamics-aware Embeddings for Symbolic Music
Hongru Liang, Wenqiang Lei, ... (+4 more)
|
👻
Ghosted
|
cs.SD
|
46 |
5 years ago |
| 180 |
Preserving Semantic and Temporal Consistency for Unpaired Video-to-Video Translation
Kwanyong Park, Sanghyun Woo, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
46 |
6 years ago |
| 181 |
GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation
Zhanyu Wang, Longyue Wang, ... (+8 more)
|
👻
Ghosted
|
cs.CV
|
46 |
2 years ago |
| 182 |
Progressive Spatio-temporal Perception for Audio-Visual Question Answering
Guangyao Li, Wenxuan Hou, Di Hu
|
💀
404 Not Found
|
cs.CV
|
45 |
2 years ago |
| 183 |
Transform-Invariant Convolutional Neural Networks for Image Classification and Search
Xu Shen, Xinmei Tian, ... (+3 more)
|
🌅
Old Age
|
cs.CV
|
45 |
6 years ago |
| 184 |
SMP Challenge: An Overview of Social Media Prediction Challenge 2019
Bo Wu, Wen-Huang Cheng, ... (+4 more)
|
👻
Ghosted
|
cs.MM
|
45 |
6 years ago |
| 185 |
Rethinking Generative Zero-Shot Learning: An Ensemble Learning Perspective for Recognising Visual Patches
Zhi Chen, Sen Wang, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
43 |
5 years ago |
| 186 |
Deep Priority Hashing
Zhangjie Cao, Ziping Sun, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
43 |
7 years ago |
| 187 |
Gradual Network for Single Image De-raining
Zhe Huang, Weijiang Yu, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
42 |
6 years ago |
| 188 |
Learning Comprehensive Representations with Richer Self for Text-to-Image Person Re-Identification
Shuanglin Yan, Neng Dong, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
41 |
2 years ago |
| 189 |
SGDiff: A Style Guided Diffusion Model for Fashion Synthesis
Zhengwentai Sun, Yanghong Zhou, ... (+2 more)
|
💀
404 Not Found
|
cs.CV
|
41 |
2 years ago |
| 190 |
Domain Adaptive Person Re-Identification via Coupling Optimization
Xiaobin Liu, Shiliang Zhang
|
👻
Ghosted
|
cs.CV
|
41 |
5 years ago |
| 191 |
Unpaired Cross-lingual Image Caption Generation with Self-Supervised Rewards
Yuqing Song, Shizhe Chen, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
41 |
6 years ago |
| 192 |
Exploiting High-Level Semantics for No-Reference Image Quality Assessment of Realistic Blur Images
Dingquan Li, Tingting Jiang, Ming Jiang
|
👻
Ghosted
|
eess.IV
|
41 |
7 years ago |
| 193 |
PHD-GIFs: Personalized Highlight Detection for Automatic GIF Creation
Ana García del Molino, Michael Gygli
|
👻
Ghosted
|
cs.CV
|
41 |
8 years ago |
| 194 |
Do LLMs Understand Visual Anomalies? Uncovering LLM's Capabilities in Zero-shot Anomaly Detection
Jiaqi Zhu, Shaofeng Cai, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
41 |
2 years ago |
| 195 |
Learning Causality-inspired Representation Consistency for Video Anomaly Detection
Yang Liu, Zhaoyang Xia, ... (+8 more)
|
👻
Ghosted
|
cs.MM
|
40 |
2 years ago |
| 196 |
Multi-modal Cooking Workflow Construction for Food Recipes
Liangming Pan, Jingjing Chen, ... (+6 more)
|
👻
Ghosted
|
cs.CL
|
40 |
5 years ago |
| 197 |
Hybrid Point Cloud Attribute Compression Using Slice-based Layered Structure and Block-based Intra Prediction
Yiting Shao, Qi Zhang, ... (+2 more)
|
👻
Ghosted
|
cs.MM
|
40 |
8 years ago |
| 198 |
Improving Zero-shot Visual Question Answering via Large Language Models with Reasoning Question Prompts
Yunshi Lan, Xiang Li, ... (+4 more)
|
💀
404 Not Found
|
cs.CV
|
40 |
2 years ago |
| 199 |
DiffDance: Cascaded Human Motion Diffusion Model for Dance Generation
Qiaosong Qi, Le Zhuo, ... (+5 more)
|
👻
Ghosted
|
cs.GR
|
39 |
2 years ago |
| 200 |
Text-to-image Synthesis via Symmetrical Distillation Networks
Mingkuan Yuan, Yuxin Peng
|
👻
Ghosted
|
cs.CV
|
39 |
7 years ago |