| 201 |
Learning Latent Spatio-Temporal Compositional Model for Human Action Recognition
Xiaodan Liang, Liang Lin, Liangliang Cao
|
👻
Ghosted
|
cs.CV
|
39 |
11 years ago |
| 202 |
Exploring the Robustness of Decision-Level Through Adversarial Attacks on LLM-Based Embodied Models
Shuyuan Liu, Jiawei Chen, ... (+3 more)
|
👻
Ghosted
|
cs.MM
|
39 |
1 year ago |
| 203 |
Arondight: Red Teaming Large Vision Language Models with Auto-generated Multi-modal Jailbreak Prompts
Yi Liu, Chengjun Cai, ... (+3 more)
|
⏳
Coming Soon™
|
cs.LG
|
38 |
1 year ago |
| 204 |
Cross-Modal Generalization: Learning in Low Resource Modalities via Meta-Alignment
Paul Pu Liang, Peter Wu, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
37 |
5 years ago |
| 205 |
LIGHTEN: Learning Interactions with Graph and Hierarchical TEmporal Networks for HOI in videos
Sai Praneeth Reddy Sunkesula, Rishabh Dabral, Ganesh Ramakrishnan
|
🌅
Old Age
|
cs.CV
|
37 |
5 years ago |
| 206 |
Webpage Segmentation for Extracting Images and Their Surrounding Contextual Information
F. Fauzi, H. J. Long, M. Belkhatir
|
👻
Ghosted
|
cs.MM
|
37 |
5 years ago |
| 207 |
Emotion-Based End-to-End Matching Between Image and Music in Valence-Arousal Space
Sicheng Zhao, Yaxian Li, ... (+5 more)
|
👻
Ghosted
|
cs.CV
|
37 |
5 years ago |
| 208 |
Human Pose Estimation from Depth Images via Inference Embedded Multi-task Learning
Keze Wang, Shengfu Zhai, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
37 |
9 years ago |
| 209 |
Chain of Visual Perception: Harnessing Multimodal Large Language Models for Zero-shot Camouflaged Object Detection
Lv Tang, Peng-Tao Jiang, ... (+4 more)
|
💀
404 Not Found
|
cs.CV
|
37 |
2 years ago |
| 210 |
Few-shot Multimodal Sentiment Analysis based on Multimodal Probabilistic Fusion Prompts
Xiaocui Yang, Shi Feng, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
37 |
3 years ago |
| 211 |
SDIT: Scalable and Diverse Cross-domain Image Translation
Yaxing Wang, Abel Gonzalez-Garcia, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
36 |
6 years ago |
| 212 |
Support Neighbor Loss for Person Re-Identification
Kai Li, Zhengming Ding, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
36 |
7 years ago |
| 213 |
Knowledge Prompt-tuning for Sequential Recommendation
Jianyang Zhai, Xiawu Zheng, ... (+3 more)
|
💀
404 Not Found
|
cs.IR
|
36 |
2 years ago |
| 214 |
Hierarchical Dynamic Image Harmonization
Haoxing Chen, Zhangxuan Gu, ... (+5 more)
|
👻
Ghosted
|
cs.CV
|
36 |
3 years ago |
| 215 |
Hybrid Multimodal Feature Extraction, Mining and Fusion for Sentiment Analysis
Jia Li, Ziyang Zhang, ... (+10 more)
|
👻
Ghosted
|
cs.CV
|
36 |
3 years ago |
| 216 |
Kalman Filter-based Head Motion Prediction for Cloud-based Mixed Reality
Serhan Gül, Sebastian Bosse, ... (+3 more)
|
👻
Ghosted
|
cs.MM
|
35 |
5 years ago |
| 217 |
Neural Storyboard Artist: Visualizing Stories with Coherent Image Sequences
Shizhe Chen, Bei Liu, ... (+7 more)
|
👻
Ghosted
|
cs.LG
|
35 |
6 years ago |
| 218 |
Adversarial Colorization Of Icons Based On Structure And Color Conditions
Tsai-Ho Sun, Chien-Hsun Lai, ... (+2 more)
|
👻
Ghosted
|
cs.LG
|
35 |
6 years ago |
| 219 |
Progressive Tree-Structured Prototype Network for End-to-End Image Captioning
Pengpeng Zeng, Jinkuan Zhu, ... (+2 more)
|
💤
Eternal Rest
|
cs.CV
|
35 |
3 years ago |
| 220 |
Equivariant and Invariant Grounding for Video Question Answering
Yicong Li, Xiang Wang, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
35 |
3 years ago |
| 221 |
The MuSe 2023 Multimodal Sentiment Analysis Challenge: Mimicked Emotions, Cross-Cultural Humour, and Personalisation
Lukas Christ, Shahin Amiriparian, ... (+11 more)
|
👻
Ghosted
|
cs.LG
|
34 |
2 years ago |
| 222 |
Online Distillation-enhanced Multi-modal Transformer for Sequential Recommendation
Wei Ji, Xiangyan Liu, ... (+4 more)
|
👻
Ghosted
|
cs.IR
|
34 |
2 years ago |
| 223 |
Combating Online Misinformation Videos: Characterization, Detection, and Future Directions
Yuyan Bu, Qiang Sheng, ... (+4 more)
|
📜
Death by README
|
cs.CV
|
33 |
3 years ago |
| 224 |
Fully Quantized Image Super-Resolution Networks
Hu Wang, Peng Chen, ... (+2 more)
|
👻
Ghosted
|
eess.IV
|
33 |
5 years ago |
| 225 |
Label Tree Embeddings for Acoustic Scene Classification
Huy Phan, Lars Hertel, ... (+3 more)
|
👻
Ghosted
|
cs.MM
|
33 |
9 years ago |
| 226 |
Who are the Devils Wearing Prada in New York City?
KuanTing Chen, Kezhen Chen, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
33 |
10 years ago |
| 227 |
SpeechCraft: A Fine-grained Expressive Speech Dataset with Natural Language Description
Zeyu Jin, Jia Jia, ... (+6 more)
|
👻
Ghosted
|
cs.MM
|
33 |
1 year ago |
| 228 |
GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian Splatting
Hongyun Yu, Zhan Qu, ... (+9 more)
|
👻
Ghosted
|
cs.CV
|
33 |
2 years ago |
| 229 |
Making Users Indistinguishable: Attribute-wise Unlearning in Recommender Systems
Yuyuan Li, Chaochao Chen, ... (+5 more)
|
👻
Ghosted
|
cs.LG
|
33 |
2 years ago |
| 230 |
A Four-Pronged Defense Against Byzantine Attacks in Federated Learning
Wei Wan, Shengshan Hu, ... (+5 more)
|
👻
Ghosted
|
cs.CR
|
32 |
2 years ago |
| 231 |
Animatable 3D Gaussian: Fast and High-Quality Reconstruction of Multiple Human Avatars
Yang Liu, Xiang Huang, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
32 |
2 years ago |
| 232 |
Compositional Few-Shot Recognition with Primitive Discovery and Enhancing
Yixiong Zou, Shanghang Zhang, ... (+4 more)
|
👻
Ghosted
|
cs.CV
|
32 |
5 years ago |
| 233 |
Deep Concept-wise Temporal Convolutional Networks for Action Localization
Xin Li, Tianwei Lin, ... (+8 more)
|
👻
Ghosted
|
cs.CV
|
32 |
6 years ago |
| 234 |
Sentence Specified Dynamic Video Thumbnail Generation
Yitian Yuan, Lin Ma, Wenwu Zhu
|
🌅
Old Age
|
cs.CV
|
32 |
6 years ago |
| 235 |
Pinterest Board Recommendation for Twitter Users
Xitong Yang, Yuncheng Li, Jiebo Luo
|
👻
Ghosted
|
cs.SI
|
32 |
10 years ago |
| 236 |
Interpretable Embedding for Ad-hoc Video Search
Jiaxin Wu, Chong-Wah Ngo
|
👻
Ghosted
|
cs.CV
|
32 |
2 years ago |
| 237 |
RecolorNeRF: Layer Decomposed Radiance Fields for Efficient Color Editing of 3D Scenes
Bingchen Gong, Yuehao Wang, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
31 |
3 years ago |
| 238 |
Understanding User Behavior in Volumetric Video Watching: Dataset, Analysis and Prediction
Kaiyuan Hu, Haowen Yang, ... (+5 more)
|
👻
Ghosted
|
cs.MM
|
31 |
2 years ago |
| 239 |
Recurrent Multi-scale Transformer for High-Resolution Salient Object Detection
Xinhao Deng, Pingping Zhang, ... (+2 more)
|
💤
Eternal Rest
|
cs.CV
|
31 |
2 years ago |
| 240 |
Dual Semantic Fusion Network for Video Object Detection
Lijian Lin, Haosheng Chen, ... (+5 more)
|
👻
Ghosted
|
cs.CV
|
31 |
5 years ago |
| 241 |
Efficient Adaptation of Neural Network Filter for Video Compression
Yat-Hong Lam, Alireza Zare, ... (+3 more)
|
👻
Ghosted
|
eess.IV
|
31 |
5 years ago |
| 242 |
Synthetic Data Supervised Salient Object Detection
Zhenyu Wu, Lin Wang, ... (+5 more)
|
💤
Eternal Rest
|
cs.CV
|
31 |
3 years ago |
| 243 |
Adaptive Structural Similarity Preserving for Unsupervised Cross Modal Hashing
Liang Li, Baihua Zheng, Weiwei Sun
|
👻
Ghosted
|
cs.IR
|
31 |
3 years ago |
| 244 |
Dance with You: The Diversity Controllable Dancer Generation via Diffusion Models
Siyue Yao, Mingjie Sun, ... (+4 more)
|
👻
Ghosted
|
cs.HC
|
30 |
2 years ago |
| 245 |
Learning Segment Similarity and Alignment in Large-Scale Content Based Video Retrieval
Chen Jiang, Kaiming Huang, ... (+10 more)
|
👻
Ghosted
|
cs.CV
|
30 |
2 years ago |
| 246 |
PoSynDA: Multi-Hypothesis Pose Synthesis Domain Adaptation for Robust 3D Human Pose Estimation
Hanbing Liu, Jun-Yan He, ... (+9 more)
|
💤
Eternal Rest
|
cs.CV
|
30 |
2 years ago |
| 247 |
Cross-Modal Retrieval and Synthesis (X-MRS): Closing the Modality Gap in Shared Representation Learning
Ricardo Guerrero, Hai Xuan Pham, Vladimir Pavlovic
|
👻
Ghosted
|
cs.CV
|
30 |
5 years ago |
| 248 |
Predicting Visual Context for Unsupervised Event Segmentation in Continuous Photo-streams
Ana Garcia del Molino, Joo-Hwee Lim, Ah-Hwee Tan
|
👻
Ghosted
|
cs.CV
|
30 |
7 years ago |
| 249 |
4D Gaussian Splatting with Scale-aware Residual Field and Adaptive Optimization for Real-time Rendering of Temporally Complex Dynamic Scenes
Jinbo Yan, Rui Peng, ... (+2 more)
|
👻
Ghosted
|
cs.CV
|
30 |
1 year ago |
| 250 |
MultiHateClip: A Multilingual Benchmark Dataset for Hateful Video Detection on YouTube and Bilibili
Han Wang, Tan Rui Yang, ... (+2 more)
|
👻
Ghosted
|
cs.MM
|
30 |
1 year ago |