| 1 |
BEM: Training-Free Background Embedding Memory for False-Positive Suppression in Real-Time Fixed-Background Camera
Junwoo Park, Jangho Lee, Sunho Lim
|
|
cs.CV
|
0 |
2 months ago |
| 2 |
TAG-Head: Time-Aligned Graph Head for Plug-and-Play Fine-grained Action Recognition
Imtiaz Ul Hassan, Nik Bessis, Ardhendu Behera
|
|
cs.CV
|
0 |
2 months ago |
| 3 |
Demographic and Linguistic Bias Evaluation in Omnimodal Language Models
Alaa Elobaid
|
|
cs.CV
|
0 |
2 months ago |
| 4 |
PlankFormer: Robust Plankton Instance Segmentation via MAE-Pretrained Vision Transformers and Pseudo Community Image Generation
Masaharu Miyazaki, Yurie Otake, ... (+4 more)
|
|
cs.CV
|
0 |
1 month ago |
| 5 |
DGSSM: Diffusion guided state-space models for multimodal salient object detection
Suklav Ghosh, Arijit Sur, Pinaki Mitra
|
|
cs.CV
|
0 |
1 month ago |
| 6 |
CollideNet: Hierarchical Multi-scale Video Representation Learning with Disentanglement for Time-To-Collision Forecasting
Nishq Poorav Desai, Ali Etemad, Michael Greenspan
|
|
cs.CV
|
0 |
1 month ago |
| 7 |
ADP-DiT: Text-Guided Diffusion Transformer for Brain Image Generation in Alzheimer's Disease Progression
Juneyong Lee, Geonwoo Baek, Ikbeom Jang
|
|
cs.CV
|
0 |
2 months ago |
| 8 |
Semantically Stable Image Composition Analysisvia Saliency and Gradient Vector Flow Fusion
Armin Dadras, Robert Sablatnig, ... (+2 more)
|
|
cs.CV
|
0 |
2 months ago |
| 9 |
The Pragmatic Persona: Discovering LLM Persona through Bridging Inference
Jisoo Yang, Jongwon Ryu, ... (+3 more)
|
|
cs.CL
|
0 |
1 month ago |
| 10 |
AMAVA: Adaptive Motion-Aware Video-to-Audio Framework for Visually-Impaired Assistance
Benjamin Klein, Kazi Ruslan Rahman, Sanchita Ghose
|
|
cs.CV
|
0 |
1 month ago |
| 11 |
Deep kernel video approximation for unsupervised action segmentation
Silvia L. Pintea, Jouke Dijkstra
|
|
cs.CV
|
0 |
1 month ago |
| 12 |
SIFT-VTON: Geometric Correspondence Supervision on Cross-Attention for Virtual Try-On
Kosuke Takemoto, Takafumi Koshinaka
|
|
cs.CV
|
0 |
1 month ago |
| 13 |
A More Word-like Image Tokenization for MLLMs
Hyun Lee, Hyemin Jeong, ... (+5 more)
|
|
cs.CV
|
0 |
28 days ago |
| 14 |
What Do Students Learn? A Feature-Level Analysis of Dark Knowledge
Seungu Kang, Songkuk Kim
|
|
cs.LG
|
0 |
13 days ago |
| 15 |
Online K-d tree for approximate neighborhood search in data streams
Eduardo V. L. Barboza, Robert Sabourin, Rafael M. O. Cruz
|
|
cs.DS
|
0 |
14 days ago |