| 1 |
WorldCache: Content-Aware Caching for Accelerated Video World Models
Umair Nawaz, Ahmed Heakl, ... (+4 more)
|
|
cs.CV
|
0 |
2 months ago |
| 2 |
VideoDetective: Clue Hunting via both Extrinsic Query and Intrinsic Relevance for Long Video Understanding
Ruoliu Yang, Chu Wu, ... (+3 more)
|
|
cs.CV
|
0 |
2 months ago |
| 3 |
End-to-End Training for Unified Tokenization and Latent Denoising
Shivam Duggal, Xingjian Bai, ... (+6 more)
|
|
cs.CV
|
0 |
2 months ago |
| 4 |
UniMotion: A Unified Framework for Motion-Text-Vision Understanding and Generation
Ziyi Wang, Xinshun Wang, ... (+3 more)
|
|
cs.CV
|
0 |
2 months ago |
| 5 |
ThinkJEPA: Empowering Latent World Models with Large Vision-Language Reasoning Model
Haichao Zhang, Yijiang Li, ... (+6 more)
|
|
cs.CV
|
0 |
2 months ago |
| 6 |
DualCoT-VLA: Visual-Linguistic Chain of Thought via Parallel Reasoning for Vision-Language-Action Models
Zhide Zhong, Junfeng Li, ... (+11 more)
|
|
cs.CV
|
0 |
2 months ago |
| 7 |
3D-Layout-R1: Structured Reasoning for Language-Instructed Spatial Editing
Haoyu Zhen, Xiaolong Li, ... (+6 more)
|
|
cs.CV
|
0 |
2 months ago |
| 8 |
The Dual Mechanisms of Spatial Reasoning in Vision-Language Models
Kelly Cui, Nikhil Prakash, ... (+4 more)
|
|
cs.CV
|
0 |
2 months ago |
| 9 |
Scaling DoRA: High-Rank Adaptation via Factored Norms and Fused Kernels
Alexandra Zelenin, Alexandra Zhuravlyova
|
|
cs.LG
|
0 |
2 months ago |
| 10 |
Repurposing Geometric Foundation Models for Multi-view Diffusion
Wooseok Jang, Seonghu Jeon, ... (+6 more)
|
|
cs.CV
|
0 |
2 months ago |
| 11 |
Decoupling Exploration and Policy Optimization: Uncertainty Guided Tree Search for Hard Exploration
Zakaria Mhammedi, James Cohan
|
|
cs.LG
|
0 |
2 months ago |
| 12 |
DUO-VSR: Dual-Stream Distillation for One-Step Video Super-Resolution
Zhengyao Lv, Menghan Xia, ... (+2 more)
|
|
cs.CV
|
0 |
2 months ago |
| 13 |
GenOpticalFlow: A Generative Approach to Unsupervised Optical Flow Learning
Yixuan Luo, Feng Qiao, ... (+3 more)
|
|
cs.CV
|
0 |
2 months ago |
| 14 |
TiCo: Time-Controllable Training for Spoken Dialogue Models
Kai-Wei Chang, Wei-Chih Chen, ... (+3 more)
|
|
cs.CL
|
0 |
2 months ago |
| 15 |
UniDex: A Robot Foundation Suite for Universal Dexterous Hand Control from Egocentric Human Videos
Gu Zhang, Qicheng Xu, ... (+17 more)
|
|
cs.RO
|
0 |
2 months ago |
| 16 |
DexDrummer: In-Hand, Contact-Rich, and Long-Horizon Dexterous Robot Drumming
Hung-Chieh Fang, Amber Xie, ... (+3 more)
|
|
cs.RO
|
0 |
2 months ago |
| 17 |
Greater accessibility can amplify discrimination in generative AI
Carolin Holtermann, Minh Duc Bui, ... (+4 more)
|
|
cs.CL
|
0 |
2 months ago |
| 18 |
Characterizing High-Capacity Janus Aminobenzene-Graphene Anode for Sodium-Ion Batteries with Machine Learning
Claudia Islas-Vargas, L. Ricardo Montoya, ... (+4 more)
|
|
cond-mat.mtrl-sci
|
0 |
2 months ago |
| 19 |
exaCB: Reproducible Continuous Benchmark Collections at Scale Leveraging an Incremental Approach
Jayesh Badwaik, Mathis Bode, ... (+2 more)
|
|
cs.DC
|
0 |
2 months ago |
| 20 |
EgoGroups: A Benchmark For Detecting Social Groups of People in the Wild
Jeffri Murrugarra-Llerena, Pranav Chitale, ... (+5 more)
|
|
cs.CV
|
0 |
2 months ago |
| 21 |
Confidence-Based Decoding is Provably Efficient for Diffusion Language Models
Changxiao Cai, Gen Li
|
|
cs.LG
|
0 |
2 months ago |
| 22 |
MemDLM: Memory-Enhanced DLM Training
Zehua Pei, Hui-Ling Zhen, ... (+5 more)
|
|
cs.CL
|
0 |
2 months ago |
| 23 |
A Dividing Line for Structural Kernelization of Component Order Connectivity via Distance to Bounded Pathwidth
Jakob Greilhuber, Roohani Sharma
|
|
cs.DS
|
0 |
2 months ago |
| 24 |
Structure-aware divergences for comparing probability distributions
Rohit Sahasrabuddhe, Renaud Lambiotte
|
|
cs.IT
|
0 |
2 months ago |
| 25 |
ShapDBM: Exploring Decision Boundary Maps in Shapley Space
Luke Watkin, Daniel Archambault, Alex Telea
|
|
cs.HC
|
0 |
2 months ago |
| 26 |
One Model, Two Markets: Bid-Aware Generative Recommendation
Yanchen Jiang, Zhe Feng, ... (+3 more)
|
|
cs.IR
|
0 |
2 months ago |
| 27 |
Riverine Land Cover Mapping through Semantic Segmentation of Multispectral Point Clouds
Sopitta Thurachen, Josef Taher, ... (+7 more)
|
|
cs.CV
|
0 |
2 months ago |
| 28 |
Benchmarking Deep Learning Models for Aerial LiDAR Point Cloud Semantic Segmentation under Real Acquisition Conditions: A Case Study in Navarre
Alex Salvatierra, José Antonio Sanz, ... (+2 more)
|
|
cs.CV
|
0 |
2 months ago |
| 29 |
SpatialReward: Verifiable Spatial Reward Modeling for Fine-Grained Spatial Consistency in Text-to-Image Generation
Sashuai Zhou, Qiang Zhou, ... (+10 more)
|
|
cs.CV
|
0 |
2 months ago |
| 30 |
Dyadic: A Scalable Platform for Human-Human and Human-AI Conversation Research
David M. Markowitz
|
|
cs.HC
|
0 |
2 months ago |
| 31 |
Adapting Self-Supervised Speech Representations for Cross-lingual Dysarthria Detection in Parkinson's Disease
Abner Hernandez, Eunjung Yeo, ... (+13 more)
|
|
cs.CL
|
0 |
2 months ago |
| 32 |
Accelerating Fresh Data Exploration with Fluid ETL Pipelines
Maxwell Norfolk, Dong Xie
|
|
cs.DB
|
0 |
2 months ago |
| 33 |
Noise Titration: Exact Distributional Benchmarking for Probabilistic Time Series Forecasting
Qilin Wang
|
|
cs.LG
|
0 |
2 months ago |
| 34 |
Gumbel Distillation for Parallel Text Generation
Chi Zhang, Xixi Hu, ... (+2 more)
|
|
cs.CL
|
0 |
2 months ago |
| 35 |
Evaluating the Reliability and Fidelity of Automated Judgment Systems of Large Language Models
Tom Biskupski, Stephan Kleber
|
|
cs.CR
|
0 |
2 months ago |
| 36 |
SPA: A Simple but Tough-to-Beat Baseline for Knowledge Injection
Kexian Tang, Jiani Wang, ... (+2 more)
|
|
cs.LG
|
0 |
2 months ago |
| 37 |
Omni-WorldBench: Towards a Comprehensive Interaction-Centric Evaluation for World Models
Meiqi Wu, Zhixin Cai, ... (+14 more)
|
|
cs.CV
|
0 |
2 months ago |
| 38 |
Chimera: Latency- and Performance-Aware Multi-agent Serving for Heterogeneous LLMs
Kangqi Ni, Wenyue Hua, ... (+4 more)
|
|
cs.LG
|
0 |
2 months ago |
| 39 |
Make Tracking Easy: Neural Motion Retargeting for Humanoid Whole-body Control
Qingrui Zhao, Kaiyue Yang, ... (+8 more)
|
|
cs.RO
|
0 |
2 months ago |
| 40 |
Mixture of Mini Experts: Overcoming the Linear Layer Bottleneck in Multiple Instance Learning
Daniel Shao, Joel Runevic, ... (+5 more)
|
|
cs.CV
|
0 |
2 months ago |
| 41 |
CayleyPy-4: AI-Holography. Towards analogs of holographic string dualities for AI tasks
A. Chervov, F. Levkovich-Maslyuk, ... (+42 more)
|
|
hep-th
|
0 |
2 months ago |
| 42 |
PAM: A Pose-Appearance-Motion Engine for Sim-to-Real HOI Video Generation
Mingju Gao, Kaisen Yang, ... (+13 more)
|
|
cs.CV
|
0 |
2 months ago |
| 43 |
Stable Algorithms Lower Bounds for Estimation
Xifan Yu, Ilias Zadik
|
|
math.ST
|
0 |
2 months ago |
| 44 |
Framework for Risk-Based IoT Cybersecurity Audit Engagements
Danielle Hanson, Jeremy Straub
|
|
cs.CR
|
0 |
2 months ago |
| 45 |
A Backbone Benchmarking Study on Self-supervised Learning as a Auxiliary Task with Texture-based Local Descriptors for Face Analysis
Shukesh Reddy, Abhijit Das
|
|
cs.CV
|
0 |
2 months ago |
| 46 |
Seeing is Improving: Visual Feedback for Iterative Text Layout Refinement
Junrong Guo, Shancheng Fang, ... (+2 more)
|
|
cs.CV
|
0 |
2 months ago |
| 47 |
Enhancing Document-Level Machine Translation via Filtered Synthetic Corpora and Two-Stage LLM Adaptation
Ireh Kim, Tesia Sker, Chanwoo Kim
|
|
cs.CL
|
0 |
2 months ago |
| 48 |
Revisiting Quantum Code Generation: Where Should Domain Knowledge Live?
Oscar Novo, Oscar Bastidas-Jossa, ... (+3 more)
|
|
cs.LG
|
0 |
2 months ago |
| 49 |
Cross-Modal Reinforcement Learning for Navigation with Degraded Depth Measurements
Omkar Sawant, Luca Zanatta, ... (+2 more)
|
|
cs.RO
|
0 |
2 months ago |
| 50 |
MARCUS: An agentic, multimodal vision-language model for cardiac diagnosis and management
Jack W O'Sullivan, Mohammad Asadi, ... (+9 more)
|
|
cs.AI
|
0 |
2 months ago |