| 1 |
Improving Reasoning Capabilities in Small Models through Mixture-of-Layers Distillation with Stepwise Attention on Key Information
Yao Chen, Jiawei Sheng, ... (+2 more)
|
|
cs.CL
|
0 |
2 months ago |
| 2 |
MultiDocFusion: Hierarchical and Multimodal Chunking Pipeline for Enhanced RAG on Long Industrial Documents
Joongmin Shin, Chanjun Park, ... (+3 more)
|
|
cs.AI
|
0 |
2 months ago |
| 3 |
QEVA: A Reference-Free Evaluation Metric for Narrative Video Summarization with Multimodal Question Answering
Woojun Jung, Junyeong Kim
|
|
cs.CV
|
0 |
1 month ago |
| 4 |
Prompts Don't Protect: Architectural Enforcement via MCP Proxy for LLM Tool Access Control
Rohith Uppala
|
|
cs.CR
|
0 |
1 month ago |
| 5 |
AI-Associated Lexical Shifts Across 34 Languages: Cross-Lingual Convergence and Diachronic Uptake in News Writing
Thomas Stephan Juzek
|
|
cs.CL
|
0 |
25 days ago |
| 6 |
IterInject: Indirect Prompt Injection Against LLM Agents via Feedback-Guided Iterative Optimization
Zixuan Chen, Jiaxiang Chen, ... (+5 more)
|
|
cs.LG
|
0 |
27 days ago |
| 7 |
SafeSteer: Localized On-Policy Distillation for Efficient Safety Alignment
Hao Li, Jingkun An, ... (+9 more)
|
|
cs.AI
|
0 |
18 days ago |
| 8 |
MENTIS: What Belief Changes Under Alignment? Measuring Multi-Scale Latent Torsion in Language Models
Partha Pratim Saha, Samarth Raina, ... (+5 more)
|
|
cs.CL
|
0 |
19 days ago |
| 9 |
MUDIDI: A Two-Stage Framework for Multilingual Dictionary Digitization with Language Models
David Setiawan, Temuulen Khishigsuren, ... (+4 more)
|
|
cs.CL
|
0 |
11 days ago |
| 10 |
Inside the LLM Word Factory
Benzi Busigin, Yuval Pinter
|
|
cs.CL
|
0 |
12 days ago |
| 11 |
Improving Cross-Lingual Factual Recall via Consistency-Driven Reinforcement Learning
Jonathan von Rad, Louis Arts, ... (+7 more)
|
|
cs.CL
|
0 |
15 days ago |
| 12 |
Consistency Training Along the Transformer Stack
Sukrati Gautam, Neil Shah, ... (+8 more)
|
|
cs.LG
|
0 |
15 days ago |
| 13 |
AURA: Intent-Directed Probing for Implicit-Need Surfacing in Situated LLM Agents
Yang Li, Jiaxiang Liu, ... (+2 more)
|
|
cs.CL
|
0 |
15 days ago |
| 14 |
Query-based Cross-Modal Projector Bolstering Mamba Multimodal LLM
SooHwan Eom, Jay Shim, ... (+5 more)
|
|
cs.CL
|
0 |
16 days ago |
| 15 |
Cross-Lingual Token Arbitrage: Optimizing Code Agent Context Windows via Local LLM Preprocessing
Mehmet Utku Colak
|
|
cs.AI
|
0 |
17 days ago |
| 16 |
Lost at the End: Primacy Bias in Multimodal Retrieval-Augmented Question Answering
Jieyuan Liu, Jianyang Gu, ... (+3 more)
|
|
cs.CL
|
0 |
4 days ago |
| 17 |
Beyond Layer Importance in Layer-wise Sparsity: An Inter-Layer Perturbation-Absorption Perspective
Tao Jing, Ningxin Wu, ... (+4 more)
|
|
cs.CL
|
0 |
6 days ago |
| 18 |
PACUTE: Phonology-, Affix-, and Character-level Understanding of Tokens for Filipino
Jann Railey Montalan, David Demitri Africa, ... (+4 more)
|
|
cs.CL
|
0 |
6 days ago |
| 19 |
CORA: Analyzing and bridging thinking-answer gap in Multimodal RLVR via Consistency-Oriented Reasoning Alignment
Jiayue Cao, Zhicong Lu, ... (+7 more)
|
|
cs.CL
|
0 |
7 days ago |
| 20 |
Keep Policy Gradient in Charge: Sibling-Guided Credit Distillation for Long-Horizon Tool-Use Agents
Tianyu Ding, Jianhong Xin, Juan Pablo De la Cruz Weinstein
|
|
cs.LG
|
0 |
9 days ago |
| 21 |
Do Vision-Language Models See or Guess? Measuring and Reducing Textual-Prior Reliance with a Phrasing-Controlled Benchmark
Pratham Singla, Shivank Garg, ... (+2 more)
|
|
cs.CL
|
0 |
10 days ago |