| 1001 |
Divergent Token Metrics: Measuring degradation to prune away LLM components -- and optimize quantization
Björn Deiseroth, Max Meuer, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
3 |
2 years ago |
| 1002 |
Anti-LM Decoding for Zero-shot In-context Machine Translation
Suzanna Sia, Alexandra DeLucia, Kevin Duh
|
👻
Ghosted
|
cs.CL
|
3 |
2 years ago |
| 1003 |
Regularized Conventions: Equilibrium Computation as a Model of Pragmatic Reasoning
Athul Paul Jacob, Gabriele Farina, Jacob Andreas
|
👻
Ghosted
|
cs.CL
|
3 |
2 years ago |
| 1004 |
Solving Data-centric Tasks using Large Language Models
Shraddha Barke, Christian Poelitz, ... (+11 more)
|
👻
Ghosted
|
cs.PL
|
3 |
2 years ago |
| 1005 |
Exploring Language Model's Code Generation Ability with Auxiliary Functions
Seonghyeon Lee, Sanghwan Jang, ... (+3 more)
|
👻
Ghosted
|
cs.SE
|
3 |
2 years ago |
| 1006 |
Context Does Matter: Implications for Crowdsourced Evaluation Labels in Task-Oriented Dialogue Systems
Clemencia Siro, Mohammad Aliannejadi, Maarten de Rijke
|
👻
Ghosted
|
cs.CL
|
3 |
2 years ago |
| 1007 |
Improving and Assessing the Fidelity of Large Language Models Alignment to Online Communities
Minh Duc Chu, Zihao He, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
3 |
1 year ago |
| 1008 |
Concept-Reversed Winograd Schema Challenge: Evaluating and Improving Robust Reasoning in Large Language Models via Abstraction
Kaiqiao Han, Tianqing Fang, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
3 |
1 year ago |
| 1009 |
DomainSum: A Hierarchical Benchmark for Fine-Grained Domain Shift in Abstractive Text Summarization
Haohan Yuan, Haopeng Zhang
|
👻
Ghosted
|
cs.CL
|
3 |
1 year ago |
| 1010 |
$C^2$: Scalable Auto-Feedback for LLM-based Chart Generation
Woosung Koh, Jang Han Yoon, ... (+8 more)
|
👻
Ghosted
|
cs.LG
|
3 |
1 year ago |
| 1011 |
Robust and Unbounded Length Generalization in Autoregressive Transformer-Based Text-to-Speech
Eric Battenberg, RJ Skerry-Ryan, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
3 |
1 year ago |
| 1012 |
VisualCoder: Guiding Large Language Models in Code Execution with Fine-grained Multimodal Chain-of-Thought Reasoning
Cuong Chi Le, Hoang-Chau Truong-Vinh, ... (+4 more)
|
👻
Ghosted
|
cs.SE
|
3 |
1 year ago |
| 1013 |
Dynamic Strategy Planning for Efficient Question Answering with Large Language Models
Tanmay Parekh, Pradyot Prakash, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
3 |
1 year ago |
| 1014 |
Uncertainty Quantification for Clinical Outcome Predictions with (Large) Language Models
Zizhang Chen, Peizhao Li, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
3 |
1 year ago |
| 1015 |
Self-Training Meets Consistency: Improving LLMs' Reasoning with Consistency-Driven Rationale Evaluation
Jaehyeok Lee, Keisuke Sakaguchi, JinYeong Bak
|
👻
Ghosted
|
cs.LG
|
3 |
1 year ago |
| 1016 |
Sneaking Syntax into Transformer Language Models with Tree Regularization
Ananjan Nandi, Christopher D. Manning, Shikhar Murty
|
👻
Ghosted
|
cs.CL
|
3 |
1 year ago |
| 1017 |
Does Self-Attention Need Separate Weights in Transformers?
Md Kowsher, Nusrat Jahan Prottasha, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
3 |
1 year ago |
| 1018 |
Efficient Continual Pre-training of LLMs for Low-resource Languages
Arijit Nag, Soumen Chakrabarti, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
3 |
1 year ago |
| 1019 |
ResoFilter: Fine-grained Synthetic Data Filtering for Large Language Models through Data-Parameter Resonance Analysis
Zeao Tu, Xiangdi Meng, ... (+5 more)
|
👻
Ghosted
|
cs.CL
|
3 |
1 year ago |
| 1020 |
ChaI-TeA: A Benchmark for Evaluating Autocompletion of Interactions with LLM-based Chatbots
Shani Goren, Oren Kalinsky, ... (+7 more)
|
👻
Ghosted
|
cs.CL
|
3 |
1 year ago |
| 1021 |
CONSTRUCTA: Automating Commercial Construction Schedules in Fabrication Facilities with Large Language Models
Yifan Zhang, Xue Yang
|
👻
Ghosted
|
cs.AI
|
3 |
1 year ago |
| 1022 |
Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert Parallelism Design
Mohan Zhang, Pingzhi Li, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
3 |
1 year ago |
| 1023 |
TransientTables: Evaluating LLMs' Reasoning on Temporally Evolving Semi-structured Tables
Abhilash Shankarampeta, Harsh Mahajan, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
3 |
1 year ago |
| 1024 |
Structured Prediction with Output Embeddings for Semantic Image Annotation
Ariadna Quattoni, Arnau Ramisa, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
2 |
10 years ago |
| 1025 |
Japanese Predicate Conjugation for Neural Machine Translation
Michiki Kurosawa, Yukio Matsumura, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
2 |
8 years ago |
| 1026 |
Cross-topic distributional semantic representations via unsupervised mappings
Eleftheria Briakou, Nikos Athanasiou, Alexandros Potamianos
|
👻
Ghosted
|
cs.CL
|
2 |
7 years ago |
| 1027 |
A framework for streamlined statistical prediction using topic models
Vanessa Glenny, Jonathan Tuke, ... (+2 more)
|
👻
Ghosted
|
stat.AP
|
2 |
7 years ago |
| 1028 |
INS: An Interactive Chinese News Synthesis System
Hui Liu, Wentao Qin, Xiaojun Wan
|
👻
Ghosted
|
cs.CL
|
2 |
6 years ago |
| 1029 |
On the Transferability of Minimal Prediction Preserving Inputs in Question Answering
Shayne Longpre, Yi Lu, Christopher DuBois
|
👻
Ghosted
|
cs.CL
|
2 |
5 years ago |
| 1030 |
SOrT-ing VQA Models : Contrastive Gradient Learning for Improved Consistency
Sameer Dharur, Purva Tendulkar, ... (+3 more)
|
👻
Ghosted
|
cs.CV
|
2 |
5 years ago |
| 1031 |
CERES: Pretraining of Graph-Conditioned Transformer for Semi-Structured Session Data
Rui Feng, Chen Luo, ... (+4 more)
|
👻
Ghosted
|
cs.IR
|
2 |
4 years ago |
| 1032 |
Fast and Light-Weight Answer Text Retrieval in Dialogue Systems
Hui Wan, Siva Sankalp Patel, ... (+3 more)
|
👻
Ghosted
|
cs.IR
|
2 |
4 years ago |
| 1033 |
Do Trajectories Encode Verb Meaning?
Dylan Ebert, Chen Sun, Ellie Pavlick
|
👻
Ghosted
|
cs.CL
|
2 |
4 years ago |
| 1034 |
Phrase translation using a bilingual dictionary and n-gram data: A case study from Vietnamese to English
Khang Nhut Lam, Feras Al Tarouti, Jugal Kalita
|
👻
Ghosted
|
cs.CL
|
2 |
3 years ago |
| 1035 |
Multi-stage Distillation Framework for Cross-Lingual Semantic Similarity Matching
Kunbo Ding, Weijie Liu, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
2 |
3 years ago |
| 1036 |
Subspace Representations for Soft Set Operations and Sentence Similarities
Yoichi Ishibashi, Sho Yokoi, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
2 |
3 years ago |
| 1037 |
Navigation as Attackers Wish? Towards Building Robust Embodied Agents under Federated Learning
Yunchao Zhang, Zonglin Di, ... (+3 more)
|
👻
Ghosted
|
cs.AI
|
2 |
3 years ago |
| 1038 |
LatticeGen: A Cooperative Framework which Hides Generated Text in a Lattice for Privacy-Aware Generation on Cloud
Mengke Zhang, Tianxing He, ... (+6 more)
|
👻
Ghosted
|
cs.CL
|
2 |
2 years ago |
| 1039 |
XAL: EXplainable Active Learning Makes Classifiers Better Low-resource Learners
Yun Luo, Zhen Yang, ... (+6 more)
|
👻
Ghosted
|
cs.CL
|
2 |
2 years ago |
| 1040 |
Non-contrastive sentence representations via self-supervision
Marco Farina, Duccio Pappadopulo
|
👻
Ghosted
|
cs.CL
|
2 |
2 years ago |
| 1041 |
MCAD: Multi-teacher Cross-modal Alignment Distillation for efficient image-text retrieval
Youbo Lei, Feifei He, ... (+5 more)
|
👻
Ghosted
|
cs.CV
|
2 |
2 years ago |
| 1042 |
The Unreasonable Effectiveness of Random Target Embeddings for Continuous-Output Neural Machine Translation
Evgeniia Tokarchuk, Vlad Niculae
|
👻
Ghosted
|
cs.CL
|
2 |
2 years ago |
| 1043 |
FAMuS: Frames Across Multiple Sources
Siddharth Vashishtha, Alexander Martin, ... (+3 more)
|
👻
Ghosted
|
cs.CL
|
2 |
2 years ago |
| 1044 |
Does Pre-trained Language Model Actually Infer Unseen Links in Knowledge Graph Completion?
Yusuke Sakai, Hidetaka Kamigaito, ... (+2 more)
|
👻
Ghosted
|
cs.CL
|
2 |
2 years ago |
| 1045 |
Code Models are Zero-shot Precondition Reasoners
Lajanugen Logeswaran, Sungryull Sohn, ... (+6 more)
|
👻
Ghosted
|
cs.AI
|
2 |
2 years ago |
| 1046 |
On Retrieval Augmentation and the Limitations of Language Model Training
Ting-Rui Chiang, Xinyan Velocity Yu, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
2 |
2 years ago |
| 1047 |
Take One Step at a Time to Know Incremental Utility of Demonstration: An Analysis on Reranking for Few-Shot In-Context Learning
Kazuma Hashimoto, Karthik Raman, Michael Bendersky
|
👻
Ghosted
|
cs.CL
|
2 |
2 years ago |
| 1048 |
PELMS: Pre-training for Effective Low-Shot Multi-Document Summarization
Joseph J. Peper, Wenzhao Qiu, Lu Wang
|
👻
Ghosted
|
cs.CL
|
2 |
2 years ago |
| 1049 |
Lost in Overlap: Exploring Logit-based Watermark Collision in LLMs
Yiyang Luo, Ke Lin, ... (+4 more)
|
👻
Ghosted
|
cs.CL
|
2 |
2 years ago |
| 1050 |
A (More) Realistic Evaluation Setup for Generalisation of Community Models on Malicious Content Detection
Ivo Verhoeven, Pushkar Mishra, ... (+3 more)
|
👻
Ghosted
|
cs.LG
|
2 |
2 years ago |