A Review of Evaluation Techniques for Social Dialogue Systems

September 13, 2017 ยท The Cartographer ยท ๐Ÿ› ISIAA@ICMI

๐Ÿ“š THE CARTOGRAPHER: The Cartographer
Survey/review paper โ€” maps the landscape rather than implementing a method.

"No code URL or promise found in abstract"
"Title-pattern auto-detect: A Review of Evaluation Techniques for Social Dialogue Systems"

Evidence collected by the PWNC Scanner

Authors Amanda Cercas Curry, Helen Hastie, Verena Rieser arXiv ID 1709.04409 Category cs.CL: Computation & Language Citations 14 Venue ISIAA@ICMI Last Checked 2 days ago
Abstract
In contrast with goal-oriented dialogue, social dialogue has no clear measure of task success. Consequently, evaluation of these systems is notoriously hard. In this paper, we review current evaluation methods, focusing on automatic metrics. We conclude that turn-based metrics often ignore the context and do not account for the fact that several replies are valid, while end-of-dialogue rewards are mainly hand-crafted. Both lack grounding in human perceptions.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

๐Ÿ“œ Similar Papers

In the same crypt โ€” Computation & Language

๐ŸŒ… ๐ŸŒ… Old Age

Attention Is All You Need

Ashish Vaswani, Noam Shazeer, ... (+6 more)

cs.CL ๐Ÿ› NeurIPS ๐Ÿ“š 166.0K cites 8 years ago