Papers
Topics
Authors
Recent
Search
2000 character limit reached

Pre-Trained Language Models Augmented with Synthetic Scanpaths for Natural Language Understanding

Published 23 Oct 2023 in cs.CL | (2310.14676v1)

Abstract: Human gaze data offer cognitive information that reflects natural language comprehension. Indeed, augmenting LLMs with human scanpaths has proven beneficial for a range of NLP tasks, including language understanding. However, the applicability of this approach is hampered because the abundance of text corpora is contrasted by a scarcity of gaze data. Although models for the generation of human-like scanpaths during reading have been developed, the potential of synthetic gaze data across NLP tasks remains largely unexplored. We develop a model that integrates synthetic scanpath generation with a scanpath-augmented LLM, eliminating the need for human gaze data. Since the model's error gradient can be propagated throughout all parts of the model, the scanpath generator can be fine-tuned to downstream tasks. We find that the proposed model not only outperforms the underlying LLM, but achieves a performance that is comparable to a LLM augmented with real human gaze data. Our code is publicly available.

Citations (3)

Summary

No one has generated a summary of this paper yet.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.