2000 character limit reached
Oracle-Checker Scheme for Evaluating a Generative Large Language Model
Published 6 May 2024 in cs.CL | (2405.03170v1)
Abstract: This work presents a novel approach called oracle-checker scheme for evaluating the answer given by a generative LLM. Two types of checkers are presented. The first type of checker follows the idea of property testing. The second type of checker follows the idea of program checking. Their applications are demonstrated in two separate contexts, entity extraction and paraphrase decision, respectively.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.