Transferability of decomposed supervision to the authentic Japanese bar exam format
Determine whether large language models trained on decomposition-based datasets that reformulate Japanese bar examination multiple-choice questions into independent true/false statements (for example, the Japanese Bar Exam Question Answering (JBE-QA) paradigm) can successfully answer intact multiple-choice questions from the Japanese bar examination when evaluated under the original joint-proposition format and official scoring scheme requiring reasoning over multiple interacting propositions.
References
As a result, it remains unclear whether models trained under this paradigm can succeed when confronted with intact exam questions that require reasoning over multiple interacting propositions and adherence to the original scoring rules.
— Self-Verification is All You Need To Pass The Japanese Bar Examination
(2601.03144 - Shin, 6 Jan 2026) in Related Work, Section 2