Addressing the long‑tail problem in data‑driven astronomy models
Develop robust techniques to handle non‑Gaussian, long‑tailed distributions and out‑of‑distribution samples in the OJALA transformer‑based autoregressive model trained on J-PAS narrow‑band photometry, so that performance does not degrade for edge‑of‑parameter‑space objects such as extreme emission‑line galaxies.
References
Addressing this long tail problem remains an open challenge in data-driven astronomy.
— OJALÁ: Optimizing J-PAS Astronomy for Large-scale Analysis. A foundation model for the SED of galaxies, QSOs and stars
(2604.00661 - Martínez-Solaeche et al., 1 Apr 2026) in Section 6 (Discussion)