Ray Regression Predictor Model
- Ray Regression Predictor is a regression methodology for positive real-valued responses that extends the Rayleigh distribution framework with generalized techniques.
- It leverages bias-adjusted maximum likelihood estimation using methods such as Firth’s and Cox & Snell corrections to improve parameter estimation.
- Applications include SAR imaging amplitude analysis and biomedical survival studies, using flexible link functions to capture covariate effects.
A Ray Regression Predictor is a regression-based methodology built on parameterizations and extensions of the Rayleigh and generalized Rayleigh (GR) distributions. It enables predictive modeling for positive real-valued responses, particularly in contexts such as amplitude data in synthetic aperture radar (SAR) imaging and time-to-event biomedical applications. Two principal frameworks are established: the standard Rayleigh regression model with bias-adjusted estimation, and an extended four-parameter Rayleigh regression based on the generalized odd log-logistic generalized Rayleigh (GOLLGR) law, with link functions mapping covariates to distribution parameters and enabling prediction analysis through maximum likelihood and advanced inference techniques (Cordeiro et al., 2022, &&&1&&&).
1. Foundations of Rayleigh Regression
The standard Rayleigh density for a response with scale (mean) parameter is
%%%%2%%%%
with and . Regression is constructed by linking for sample to covariates via a strictly monotone, twice-differentiable link function , typically the log-link :
where is the regression coefficient vector. This regression structure captures systematic effects of covariates on the Rayleigh parameters, supporting modeling of various physical or biomedical phenomena (Palm et al., 2022).
2. Extended and Generalized Rayleigh Predictor: GOLLGR Model
The GOLLGR distribution generalizes the GR law by incorporating two additional shape parameters (, ) using the odd-log-logistic generator. The GR model of Vodă (1976) for , , has
where is the normalized lower-incomplete gamma ratio. The GOLLGR cumulative distribution function and density for are:
Special cases include: GR (), odd-logistic GR (), and exponentiated GR (). Parameters , , , can be linked to covariates via log-linear link functions,
admitting a flexible regression structure for lifetime or amplitude data (Cordeiro et al., 2022).
3. Inference, Estimation, and Bias Correction
3.1 Maximum Likelihood Estimation
The log-likelihood for observed responses with corresponding covariates is:
The score vector is
and Fisher information is
Global maximization is achieved using iterative methods such as Fisher scoring and, for GOLLGR, Newton–Raphson or BFGS applied to the parameterized log-likelihood (Cordeiro et al., 2022, Palm et al., 2022).
3.2 Analytical and Numerical Bias Adjustment
MLE estimates of exhibit bias, especially in small-sample SAR windows. Three correction schemes are implemented:
- Cox & Snell’s Analytical Correction:
and the bias-adjusted estimator is .
- Firth’s Penalized-Likelihood Correction:
The score is augmented by , with estimation via Fisher scoring for reduced bias.
- Parametric Bootstrap:
Multiple Rayleigh samples are simulated at , yielding bootstrap-corrected estimates .
Bias correction is crucial for SAR pixel windows ($9$–$49$ pixels), with Firth’s method offering optimal bias–variance trade-off in empirical tests (Palm et al., 2022).
4. Prediction, Confidence, and Interval Construction
4.1 Quantile and Mean Prediction
For any new covariate vector , construct , then predict:
- Mean:
- Median: In Rayleigh regression, ; in GOLLGR, the quantile function inverts the CDF:
for arbitrary quantiles (Cordeiro et al., 2022).
4.2 Confidence and Prediction Intervals
- Confidence Intervals: For , delta-method yields ; standard 95% intervals are constructed as .
- Prediction Intervals: Combine the fitted variance with model variance, i.e., for Rayleigh, . For GOLLGR, intervals can be constructed using plug-in quantiles or empirical percentiles from bootstrap simulations of (Cordeiro et al., 2022, Palm et al., 2022).
5. Applications and Empirical Results
Rayleigh regression predictors have been applied to amplitude modeling in SAR images and survival analysis in COVID-19 patient datasets. In SAR applications, the regression structure with covariate-dependent mean provides a model for pixel intensity variation; bias corrections are particularly effective in small pixel windows, with Firth’s correction demonstrating favorable properties (Palm et al., 2022). In biomedical survival analysis, the GOLLGR extension enables nuanced modeling of lifetimes using covariates such as age and comorbidity indicators. Implementation in R’s gamlss framework reports highly significant covariate effects, strong model selection preference (AIC/BIC), and accurate quantile residual diagnostics. Individual prediction for new cases (e.g., given age and diabetes status) is supported using plug-in median and interval estimates, enabling patient-level prognostication (Cordeiro et al., 2022).
6. Algorithmic Implementation and Practical Guidance
Implementation of Ray Regression Predictors proceeds via iterative estimation algorithms:
- Fisher scoring for MLE:
- Penalized scoring for Firth’s method
- Convergence criterion: , for example,
- Regression fitting: R’s nlm(), gamlss(), SAS PROC NLMIXED accept the required likelihood and gradients
- Bias correction selection: plain MLE for ; Cox–Snell or Firth for ; bootstrap when a fully simulation-based adjustment is preferable
This methodology offers flexible and accurate predictive modeling for positive-valued data, robust under both substantial and limited sample regimes, with full support for model diagnostics, confidence assessment, and empirical application (Cordeiro et al., 2022, Palm et al., 2022).