Papers
Topics
Authors
Recent
Search
2000 character limit reached

SWEET-Cat & MAISTEP: Stellar Host Analysis

Updated 23 January 2026
  • SWEET-Cat and MAISTEP are complementary tools that deliver high-precision stellar parameters crucial for exoplanet demographic studies.
  • SWEET-Cat provides uniform spectroscopic measurements while MAISTEP uses grid-based machine learning to infer stellar masses, radii, and ages with robust uncertainty estimates.
  • Their combined application refines exoplanet population analysis, such as the radius valley, supporting insights into atmospheric evolution and host star demographics.

SWEET-Cat is a homogeneous, continually maintained spectroscopic catalog of stellar parameters for planet-hosting stars, while MAISTEP is a grid-based machine learning tool for inferring fundamental stellar properties—including radius, mass, and age—from atmospheric observables. Used in combination, these resources enable precise and consistent host-star characterization, which is essential for exoplanet demographic studies, such as resolving the occurrence and parameter dependence of the exoplanet radius valley. Below follows a comprehensive overview of their construction, methodology, and scientific impact.

1. SWEET-Cat: Homogeneous Spectroscopic Stellar Parameters

SWEET-Cat (Stars With ExoplanETs Catalogue) provides a high-precision, uniform database of key spectroscopic parameters (effective temperature TeffT_{\mathrm{eff}}, logarithmic surface gravity logg\log g, and metallicity [Fe/H]) and masses for stars known to host exoplanets (Santos et al., 2013, Kamulali et al., 18 Jan 2026). The catalog is designed to enable robust statistical studies of planet populations by minimizing heterogeneity in host-star properties, which can otherwise hinder cross-sample comparisons or demographic inferences.

Principles and Methodology

  • Target Selection: Stars identified in exoplanet.eu as exoplanet hosts by radial velocity, transit, or astrometry methods; direct-imaging, microlensing, and degenerate-timing hosts are typically excluded.
  • Spectral Types: Primary focus on FGK dwarfs and subgiants, but includes cool giants, evolved stars, and M dwarfs (the latter via empirical color or index calibrations).
  • Spectroscopic Pipeline: Atmospheric parameters and masses are derived under LTE using:
    • Automatic Equivalent Width (EW) measurement from high-RR, high-S/N spectra (e.g., ARES code).
    • Forced excitation/ionization balance via MOOG with ATLAS9 model atmospheres.
    • Key equilibrium conditions (in LaTeX):
    • Excitation equilibrium: A(FeI)χ=0\frac{\partial\,\overline{A(\mathrm{Fe\,I})}}{\partial\chi} = 0
    • Microturbulence equilibrium: A(FeI)log(EW/λ)=0\frac{\partial\,\overline{A(\mathrm{Fe\,I})}}{\partial\log(\mathrm{EW}/\lambda)} = 0
    • Ionization equilibrium: A(FeI)=A(FeII)\overline{A(\mathrm{Fe\,I})} = \overline{A(\mathrm{Fe\,II})}
  • Mass Determination: Initial mass estimates via the Torres relation,

logMT=a1+a2logTeff+a3logg+a4[Fe/H]\log M_{T} = a_1 + a_2\log T_{\rm eff} + a_3\log g + a_4\,[\mathrm{Fe/H}]

corrected to Padova isochrone scale:

Miso=0.791MT20.575MT+0.701M_{\rm iso} = 0.791\,M_T^2 - 0.575\,M_T + 0.701

Uncertainties are propagated through Monte Carlo sampling.

  • Validation: Cross-matching with fundamental scale benchmarks (asteroseismology, interferometry), and analysis of systematics across literature catalogs.

Catalog Content and Uncertainties

The database (currently covering >>4200 planet hosts) provides the following fields per star: identifiers, coordinates, VV magnitude, parallax, TeffT_{\mathrm{eff}}, logg\log g, microturbulence, [Fe/H], stellar mass, error estimates, and provenance. Typical errors:

2. MAISTEP: Grid-Based Machine Learning Stellar Parameter Inference

MAISTEP (Machine learning Algorithm for Inferring STEllar Parameters) is a tool that infers stellar radius, mass, and age using only atmospheric inputs—TeffT_{\mathrm{eff}}, [Fe/H], and luminosity LL—meeting the need for robust parameter estimation when asteroseismic constraints are unavailable (Kamulali et al., 4 Feb 2025, Kamulali et al., 18 Jan 2026).

Model Construction

  • Training Data: Synthetic data from MESA evolutionary tracks spanning 0.70M/M1.600.70 \leq M_\star/M_\odot \leq 1.60, 0.50[Fe/H]+0.50-0.50 \leq \mathrm{[Fe/H]} \leq +0.50, ΔY/ΔZ=0.4\Delta Y/\Delta Z = 0.4 to $2.4$, from ZAMS to core-H exhaustion. Sampling along each track is optimized for uniform XcX_c coverage.
  • Features: Raw TeffT_{\mathrm{eff}}, [Fe/H], and LL (derived via Gaia DR3 parallax, GG magnitude, bolometric corrections, and extinction), directly ingested by tree-based algorithms without additional scaling.

Machine Learning Architecture

Four base regressors—Random Forest (RF), Extra Trees (XT), XGBoost, and CatBoost—are independently optimized (Optuna, 50 trials, 10-fold cross-validation) and stacked via non-negative least squares combination: y^i=j=14βjpij,βj0,βj=1\hat{y}_i = \sum_{j=1}^4 \beta_j\,p_{ij}, \hspace{6pt} \beta_j \geq 0, \sum \beta_j = 1 with pijp_{ij} the prediction of regressor jj on sample ii.

Uncertainty Estimation

Given observational errors in TeffT_{\mathrm{eff}}, [Fe/H], and LL, each star is realized 10,000 times in a Monte Carlo ensemble; the median and quartile-based bounds of the resulting parameter distributions define the output value and uncertainty.

Performance Metrics

Validation against asteroseismic samples (APOKASC, LEGACY) demonstrates negligible bias and competitive scatter:

  • Radius: 0.5%-0.5\% (bias), 5%5\% (scatter) vs. APOKASC; 0.2%-0.2\%, 2%2\% vs. LEGACY
  • Mass: 6%6\%, 5%5\% (APOKASC); 2%-2\%, 3%3\% (LEGACY)
  • Age: 9%-9\%, 16%16\% (APOKASC); 7%7\%, 23%23\% (LEGACY) (Kamulali et al., 4 Feb 2025)

3. SWEET-Cat and MAISTEP: Combined Application for Exoplanet Science

The power of SWEET-Cat and MAISTEP emerges most strikingly in joint applications, where homogeneous spectroscopy feeds into MAISTEP's inference framework for host-star parameters, which are then propagated into planetary property determination and exoplanet population studies (Kamulali et al., 18 Jan 2026).

Workflow Integration

  • Data Matching: SWEET-Cat’s TeffT_{\mathrm{eff}} and [Fe/H], combined with Gaia luminosities, are supplied to MAISTEP. For any existing SWEET-Cat table, ages and radii can be appended via a database join on star identifier or coordinates.
  • Propagation: The uncertainties on input parameters from both pipelines can be combined to produce robust final confidence intervals.

Case Study: The Exoplanet Radius Valley

In (Kamulali et al., 18 Jan 2026), SWEET-Cat and MAISTEP are used to recompute radii, masses, and ages for over 1,200 main-sequence planet-hosting stars, producing a sample for refined analysis of the "radius valley"—a deficit of planets near 2R2\,R_\oplus thought to demarcate the transition between super-Earths and sub-Neptunes. With stellar radii determined to 2%2\% precision and propagating to 5%5\% in RpR_p, the valley is revealed as deeper and partially filled, with trends quantified as follows:

  • Period dependence: αrv=0.120.01+0.02\alpha_{rv} = -0.12^{+0.02}_{-0.01} in log10(Rp)\log_{10}(R_p) versus log10(P)\log_{10}(P)
  • Flux dependence: αrv=0.100.03+0.02\alpha_{rv} = 0.10^{+0.02}_{-0.03}
  • Stellar mass dependence: Slope 0.190.07+0.090.19^{+0.09}_{-0.07} (stronger for sub-Neptunes)
  • Age dependence: NE/NNN_E/N_N (super-Earth to sub-Neptune ratio) rises from 0.510.08+0.110.51^{+0.11}_{-0.08} for <3<3 Gyr to 0.640.11+0.110.64^{+0.11}_{-0.11} for 3\geq 3 Gyr; the valley becomes broader and shifts to larger RpR_p with increasing age, consistent with gradual atmospheric loss.

A multidimensional fit yields: log10(Rp)=αlog10P+βlog10M+γlog10τ+δ\log_{10}(R_p) = \alpha\log_{10}P + \beta\log_{10}M_\star + \gamma\log_{10}\tau + \delta with γ=0.070.04+0.03\gamma = 0.07^{+0.03}_{-0.04}, showing the age trend is weaker but significant (Kamulali et al., 18 Jan 2026).

4. Scientific Implications of Precision Host-Star Characterization

The joint use of SWEET-Cat and MAISTEP has direct impact on key exoplanetary problems:

  • Atmospheric loss diagnostics: Trends with period, flux, and mass support both photoevaporation and core-powered mass loss as valley-sculpting processes. The presence of gigayear-scale valley evolution and rising NE/NNN_E/N_N with age strongly suggests an important role for long-lived envelope loss mechanisms, favoring core-powered mass loss (Kamulali et al., 18 Jan 2026).
  • Exoplanet demographics: Enhanced precision in host properties enables detection of subtle demographic features, resolving previous ambiguities due to parameter systematic errors.
  • Giant-planet host ages: Application of MAISTEP to Jupiter-mass planet hosts reinforces prior findings: hosts of hot Jupiters are statistically younger (median 1.98 Gyr) than those of warm or cold Jupiters (medians 2.98 and 3.51 Gyr, respectively), consistent with tidal decay and evolutionary scenarios (Kamulali et al., 4 Feb 2025).

5. Future Directions and Expansion

Continued growth and refinement of SWEET-Cat—especially as new high-resolution spectroscopic surveys expand the catalog—and enhancements in MAISTEP’s evolutionary grid and ML architecture are expected. Further improvement in precision and age accuracy is anticipated from upcoming missions such as PLATO, which will yield direct age and radius benchmarks via asteroseismology across broader stellar populations (Kamulali et al., 18 Jan 2026).

A plausible implication is that future studies leveraging SWEET-Cat and MAISTEP will provide increasingly robust constraints on exoplanet formation and evolution scenarios, as systematic uncertainties in host parameters are suppressed to the few-percent level at scale.


References:

  • "MAISTEP -- a new grid-based machine learning tool for inferring stellar parameters I. Ages of giant-planet host stars" (Kamulali et al., 4 Feb 2025)
  • "Revisiting the exoplanet radius valley with host stars from SWEET-Cat" (Kamulali et al., 18 Jan 2026)
  • "SWEET-Cat: A catalogue of parameters for Stars With ExoplanETs I. New atmospheric parameters and masses for 48 stars with planets" (Santos et al., 2013)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to SWEET-Cat and MAISTEP Tool.