Papers
Topics
Authors
Recent
Search
2000 character limit reached

GWcosmo Pipeline for Cosmology Inference

Updated 6 February 2026
  • GWcosmo pipeline is a computational framework that integrates gravitational-wave data with galaxy and galaxy-cluster catalogs to infer cosmological parameters such as the Hubble constant H0.
  • It employs a Bayesian hierarchical model and advanced completeness corrections to build accurate line-of-sight redshift priors from EM surveys, enhancing cosmological inferences.
  • The pipeline is modular and extensible, facilitating robust cross-correlation with cosmological backgrounds and future integration of diverse survey data.

The GWcosmo pipeline is a set of methods and computational tools for inferring cosmological parameters, especially the Hubble constant H0H_0, by combining gravitational-wave (GW) observations with electromagnetic (EM) galaxy or galaxy-cluster catalogs. Originating with works by Gray et al., its scope includes both traditional “dark siren” cosmology (where host redshifts are estimated statistically) and more recent extensions incorporating galaxy cluster information, robust catalogue completeness estimates, and cross-correlation with cosmological backgrounds. The pipeline is architected to handle the selection effects, completeness limits, and likelihood formulations that arise in GW cosmology with incomplete or multi-resolution EM datasets, achieving competitive constraints on H0H_0 through hierarchical Bayesian inference and advanced catalogue handling (Datrier et al., 20 Feb 2025, Beirnaert et al., 20 May 2025, Schulze et al., 2023).

1. Core Bayesian Framework and Likelihood Construction

GWcosmo adopts a Bayesian hierarchical model targeting inference on cosmological parameters (H0H_0, extensions to ΩM\Omega_M, w0w_0, modified gravity). For a set of NN GW events with strain data {di}\{d_i\}, the central posterior factorizes as:

p(H0,{zi}{di},C)p(H0)i=1Np(dizi,H0)p(ziC)×1S(H0)p(H_0, \{z_i\} | \{d_i\}, \mathcal{C}) \propto p(H_0) \prod_{i=1}^{N} p(d_i|z_i,H_0) p(z_i|\mathcal{C}) \times \frac{1}{S(H_0)}

  • p(H0)p(H_0): user-prior over H0H_0
  • p(diz,H0)p(d_i|z, H_0): single-event GW likelihood, recast from distances to redshifts via luminosity distance dL(z,H0)d_L(z, H_0)
  • p(zC)p(z|\mathcal{C}): line-of-sight redshift prior from EM catalogue(s) C\mathcal{C}
  • S(H0)S(H_0): selection-function normalization, accounting for cosmology-dependent GW detectability

Marginalizing over the individual source redshifts leads to

p(H0{di},C)p(H0)i=1N[0dzp(diz,H0)p(zC)]/S(H0)Np(H_0|\{d_i\}, \mathcal{C}) \propto p(H_0) \prod_{i=1}^{N} \Bigg[ \int_0^\infty dz\, p(d_i|z,H_0)\, p(z|\mathcal{C}) \Bigg] / S(H_0)^N

This formalism is adaptable—handling cluster catalogs, mixed catalogs, or hypothetical future catalogs by adjusting p(zC)p(z|\mathcal{C}) and the completeness model (Beirnaert et al., 20 May 2025).

2. EM Catalog Integration: Galaxy and Cluster Priors

Initially, GWcosmo utilized full-sky galaxy catalogs (e.g., GLADE, GLADE+) to supply p(zC)p(z|\mathcal{C}) by associating candidate galaxies (with photometric/spectroscopic zz) to GW localization regions and marginalizing over possible hosts. With the introduction of robust completeness limits, the pipeline discards galaxies beyond a magnitude threshold mthrm_\text{thr}, determined per-sky-pixel, for assigning host probabilities (Datrier et al., 20 Feb 2025).

Recent advancements adapt the pipeline to leverage galaxy cluster catalogs such as PSZ2 (Planck) and eRASS (eROSITA). Here, a “ClusterCatalogue” class constructs the LOS-zz prior as a sum over clusters in each GW localization pixel, weighting by cluster mass and convolving with zz-uncertainties (typ. Gaussian, width via physical cluster size). For sky areas or redshift ranges where the catalog is incomplete, the missing prior is supplied from a Press–Schechter mass function, normalized by measured completeness curves. The full prior is:

p(zΩi,Λ,s)=1p(sΩi,Λ)[p(CΩi,Λ)fin(z)+p(C~Ωi,Λ)fout(z)]p(z|\Omega_i, \Lambda, s) = \frac{1}{p(s|\Omega_i, \Lambda)} \left[ p(C|\Omega_i, \Lambda) f_\text{in}(z) + p(\tilde{C}|\Omega_i, \Lambda) f_\text{out}(z) \right]

with finf_\text{in} and foutf_\text{out} built as in (Beirnaert et al., 20 May 2025).

3. Statistical Treatment of Catalogue Completeness

GWcosmo initially used a “median-magnitude” heuristic to estimate catalogue completeness, often discarding a significant fraction of faint galaxies even when well detected. The robust method introduced in (Datrier et al., 20 Feb 2025) operationalizes the statistical test of Rauzy (2001):

  • For each galaxy kk, the rank-ratio statistic ζk=rk/(nk+1)\zeta_k = r_k / (n_k+1) is constructed, where rkr_k (resp. nkn_k) is the count of galaxies with lower μ\mu and M<MkM<M_k (resp. MMlim(μk)M\leq M_\text{lim}(\mu_k)).
  • Under the null hypothesis (catalogue complete to mlimm_\text{lim}), ζk\zeta_k are i.i.d. uniform[0,1][0,1].
  • The global TC(mlim)=k(ζk1/2)/kVkT_C(m_\text{lim}) = \sum_k(\zeta_k-1/2)/\sqrt{\sum_k V_k} is tracked; the threshold mthrm_\text{thr} is where TCT_C first drops below 3-3.
  • No galaxy luminosity function is assumed; only spatial stationarity within band/redshift is needed.
  • Implementation subsamples pixels for speed (e.g., N=400N=400 galaxies per B-band HEALPix pixel, averaged over 30 runs), incorporates photometric redshift uncertainties, and produces a per-pixel mthr(Ω)m_\text{thr}(\Omega) map.

The pipeline then integrates this limit into the likelihood, discarding galaxies with m>mthr(Ω)m > m_\text{thr}(\Omega) and modifying both discrete sum and missing-mass (out-of-catalog) terms.

4. Pipeline Modularity and Workflow

The “gwcosmo” pipeline is structured for extensibility:

  • Inputs: GW parameter-estimation posteriors (luminosity distance, sky position, mass, spins), EM catalogs (galaxy or cluster, with magnitude, zz, mass).
  • Pixelation: Catalogs subdivided into HEALPix pixels (Nside=32N_\text{side}=32 typical).
  • Completeness: Per-pixel mthrm_\text{thr} map computed and cached.
  • Likelihood Evaluation: For each GW event, the pipeline reads the mthrm_\text{thr} map, constructs LOS-zz prior, and evaluates the GW likelihood, including completeness corrections. For clusters, additional Press–Schechter terms supplement out-of-catalog regions.
  • Inference: The core sampling engine is unchanged, except for reading in updated priors and completeness limits; base posterior sampling over H0H_0 and ancillary parameters proceeds as standard.

Integration with external frameworks—such as using GW_CLASS and MontePython for anisotropy studies or joint GW+CMB cosmology—is handled via standard data flows and output chaining (Schulze et al., 2023).

5. Quantitative Performance and Improvements

Application of these methods has resulted in quantifiable gains in cosmological parameter inference:

  • For GLADE+ (BJ_J-band), robust completeness cuts from the Rauzy test resulted in an 8.6%8.6\% narrower H0H_0 credible interval for dark sirens and 6.3%6.3\% narrower interval when combining with GW170817, compared to previous median-cut approaches (Datrier et al., 20 Feb 2025).
  • Using cluster catalogs (PSZ2 and eRASS) rather than galaxies, the interval width on H0H_0 improved from approximately 22kms1Mpc122 \,\text{km}\,\text{s}^{-1}\,\text{Mpc}^{-1} (GLADE+) to 20kms1Mpc120 \,\text{km}\,\text{s}^{-1}\,\text{Mpc}^{-1} (PSZ2, 10%10\% gain) and 16kms1Mpc116 \,\text{km}\,\text{s}^{-1}\,\text{Mpc}^{-1} (eRASS, 38%38\% gain), attributable to the deeper redshift reach of modern cluster catalogs (Beirnaert et al., 20 May 2025).
  • For GWTC-3 BBH events in K-band (GLADE+), both median and robust methods yielded no improvement due to catalog incompleteness at the relevant distances: posteriors reverted to empty-catalogue limits with <1%<1\% difference (Datrier et al., 20 Feb 2025).

The table below summarizes relevant H0H_0 uncertainties (rounded; see original works for details):

Catalog/Band Method H0H_0 Posterior Width (km/s/Mpc) Gain vs. Median
GLADE+, BJB_J Median +41.8/22.7+41.8/-22.7 --
GLADE+, BJB_J Robust +35.1/23.8+35.1/-23.8 8.6%8.6\%
PSZ2 Cluster +10/10+10/-10 10%10\%
eRASS Cluster +8/8+8/-8 38%38\%

6. Selection Effects, Systematics, and Future Prospects

Selection function normalization S(H0)S(H_0) is included in all likelihoods, reflecting the joint sensitivity to changes in H0H_0 and event detection probability. GWcosmo pipelines also adjust for sky coverage of catalogs (PSZ2: 83.6%83.6\%, eRASS: 31.8%31.8\%), redshift uncertainty models (e.g., Gaussian for clusters, scatter from physical size or X-ray scaling), and EM survey incompleteness via analytic or empirical completeness curves.

Planned extensions include:

  • Combining both galaxy and cluster catalogues such that well-mapped low-zz galaxies and high-zz clusters provide joined p(zC)p(z|\mathcal{C}) support.
  • Adding a term to the LOS–z prior accounting for mergers outside clusters (voids/filaments), as neither catalog necessarily captures all hosts.
  • Generalizing completeness methodology to accommodate both faint and bright magnitude limits (e.g., for DESI, LSST).
  • Adapting to multi-parameter cosmology inference (ΩM\Omega_M, w0w_0, modified gravity), achievable by replacing H0H_0-only grids with multidimensional cosmology grids.

Robustness checks—varying input mass-function parameters, redshift scatter, completeness curves—have consistently shown negligible impact on H0H_0 posteriors.

A complete end-to-end mock data challenge, with known injected cosmological parameters, is recommended before application to the next-generation observing runs.

7. Integration with Cosmological GW Background Studies

The pipeline can be embedded in broader frameworks, connecting GW event-based inference with studies of cosmological GW backgrounds (CGWB) and their cross-correlation with CMB anisotropies (Schulze et al., 2023). The mathematical structure allows for inclusion of CGWB angular power spectra CGWC_\ell^{\text{GW}}, cross-spectra with CMB temperature CGW×CMBC_\ell^{\text{GW}\times\text{CMB}}, and computation of projected constraints with joint CMB + GW datasets. A modified version of the CLASS Boltzmann solver (GW_CLASS) computes these observables, and interfacing with MontePython enables joint posterior sampling over both conventional and GW-specific cosmological parameters.

The GWcosmo pipeline thus constitutes a central computational framework in the emerging field of gravitational-wave cosmology, capable of adapting to new catalogs, survey depths, and parameter inference challenges as the capabilities of both GW detectors and EM surveys expand.

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to GWcosmo Pipeline.