Cosmicflows-4 Compilation
- Cosmicflows-4 is the largest catalog of galaxy distances and peculiar velocities, unifying eight measurement methods through robust Bayesian calibration.
- It synthesizes photometric, spectroscopic, and geometric indicators to construct a high-fidelity 3D map of the local universe’s velocity and density fields.
- The compilation enables precise cosmological tests, yielding accurate measurements of the Hubble constant, bulk flows, and structure formation.
The Cosmicflows-4 (CF4) compilation is the largest and most comprehensive catalog of galaxy distances and inferred peculiar velocities assembled to date, comprising approximately 56,000 galaxies grouped into 38,000 systems. It synthesizes photometric, spectroscopic, and geometric distance indicators to construct a high-fidelity three-dimensional map of the local universe's velocity and density fields, enabling precise cosmological tests and measurements of the Hubble constant and bulk flows. CF4 is the result of systematic data acquisition from multiple international radio and optical surveys, advanced calibration methods, and robust Bayesian statistical merging.
1. Dataset Structure and Composition
CF4 compiles individual distances for 55,877–56,000 galaxies (depending on the precise cut and grouping protocol) merged into 38,000–38,065 groups, using eight primary methodologies. The catalog brings together three principal subsamples:
- “Others” (CF2-like): ~23,000 groups at , distributed nearly isotropically outside the Zone of Avoidance; includes classical Tully–Fisher (TF), Fundamental Plane (FP), and smaller surveys.
- 6dFGS: ~7,500 groups at ; dominant in the Southern Galactic hemisphere.
- SDSS: ~7,000 groups at ; confined to the Northern Galactic hemisphere.
The catalog relies on the following primary tracers and distance methodologies:
| Method | Number of Galaxies | Typical Distance Uncertainty |
|---|---|---|
| TF/BTFR | 12,412 / 9,967 | 0.3–0.4 mag (15–20%) |
| Fundamental Plane (FP) | 42,223 | 0.3 mag (15%) |
| SN Ia | 1,008 | 0.1–0.15 mag (5–7%) |
| SBF | 480 | 0.05 mag (SBF method) |
| SN II | 94 | 0.3 mag (15%) |
| TRGB | 489 | — |
| Cepheid | 76 | — |
| Maser | 6 | — |
The sample extends out to km s (), with the SN Ia subsample reaching . Typical individual distance errors for galaxies are , corresponding to km s for .
2. Distance Indicators and Calibration Protocols
The absolute zero-point for all distance measurements is anchored by local geometric methods—chiefly Cepheid period–luminosity relations, Tip of the Red Giant Branch (TRGB) in the color-corrected -band, and water maser distances to NGC 4258. Cross-calibration across methodologies leverages group overlaps using Bayesian Markov Chain Monte Carlo (MCMC) techniques, marginalizing over intrinsic zero-point offsets.
Tully–Fisher and Baryonic Tully–Fisher
For spirals, the TF relation in a given band is parameterized as
where is the inclination-corrected HI profile width, and , are empirically fit. Extensions to the Baryonic TF Relation (BTFR) incorporate the sum of stellar and gas mass:
Calibration employs a two-step process: (1) cluster regression for the TF/BTFR slope and (2) zero-point alignment using 64 galaxies with independent Cepheid and/or TRGB distances, referenced to LMC and NGC 4258 (Kourkchi et al., 2022, Kourkchi et al., 2020, Kourkchi et al., 2020).
Fundamental Plane (FP)
For early-type galaxies, the FP in SDSS bands is formulated as
with the effective radius, the velocity dispersion, and the mean surface brightness. Uncertainties are controlled via strict photometric and kinematic cuts (Tully et al., 2022).
Other Methods
- SN Ia: Standardized using light-curve shape and color corrections, yielding uncertainties.
- Supernova II: Standardized through expansion velocity and color.
- SBF: I-band and IR HST calibrations yield mag precision.
All methodologies are merged onto a common scale by maximizing the joint likelihood over group/distance correlations, with zero-point priors sampled in the MCMC merging (Tully et al., 2022).
3. HI Data Acquisition, Linewidths, and Sample Selection
The backbone of TF distances is the All-Digital HI Catalog, aggregating 21 cm width measurements from Parkes, Green Bank Telescope (GBT), and Arecibo (ALFALFA), uniformized through a common reduction pipeline.
Key HI width parameter:
- : Width at 50% of mean flux within the 90% flux window.
- Standardization through instrumental, redshift, and turbulence corrections yields , corrected for inclination as .
Inclinations are determined via the Galaxy Inclination Zoo (GIZ), a crowdsourced visual classification system, minimizing systematic bias and reducing error floors to 1°–5°, depending on (Kourkchi et al., 2020, Dupuy et al., 2021, Courtois et al., 2014).
Selection criteria for TF inclusion: S/N 10, km s in , , morphologically classified as Sa or later, and well-determined photometric parameters. ALFALFA widths are harmonized as km s. Internal extinction corrections exploit both parametric and machine learning (random forest) models, incorporating colors and surface-brightness.
4. Grouping Algorithms and Bayesian Merging
Galaxies are assigned to groups or clusters via friends-of-friends and virial scaling (using ), providing a robust statistical basis for cross-method calibration and error suppression.
Bayesian inference treats each methodology's zero-point as a free parameter; the likelihood for group having distances from method is
with the posterior sampled by the \texttt{emcee} implementation of affine-invariant MCMC (Tully et al., 2022).
Group merges are essential for suppressing non-linear virial motions, reducing peculiar-velocity noise, and leveraging overlaps in hybrid clusters (e.g., Coma: FP, TF, 7 SN Ia).
5. Velocity Field Reconstruction and Statistics
Peculiar velocities are extracted for groups via
with the relativistic correction, or with alternative logarithmic formulations at large .
Reconstruction of the 3D density () and velocity () fields utilizes Bias Gaussianization correction (BGc) to address the lognormal error distribution of distances. After Gaussianizing, the Wiener Filter (WF) and Constrained Realizations (CRs) deliver minimum-variance estimates:
with the prior drawn from the linear CDM power spectrum.
Key statistics:
- Mean overdensity:
- Bulk velocity:
CF4 finds and are within of cosmic variance using “Others” alone. Inclusion of the 6dFGS introduces a bulk flow excess at Mpc, aligning mostly with the Supergalactic X axis (Shapley Concentration), and a underdensity at Mpc (Hoffman et al., 2023).
6. Cosmological Results and Implications
The joint dataset yields the following key cosmological measures:
- Hubble constant (): (stat) (sys) km s Mpc (global CF4), with internal consistency among TF, BTFR, and SN Ia—e.g., TF: (stat) km s Mpc; BTFR: (Kourkchi et al., 2022, Kourkchi et al., 2020, Tully et al., 2022).
- Large-scale bulk flow: At Mpc, km s, directed toward the Sloan Great Wall; tidal analysis finds km s, indicating of CMB-frame motion arises from external structures (Hoffman et al., 2023, Courtois et al., 2022).
- Structure growth (): (grouped), (ungrouped); SN Ia: (Courtois et al., 2022).
- The inferred velocity field exhibits moderate – excess for bulk flows on $150$– Mpc scales, but within plausible cosmic-variance fluctuations of CDM.
7. Significance and Applications
CF4 provides the densest grid of peculiar velocities and distances to date, enabling rigorous mapping and analysis of large-scale flows, bulk motions, density monopoles, and comparisons against cosmological simulations. The systematic merging of eight distance indicators, anchored by geometric calibrations and robust group assembly, establishes a low-bias, high-precision foundation for cosmic velocity field studies and presents vital empirical constraints on and structure formation.
CF4 data products underpin analyses of gravitational basins (e.g., Laniakea), statistical isotropy, and tests for tension with CDM, facilitating critical investigations into both local and global cosmological parameters (Hoffman et al., 2023, Courtois et al., 2022, Tully et al., 2022).