Demographic Salience Score Overview

Updated 15 November 2025

Demographic Salience Score (DSS) is a metric family that quantifies the prominence and retention of demographic attributes in network data and generated summaries.
The framework employs entropy calculations, normalization, and entity matching to provide interpretable, mathematically grounded measures across identity-graph and LLM summary contexts.
Empirical analyses using DSS reveal significant discrepancies in demographic representation, offering a tool for diagnosing bias in social media follow patterns and biomedical summarization.

The Demographic Salience Score (DSS) is a family of metrics conceived to quantify the prominence and retention of demographic characteristics within relational or generated data—most notably applied in the analysis of social media follow patterns across identity dimensions (Fulay et al., 2023) and in the evaluation of demographic fidelity in LLM summaries of biomedical evidence (Aghaebe et al., 8 Nov 2025). DSS frameworks formalize the notion of salience as the degree to which a particular demographic dimension stands out or is preserved, offering direct, mathematically interpretable measures for both internal (ego-centric) and external (audience-centric) perspectives, as well as for entity retention and hallucination within generative workflows.

1. Formalization and Mathematical Foundations

DSS is contextually instantiated for two distinct data modalities:

Identity-Graph DSS (Fulay et al., 2023):

Ego-centric DSS: For user $j$ and dimension $d$ , $e_j^d$ is calculated to reflect the normalized salience of $d$ among $j$ ’s followees in set $K$ (influencers).
Audience-centric DSS: For influencer $k_i$ , $a_i^d$ is defined as the mean $e_j^d$ over all audience members $j$ who follow $k_i$ .

For categorical dimensions (e.g., race, gender), the process is:

Count followee-category distributions:

$u_j^d = D^d f_j, \quad v_j^d = u_j^d / \|u_j^d\|_1$

Calculate entropy:

$w_j^d = -\sum_{c=1}^{C_d} v_{j,c}^d \log v_{j,c}^d$

Z-normalization and negation:

$e_j^d = -\frac{w_j^d - \mu_d}{\sigma_d}$

where $\mu_d$ and $\sigma_d$ are the mean and standard deviation of $w_j^d$ over all $j$ .

For partially-tagged dimensions (e.g., religion, politics, LGBTQIA+):

$w_j^d = t_j^d / n_j$

$e_j^d = -\frac{w_j^d - \mu_d}{\sigma_d}$

Entity-Retention DSS in Document Generation (Aghaebe et al., 8 Nov 2025):

Entity Retention Score (ERS):

$ERS = \frac{|Ent_{summary} \cap Ent_{gold}|}{|Ent_{gold}|}$

Hallucination Penalty (HP):

$HP = \frac{|Ent_{summary} \setminus Match(Ent_{gold})|}{|Ent_{summary}|}$

where $Match(Ent_{gold})$ includes entities matched by exact string or cosine-similarity $\geq \tau$ .

Over-length Penalty (OP):

$OP = \max(0, T_{gen} - T_{max}) / T_{max}$

Adjusted Hallucination:

$HP' = HP + OP$

Raw DSS:

$DSS_{raw} = \alpha \cdot ERS - \gamma \cdot HP'$

Normalized DSS (clipped in $[0,1]$ ):

$DSS = \max\left(0, \frac{DSS_{raw}}{\alpha N}\right)$

Parameters $\alpha, \gamma$ are tunable; $N$ normalizes across cases.

2. Computational Pipeline and Workflow

Social Graph DSS (Fulay et al., 2023):

Step 1: Sample influencer set $K$ , audience set $A$ (with minimum engagement threshold).
Step 2: Tag influencers via semi-automated intersection of Wikipedia categories and external lists for partially-tagged dimensions.
Step 3: Construct binary followee matrix $F$ by crawl/query.
Step 4: For each user and dimension:
- Categorical: build $u_j^d, v_j^d$ , compute entropy $w_j^d$ , z-score to $e_j^d$ .
- Tagged: compute $w_j^d$ (ratio), z-score to $e_j^d$ .
Step 5: For influencers, average audience $e_j^d$ to obtain $a_i^d$ .

LLM Summary DSS (Aghaebe et al., 8 Nov 2025):

Step 1: Extract gold entities from reference abstracts (regex, LLM-assisted NER).
Step 2: Extract entities from generated summaries by identical pipeline.
Step 3: Compute ERS (retention), HP (hallucination), OP (overlength) per summary.
Step 4: Aggregate DSS scores per review and age stratum.

Pseudocode formalizes the procedure for reproducibility and scaling in automated pipelines.

3. Empirical DSS Analysis and Interpretation

Across $M=12,593$ influencers:

Race: Salience markedly higher in ego-profiles versus audience-centric profiles (mean difference $\approx +0.18$ z-units, $p\ll.01$ ).
Religion & Politics: Audiences display higher salience than influencers themselves (mean differences $\approx -0.08$ and $-0.12$ , both $p\ll.01$ ).
Gender & LGBTQIA+: Pronounced right-skew observed; some influencer cliques (notably athletes) exhibit near-exclusive same-gender followership.

Significance robust under bootstrap, paired t-test, Wilcoxon, and KS test with Bonferroni correction at $\alpha=0.01$ .

Empirically, DSS distributions by age group and model:

Age Group	Model	ERS	HP	Omission	DSS
Adults	GPT-4.1 Nano	0.81	0.12	0.19	0.69
	Qwen-2.5	0.78	0.74	0.22	0
	Longformer	0.45	0.18	0.50	0.27
Children	GPT-4.1	0.84	0.12	0.16	0.72
	Qwen-2.5	0.97	0.58	0.02	0
	Longformer	0.91	0.33	0.09	0.63
OlderAdult	GPT-4.1	0.92	0.14	0.08	0.78
	Qwen-2.5	0.98	0.11	0.02	0.79
	Longformer	0.95	0.07	0.05	0.78

Key observations:

DSS $>0.8$ : Very high fidelity (older adults best preserved).
DSS $0.5$–$0.8$: Moderate fidelity (children).
DSS $<0.5$ : Poor fidelity (adults under-represented, frequent omission/hallucination).
Qwen-2.5 produces high entity counts but simultaneously high hallucination rates, nullifying DSS gains ( $DSS\approx0$ for adults/children).
GPT-4.1 Nano exhibits balanced, consistently superior demographic fidelity.

4. Conceptual Significance and Use Cases

DSS provides:

An interpretable score for demographic prominence, facilitating direct comparison across user groups or model outputs.
A mechanism for identifying "bridging" influencers: high DSS gap (ego-centric vs. audience-centric) indicates capacity to channel diverse perspectives (e.g., Dolly Parton as a bridge to women, Allen Iverson to communities of color).
In generative systems, DSS quantifies the preservation of demographic specificity, supporting diagnostic and regulatory practices in biomedical evidence synthesis.

A plausible implication is the adoption of DSS as a diagnostic tool for identifying representational bias, evaluating fairness interventions, and guiding post-hoc review protocols where demographic fidelity is critical (e.g., medical guideline summarization, social platform diversity analysis).

5. Limitations and Recommendations

Limitations:

Coverage: DSS as implemented explores only select axes—race, gender, religion, politics, LGBTQIA+ (influencer graphs) or age (LLM summaries); latent/confounding attributes (age, occupation, nationality) are acknowledged but not measured.
Entity extraction and tagging: Reliance on Wikipedia categories and curated lists introduces selection and coverage bias.
Proxy nature: Following patterns and summary entity retention are noisy proxies; causal inference regarding underlying divergence drivers is explicitly out of scope.
Generalization: Weighting and normalization choices (e.g., $\tau=0.7$ , $\alpha=\gamma=2$ ) may require contextual tuning.

Recommendations:

Incorporate DSS alongside standard metrics (BLEU, BERTScore, FactCC) to detect representational bias.
Use demographic-aware prompting with caution: slot-filling or two-stage pipelines may control over-generation and hallucination.
Post-hoc entity-to-gold matching and review protocols (flag summaries when DSS $<0.5$ ).
Extend DSS to additional dimensions (gender, race), and explore demographic-anchored objectives in model training/fine-tuning.
Expand real-time monitoring, feedback dashboards, and intersectional/dynamic DSS analytics for high-stakes intervention.

6. Illustrative Examples and Practical Impact

Social Identity Bridging: Dolly Parton's ego-centric gender score ( $e_i^{gender}\approx+2.9$ ) vs. audience mean ( $a_i^{gender}\approx-0.4$ ) produces a DSS gap of $3.3$, exemplifying her role as a conduit for women's representation.

Generative Model Hallucination: Qwen in a child-focused review retained most gold age entities (high ERS) but fabricated "prisoners" as a descriptor (high HP), resulting in $DSS\approx0$ . Longformer's omission of "midlife women" in adult reviews yielded $ERS\approx0.45$ , $DSS\approx0.27$ . For older adults, all models demonstrated high retention ( $DSS>0.78$ ).

A plausible implication is that DSS is uniquely capable of revealing both overt and latent representational gaps in machine-generated and social attention landscapes, guiding interventions for improved demographic coverage and fidelity.

7. Extensions, Future Directions, and Contextual Integration

Proposed DSS extensions involve:

Incorporating additional data modalities (retweet/mention graphs), developing intersectional salience analytics (e.g., race $\times$ gender), and supporting time-varying, event-driven salience monitoring.
Broadening the schema to encompass further axes of sensitive information, tailored to the demands of fairness-aware systems, federated social analysis, and biomedical NLP.
Application in evaluating pipeline-level bias, guiding influencer network gatekeeper identification, diversity interventions, and real-time demographic shift reporting.

Collectively, the Demographic Salience Score forms an evidentiary and analytical backbone for quantitative demographic analysis in both network-centric and generative AI workflows, enabling systematic diagnosis and remediation of under-represented group bias and facilitating more equitable information ecosystems.

Markdown Report Issue Upgrade to Chat

References (2)

Divergences in Following Patterns between Influential Twitter Users and Their Audiences across Dimensions of Identity (2023)

LLMs Do Not See Age: Assessing Demographic Bias in Automated Systematic Review Synthesis (2025)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Demographic Salience Score (DSS).

Demographic Salience Score Overview

1. Formalization and Mathematical Foundations

2. Computational Pipeline and Workflow

3. Empirical DSS Analysis and Interpretation

Identity-Based DSS (Fulay et al., 2023)

LLM Summary DSS (Aghaebe et al., 8 Nov 2025)

4. Conceptual Significance and Use Cases

5. Limitations and Recommendations

6. Illustrative Examples and Practical Impact

7. Extensions, Future Directions, and Contextual Integration

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Demographic Salience Score Overview

1. Formalization and Mathematical Foundations

2. Computational Pipeline and Workflow

3. Empirical DSS Analysis and Interpretation

Identity-Based DSS (Fulay et al., 2023)

LLM Summary DSS (Aghaebe et al., 8 Nov 2025)

4. Conceptual Significance and Use Cases

5. Limitations and Recommendations

6. Illustrative Examples and Practical Impact

7. Extensions, Future Directions, and Contextual Integration

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Related Topics