Papers
Topics
Authors
Recent
Search
2000 character limit reached

Unified Generative Recommender Architectures

Updated 15 August 2025
  • Unified generative recommender architectures are models that recast recommendation subtasks as sequence generation, enabling end-to-end parameter sharing and semantic grounding.
  • They tokenize diverse inputs into semantically meaningful representations using techniques like hierarchical quantization and Transformers to fuse multi-modal and cross-domain information.
  • Empirical evaluations show improved ranking metrics, efficient cold-start handling, and scalable deployment across industrial and academic benchmarks.

Unified generative recommender architectures refer to models that recast the recommendation task as a sequence or content generation problem, often within a single system that unites formerly distinct recommendation subtasks such as retrieval, ranking, explanation, cross-domain transfer, and even multi-modal personalization. By adopting generative paradigms––typically leveraging LLMs, sequence-to-sequence Transformers, or specially devised tokenization mechanisms––these architectures move beyond discriminative, cascaded pipelines and aim for end-to-end, parameter-shared, semantically grounded, and highly adaptable solutions.

1. Foundational Principles and Motivations

Unified generative recommender architectures are fundamentally motivated by limitations in traditional pipelines, such as:

  • Pipeline fragmentation and information loss: Conventional multi-stage systems (e.g., retrieve–rank–rerank) often suffer from suboptimal parameter sharing and information leakage between stages (Zhang et al., 23 Apr 2025).
  • Cold-start and domain transfer barriers: ID-based systems are brittle in cold-start situations and require domain-specific retraining, hindering adaptation and cross-domain generalization (Jiang et al., 6 Jun 2025).
  • Multiplicity of models and inefficiency: Maintaining separate models for retrieval, ranking, generation, and explanation is resource-intensive and constrains deployment scalability (Cui et al., 2022).

Unified generative architectures address these challenges by:

2. Semantic Tokenization and Item Representation

A pivotal innovation in unified generative recommender architectures is the use of discrete, semantic tokenization to replace or augment traditional unique item IDs:

Tokenization Approach Embedding Source Generalization
RQ-VAE/Hierarchical Text/Content, CF, Image Multi-domain, cross-modality
FSQ Text (MPNet/BERT) Domain-invariant, cold-start
Bi-encoder Joint Search + Rec (ENMF) Balanced cross-task performance

3. Unified Sequence Generation and Task Handling

Unified generative architectures recast recommendation subtasks as generation over sequences:

  • Sequence-to-sequence models: Both user histories and items are encoded as sequences of tokens, and the generative model is trained to predict the next item(s) or content (Rajput et al., 2023, Wang et al., 2024, Deng et al., 26 Feb 2025).
  • Session-wise and listwise generation: Rather than predicting items one at a time, some frameworks generate entire recommendation sessions or slates, capturing intra-list correlations and diversity (Liu et al., 2023, Deng et al., 26 Feb 2025).
  • Constrained generative retrieval: Catalog-aware beam search with Trie-based prefix matching ensures generated sequences correspond to real items amidst enormous candidate spaces (Jiang et al., 6 Jun 2025, Rajput et al., 2023).
  • Integration of retrieval and ranking: End-to-end frameworks unify retrieval and ranking processes within a single model via shared sequence generation, with inter-stage enhancer modules and gradient-guided weighting to synchronize and optimize both objectives (Zhang et al., 23 Apr 2025, Deng et al., 26 Feb 2025).

4. Cross-Task and Multi-Modal Knowledge Fusion

Advanced unified architectures leverage the fusion of knowledge across tasks, domains, or modalities:

5. Evaluation Metrics, Empirical Results, and Trade-offs

Unified generative recommender architectures have been empirically validated across multiple dimensions:

6. Theoretical Perspectives and Future Directions

Several works ground the unification of generative architectures in formal or theoretical considerations:

  • Information theory: The GenSR framework justifies prompt-based task partitioning by showing that it increases mutual information between input features and outputs, thus reducing gradient conflict and manual design overhead for multi-task models (Zhao et al., 9 Apr 2025).
  • Representation regularization: Joint training across tasks (search, recommendation) regularizes both popularity estimation and item latent representations, improving robustness and coverage in real-world scenarios (Penha et al., 2024, Shi et al., 8 Apr 2025).
  • Scaling and deployment: The ability to train on heterogeneous, large-scale data—paired with rapid adaptation to new tasks, users, or domains—positions these models as general-purpose foundations for recommendation and information retrieval systems in industry (Cui et al., 2022, Jiang et al., 6 Jun 2025).

Anticipated future directions include:

7. Practical and Industrial Impact

Unified generative recommender architectures are now demonstrated not only on academic benchmarks but also in large-scale production environments:


In sum, unified generative recommender architectures represent a paradigm shift in recommender system design: using discrete, semantically meaningful token representations, generation-centric models, and shared parameterization to enable transferability, efficiency, and cross-task flexibility. By integrating learnings from foundation modeling, semantic and collaborative information fusion, instruction prompting, and large-scale empirical validation, these architectures are steadily redefining best practices in both research and industrial deployment of recommender systems.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (17)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Unified Generative Recommender Architectures.