Sufficiency of Compute in Processing-in-Memory under DRAM Process Constraints
Determine whether processing-in-memory (PIM) architectures fabricated in DRAM technology process nodes can provide sufficient compute capability given the very limited power and thermal budgets of memory dies, particularly in the context of large language model inference workloads.
References
It is also unclear if the compute can be sufficient in PIM given the very limited budget for power and thermal of a DRAM technology process node.
— Challenges and Research Directions for Large Language Model Inference Hardware
(2601.05047 - Ma et al., 8 Jan 2026) in Section 2: Processing-Near-Memory for high bandwidth (preceding Table 4)