Papers
Topics
Authors
Recent
Search
2000 character limit reached

Miss Rate (MR): Definition & Applications

Updated 16 January 2026
  • Miss Rate (MR) is a metric that quantifies the probability of a missed event, such as cache misses or deadline misses in real-time systems.
  • Researchers apply a range of techniques, including exact combinatorial methods and Che's approximation, to compute MR in various domains.
  • Optimizing MR is crucial for enhancing system efficiency, balancing trade-offs between energy, latency, and resource allocation in applications like caching and WLAN.

Miss Rate (MR) is a fundamental metric quantifying the proportion of expected retrieval or access events that result in a miss, i.e., the desired item, measurement, or result is absent or not received. MR serves as a performance indicator for diverse systems, including caching mechanisms, networked measurement acquisition, and real-time scheduling. It is typically formalized as the long-run ratio of misses to total expected accesses, with precise mathematical definitions varying across domains but consistently reflecting the probability or fraction of missed events conditioned on expected opportunity.

1. Formal Definitions and Domain-Specific Formulations

In caching, the miss rate MR\mathrm{MR} for a cache of size %%%%1%%%% under the independent reference model (IRM) is defined as the steady-state probability that a requested item is not resident in the cache, equivalently the long-run fraction of cache requests incurring a miss:

MR(C)=P{miss at a generic request}\mathrm{MR}(C) = P\{\text{miss at a generic request}\}

This complements the hit rate and, for LRU caching, relates to the probability that the stack distance for the requested item exceeds the cache capacity (Berthet, 2016, Berthet, 2017).

For periodic soft real-time systems, deadline miss rate (DMR) is given by

DMRN=1Nj=1NDM(j)\mathrm{DMR}_N = \frac{1}{N} \sum_{j=1}^N \mathrm{DM}(j)

where DM(j)\mathrm{DM}(j) is an indicator for whether job jj missed its deadline. The long-run DMR is defined as the almost sure limit as NN \rightarrow \infty (Chen et al., 2024).

In networked measurement domains, such as WLAN-based indoor positioning, MR for received signal strength (RSS) samples is: MR=NexpNobsNexp=1NobsNexp\mathrm{MR} = \frac{N_{\rm exp} - N_{\rm obs}}{N_{\rm exp}} = 1 - \frac{N_{\rm obs}}{N_{\rm exp}} where NexpN_{\rm exp} is the expected number of samples and NobsN_{\rm obs} the number actually observed in a window (Schmidt et al., 2019).

In constrained on-device expert caching such as SliceMoE, MR is formulated as: MR=#cache misses#expert-selection events\mathrm{MR} = \frac{\# \text{cache misses}}{\# \text{expert-selection events}} and is treated as a constraint in energy-minimization optimization (Choi et al., 15 Dec 2025).

2. Exact and Approximate MR Computation in Caching

In LRU caching, MR admits both exact combinatorial and approximate analytical formulations. King's 1971 formula computes MR via summation over all subsets and permutations,

MR[j]=1J=jIJ\mathrm{MR}[j] = 1 - \sum_{|J|=j} I_J

where IJI_J aggregates products over probabilities pip_i and complements 1pi1-p_i according to subset and ordering structure (Berthet, 2016). Flajolet et al. (1992) provide an equivalent integral-generating function formula, with correspondence proven by combinatorial means.

For large-scale or complex access patterns, exact MR computations are combinatorially infeasible. Che's approximation uses the concept of characteristic time τ\tau,

MRChe(C)i=1mpiepiτ\mathrm{MR}_{\rm Che}(C) \approx \sum_{i=1}^m p_i e^{-p_i \tau}

where τ\tau is the unique time such that i=1m(1epiτ)=C\sum_{i=1}^m (1-e^{-p_i \tau})=C. Under non-stationary request processes, corrections to Che's approximation are explicitly quantifiable, with asymptotic expansions providing order-O(1/C)O(1/C) error bounds (Olmos et al., 2015).

Table 1: Representative Cache MR Formulas

Formula Type Expression Remarks
King's exact (IRM) 1J=jIJ1-\sum_{|J|=j} I_J Permutations over all subsets
Flajolet's integral Integral+coefficient extraction (see above) Equivalent to King by combinatorial lemma
Che's approximation ipiepiτ\sum_i p_i e^{-p_i \tau} Characteristic time-based
Power-law closed-form [1705] aHN1aHN,aE1()\frac{a H_N^{1-a}}{H_{N,a}} E_1(\cdots) For Zipf/popularity laws

3. Analytical Models and Asymptotics

Under IRM and power-law popularity (piiap_i \sim i^{-a}), MR exhibits distinct asymptotic regimes depending on the tail index aa, cache fraction θ=D/N\theta=D/N, and request distribution. For heavy-tail (a>1a>1), small cache sizes yield suboptimal LRU performance relative to static caching; conversely, Zipf (a=1a=1) and light-tail ($0Berthet, 2017).

When modeling networked measurement or packet arrivals, MR can be interpreted via Poisson or binomial point process approximations—e.g., the Poisson arrival rate λeff=pλmax\lambda_{\rm eff}=p\lambda_{\max} yields

MR=1λobsλmax\mathrm{MR} = 1 - \frac{\lambda_{\rm obs}}{\lambda_{\max}}

which generalizes to binomial-loss models depending on packet success probability (Schmidt et al., 2019).

Table 2: MR Asymptotics in LRU Cache under Power-law Demand

Case MR Behavior Static Optimality
a>1a>1 (heavy-tail) D1/a1\sim D^{1/a-1} LRU can be \gg static
a=1a=1 (Zipf) (1θ)ln11θ(1-\theta)\ln \frac{1}{1-\theta} Near optimal
$0 (1θ)1a(1-\theta)^{1-a} Optimal as θ0\theta\to0

4. Impact, Measurement, and Optimization

MR directly governs system efficiency, energy usage, and latency. For instance, fast-rate WLAN RSS acquisition reduces MR from 41.7% (normal mode, –80 dBm) to 5.6% (monitor mode), yielding >7× improvement and accelerating indoor navigation survey duration by an order of magnitude (Schmidt et al., 2019). In SliceMoE inference, DRAM cache allocation subject to a miss-rate constraint enables model designers to minimize energy while bounding Flash retrieval events: minC,P  E(C,P)s.t.MR(C,P)ε\min_{C,P}\; E(C,P) \quad \text{s.t.} \quad \mathrm{MR}(C,P)\leq\varepsilon (Choi et al., 15 Dec 2025). MR can be fine-tuned via dynamic bit-sliced caching, precision routing, and predictive cache warmup.

Deadline miss rate in periodic scheduling is amenable to rigorous analysis using finite-state Markov chains whose stationary distribution encodes the long-run MR, with system parameters (e.g., dismiss point δ\delta) allowing designers to interpolate between immediate dismiss and never dismiss regimes (Chen et al., 2024).

5. Practical Trade-offs, Optimization, and Policy Design

Miss rate acts as a pivotal constraint and objective in system optimization: MR can be minimized at the expense of increased memory footprint, energy, or latency in hardware-constrained environments. In MoE models, bit-sliced caching and calibration-free quantization free up DRAM for storing critical expert slices, directly lowering MR under a fixed capacity (Choi et al., 15 Dec 2025). In general, tuning eviction policies, replacement strategies, and prefetching approaches can mitigate the adverse effects of high miss rates.

In real-time scheduling, the design of the dismissal policy (parameter δ\delta) quantitatively alters the long-run deadline miss rate and the trade-off between tardy task execution and interference suppression (Chen et al., 2024). In caching, the choice among LRU, static, and adaptive policies imparts distinct MR profiles, with replacement optimization—particularly under skewed or clustered access patterns—yielding significant practical gains.

6. Algorithmic and Implementation Notes

Exact MR formulas are combinatorially expensive; dynamic programming recurrences and integral approximations greatly enhance tractability for realistic system sizes (Berthet, 2016). In performance-sensitive domains, MR estimation via Che-type approximations, Poisson/TTL modeling, and clustering-aware corrections provide robust, scalable solutions (Olmos et al., 2015, Berthet, 2017). Algorithmic best practices involve exploiting monotonicity, set-partitioning, and access frequency statistics to manage or predict MR.

Table 3: MR Policies and Techniques

System MR Definition Policy/Optimization
LRU Cache Fraction of cache misses Che’s approximation, static
WLAN IPS [1907] Fraction of lost RSS samples Monitor mode, faster survey
MoE Inference DRAM cache misses DBSC, AMAT, PCW, constraint
Soft RT [2401] Fraction of jobs missing DL Markov-chain tuning, dismiss

7. Conceptual Significance and Directions

Miss rate provides a unifying lens for analyzing retrieval, scheduling, and resource-constrained systems. Its interpretation encompasses classical probability, queueing theory, combinatorial analysis, and stochastic process modeling. MR figures centrally in both theoretical performance bounds and practical user experience, guiding system evolution in domains from processor caches to real-time data acquisition and model serving. Future work extends MR analysis to non-iid, non-stationary, and adversarial access patterns, distributed caching, and energy-aware embedded inference.

The application-specific character of MR, together with its formal equivalence across diverse computational domains, underpins its continued centrality as a design metric and analytical construct.

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Miss Rate (MR).