Papers
Topics
Authors
Recent
Search
2000 character limit reached

On Estimating the First Frequency Moment of Data Streams

Published 5 May 2010 in cs.DS | (1005.0809v1)

Abstract: Estimating the first moment of a data stream defined as $F_1 = \sum_{i \in {1, 2, \ldots, n}} \abs{f_i}$ to within $1 \pm \epsilon$-relative error with high probability is a basic and influential problem in data stream processing. A tight space bound of $O(\epsilon{-2} \log (mM))$ is known from the work of [Kane-Nelson-Woodruff-SODA10]. However, all known algorithms for this problem require per-update stream processing time of $\Omega(\epsilon{-2})$, with the only exception being the algorithm of [Ganguly-Cormode-RANDOM07] that requires per-update processing time of $O(\log2(mM)(\log n))$ albeit with sub-optimal space $O(\epsilon{-3}\log2(mM))$. In this paper, we present an algorithm for estimating $F_1$ that achieves near-optimality in both space and update processing time. The space requirement is $O(\epsilon{-2}(\log n + (\log \epsilon{-1})\log(mM)))$ and the per-update processing time is $O( (\log n)\log (\epsilon{-1}))$.

Summary

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Authors (2)

Collections

Sign up for free to add this paper to one or more collections.